EMC distortion about capacity efficiency - Around the Storage Block Blog -
EMC distortion about capacity efficiency

By Warren Smith, HP StorageWorks

In an all too familiar world, we are compelled to report that the information in Chuck's blog on capacity efficiency is a distortion as far as the EVA is concerned. Read Chuck's blog post 

That this is a distortion, we are certain and details are provided below for the readers to evaluate for themselves.  But we wonder if Chuck and his team of EMC testing engineers, in a rush to post this, have not allowed their own ambition to deliberately ignore HP EVA technical guidance, or inadvertently done so.  In either case, you will find details below why we think that Chuck should have used the long weekend to re-consider the facts, before dashing off this blog.

Real story about EVA Usable Capacity

The best place to start a response to Chuck Hollis is to say that the EVA configuration cited in EMC's "fair comparison" test is a radical departure from published EVA best practices that can be found by clicking here.

Major points from these EVA best practices that undermine EMC's "fair comparison" test are:

CONCISE SUMMARY:  If EMC had followed HP's best practices for EVA, they would have configured this as a 2 disk group, rather than deciding to use 7 disk groups for this example.   The 2 disk groups would have consisted of 1 large disk group for Exchange data and 1 small disk group for logs.   Using 146GB drives example, then we would actually set aside the equivalent of 2 146GB drives for the single level of redundancy.  This is regardless of VRAID1 or VRAID5.  If EMC had followed this one single adherence to the EVA best practices, the resulting capacity rate shoots up dramatically to 74.73% capacity efficient with vRAID 5. 

For those with a taste for the details, the following bullets parse out how EMC got it wrong with the EVA Capacity Efficiency description and calculations.  But we add here that EMC and Chuck himself formerly whined about how NetApp had evaluated one their systems prejudicially by picking a configuration that would deliberately weaken the EMC system performance in the comparison.  We think that Chuck should practice what he preaches, because that is exactly what EMC has done in this comparison with the EVA, ie. ignored the EVA best practices guidance and implemented a worst case configuration.  Or maybe this was just an oversight by Chuck.

Disk Groups.  There is an assumption in Chuck's analysis that since Microsoft Exchange recommends separating Data and Logs into different EVA disk groups that every application's data must be separated.  We believe that is what led him to using the 7 disk groups with EVA choice, and he simply added up the number of data volumes needed for the "application example" he chose and said 1 per disk group.  The reality is that the separation is not for performance reasons but for some perceived data protection issues.  Microsoft Exchange relies on the Log files to protect and recover the Data files so they do not feel comfortable placing them in the same disk group lest they both be affected by some disk issues / failure / recovery.  However, if two Microsoft Exchange Servers have their data stored on the same EVA, it is perfectly acceptable for one EVA Disk Group to contain the Data from Exchange Server 1 *and* the Logs from Exchange Server 2.  Another EVA Disk Group would then contain the Logs from Exchange Server 1 *and* the Data from Exchange Server 2.  From the point of view of each Exchange Server, their separation has been accomplished.  From the EVA's point of view, the number of Disk groups has been kept to a minimum, achieving the performance recommendations.

  • Hot spares. Chuck's description of the Distributed Sparing method used by EVA is not quite correct. The EVA spares virtual volume, as opposed to dedicated spare physical drives, across disk groups to deliver greater, rather than less capacity utilization, as suggested. The added advantages of EVA Distributed Sparing include the ability to incorporate added spindles (for the virtual distributed spare volume) in all disk group transactions to raise performance, while simultaneously providing virtual capacity for sparing use in the event of disk failure. This method also provides for room to rebuild in the event of failures that is faster than the comparative physical hot spare approach. And finally, the choice to use hot spinning spare drives, raises the prospect (admittedly small but still existing) of failure of the hot spares themselves. HP has asked the question: If the spare drive mechanics must be hot and available, why not have that contribute to the disk group performance? This is what EVA's Distributed Sparing method provides.

Also, "Virtual Spares" and "Proactive Disk Management" are two separate things so Chuck has also double counted the spares for the EVA example.  Since the "Protection Level" can be for 1 or 2 failures, the number of spares can either be 2 or 4 spindles per disk group.  Most of the EVA examples we show use Protection Level 1 (2 disks per group).

  • Snapshots. Chuck is wrong in his statement that "HP EVA's ‘Virtually Capacity Free' snapshots, require close to the same amount of capacity as a CX4 would." The EVA uses two techniques for local replication, Cloning, snapclones, and mirrorclones - for complete point in time copies and Snapshots for efficient point in time copies. EVA also supports two types of snapshots and this may be the source of confusion. The first type, I fully reserved, i.e. reserves enough space for a fully diverged snapshot. The other type, more appropriate for this scenario, is space efficient, which only allocates space as required and keeps track of unchanged data via pointers. This type of snapshot has the potential to be virtually capacity free and can also be mounted and written to. The possible efficiency of this second snapshot method is a function of the planning effort by the user. If the user does no planning, and has no idea of the rate of change of his data, then the SnapClone, or fully reserved snapshots are the best and least efficient choice. But if the user has an intelligent or practiced sense of the rate of change for his data, then he can use the Space Efficient Snapshot, with a user selected allowance for Snapshot capacity growth. In the case of Exchange data, a typical rate of change that might be seen is on the order of 20-25% in the first 24 hours and nearly zero after that. And so the EVA's Space Efficient Snapshot method conserves capacity for original data use better than the CX4.
  • Overhead. Chuck mis-states the overhead capacity that is required for meta-data. Such EVA meta-data consists of less than ½ of 1% of the total capacity. Thus, for 120 data disks, the overhead would be on the order of one disk, not five.

Occupancy Alarms.    Chuck has taken an EVA positive here and cast it as a negative.  We are happy to correct any misunderstanding on EVA's ability to allow for dynamic volume expansion.  Chuck states that "EVA best practice recommends a further 10% of each disk group be given to the EVA operating system, known as ‘Occupancy Alarm'".  Well, that does not remove that 10% from the available usable capacity.  Again, user planning and confidence in their planning comes into play here, as it did with Distributed Sparing.  If a user creates and mounts a LUN of a specific capacity on the EVA and has confidence that this capacity is adequate. In this case, there will be no Occupancy alarm, no extra Occupancy reserve, and no unusable capacity.  If however, the user wants the added assurance that should his LUN capacity require space beyond the original LUN capacity, he can elect to allow the LUN to dynamically grow within user selected limits. 

The reserve for such LUN expansion is active and usable space and in no way reduces the usable capacity of the EVA LUN.    The purpose of the "Occupancy Alarm' is not to withhold capacity, but to tell the user that a capacity threshold has been passed so they can proactively increase the capacity of a disk group by simply adding a disk(s) and letting the EVA transparently and non-disruptively incorporate the new disk(s) and capacity into the disk group.  There is no capacity use, reservation, or other impact to capacity efficiency related to "Occupancy Alarm," but simply a straight forward use model to allow a user to proactively and non disruptively adjust the capacity of a disk group.

We actually want to thank Chuck for stimulating this examination of EVA Capacity Efficiency, despite the errors in his initial reporting.  For the inquiring minds reading this, there is a also HP StorageWorks Usable Capacity Calculator tool that HP has developed that can be accessed through any direct, Partner or VAR authorized to sell the EVA.

Until Chuck holds another of these great public discussions, thanks for reading and HP looks forward to addressing such storage questions, as always, in a factual and objective manner.


Posted 08-29-2008 8:43 PM by CalvinZ
Filed under: , ,

Comments

Stephen Foskett wrote re: EMC distortion about capacity efficiency
on 08-29-2008 9:57 PM

Calvin,

Thanks so much for posing this detailed response. I am learning more about EVA every day!

Like you, I immediately saw some holes in the comparison, as documented on my blog:

blog.fosketts.net/.../grapples-and-tangelos-why-its-impossible-to-compare-fairly

I suggest, though, that you include a simple correct answer to Chuck's scenario: How much is the overhead of a similarly-configured EVA given the rest of his specs?

I'd love it if each vendor would go through the exercise themselves as a way to come up with something closer to apples-to-apples than EMC can do themselves.

Thanks,

Stephen

CalvinZ wrote re: EMC distortion about capacity efficiency
on 08-29-2008 11:25 PM

Hi Stephen,

I'll try to find someone to make that comparison and explain it (I'm not technical enough to do it myself) but one of my pre-sales EVA experts did send me a screen shot of the calculation he did with our EVA capacity calculator tool that we have.  He configued a 240 drive EVA using VRAID5 and had around 74% efficiency.  Its not an apples to apples comparison but probably better reflects real-world than what Chuck did.

I also got an email from an EVA customer that saw what Chuck was saying and while I don't want to quote all of it, he said  "who in their right mind would ever configure their array like this ... (guy)"?  I'll omit the actual word he used as my intent isn't to offend Chuck.  But this customer has 15 EVA's and couldn't believe what he was reading from Chuck.

I also got an email from another pre-sales storage rep; he mentioned that EMC is passing around a white paper to prospects giving an example of an EVA with 16 disk groups of 8 drives each with double sparing turned on. Good luck finding any EVA customer that has more than 5 disk groups and most probably have 1-3.  

One last EVA point - regardless of the efficiency, every drive in an EVA contributes to array performance.  Even if someone configured some huge capacity as spare, since the EVA stipes across the entire array (including spare space), every drive is busy doing I/O's.  I wouldn't expect Chuck or EMC to make this point but I think I'll do a series of blogs in the coming weeks about virtualization in the EVA and what it does for our customers.

In the end, I'm glad that I work for a company like HP that takes the high road and doesn't use tactics like I continually hear coming from Hopkinton.

Thanks for posting Stephen!

Cleanur wrote re: EMC distortion about capacity efficiency
on 11-28-2008 5:08 PM

Equivalent of 120x146GB = 17.5TB required total usable (inc Raid 5 + Sparing)

Single DIsk group (all VR5 as per Chuck) so odd disks allowed

173x146GB = 25,258GB RAW

173x146GB = 24,666GB after Rightsizing

173x146GB =  23,643GB after Metadata

Set 90% occupancy alarm = 10% reserve = 21,279GB

Add single sparing = 21,033GB

Vraid 5 available = 17,527GB

% Available = 17,527GB / 24,666GB  = 71% utilization

173 Disks = 13 Enclosures

A more realistic config would be.

Two DIsk groups (all VR5 as per Chuck)

144x146GB = 21,024GB RAW

144x146GB = 20,531GB after Rightsizing

173x146GB =  19,680GB after Metadata

Set 90% occupancy alarm = 10% reserve = 17,712GB

Add single sparing = 17,466GB

Vraid 5 available = 14,555GB

% Available = 14,555GB / 20,531GB  = 70%

Disk group 2 = 32x144GB = 4672GB RAW

32x144GB = 4,562GB after Rightsizing

8x144GB = 4,373GB after Metadata

Set 90% occupancy alarm = 10% reserve = 3,936GB

Add single sparing = 3,690GB

Vraid 5 available = 3,075GB

% Available = 3,075GB / 4,562GB = 67%

70% + 67% = 137 / 2 Disk groups = 68.5% utilization

144 Disks + 32 Disks = 176 = 13 Enclosures

14,555GB + 3,075GB = 17,630GB

A key thing about the use of disk groups and vdisks is that of the 17.xGB available on each configuration, every GB displayed as usable, is absolutely usable within th disk group i.e No stranded capacity as is more than likely the case for non virtualized array's.

CalvinZ wrote re: EMC distortion about capacity efficiency
on 12-02-2008 10:50 PM

Hi Cleanur - thanks for your analysis.  You're data is good to have.  Thanks again!  Calvin

Michael wrote re: EMC distortion about capacity efficiency
on 12-08-2008 6:07 PM

> 173x146GB = 24,666GB after Rightsizing

Is there any right-sizing going on in an EVA, I'm not clear on this? The above isn't right-sizing to anyone I've ever talked to, that's simply converting pure decimal GB to some kind of binary GB (well almost). No space is lost in this process, while right-sizing means actually truncating space (blocks) from drives.

173*146 = 25258

25258*10^3/2^10 = 24666

I don't quite understand what this calculation actually tells us, and why it would be called right-sizing of disks. It seems to be a numerical conversion from 1GB = 1000 MB -> 1GB = 1024 MB.

If we define a GB as 2^30 bytes (octets).

Then the conversion would give

173*146*10^9/2^30 = 23523 GB

Surely most ppl would agree that at this point, efficiency is still 100% the EVA.

There seems to be an overhead of ~4% for metadata:

1-23643/24666 = 0.0415

That's assuming it's the same definition of "GB" in those two rows...

Then the sparing. Two (2) times the size of the largest disk in the Disk Group is subtracted from available raw free space, for Protection Level 1. How large is a 146G disk in this step? In the calculation above, it doesn't easily add up for me.

AFAIK, before Occupancy Alarm Level is set, the reconstruction space is already deducted from total available raw space, so calculating backwards from the above figures would give: (23643-21033/0.9)/2 = 136.5 GB

OK, so the only sense I can make out of this is:

146*1000/1024*(23643/24666) = 136.664

There are some rounding errors playing a role here, but that seems close enough at least -- it implies that the space reserved for reconstruction is first deduced by the overhead for EVA metadata, and that's logical to me anyway.  Comments?

I don't know exactly how vRAID5 is implemented (yet), but some info I have states that it behaves in principal like any other RAID5: one disk per stripe set is lost to parity data. In the EVA the nominal redundancy set is 8 disks (optimal RSS size), and I'm guessing there's a connection to this but I don't fully understand it.   In this case above we have

17527/21033 = 0.8333 = 5/6 (i.e. the factor is *exactly* 1.20)

I'd be much obliged if someone could explain to me why this is, and if it's affected by (and how) some user settable parameters in the EVA.

Anyway...this is quite interesting,  when I calculate volume efficiency for a disk array I don't do it the way shown or talked about in this post and followup-thread -- there seems to be a lot of unnecessary confusion on this subject

Regards,

/M

Add a Comment

(required)  
(optional)
(required)  
Remember Me?

Type the numbers and letters above:
Powered by Community Server (Non-Commercial Edition), by Telligent Systems