Professional Documents
Culture Documents
NetApp, Inc.
495 East Java Drive
Sunnyvale, CA 94089
U.S.
Copyright Information
Copyright 19942014 NetApp, Inc. All rights reserved. Printed in the U.S
No part of this document covered by copyright may be reproduced in any form or by any meansgraphic,
electronic, or mechanical, including photocopying, recording, taping, or storage in an electronic retrieval
systemwithout prior written permission of the copyright owner.
Software derived from copyrighted NetApp material is subject to the following license and disclaimer:
THIS SOFTWARE IS PROVIDED BY NETAPP "AS IS" AND WITHOUT ANY EXPRESS OR IMPLIED
WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY
AND FITNESS FOR A PARTICULAR PURPOSE, WHICH ARE HEREBY DISCLAIMED. IN NO EVENT
SHALL NETAPP BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR
CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE
GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION)
HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT
LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE
USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
NetApp reserves the right to change any products described herein at any time, and without notice. NetApp
assumes no responsibility or liability arising from the use of products described herein, except as expressly
agreed to in writing by NetApp. The use or purchase of this product does not convey a license under any
patent rights, trademark rights, or any other intellectual property rights of NetApp.
The product described in this manual may be protected by one or more U.S. patents, foreign patents, or
pending applications.
RESTRICTED RIGHTS LEGEND: Use, duplication, or disclosure by the government is subject to restrictions
as set forth in subparagraph (c)(1)(ii) of the Rights in Technical Data and Computer Software clause at
DFARS 252.277-7103 (October 1988) and FAR 52-227-19 (June 1987).
Trademark Information
NetApp, the NetApp logo, Network Appliance, the Network Appliance logo, Akorri, ApplianceWatch, ASUP,
AutoSupport, BalancePoint, BalancePoint Predictor, Bycast, Campaign Express, ComplianceClock,
Cryptainer, CryptoShred, Data ONTAP, DataFabric, DataFort, Decru, Decru DataFort, DenseStak, Engenio,
Engenio logo, E-Stack, FAServer, FastStak, FilerView, FlexCache, FlexClone, FlexPod, FlexScale,
FlexShare, FlexSuite, FlexVol, FPolicy, GetSuccessful, gFiler, Go further, faster, Imagine Virtually Anything,
Lifetime Key Management, LockVault, Manage ONTAP, MetroCluster, MultiStore, NearStore, NetCache,
NOW (NetApp on the Web), Onaro, OnCommand, ONTAPI, OpenKey, PerformanceStak, RAID-DP,
ReplicatorX, SANscreen, SANshare, SANtricity, SecureAdmin, SecureShare, Select, Service Builder, Shadow
Tape, Simplicity, Simulate ONTAP, SnapCopy, SnapDirector, SnapDrive, SnapFilter, SnapIntegrator,
SnapLock, SnapManager, SnapMigrator, SnapMirror, SnapMover, SnapProtect, SnapRestore, Snapshot,
SnapSuite, SnapValidator, SnapVault, StorageGRID, StoreVault, the StoreVault logo, SyncMirror, Tech
OnTap, The evolution of storage, Topio, vFiler, VFM, Virtual File Manager, VPolicy, WAFL, Web Filer, and
XBB are trademarks or registered trademarks of NetApp, Inc. in the United States, other countries, or both.
Apple is a registered trademark and QuickTime is a trademark of Apple, Inc. in the United States and/or other
countries. Microsoft is a registered trademark and Windows Media is a trademark of Microsoft Corporation in
the United States and/or other countries. RealAudio, RealNetworks, RealPlayer, RealSystem, RealText, and
RealVideo are registered trademarks and RealMedia, RealProxy, and SureStream are trademarks of
RealNetworks, Inc. in the United States and/or other countries.
All other brands or products are trademarks or registered trademarks of their respective holders and should
be treated as such.
NetApp, Inc. is a licensee of the CompactFlash and CF Logo trademarks.
NetApp, Inc. NetCache is certified RealSystem compatible.
Table of Contents
1
1.1
1.2
2
2.1
2.1.1
2.1.2
2.1.3
2.1.4
2.2
2.2.1
2.2.2
2.3
2.4
2.5
3
3.1
3.2
3.3
4
4.1
5
5.1
5.2
5.3
5.4
5.5
5.6
5.7
6
6.1
6.2
6.3
7
7.1
7.2
7.3
7.4
7.5
7.6
7.7
7.8
7.9
8
9
3 of 50
Preface
This document describes how to perform sizing for NetApp FAS systems.
AUDIENCE
The primary audience for this document is technical personnel such as systems administrators,
engineers, professional services engineers, and professional services consultants at NetApp.
KEYBOARD AND FORMATTING CONVENTIONS
You can use your product more effectively when you understand how this document uses keyboard
and formatting conventions to present information.
Keyboard conventions
Convention
What it means
Enter, enter
Used to refer to the key that generates a carriage return; the key is named
Return on some keyboards.
Used to mean pressing one or more keys on the keyboard and then pressing
the Enter key, or clicking in a field in a graphical interface and then typing
information into the field.
Formatting conventions
Convention
What it means
Italic font
Monospaced font
Bold
monospaced font
Words or characters you type. What you type is always shown in lowercase
letters; unless your program is case-sensitive and uppercase letters are necessary
for it to work properly.
SPECIAL MESSAGES
This document might contain the following types of messages to alert you to conditions you must be
aware of.
Note: A note contains important information that helps you install or operate the system efficiently.
HOW TO SEND YOUR COMMENTS
You can help us to improve the quality of our documentation by sending us your feedback.
Your feedback is important in helping us provide the most accurate and high-quality information.
If you have suggestions for improving this document, send us your comments by e-mail to
doccomments@netapp.com. To help us direct your comments to the correct division, include the name
of your product in the subject line.
DEFINITIONS, ACRONYMS, AND ABBREVIATIONS
Abbreviation/Phrase
Explanation/ Definition
SPM
CSI
ASUP
AutoSupport
4 of 50
Customer: Displays the customers name as entered in the New Sizing wizard
5 of 50
Sizing Title: Displays the sizing title as entered in the New Sizing wizard
Workloads: Lists all the workloads added during the sizing workflow
New Sizing: Initiates the workflow for a new sizing exercise
Search: Finds previous sizings based on multiple search criteria
View: Opens previous sizing requests located in history or saved configurations. Sizing templates
are also available from this menu
Help: Shows where additional information can be found, as well as a way to provide feedback to
the SPM team
Toggle: Allows toggling between workflows
Save For Later: Saves the workload and options for later use. The saved date can be retrieved
from the Saved Inputs option from the View menu
Perform Sizing: Allows the user to actually perform the sizing. In the case of the Forward Sizing
workflow, it opens the hardware pre-filter window. In the case of the Reverse Sizing workflow, it
submits the sizing to CSI
SPM is designed to be more intuitive than the previous sizers, and it supports multiple workloads within
a single sizing by combining the various independent workload sizers into modules within the workflow.
The following application modules are supported by SPM:
6 of 50
7 of 50
For more information about using the Reverse Sizing workflow, see Performing a new reverse sizing
For more information about using the Reverse Sizing workflow, see Perform a New Reverse Sizing
System Performance Modeler User Guide
8 of 50
9 of 50
10 of 50
A convenient feature of SPM is the ability to size both with and without flash acceleration. This
provides the results in a single report and illustrates the effect of the acceleration modules on the
predicted performance of the system.
When sizing for Flash Pool, SPM will identify the number of SSD drives necessary to achieve the
performance requirements of the workloads created in the subsequent steps. This step does not place
any specific workload on a Flash Pool aggregate. All options that are selected will be modeled.
Forward sizing prefilter hardware configuration window is as shown:
At this point, you can either click Calculate Sizing or select the Detailed Inputs checkbox to use non
default options in your sizing. Whether you decide to use some of the detailed inputs or not, click
Calculate Sizing after you are satisfied with the options you have set.
The link to flash recommender tool is included under Flash Acceleration Options
2.2.1 Detailed inputs
Selecting detailed inputs gives you the following options:
Allow disk/shelf type selection per workload
In this section you can over write the global drive and shelf selection. You can also select a new drive
and shelf for each workload.
Disk shelf and drives
In this section, you can select the exact disk shelf and disks that you would like to use.
The detailed disk selection is as shown:
11 of 50
Flash
Selecting the Flash Acceleration Options checkbox in the Pre-Filter Hardware Configuration window
enables the Detailed Inputs dialog box.
The Auto_Suggest feature has been newly added to the flash acceleration options. This is the default
option, and when this option is selected, the CSI will determine the optimal number, type, and capacity
of the Flash Cache card and also the optimal number and capacity for Flash Pool, based on the
workload characteristics.
You can manually select the type of Flash Cache card, the number of cards, and the capacity of each
card, if you prefer to not use the automatic suggestion option.
Also, you can manually select the type of SSD disk and the amount of capacity of the Flash Pool card
per controller, if you prefer to not use the automatic suggestion option.
The detailed flash acceleration options are as shown:
12 of 50
System headroom (%): Amount of CPU and other system resources that should be reserved
and unused while sizing to allow for future growth.
Increasing the headroom can increase the platform count if the supplied workloads exceed the
headroom threshold, even if the workload can be serviced with fewer systems.
Map to full shelves: Select this option if SPM should produce disk requirements equal to the
number of disks in a full shelf. The final disk count can be increased if the number of disks
required for performance, capacity, and spares does not equal full shelves.
Capacity reserve (%): Amount of disk space that should be reserved for future growth. This
can increase the number of disks required to meet capacity requirements.
Spare disks per node: The number of disks that should be added as spares.
System age: As system age increases, I/O operations can become less optimal, which
ultimately increases disk utilization. Adjusting the system age can increase the amount of disks
required to support the workload. The Empty System setting represents a new storage
environments age.
13 of 50
14 of 50
15 of 50
Estimate resource utilizations and latencies, which answers the what if questions
Estimate maximum throughput, which provides the exact opposite of a forward sizinglike
functionality
Starting with SPM 1.4, the reverse (formerly advanced) sizing workflow is supported for both 7-Mode
and clustered Data ONTAP.
When using the Reverse Sizing modes, only a single platform model can be selected.
Also, instead of solving for the number of disks, the aggregate sizes and types are user defined using
the aggregate attributes feature. There are no additional advanced parameters, such as capacity
reserves of spare disks, because aggregates are user defined and have already been determined.
Note: You can toggle between the forward and Reverse Sizing workflows by clicking the Toggle button
at the bottom of the SPM window.
The sizing method selection is as shown:
16 of 50
This section describes how to complete a reverse sizing for both resource estimation and maximum
throughput calculations.
Steps
1. Enter opportunity information.
After clicking New Sizing, this step is the same as in a forward sizing. This window allows you
to enter information about the opportunity, which is then included in the output report and
enables easier searching.
2. Select sizing workflow and data entry
In a reverse sizing, you must first select whether the inputs are to be manually entered or
loaded from a perfstat file. Then, a choice for the attribute on which you are going to estimate
performance is needed. The selection depends on the end result that is desired. The workflow
for both options is similar, although the outputs are different.
Resource Utilization and IO Latencies: This option is useful for answering a what if
question. For example, if a system is already deployed and a customer wants to
determine what would happen if another workload were deployed on it, using this sizing
method will determine what the customer should expect for overall system utilization
and latency.
Maximum Throughput: This option is the exact opposite of a forward sizing. SPM
provides an estimated maximum throughput given a system configuration and one or
more application workloads.
17 of 50
If this is a clustered Data ONTAP configuration, the number of HA pairs needs to be set.
Entering the number of HA pairs in clustered Data ONTAP is as shown:
The FlashCache options section enables you to select no Flash Cache or specify the exact type,
number, and size of the cards.
The flash cache options in reverse sizing are as shown:
In this step, aggregate configurations must be defined. For each aggregate that is part of the system
being modeled, add an aggregate and the number of disks in the Aggregate Attributes window using
the New Aggregate button. SPM assumes that the RAID type is NetApp RAID-DP technology and
uses the default RAID group size (16). For guidance on RAID group and disk spare configurations,
refer to the Storage Subsystem Technical FAQ.
Adding an aggregate in reverse sizing is as shown:
18 of 50
19 of 50
20 of 50
The results of the reverse sizing are also captured in the report available from the results window.
For more information about the report, see Understanding SPM sizing output
21 of 50
When the Detailed Inputs checkbox is selected, the following more detailed options are allowed to be
selected.
Flash acceleration option - Auto_Suggest
The flash acceleration options section of the SPM wizard now includes the Auto_Suggest option, in
addition to the existing No and Manual options.
The Auto_Suggest option is enabled by default. When this option is enabled, CSI will determine the
optimal number, type, and capacity of Flash Cache and also the optimal number and capacity for Flash
Pool, based on the workload characteristics.
CSI will generate two outputs per platform/drive combination selected: one output with only Flash
Cache and the other with only Flash Pool, based on the workload characteristics.
Auto_Suggest in Flash acceleration options as shown:
22 of 50
When both the Flash Cache and Flash Pool manual modes are selected and inputs provided, CSI will
generate two outputs per platform/drive combination selected: one output with only Flash Cache and
the other output with only Flash Pool, based on the workload characteristics.
Manual option in Flash Acceleration is as shown:
23 of 50
When both the Flash Cache and Flash Pool options are selected, either in Auto_Suggest or Manual
mode, CSI will generate two outputs per platform/drive combination selected: one output with only
Flash Cache and the other with only Flash Pool, based on the workload characteristics.
24 of 50
4.1 Perfstat
Perfstat is a NetApp tool used to capture performance and configuration information from an existing
installation. SPM supports uploading perfstats to automatically fill in the controller and workload
information necessary for sizing. After a perfstat is uploaded, its possible to modify the controller and
workload to model changes to an existing system. SPM allows you to submit perfstat files in two ways:
online and offline.
Perfstat parser options are as shown:
25 of 50
After the perfstat file is parsed, a list of controllers in the perfstat file is made available.
2. Select the desired controller configurations and then select the workload characteristics.
System Performance Modeler User Guide
26 of 50
3. Select the desired iteration and option (Min, Max, or Average IOPS and CPU).
Perfstat parser workload characteristics dialog box is as shown:
After the controllers and workloads are selected, the parser prompts for a location in which to save an
intermediate file (.spm file).
4. Save the file.
5. Close the parsing window after the file is saved. SPM then displays the Import workloads from
Perfstat page. Use the Browse button to browse for the intermediate file (the .spm file
generated in step 3).
Offline perfstat parser intermediate file upload is as shown:
27 of 50
The information in the perfstat file to be used for the sizing should now be visible on the main SPM
window as a controller configuration and a workload configuration.
28 of 50
The basic window will have the mandatory fields, and the details will all be set to
default values; however, the user can expand the detailed section and set personal
values.
Free aggregate space and free snap reserve define the amount of free space that must
be kept in the aggregate and the amount that should be reserved for NetApp
Snapshot copies.
The number of input/output operations per second (IOPS) is the amount of I/O each
user is estimated to produce.
The C drive size (in GB) is the size of the main VM operating system drive.
VM memory size refers to the amount of memory per VM and is used to factor in the
vswap storage requirements.
Unique data per VM is the estimated space unique to each VM. Because of cloning
and deduplication technologies, VMs might not require much additional space. This
helps define additional storage requirements for the life of the VM.
Note: Disk capacity requirements can vary significantly depending on the cloning and Snapshot
technologies employed. Make sure to understand these differences when sizing for VDI environments.
The read and write workload estimates are the percentage of read and write I/O
expected from the VMs.
Working set size defines the percentage of the total data that is considered active and
can vary depending on the customer environment.
Read and write I/O size defines the I/O size for the workload.
Random read latency defines the maximum allowed latency for reads.
VDI workload inputs are as shown:
29 of 50
30 of 50
Sizing output can vary significantly based on differences found in vendors cloning technologies, so
NetApp recommends reviewing TR-3949: NetApp and VMware View Performance Report.
Detailed instructions on the deployment characteristics of VDI technologies are beyond the scope of
this document; therefore, NetApp recommends engaging with a consulting systems engineer (CSE) in
your area when sizing any VDI opportunity.
Under Backup Configuration, select Yes for Database and Log Isolation
Configured. Otherwise, the database and log will be placed on the same LUN,
which is against NetApp best practices and will result in a configuration that will
not work with SnapManager for Exchange.
If the database size is less than the NetApp best practice of 2TB minimum, it will
adversely affect the performance of the system, because each database performs
maintenance. Make sure that you set the Maximum Database Size Configuration
value to Custom. Set the Maximum Database Size (GB) to a value greater than
2TB until the actual database size is close to 2048GB. This value can be viewed
in the LUN Requirements (in Exchange 2010 calculator) and Volume
Requirements (in Exchange 2013 calculator) and in column E under DB size +
overhead.
Under Backup Configuration, select the VSS Hardware Provider for Backup
Methodology.
Under Tier-1 User Mailbox Configuration, select Yes for Desktop Search Engines
Enabled (for Online Mode Clients).
Under User Mailbox Configuration, selecting Yes for Desktop Search Engines
Enabled will affect the IOPS accordingly.
2. Upload the completed copy of the Mailbox calculator by clicking the Browse button in the basic
inputs section.
31 of 50
3. After the application module workflow is completed, two workloads are generated within SPM, a
primary site and a secondary site. Modify the controller constraints and use the checkboxes
next to the generated workload to model the primary and secondary workloads independently.
User Type: Select from light, low, medium, or heavy; based on this, the other
dependent values will be set to default optimal values.
Home Directory Size (GB): The amount of space required for each user.
The following list describes various user parameters. The number of users along with the size of the
home directory defines the capacity requirements; the number of users along with type of user and
concurrency defines the performance requirements. The user type is based on the data transfer rate
requirement per user.
32 of 50
The CIFS home directory sizing workload module was implemented based on TR-3564i: Sizing of
CIFS-Based Home Directories. Although this document is no longer current, refer to it for additional
reference material.
33 of 50
If a statistics file is not used, the workload characteristics can be entered manually. The performance
inputs fields are for specifying the basic characteristics of the database workload, such as throughput,
operating-mix percentages, throughput growth, and maximum acceptable latency.
Performance inputs fields options are as shown:
34 of 50
App Specification Inputs fields allow you to select the database type to use for this sizing request, the
protocol to use, and the project life, which aids in the definition of growth requirements.
App specification inputs fields are as shown:
The file system inputs section of the database sizer helps to define the capacity requirements for the
database application. The working set size parameter defines what percentage of the capacity is active
at any point in time.
File system inputs fields are as shown:
The performance of the storage system also depends on the block size used by the application. The
default is 8KB, but other sizes can be specified, if necessary.
System Performance Modeler User Guide
35 of 50
Because detailed instruction on the technical and deployment characteristics of the various databases
supported by SPM is beyond the scope of this document, NetApp recommends engaging with a CSE in
your area when sizing any database application.
The rest of the fields covered are defaulted, but can be modified by clicking the Detailed Inputs check
box.
When specifying a random read latency, make sure that it is reasonable and also meets the
requirements of your application. Enter the desired latency in milliseconds. This defaults to 20ms, which
is considered a reasonable value.
Workloads associated with aggregates that only have hard disks (HDDs) have a floor of a 8ms latency.
Workloads associated with aggregates, which have SSDs, have a floor of a 3ms latency. Latencies set
below these values might return an error.
Random Read Latency in custom application module is as shown:
The I/O percent section defines the read-write breakdown of the workload. The options at the top of the
I/O percent parameters are presets that quickly adjust the percentages. If you know the breakdown of
your workload in greater detail, the percentages are adjustable. This defaults to 25% of each of the four
categories.
I/O percent workload parameters are as follows:
36 of 50
The working set size, which is generally applicable to workloads with random I/O, defines the size of
the active portion of data. This value affects how much data can be cached, which affects how much of
the workload is forced to disk.
The smaller the working set, the greater the chance that data is kept in the cache. The working set size
is not often known for many applications, but its typically a smaller percentage of the total data.
Application working set size recommendations
Guidelines to determine working set sizes based on workload type.
Application
Home directories
Databases (OLTP)
Microsoft Exchange
Changing the size of the I/Os in the workload can affect the number of IOPS that the system will
achieve. However, total data throughput in MB/s might be greater. Set these values to be as close as
possible to the customer workload characteristics.
I/O size parameters in custom application module are as shown:
If Flash Pool is used for this application workload, the random overwrite percent selection is made
available. Random overwrite percent determines the impact that the random overwrite caching in SSD
will have for the workload and system performance.
Random overwrite percent selection is as shown:
37 of 50
After the application-specific portions of the workload are complete, there are additional points to
consider that are common to all applications, such as layout hints, aggregate types, and SnapMirror
functionality.
For more information, see Additional sizing options
38 of 50
When defining workloads to be placed on a clustered Data ONTAP system, additional options are
available. The flexibility of clustered Data ONTAP allows client access to be split across nodes
regardless of whether the data is stored on one node or on many nodes.
Additional clustered Data ONTAP layout hints are as shown:
The following combinations are possible depending on how the layout options are set for the workload:
39 of 50
40 of 50
Reverse sizing
For each workload in a clustered Data ONTAP sizing, during reverse sizing, there is an option to
include SnapMirror parameters related to the workload.
SnapMirror features are as shown:
41 of 50
42 of 50
43 of 50
The node containing the aggregate and actually serving the data
Each workload shows how much system utilization it is contributing to the node, and the overall system
utilization, under node details, reflects the sum of all the workload utilizations along with additional
overhead.
The node receiving client traffic and forwarding indirectly will have a lower utilization than the node
receiving the indirect traffic and accessing the storage.
Multiple workloads are adjusted depending on the layout hints.
Node-perspective system usage (clustered Data ONTAP, direct only) is as shown:
44 of 50
Node-perspective system usage (clustered Data ONTAP, distributed across cluster) is as shown:
If Flash Pool is used in the sizing, this table changes to show the disks required for both the SSD and
HDD portions of the Flash Pool aggregate. The expected impact of read caching of Flash Pool is shown
below.
Flash Pool drive calculations are as shown:
The shelf details section elaborates on the number of shelves, drive types, and number of drives
needed for the configuration.
45 of 50
Other outputs can also be produced if they are relevant to the sizing request. Some examples include
information about the size and number of Flash Cache modules or the disk-OPS-to-host-OPS ratio.
Additional information outputs are as shown:
46 of 50
When sizing for maximum throughput, SPM creates a table. In addition to maximum throughput, SPM
also lists the reason for the bottleneck that is limiting the performance of the configured system.
Reverse maximum throughput sizing report is as shown:
47 of 50
8 Additional resources
Additional information can be found in the SPM help menu, as well as in the SPM FAQ. For more
information, contact the sizing team or visit the Sizers Community page.
48 of 50
9 Version history
Version
Date
Version 0.1
March 2012
Original version
Version 1.0
March 2012
Version 1.1
September 2012
Version 1.2
December 2012
Version 1.3
December 2013
Version 1.4
January 2014
Version 1.5
May 2014
49 of 50