Professional Documents
Culture Documents
NOTICE
This product brief may contain proprietary information protected by copyright. Information in this
product brief is subject to change without notice and does not represent a commitment on the part
of Quantum. Although using sources deemed to be reliable, Quantum assumes no liability for any
inaccuracies that may be contained in this product brief. Quantum makes no commitment to update
or keep current the information in this product brief, and reserves the right to make changes to or
discontinue this product brief and/or products without notice. No part of this document may be
reproduced or transmitted in any form or by any means, electronic or mechanical, including
photocopying, recording, or information storage and retrieval systems, for any person other than
the purchasers personal use, without the express written permission of Quantum.
PRODUCT BRIEF
Executive Summary
Keeping up with the rapid growth of unstructured file data while maintaining fast access to those business
critical digital assets is a monumental challenge for many organizations today. Consequently, IT Managers
and Storage Administrators are typically forced to choose between the shared-storage simplicity of NAS and
the high performance scalability of SAN. With Quantum StorNext data management software, you get the
best of both worlds. StorNext combines high-performance file sharing with cost-effective data management
and protection, delivering unrivaled performance, scalability, and data preservation.
The StorNext shared SAN file system accelerates business operations by storing information rapidly, then
sharing it simultaneously across heterogeneous platforms, including Linux, Windows, UNIX, and MAC OS
operating systems (see Figure 1). Via a single shared file system instance, files can remain on the SAN storage
while presenting simultaneously to multiple OS types running various applications accessing the same file.
This eliminates the need to physically move files or convert formats to accommodate a native OS file system.
Over time, the required cost and performance characteristics of those files will likely diminish. StorNext
enables cost savings and management simplicity with its policy-based tiered storage and archiving
capability. The data movement is all done transparently within the StorNext file system, keeping the file
online and accessible while significantly reducing storage and management costs. In addition, deploying
this automated tiered storage solution avoids added the cost and complexity to the environment introduced
by 3rd party applications required to move files from tier to tier.
With enterprise features, such as deduplication, replication, Distributed Data Movers, and tiered storage &
archiving, StorNext delivers powerful solutions that address current and future IT challenges.
Replication Enabling disaster recovery and business continuity by replicating business critical data to
remote locations. Business operations can continue on a secondary site in the event the primary site is
unavailable. In addition, the distribution of data to remote campuses enables branch offices to utilize a
copy of the data for enhanced business operations without impact to the primary location.
Data Deduplication Reducing the storage requirements for growing amounts of unstructured data. As
data within an enterprise continues to proliferate, the likelihood of multiple copies of the same file is an
increasing burden on IT resources. Data Deduplication intelligently saves only the changed data for more
efficient storage capacity untilization and reduced management overhead.
Tiered Storage & Archiving Delivering a transparent, policy-based storage tiering and archiving solution
to better correlate the cost and performance characteristics of your data with your storage. Store the right
data, on the rights storage tier, at the right cost. In addition, as environments grow, StorNext Distributed Data
Movers (DDM) enable unparalleled scale and performance for your tiered storage and archiving environment.
PRIMARY TIER
SECONDARY TIER
CIFS/NFS
StorNext
SAN Clients
FC/iSCSI
GigE TCP/IP
Infiniband IP
Optional
StorNext
Policy
Engine
VAULT
Metadata
Controller
Figure 1
PRODUCT BRIEF
Methodology
StorNext software delivers flexible enterprise data management solutions, such as high speed data access
and cost-effective storage tiering and archiving with two core components, the StorNext shared SAN file
system (SN SAN Clients) and StorNext Storage Manager.
StorNext File System is enabled by SAN clients which reside on a Windows, Linux, UNIX, or MAC OS server
connected to a SAN. A StorNext Meta Data Controller (MDC) sits on a Windows or Linux server and acts as
the traffic cop for shared heterogeneous server access to the data pool on the SAN storage. Where the SAN
shares out the storage to a server community, the StorNext file system, enabled by the MDC, shares a common
name space, and data within, out to the applications simultaneously across different server platforms.
StorNext Storage Manager, residing on the MDC, is a policy-based data mover that automatically and
transparently copies or moves data to another tier of storage. Files can be truncated in the primary file
system, leaving behind a pointer, or can remain whole with a copy in a lower storage tier. Since this
movement is all done within the StorNext instance, the data remains local to the application accessing the
file system. If an archived file is requested, the Storage Manager fulfils the request and presents the file
back to the application as if it was resident in the primary file system.
There are many other features, including replication and deduplication, which further enhance these two
core components ability to manage enterprise data.
PRODUCT BRIEF
PRIMARY TIER
Metadata
Controller
1
GigE TCP/IP
Infiniband IP
StorNext
SAN Clients
FC/iSCSI
Figure 2
PRODUCT BRIEF
An additional performance factor taken into consideration by StorNext, but not by most traditional
file systems, is metadata location. A file is composed of content (the actual data) and metadata (the
information about the file which includes attributes, permissions, etc.). Whenever a file is written, updated,
or deleted the metadata is changed. This means that whenever a file is written, not only must the data be
written but metadata must be written as well. With larger files, write performance can be reduced because
a file may be written sequentially for high performance, but the metadata write operations requires disk
head movement in bursts to another location on disk. For this reason, StorNext allows metadata to be
written to a different disk volume so that the sequential writes of data are isolated from more random
metadata writes, providing the best possible throughput.
In addition, StorNext includes cache coherency mechanisms to ensure that data is coherent and up-to-date
even when data is changed by multiple clients. These mechanisms also limit the required metadata
operations by having the clients communicate with only the MDC for certain requests, such as file creation,
deletes or allocations.
PRIMARY TIER
Vendor Agnostic
SECONDARY TIER
StorNext DLC
C
Gateway Servers
FC/iSCSI
GigE TCP/IP
Infiniband IP
Optional
StorNext
Policy
Engine
Vendor Agnostic
TAPE LIBRARY ARCHIVE
Metadata
Controller
Figure 3
PRODUCT BRIEF
Load balancing and clustering is provided when the DLC requests I/O across the attached gateways. If a
gateway system is unavailable or the network connection is lost, DLC transparently fails over I/O to another
attached gateway. Additionally, a DLC load balances data requests across all available network interfaces
on the client. In the chart below you can see a comparison between NFS, DLC and SAN Client performance
using the exact same system (see Figure 4). Those numbers are not maximum throughput numbers;
however, they show how throughput increases dramatically from NFS to DLC (nearly 3x the throughput on
the same network).
Distributed LAN Client is an important part of the StorNext data sharing offering, allowing lower cost high
speed LAN connectivity for applications connected via the network. This is especially attractive to customers
with High Performance Computing (HPC) and rendering projects where a large dataset is broken into
segments and processed by multiple servers.
For more information on Distributed LAN Client please see the Quantum StorNext Distributed LAN Client
Product Brief (PB00021).
350
300
250
200
Throughput (MB/s)
150
100
50
0
Figure 4
PRODUCT BRIEF
This is where a tiered data approach becomes important. Tiered strategies are aimed at reducing the cost
of storing content while still keeping it accessible to users for re-purposing. Archiving is a specific example
of how data retention can be implemented to prevent expensive primary storage from being overwhelmed
by inactive data. Archiving in this regard is not necessarily for compliance adherence but instead for costeffective data storage. Data management software helps control storage costs by moving data off of
primary storage and on to lower-cost secondary storage tiers. This allows users to reduce the amount of
expensive, primary disk in play and instead use more cost effective mass storage devices for infrequently
used content (i.e. tape technologies, VTLs or MAID). Tiered and archived data is kept in a near-line state
and accessible to users so it can be re-purposed for future revenue-generating projects.
StorNext Replication
General file-based replication has many use cases including data consolidation, data distribution,
repurposing, and data protection for disaster recovery, just to name a few. StorNext Replication is an
asynchronous, host based replication feature that is highly flexible and configurable. Replication is
implemented at the directory level and is policy driven to initiate timed or user initiated copies of data.
The policy engine within StorNext also includes the capability to specify how copies are maintained and
managed on both the source and target locations. For example, replicated files that have been deleted on
the source can be kept at the target location. With StorNext Replication, environments can be designed to
replicate up to five source StorNext file systems into one centralized StorNext target location (see Figure 5).
Similarly, StorNext replication can also be configured to replicate from one StorNext file system to as many
as three target StorNext file systems (see Figure 6). The target file system can be the same machine as the
source, different machines on the SAN, or in a remote StorNext environment over a LAN or WAN.
TARGET
TARGET
Remote Office
Off
ff
mote Office
Remote
Off
Remote Office
WAN
WAN
Off
Remote Office
Central Datacenter
acenter
mote Office
Remote
Central D
Datacenter
Remote Office
Off
mote Office
Remote
Remote Office
Off
Figure 5
Figure 6
For customers who already have a StorNext Storage Manager environment and want to ingest those files
into a replicated environment, files will need to be retrieved from the disk or tape device and passed
through the Replication policy engine to be replicated to a new target location. Environments can be
replicated in many different combinations, including:
StorNext File System to StorNext File System
StorNext File System to StorNext Storage Manager
StorNext Storage Manager to StorNext Storage Manager
PRODUCT BRIEF
Replication policies can be assigned to more than one directory at a time for easy management. For added
flexibility, StorNext Replication supports the concept of cascading (or multi-hop) replication, meaning
any directory can be designated as a source and/or target for replication. Files are replicated only when
the primary copy changes, and has the ability to truncate (remove) the primary copy of the file once the
replicated copy is stored, thus freeing up valuable capacity on primary disk systems.
Benefits of StorNext Replication over array based or 3rd party replication products include:
Flexibility StorNext can replicate multiple OS types, multiple vendor SANs and can be combined with
tiered data environments
Cost StorNext replication is license-based on per server basis, versus a capacity based model
Web services in the StorNext Management Console can be utilized to integrate Replication more closely
with 3rd party and custom applications
Deduplication used with Replication provides enhanced replication performance in low bandwidth
environments making recovery time much faster because you dont have to pull as much data back from
the remote site to rebuild the files
StorNext Deduplication
Quantum has long been an innovator in data management software, including data deduplication
technology. We developed the pioneering technology known as variable-length hash-based data
deduplication that has become an industry leader in the deduplication market. The introduction of
Deduplication in the StorNext File System now embraces this technology in the primary storage tier and is
changing the way tiered storage is used in file systems.
StorNext Deduplication reduces the amount of capacity consumed by data on disk. Unlike compression
technologies that use static pattern table methodologies based on an individual file being compressed,
StorNext Deduplication builds its redundancy index based on all the data being deduplicated, not just the
file being processed at that moment. StorNext Deduplication will find redundancies within a file making it
extremely efficient at saving storage space (up to 93% in some documented use cases).
With StorNext Deduplication (see Figure 7), any area of primary storage can become the repository for
the deduplicated dataset. StorNext Deduplication utilizes the same file system policy engine running on a
Linux MDC that is put in place for StorNext Replication. The system allows the administrator to set policies
through the Management Console or command line based upon:
Directories to deduplicate
How quickly deduplication will occur after files are created or changed
When, and if removal of primary file occurs
Meeting file size requirements in order for a file to be deduplicated
A typical use case for StorNext Deduplication is for reducing the capacity utilization due to having multiple
copies of the same, or even similar, file. Imagine how much disk space can be saved by deduplicating large
static files, images, and video that have been intentionally or unknowingly stored in multiple locations.
Capacity reductions also occur for customers who push pre-production copies of deduplicated data to target
locations (using StorNext Replication), then push production copies of the same data to the target location
later. With StorNext Deduplication only small amounts of production data are transferred at this time.
StorNext Deduplication can also take advantage of WAN acceleration techniques during replication. Before
newly created data is replicated from the source to the target location, the replication engine will check to
see what deduplicated data already exists at the target site and only send the unique bits of the file across
the WAN, greatly reducing bandwidth and transfer time.
PRODUCT BRIEF
For those who need the maximum file retrieval performance available, StorNext Deduplication allows
application and user access to the file before the entire file is completely retrieved. Similarly, utilizing Solid
State Disk (SSD) storage devices to house the block pool index can optimize performance.
For customers who utilize StorNext Deduplication as part of a tiered storage methodology with Storage
Manager, file retrieval will occur from the Deduplication block pool before retrieving from Storage
Manager. The only reason files would be retrieved from Storage Manager is if the block pool is unavailable.
This increases the performance of file retrievals while maintaining the highest possible file preservation
techniques. Protection of the blockpool is achieved through the use of StorNext Replication.
PRIMARY TIER
Vendor Agnostic
SECONDARY TIER
StorNext
SAN Clients
StorNext
Policy
Engine
Vendor Agnostic
DEDUPLICATED
REPOSITORY
Vendor Agnostic
Figure 7
Affinities
The simplest example of a tiered storage practice using the StorNext File System is the use of Affinities.
As mentioned before, StorNext can simultaneously manage multiple types of storage (FC, iSCSI, SAS, and
SATA) from different vendors under one namespace. Affinities are used to steer data within the file system
to a specific stripe group which is made of one or multiple LUNs. Assigned at a directory level, Affinities
can be used to steer data to the most appropriate device. An example is steering very important, high
throughput files to a FC disk volume while temporary files are stored in a directory that is tied to SATA disk.
Users and applications do not see this operation: they simply see the directory structure of the file system,
thus reaping the benefit of a single namespace combined with cost-optimal data location.
The example below shows various data-flows in a StorNext environment (See Figure 8). In this example
data is first stored in directories 1 and 2. Affinities are steering data stored in Directory 1 to Storage
Group A, which is a high-performance, mirrored FC Storage System, while data in Directory 2 is stored on
Storage Group B, which is lower-cost SATA-based storage system.
STORAGE GROUP A
FILE SYSTEM A
Directory 1
StorNext
Client
STORAGE GROUP B
Directory 2
SINGLE NAMESPACE
Figure 8
PRODUCT BRIEF
PRIMARY TIER
SAN
Client
FC/iSCSI
DDM
Server
SECONDARY TIER
StorrNext
StorNext
SAN
N Clients
Clientts
with
h DDMss
FC/iSCSI
DD
DDM
Server
Ser
MDC &
DDM
FC
FC
DDM
Server
FC
Figure 9
10
PRODUCT BRIEF
It is important to note that all Storage Manager features are tightly integrated with the StorNext File
System to provide complete transparency. When files are moved from primary storage to a storage tier, the
StorNext File System metadata is updated to point to the storage tier location. From an application or users
perspective, the StorNext File System provides the mechanism to make the entire namespace seem as though
its on primary disk, even if its an archived copy on tape. When a moved file is subsequently requested,
the StorNext Storage Manger moves the file back to the StorNext File Systems primary disk for access.
For protection purposes, up to four copies of a file can be transparently managed by Storage Manager.
Imagine having a duplicate copy of a file on tier 2 SATA disk for nearline access, a copy in the tape library
and an offsite copy (vaulted copy) in a remote site. In this scenario, there is no need to backup the copy
on SATA, as duplicate copies exist in protected areas. If the SATA failed or became inaccessible, Storage
Manager would automatically and transparently retrieve the data from the next fastest storage area. A
typical use case for this is for protection of large StorNext File Systems where utilizing a backup application
is not feasible due to incompatibility or lengthy backup windows.
Summary
No longer do IT managers and Storage Administrator have to choose between performance and scale.
Quantum StorNext File System delivers high-performance shared pools of storage to optimize file sharing to
increase productivity and scale with the demands of your organization, while providing enterprise solutions
for managing and protecting business critical data.
High Performance Data Sharing
SAN File System: delivers high-performance file creation, storing, & sharing
Distributed LAN Client: provides NAS-like scalability to thousands of server nodes
Shared File System: offers simultaneous file access across platforms
Platform Independence: supports Windows, Linux, Mac and UNIX
Enterprise Data Management & Protection
Storage Vendor Agnostic: supports all major Disk and Tape systems
Replication: enables flexible data protection and data distribution solutions
Deduplication: optimizes capacity and cost of nearline and archived storage
Management Console: simplifies data management
Storage Manager: drives transparent tiered storage and archive
Distributed Data Mover (DDM): improves access performance and scalability of storage tiers
The data management challenges experienced today will only become increasingly complex and
overwhelming over time if the right solutions are not deployed now. StorNext is designed to meet the
demands of the enterprise organization today and into the future.
11
PRODUCT BRIEF
Additional Information
For additional details on StorNext File System and StorNext Storage Manager, including integration options
for your environment, consult Quantums Software Architecture Group or contact the Software team at
softwareinfo@quantum.com.
About Quantum
Jan 2010