You are on page 1of 100

MODULE 2

CLASSIC DATA CENTER

Copyright 2011 EMC Corporation. All Rights Reserved. 1


Module 2: Classic Data Center (CDC)

Upon completion of this module, you should be able to:


Describe the key elements of a CDC (compute, storage, and network)
Describe the common storage networking technologies in a CDC
Explain business continuity technologies commonly used in a CDC
Discuss CDC management

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 2
Module 2: Classic Data Center (CDC)

Lesson 1: Application, DBMS, Compute, and Storage


Topics covered in this lesson:
Application and DBMS
Physical and logical components of a compute system
Storage device options
RAID technology and Intelligent storage system

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 3
Classic Data Center (CDC)
CDC

A CDC is a facility containing physical IT resources including compute,


network, and storage

Classic Data Center

Core elements of CDC

Application
Database Management System
(DBMS)
Compute

Storage

Network

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 4
Key Requirements of a Data Center

Availability

Data Integrity Performance

Manageability

Security Flexibility

Scalability

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 5
Application
Commonly deployed applications in a CDC
Business applications
E-mail, Enterprise Resource Planning (ERP), Decision Support System
(DSS), Data Warehouse (DW)
Management applications
Resource management, performance tuning
Data protection applications
Backup, replication
Security applications
Authentication, antivirus
Key I/O characteristics of an application
Read intensive vs. write intensive
Sequential vs. random

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 6
Database Management System (DBMS)
Database is a structured way to store data in logically organized
tables that are interrelated
Helps to optimize the storage and retrieval of data
DBMS is a collection of computer programs that control the
creation, maintenance, and use of databases
Processes an applications request for data
Instructs the OS to retrieve the appropriate data from storage
Popular DBMS examples are MySQL, Oracle RDBMS, SQL Server,
etc.

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 7
Compute
Compute

A resource that runs applications with the help of underlying computing


components

Compute consists of physical components (hardware devices) and logical


components (software and protocols)
Physical components of compute are CPU, Memory, and Input/Output (I/O)
devices
I/O devices facilitate the following types of communication:
User to compute: Handled by basic I/O devices such as keyboard, mouse, etc.
Compute to compute/storage: Enabled using host controller or host adapter

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 8
Examples of Compute System
Examples of compute systems:
Laptops/Desktops
Blade servers
Complex cluster of servers
Mainframes
Bladed server technology is commonly used to deploy compute
systems in a CDC
Consolidates power- and system-level function into a single,
integrated chassis
Enables the addition of server modules as hot-pluggable
components
Provides increased server performance and availability without
increase in size, cost, or complexity

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 9
Server Clustering
Multiple servers (nodes ) are brought together in a cluster to
improve availability and performance
When a failure occurs on one node in a cluster, resources and
workload are redirected to another node
Exchange heartbeat is a checkup mechanism between two nodes
To see whether a node is up and running
A failover is initiated, if heartbeat fails

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 10
Logical Components of Compute

Applications

File System

Operating System
Logical Components
Volume Manager

Device Drivers

Storage

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 11
Storage
Storage

It is a resource that stores data persistently for subsequent use.

Data created by individuals/businesses must be stored for further


processing
The type of storage device used is based on the type of data and
the rate at which it is created and used
A storage device may use magnetic, optical, or solid state media
Examples: Disk drive (magnetic), CD (optical), Flash drive (solid state)

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 12
Storage Device Options

Low cost solution for long term data storage


Tape Drive Sequential data access, physical wear and
tear, and storage/retrieval overheads

Write Once and Read Many (WORM): CD,


DVD
Optical Disk Limited in capacity and speed
Popular in small, single-user environments
Random read/write access
Uses mechanical parts for data access
Disk Drive
Most popular storage device with large
storage capacity
Provides ultra high performance required by
Solid State Drive mission-critical applications
Very low latency per I/O, low power
requirements, and very high throughput per
drive

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 13
Redundant Array of Independent Disks (RAID)
RAID

A technology which utilizes multiple disk drives as a set to provide


protection, capacity, and/or performance benefits

Overcomes limitations of disk drives


Improves storage system performance
By serving I/Os from multiple disks simultaneously
RAID techniques are:
Striping, mirroring, and parity

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 14
RAID Techniques

Striping Mirroring Parity

A A A

RAID Controller RAID Controller RAID Controller

A1 A2 A3 A A A1 A2 Ap

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 15
RAID Levels

RAID Levels Definition


RAID 0 Striping with no fault tolerance
RAID 1 Disk mirroring
Nested Combinations of RAID levels; Example: RAID 1 + RAID 0
RAID 3 Parity RAID with dedicated parity disk
Parity RAID with distributed parity across all the disks in the
RAID 5
set
RAID 6 Distributed parity RAID with dual parity

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 16
Intelligent Storage System
Is a RAID array highly optimized for I/O processing
Have large amounts of cache for improving I/O performance
Have operating environments that provide:
Intelligence for managing cache
Optimal management, allocation, and utilization of storage
resources

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 17
Components of an Intelligent Storage System

Intelligent Storage System

Compute Front-end Back-end Physical Disks

Cache

Network

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 18
Presenting Storage to Compute System

Intelligent Storage System

LUN 0 Front-end Back-end


Physical Disks

Cache
LUN 0

Network
Compute

LUN 1

LUN 1
Controller RAID Set

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 19
Module 2: Classic Data Center (CDC)

Lesson 2: Storage Networking Technologies -1


Topics covered in this lesson:
Compute to compute communication
Compute to storage communication
Direct Attached Storage (DAS)
Fibre Channel SAN (FC-SAN)

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 20
Compute to Compute Communication
Typically uses Ethernet or TCP/IP protocol
LAN, MAN, and WAN
Communication is enabled using various components:
Network Interface Card (NIC)
Has unique MAC address
Switches and routers
Switch provides scalability and interconnection between multiple
compute systems
Routers allow different networks to communicate with each other
Cables
Twisted pair, co-axial cable, optical fiber

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 21
Compute to Storage Communication
Communication is enabled using various hardware components
(HBA, CNA, NIC, switch, router, gateway ,and cables) and
protocols
Communication between compute and storage can be done
using channel or network technologies
Channel Technology Network Technology
Compute system and peripheral Compute system and peripheral devices
devices are connected through are connected over a network
channel
Provides low protocol overhead due to High protocol overhead due to network
tight coupling connection
Supports transmission only over short Supports transmission over long
distances distances
Protocol examples: PCI, IDE/ATA, Protocol examples: iSCSI( SCSI over
SCSI, etc. IP), FCoE ( Fibre Channel over
Ethernet), and FC

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 22
Communication Protocols
Peripheral Component Interconnect (PCI)
Provides interconnection between CPU and attached devices
Latest PCI Express bus provides throughput of 133 MB/sec
Integrated Device Electronics/Advanced Technology Attachment
(IDE/ATA)
Popular protocol to connect to disk drives
Supports 16-bit parallel transmission
Serial version is called Serial ATA (SATA)
Both versions offer good performance at a relatively low cost

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 23
Communication Protocols (contd.)
Small Computer System Interface (SCSI)
Preferred storage connectivity option for high-end
environments
Improved performance, scalability, and high cost when
compared to ATA
Serial version is called Serial Attached SCSI (SAS)
Transmission Control Protocol/Internet Protocol (TCP/IP)
Traditionally used for compute to compute communication
Now used for compute to storage communication also
iSCSI (SCSI over IP) and FCoE (Fibre Channel over Ethernet) are
examples

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 24
Data Access by Compute

Compute Application Application


Compute
File System

File level
Request Network
Block level
Network Request

File System

Storage
Storage Storage Storage

File level Access Block level Access

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 25
Direct Attached Storage (DAS)
DAS

An internal or external storage device, which connects directly to a


compute system

DAS is classified as internal or external based on the location of


the storage device with respect to the compute system
Benefits:
Simple to deploy and ideal for local data provisioning
Low capital expense and less complexity
Challenges:
Limited scalability
Limited ability to share resources
Islands of over and under utilized storage resources

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 26
Emergence of Storage Networking Technologies
Just-in-time information for business users
Flexible and resilient storage architecture
DAS is inefficient to fulfill these requirements
Storage networking technologies emerged as a solution
Fibre Channel SAN (FC SAN)
Network Attached Storage (NAS)
Internet Protocol SAN (IP SAN)
Fibre Channel over Ethernet (FCoE)
Object Based storage
Unified storage

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 27
What is FC SAN ?
Dedicated high speed network of Servers Servers Servers

compute systems and shared storage


devices
Uses SCSI over FC protocol
Provides block level data access FC SAN

Benefits

Enables storage consolidation and


sharing
Enables centralized Management
Provides scalability and high
performance
Storage Array
Reduces storage and administration Storage Array

cost

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 28
Components of FC SAN
Node ports
Cables
Connectors
Interconnecting Devices
Storage Arrays
SAN Management software

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 29
Fibre Channel Fabric
FC Fabric Servers

One or more interconnected FC switches


through which multiple SAN nodes can
communicate

In a switched fabric, the link between any


two switches is called Inter Switch Link
Inter switch links
(ISL)

Storage Array Storage Array

Server Server

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 30
Port Types

N-Port

FC Switch FC Switch
Compute

F-Port

F-Port E-Port E-Port F-Port

Storage Array Storage Array


N-Port N-Port

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 31
FC SAN Addressing
Fibre Channel Address
Used to communicate between nodes within SAN
Similar in functionality to an IP address on NICs
24 bit address, dynamically assigned

World Wide Name: Unique 64 bit identifier


Static to the port, similar to NICs MAC address
Used to physically identify ports or nodes within SAN
World Wide Name Array
5 0 0 6 0 1 6 0 0 0 6 0 0 1 B 2
0101 0000 0000 0110 0000 0001 0110 0000 0000 0000 0110 0000 0000 0001 1011 0010

24 bits Model Seed 32 bits

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 32
Zoning
Switch Domain ID = 15
Server 1

Port 5 Storage Array


WWN 10:00:00:00:C9:20:DC:40 Zone 2 Port 7

Zone 3 Port 1

Server 2

Port 12
Port 9
WWN 50:06:04:82:E8:91:2B:9E
WWN 10:00:00:00:C9:20:DC:56

Zone 1

Server 3 Zone 1 (WWN Zone) = 10:00:00:00:C9:20:DC:82 ; 50:06:04:82:E8:91:2B:9E


WWN 10:00:00:00:C9:20:DC:82 Zone 2 (Port Zone) = 15,5 ; 15,7
Zone 3 (Mixed Zone) = 10:00:00:00:C9:20:DC:56 ; 15,12

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 33
Module 2: Classic Data Center (CDC)

Lesson 3: Storage Networking Technologies -2


Topics covered in this lesson:
Internet Protocol SAN (IP-SAN)
Fibre Channel over Ethernet (FCoE)
Network Attached Storage (NAS)

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 34
IP-SAN
IP-SAN

It is a technology that provides transfer of block level data over an IP network.

IP is being positioned as a storage transport because:


Offers easier management
Allows existing network infrastructure to can be leveraged
Reduces cost compared to new SAN hardware and software
Supports multi-vendor interoperability
Many long-distance disaster recovery solutions already leverage IP-
based networks
Many robust and mature security options are available for IP
networks

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 35
Block Storage Over IP Protocol Options
Storage Array
iSCSI (SCSI over IP) iSCSI Gateway

Encapsulation of SCSI data (a) iSCSI Implementation


in IP packets
Ethernet NIC card Server FC Port

TOE card
IP
iSCSI HBA
Hardware-based gateway iSCSI HBA
to Fibre Channel storage
iSCSI Port
Used to connect compute
systems
FCIP Storage Array
(b) FCIP Implementation
Fibre Channel-to-IP FCIP
FCIP
bridge/tunnel (point-to- gateway
gateway

point)
IP
Fibre Channel end
points
FC HBA FC Port
Used in DR Storage Array Storage Array

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 36
iSCSI Topologies
Native Native iSCSI Port Storage Array

No FC components Servers

iSCSI Initiators IP
connect directly to the
Array iSCSI HBA

Bridged Servers
iSCSI Gateway

Translates iSCSI/IP to
IP Storage Array
FC
iSCSI initiator
iSCSI HBA
configured with Servers
bridge as target FC SAN

Bridge acts as virtual


FC initiator FC Port

FC HBA

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 37
Fibre Channel over IP (FCIP)
IP-based storage networking technology
Combines advantages of Fibre Channel and IP
Creates virtual FC link that connects devices in a different fabric
Distance extension solution
Used for data sharing over geographically dispersed SAN

FCIP gateway FCIP gateway

FC SAN IP FC SAN

Host

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 38
Fibre Channel over Ethernet (FCoE)
A protocol that encapsulates Fibre Channel frames for transport
over Enhanced Ethernet networks
Enables the consolidation of SAN traffic and Ethernet traffic onto
a common 10 Gigabit Ethernet infrastructure
Consolidates compute to compute and compute to storage
communication over a single channel

Benefits

Lowers capital expenditure


Reduces power and cooling requirements
Enables consolidation of network infrastructure
Lowers Total Cost of Ownership (TCO)

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 39
I/O Consolidation with FCoE


Hosts

FCoE links

FCoE Switch

LAN


Storage Arrays

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 40
Components of FCoE
Converged Network Adapter(CNA)
Multi function adapter
Performs the data networking of NIC and storage networking of
HBA
FCoE Switch
Contains Ethernet bridge and Fibre Channel Forwarder (FCF)
FCF encapsulates FC frames into FCoE frames and de-capsulates
FCoE frames to FC frames
Converged Enhanced Ethernet (CEE)
Extensions to conventional Ethernet standard to eliminate its
lossy nature

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 41
File Sharing Technology Evolution
Networked File Sharing

Portable Media Networked PCs


Stand Alone PC for File Sharing

Network Attached Storage (NAS)

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 42
What is Network Attached Storage(NAS)?
NAS

A storage device connected to a network that provides file level data


access to heterogeneous clients Clients

Application Print
Server Server NAS Device

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 43
Benefits of NAS
Supports comprehensive access to information
Provides improved efficiency
Provides improved flexibility
Provides centralized storage
Simplifies management
Enables scalability
High availability through native clustering
Provides security integration to environment (user
authentication and authorization)

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 44
Components of NAS

UNIX
NFS Network Interface
NAS Head
NFS CIFS
IP
NAS Device OS

CIFS Storage Interface

Windows
Storage Array

NAS

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 45
Module 2: Classic Data Center (CDC)

Lesson 4: Object Based and Unified Storage Technologies


Topics covered in this lesson:
Object Based Storage
Unified Storage

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 46
Object Based Storage
Object Based Storage combines data with rich metadata to
create an object
Object Based Storage stores data in a flat address space
There are no hierarchies or nested directories
Each object is identified by an unique ID (Object ID)
Generated by a hashing function

OID Object Object

Object

Data Metadata Object Object

Object
Object Object

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 47
Object Based Storage (contd.)
Object Based Storage uses HTTP communication as its standard
interface
SOAP and REST are the protocols commonly used in object based
communication in Cloud
Simple Object Access Protocol (SOAP ) is used for communication
between peers in a distributed environment
Uses Extensible Markup Language ( XML) framework
Representational State Transfer (REST) is used to retrieve
information from a Website by reading Web pages

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 48
Why Object Based Storage ?
Increasing amount of unstructured
data
SAN is highly scalable and supports
data access at a block level
Not a good option for data sharing
NAS is a good option for applications
which need to share data
Limited scalability due to hierarchical
structure
Object Based approach potentially
eliminates SAN and NAS limitations
Highly scalable with data sharing
capabilities

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 49
Benefits of Object Based Storage
Automates and simplifies storage management
Ensures data integrity
Ensures compliance and auditability
Enables easy data migration
Enables self healing
Facilitates intelligent replication
Allows flexible scalability

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 50
Unified Storage
Provides Consolidated Management interface for NAS, SAN,
iSCSI, FCoE, and object based technologies

SOAP and REST


FC FCoE NFS and CIFS

LAN/WAN

Unified Storage

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 51
Benefits of Unified Storage
Provides consolidated multi-protocol storage
File: NFS, CIFS
Block: iSCSI, FC, FCoE
Object: REST, SOAP
Simplifies administration
Integrated management interface
Reduces cost of storage assets, along with power, cooling, and
space
Provides a highly scalable architecture

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 52
Module 2: Classic Data Center (CDC)

Lesson 5: Business Continuity Overview and Backup


Topics covered in this lesson:
Business Continuity (BC) Terminologies
Backup Granularity
Backup Components and Operation
Deduplication: Types and methods

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 53
Business Continuity
BC

Processes and/or procedures for ensuring continued business


operations

BC solutions address unavailability and degraded application


performance
BC is an integrated and enterprise wide process and set of
activities to ensure information availability

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 54
BC Terminologies

BC Terminologies Description

Disaster Recovery(DR) Coordinated process of restoring systems, data, and


infrastructure required to support ongoing business
operations in the event of a disaster
Hot Site A site where an enterprises operations can be moved
in the event of disaster
DR site infrastructure is up and running all the times
Cold Site The IT infrastructure required to support DR is not
activated
Cluster A group of servers and other necessary resources,
coupled to operate as a single system

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 55
RTO and RPO
Recovery Point Objective (RPO) Recovery Time Objective (RTO)
Point in time to which systems Time within which systems,
and data must be recovered applications, or functions must
after an outage be recovered after an outage
Amount of data loss that a Amount of downtime that a
business can endure business can endure and survive

Weeks Weeks
Tape Backup Tape Restore
Days Days
Periodic Replication Disk Restore
Hours Hours
Asynchronous Replication Manual Migration
Minutes Minutes

Seconds
Synchronous Replication Seconds Global Cluster

Recovery-point objective Recovery-time objective

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 56
BC Technology Solutions
Following are the solutions and supporting technologies that
enable business continuity and uninterrupted data availability:
Eliminating single points of failure
Multi-pathing software
Backup
Backup/restore
Replication
Local replication
Remote replication

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 57
Backup and Recovery
Backup is an additional copy of data that can be used for restore
and recovery purposes
The Backup copy is used when the primary copy is lost or
corrupted
This Backup copy can be created by:
Simply copying data (there can be one or more copies)
Mirroring data
The Backup purposes are:
Disaster Recovery
Operational backup
Archival

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 58
Backup Granularity

Full Backup

Su Su Su Su Su

Cumulative (Differential) Backup

Su M T W T F S Su M T W T F S Su M T W T F S Su M T W T F S Su

Incremental Backup

Su M T W T F S Su M T W T F S Su M T W T F S Su M T W T F S Su

Amount of data backup

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 59
Backup Components
Backup client
Sends backup data to
backup server or storage
node
Backup server
Manages backup
Storage Array
operations and Backup Data

maintains backup catalog


Storage node Application Server/ Backup Server/
Responsible for writing Backup Client Storage Node

data to backup device


Backup device
Backup device
Stores backup data

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 60
Backup and Restore Operation
Backup operation
Backup server initiates a scheduled backup
Instructs storage node to load backup media and instructs clients to
send backup data to the storage node
Storage node sends backup data to backup device and media
information to backup server
Backup server updates catalog and records the status
Restore operation
Backup client initiates the restore
Backup server scans backup catalog to identify data to be restored
and the client that will receive the data
Backup server instructs storage node to load backup media
Storage node restores the backup data to the client and sends
metadata to the backup server

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 61
Backup Technology Options
Backup to Tape
Physical tape library
Backup to Disk
Backup to virtual tape
Virtual tape library

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 62
Backup Optimization: Deduplication
Deduplication

Technology that conserves storage capacity and/or network traffic by


eliminating duplicate data.

Can be implemented at:


File level
Block/chunk level
Deduplication can be:
Source Based (client)
Target Based (Storage Device)

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 63
Benefits of Deduplication
By eliminating redundant data, far less infrastructure is required
to hold the backup images
Lowers infrastructure costs
Reduces the amount of redundant content in the daily backup
Enables longer retention periods
Reduces backup window and enables faster restore
Less data to be backed up
Enables creation of daily full backup images

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 64
Where does Deduplication Occur?

Source Based Deduplication Target Based Deduplication


Data is Deduplicated at the source Data is Deduplicated at the target
(backup client) (backup device)
Backup client sends only new, unique Backup client sends native data to the
segments across the network to the backup device
backup device Increased network bandwidth and
Reduced storage capacity and network storage capacity requirements
bandwidth requirements and
increased overhead on the backup
client
Deduplication at Source Deduplication at Target

SAN
SAN

Backup client Backup device Backup client Backup device

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 65
Deduplication : Methods
Single Instance Storage (SIS)
Detects and removes redundant copies of identical files
After a file is stored in the SIS system, all other references to the
same file refer to the original copy
Sub-file Deduplication
Identifies and filters repeated data segments stored in files
Within a single system and across multiple systems
Compression
Reduces file size
Identifies and removes blank spaces and repeated data chunks
Can be performed at source(client) or target(storage device)

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 66
Module 2: Classic Data Center (CDC)

Lesson 6: Replication Technologies


Topics covered in this lesson:
Types of Replication
Local Replication Methods
Remote Replication Methods

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 67
What is Replication?
Process of creating an exact copy of data
Drivers for replication
Alternate source for backup
Fast recovery
Decision support
Testing platform
Restart from replica
Classification of Replication:
Local replication
Remote replication
REPLICATION

Source Replica (Target)

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 68
Replica: Types and Characteristics
Types of Replica: Choice of replica ties back to RPO
Point-in-Time (PIT)
Non-zero RPO
Continuous
Near-zero RPO
Characteristics of a good replica:
Recoverability
Replica should be able to restore data on the source device
Restartability
Restart business operation from replica
Consistency
Consistent replica ensures that the data buffered in the compute
system is properly captured on the disk when the replica is created

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 69
Local Replication
Process of replicating data within the same array or the same
data center
Compute based replication
Replication is performed by using CPU resources of the compute
system
Types: LVM based mirroring and File system Snapshot
Storage array based replication
Replication is performed by using CPU resources of the storage
array
Types of Storage array based replication techniques:
Full volume mirroring
Pointer based full volume replication
Pointer based virtual replication

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 71
Compute Based Replication
Logical Volume Manager based mirroring
Each logical partition in a logical volume is mapped to two physical
partitions on two different physical volumes
Write to a logical partition is written to the two physical partitions
File System Snapshot
Pointer-based local replication uses Copy on the First Write (CoFW)
principle
Uses bit map and block map
Requires a fraction of the space used by the production FS

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 72
Full Volume Mirroring
Target is a full physical copy of the source device
Target is attached to the source and data from the source is
copied to the target
Target is unavailable while it is attached
Target device is as large as the source device

Attached

Read/Write Not Ready

Source Target
Array

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 73
Pointer Based Full Volume Replication
Provides a full copy of the source data on the target
Target device is made accessible for business operations as soon
as the replication session has started
Point-in-Time (PIT) is determined by the time of session
activation
Two modes
Copy on First Access (deferred)
Full Copy mode
Target device is at least as large as the source device

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 74
Pointer Based Virtual Replication
Targets do not hold actual data, but hold pointers to where the
data is located
Target requires only a small fraction of the size of the source
volumes
Target devices are accessible at the start of session activation
Pointer based virtual replication uses Copy on First Write (CoFW)
technology
When a write is issued for the first time to source or target:
Original data at that address is copied to a predefined area in the
storage array called Save location
Pointers in the source/target are updated to point to Save
location

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 75
Remote Replication
Remote Replication

A process of creating and maintaining copies of data from a production site


to remote site(s)

Addresses risks associated with regionally driven outages


Modes of remote replication (based on RPO requirements)
Synchronous
Replica is identical to source at all times near zero RPO
Asynchronous
Replica is behind the source by a finite time finite RPO
Network infrastructure is required between source and target

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 76
Synchronous Vs. Asynchronous Replication

Synchronous Replication Asynchronous Replication


A write must be committed to the Write is committed to the source and
source and remote replica before it is immediately acknowledged to the
acknowledged to the compute system compute system. Data is buffered at the
source and transmitted to the remote
replica later
Application response time will be Application response time is unaffected
extended
To minimize impact on response time, Needs only average network bandwidth
maximum network bandwidth must be
provided at all times
Rarely deployed beyond 200 km Deployed over long distances

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 77
Compute-based Remote Replication
Replication is done by using the CPU resources of the compute
system, using a software that is running on the compute
The remote replication methods are:
LVM-based
All writes to the source Volume Group are replicated to the target
Volume Group by the LVM
Can be in synchronous or asynchronous mode
Database Log Shipping
Transactions to the source database are captured in logs, which are
periodically transmitted by the source compute system to the
remote compute system
Remote compute system applies these logs to the remote database

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 78
Storage Array Based Remote Replication
Performed by array operating environment
Synchronous Replication
Asynchronous Replication
Disk buffered Replication (shown in the figure)
Combination of local and remote replications
RPO usually in the order of hours
Low bandwidth requirements
Extended distance solution

Source storage array Target storage array

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 79
Advanced Replication Technologies
Three site replication
Eliminates disadvantages of two site replication
Replicates data to two remote sites
SAN based replication
Allows replication between heterogeneous vendor storage arrays
over SAN/WAN
Continuous Data Protection (CDP)
Changes to data are continuously captured or tracked

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 80
Continuous Data Protection
All data changes are stored in a location separate from the primary
storage
Recovery point objectives are arbitrary and need not be defined in
advance of the actual recovery
CDP Elements are:
CDP appliance
Runs the CDP software and manages all the aspects of local and
remote replication
Storage Volumes
Repository volume, journal volume, and replication volume
Write Splitters
Intercepts write from initiator and splits each write into two copies

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 81
Module 2: Classic Data Center (CDC)

Lesson 7: CDC Management


Topics covered in this lesson:
Key Management activities in a CDC
Information Lifecycle Management (ILM)

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 82
Overview of CDC Management Activities
Key management activities in a CDC:
Monitoring and Alerting
Reporting
Availability Management
Capacity Management
Performance Management
Security Management

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 83
Monitoring
Compute systems, storage, and networks are the key
components to be monitored
Key Parameters to be Description
Monitored
Accessibility Availability of a component to
perform a desired operation
Capacity Amount of resources available
For ex: free space available on a
file system or RAID group
Performance How efficiently different
components are performing
Security Mechanisms to track and prevent
unauthorized access

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 84
Alerting of Events
Alerting is an integral part of monitoring
Monitoring tools enables administrators to assign different
severity levels for different alerts
Levels of Alerts based on Description
Severity
Information alert Provides useful information and
may not require administrator
intervention
For ex: creation of zone or LUN
Warning alert Require administrative attention
For ex: file systems becoming full
Fatal alert Require immediate administrative
attention
For ex: power failures/disk
failures/memory failures

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 86
Reporting
Reporting on CDC resources involves keeping track and gathering
information from various components/processes

Type of Report Description


Capacity Planning Provides current and historic information about
utilization of storage, file system, database
tablespace, ports, and so on
Chargeback Provides information about the allocation or
utilization of the CDC infrastructure components
by various departments or user groups
Performance Provides details about the performance of various
infrastructure components in a CDC

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 87
Availability Management
Establishes guidelines for all configurations to achieve high
availability based on service level requirements
Ensures high availability by:
Eliminating single points of failure by configuring
Two or more HBAs/NICs
Multipathing software
RAID protection
Redundant Fabrics
Performing data backup and replication

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 88
Capacity Management
Ensures adequate availability of resources based on their service
level requirements
Manages resource allocation
Key activities
Trend and Capacity analysis
Storage provisioning
Examples
Compute: Compute configuration and file system/DB management
Storage: Device configuration and LUN Masking
SAN: Unused Ports and Zoning

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 89
Performance Management
Configure/design for optimal operational efficiency
Performance analysis
Identify bottlenecks
Fine tuning for performance enhancement
Key activities:
Compute: Volume management, database/application layout
Storage Array: Choice of RAID type and layout of devices (LUNs)
and choice of front-end ports
SAN: Designing sufficient ISLs with adequate bandwidth

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 90
Security Management
Prevents unauthorized activities or access
Key activities:
Compute
Creation of user logins and user privileges
Storage Array
LUN masking prevents data corruption on the storage array by
restricting compute access to a defined set of logical devices
SAN
Configuration of zoning to restrict unauthorized HBAs

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 91
Managing Information in CDC Challenges
Exploding digital universe
Multifold increase of information growth
Increasing dependency on information
The strategic use of information plays an important role in
determining the success of an organization
Changing value of information
Information that is valuable today may become less important
tomorrow

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 92
Information Lifecycle Management (ILM)
ILM
A proactive strategy that enables an IT organization to effectively align the
business value of information with the most appropriate and cost-effective
infrastructure, from the time information is created, through its final
disposition

Protect

New Process Deliver Warranty


order order order claim
Time
Value

Fulfilled Aged Warranty


order data Voided

Create Access Migrate Archive Dispose

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 93
Module 2: Classic Data Center (CDC)

Concepts in Practice
Topics covered in this lesson:
EMC Symmetrix VMAX
EMC VNX
EMC Connectrix
EMC Backup Solutions
EMC Replication Solutions
EMC Storage Management Solutions

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 95
Concept in Practice: Storage Array Solutions
EMC Symmetrix VMAX
Intelligent Storage System built on Virtual
Matrix Architecture
EMC VNX
Unified storage platform for multi protocol
file, block and object storage
VNXe series and VNX series

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 96
Concept in Practice: EMC Connectrix
The Connectrix family consists of:
Enterprise directors
Departmental switches
Multi-protocol routers

DCX Director
MDS-9513 Director MP-7800B

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 97
Concepts in Practice: EMC Backup solutions
EMC Avamar
Disk-based backup and recovery solution that provides inherent
source-based data deduplication
EMC Data Domain
Target-based data Deduplication solution
Data Invulnerability Architecture
EMC Networker
Provides centralized, automated backup and recovery operations
across an enterprise
Diverse computing and storage environments
Provides both source and target-based deduplication capabilities
by integrating with Avamar and Data Domain respectively

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 98
Concepts in Practice: EMC Replication Solutions
Name of the product Description
EMC Symmetrix Remote Data Facility SRDF offers a family of technology solutions to
(SRDF) implement storage array based remote
replication technologies
EMC MirrorView MirrorView is a VNX-based software that
enables storage arraybased remote replication
EMC Open Replicator Open Replicator is a Symmetrix software that
performs SAN-based remote replication
between Symmetrix and qualified storage
arrays
It has full or incremental copy capabilities
EMC Timefinder EMC TimeFinder family of products is used for
full volume and pointer-based local replication
in Symmetrix storage array
EMC Snapview SnapView is a VNX based local replication
software that creates point-in-time views or
point-in-time copies of logical volumes
EMC Recover point It is a product which offers CDP functionality

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 99
Concepts in Practice: EMC Storage Resource
Management Solution
EMC ProSphere
Storage management solution for the data center
Monitoring and alerting
Performance analysis
Capacity reporting and analysis
Configuration reporting and validation
EMC Unisphere
Provides a flexible, integrated management interface for managing
unified storage
Provides intuitive task-based controls, customizable dash boards,
and single-click self help access to real-time support tools

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 100
Module 2: Summary
Key points covered in this module:
Key components of a CDC
Storage networking technologies in a CDC
BC technologies in a CDC
Key management activities in a CDC

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 101
Check Your Knowledge
1. What are the common protocols used in channel and network
communication?
2. What are the different storage device options?
3. What are the components of FC SAN?
4. What are the main drivers for the emergence of IP-SAN?
5. What are the characteristics of a good replica?
6. What are the storage array based remote replication
techniques?
7. What are the key management activities in a CDC?

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 102
MODULE 2 QUIZ

Copyright 2011 EMC Corporation. All Rights Reserved. Classic Data Center 103

You might also like