Professional Documents
Culture Documents
Gary Leong
HPCRD Systems Engineer
High Performance Computing Research
Lawrence Berkeley National Laboratory
High Performance Computing
Research Department
C O M P U T A T I O N A L R E S E A R C H D I V I S I O N
ZFS – Why?
C O M P U T A T I O N A L R E S E A R C H D I V I S I O N
ZFS – At a glance
C O M P U T A T I O N A L R E S E A R C H D I V I S I O N
ZFS is like “Virtual Memory”
C O M P U T A T I O N A L R E S E A R C H D I V I S I O N
ZFS – VM similarity
C O M P U T A T I O N A L R E S E A R C H D I V I S I O N
ZFS – Volumes and Pool Storage
Traditional Volumes
C O M P U T A T I O N A L R E S E A R C H D I V I S I O N
ZFS – is like a “merged FS w/
RAID/Volume manager”
C O M P U T A T I O N A L R E S E A R C H D I V I S I O N
ZFS – is like an attached “NAS”
Think of having a NAS with its integrated filesystem, RAID, and other features attached
locally, directly to VFS instead of through the network.
C O M P U T A T I O N A L R E S E A R C H D I V I S I O N
ZFS – “NAS” like elaborated
C O M P U T A T I O N A L R E S E A R C H D I V I S I O N
ZFS - Copy on Write (COW)
C O M P U T A T I O N A L R E S E A R C H D I V I S I O N
ZFS - Central Administration
C O M P U T A T I O N A L R E S E A R C H D I V I S I O N
ZFS - Other notable features
C O M P U T A T I O N A L R E S E A R C H D I V I S I O N
ZFS - Data checksum
Patterned off Merkle tree - each level of data to validate all things below it
Similar to ECC memory
Isolation of data and checksum
C O M P U T A T I O N A L R E S E A R C H D I V I S I O N
ZFS - ZIL
C O M P U T A T I O N A L R E S E A R C H D I V I S I O N
ZFS - RAIDZ
C O M P U T A T I O N A L R E S E A R C H D I V I S I O N
ZFS - Truths (no marketing)
C O M P U T A T I O N A L R E S E A R C H D I V I S I O N
ZFS - Experimental Results
C O M P U T A T I O N A L R E S E A R C H D I V I S I O N
Writer Performance: ZFS/UFS (Disksuite)
250000
200000
180000
200000
160000
140000
150000
180000-200000
200000-250000
kB/sec 120000 160000-180000
150000-200000
140000-160000
100000-150000
100000 kB/s 100000 120000-140000
50000-100000 100000-120000
0-50000 80000-100000
80000
60000-80000
50000 40000-60000
16384 60000
20000-40000
2048
0-20000
40000
256 16384
0 Record size - kB 4096
20000
64
32
128
1024
256
512
1024
2048
4096
8192
256
16384
32768
4
65536
0 Record size - kB
131072
262144
64
524288
64
128
File size - kB
256
512
1024
16
2048
4096
8192
16384
32768
4
65536
131072
262144
524288
File size - kB
C O M P U T A T I O N A L R E S E A R C H D I V I S I O N
Re-writer Performance: ZFS/UFS (Disksuite)
250000
250000
200000
200000
150000
150000
200000-250000
200000-250000 kB/s
kB/sec 150000-200000
150000-200000
100000-150000
100000-150000 100000
100000 50000-100000
50000-100000
0-50000
0-50000
50000 50000
16384 16384
2048 4096
1024
256 256
0 Record size - kB 0 Record size - kB
64
64
32
128
64
256
128
512
1024
256
2048
512
1024
4096
16
2048
8192
16384
4096
4
32768
8192
65536
16384
131072
32768
262144
65536
524288
131072
262144
524288
File size - kB
File size - kB
C O M P U T A T I O N A L R E S E A R C H D I V I S I O N
Reader Performance: ZFS/UFS (Disksuite)
300000 250000
250000
200000
200000
150000
250000-300000
200000-250000
kB/sec 150000 200000-250000 kB/s
150000-200000
150000-200000
100000-150000
100000-150000 100000
50000-100000
100000 50000-100000 0-50000
0-50000
50000
16384 16384
50000
4096
2048
1024
256 256
0 0 Record size - kB
Record size - kB 64
64
64
128
32
128
256
256
512
16
512
1024
1024
2048
2048
4096
4096
8192
8192
16384
16384
32768
4
32768
65536
65536
131072
131072
262144
262144
524288
524288
C O M P U T A T I O N A L R E S E A R C H D I V I S I O N
Re-reader Performance: ZFS/UFS (Disksuite)
300000 250000
250000
200000
200000
150000
250000-300000
200000-250000
kB/sec 150000 200000-250000 kB/s
150000-200000
150000-200000
100000-150000
100000-150000 100000
50000-100000
100000 50000-100000
0-50000
0-50000
16384 50000
50000 16384
2048 4096
1024
256 256
0 Record size - kB 0 Record size - kB
64
64
32
128
64
256
128
512
256
1024
512
16
2048
1024
4096
2048
8192
4096
16384
8192
32768
16384
65536
32768
4
131072
65536
262144
131072
524288
262144
524288
File size - kB File size - kB
C O M P U T A T I O N A L R E S E A R C H D I V I S I O N
Random Read Performance: ZFS/UFS (Disksuite)
ZFS: Random Read Performance - 5 disks UFS: Random Read Performance - 5 disks
300000 250000
250000
200000
200000
150000
250000-300000
kB/sec 150000 200000-250000 200000-250000
kB/s
150000-200000 150000-200000
100000-150000 100000-150000
100000
50000-100000 50000-100000
100000
0-50000
0-50000
64
256
512
64
1024
128
2048
256
4096
512
16
8192
1024
16384
2048
4
32768
4096
65536
8192
131072
16384
262144
32768
4
524288
65536
131072
262144
524288
File size - kB
File size - kB
C O M P U T A T I O N A L R E S E A R C H D I V I S I O N
Random Write Performance: ZFS/UFS (Disksuite)
200000
250000
180000
160000
200000
140000
180000-200000
120000 160000-180000
150000 140000-160000
200000-250000 kB/s 100000 120000-140000
kB/sec
150000-200000 100000-120000
100000-150000 80000 80000-100000
100000
50000-100000 60000-80000
0-50000 60000 40000-60000
20000-40000
40000 0-20000
50000 16384
16384
4096
2048 20000
1024
256 256
0 0 Record size - kB
Record size - kB 64
64
128
64
32
256
128
512
16
256
1024
512
2048
1024
4096
2048
8192
4096
16384
8192
32768
4
16384
65536
4
32768
131072
65536
262144
131072
524288
262144
524288
C O M P U T A T I O N A L R E S E A R C H D I V I S I O N
ZFS – Summary/Conclusions
C O M P U T A T I O N A L R E S E A R C H D I V I S I O N
ZFS - Upcoming features
C O M P U T A T I O N A L R E S E A R C H D I V I S I O N
ZFS - References
Jeff Bonwick; ZFS: the last word in file systems. Sun Microsystems.
Jeff Bonwick. ZFS: The Last Word in Filesystems. Jeff Bonwick's Blog.
(http://blogs.sun.com/roller/page/bonwick?entry=raid_z)
Neil Perrin. ZFS: The Lumberjack. Neil Perrin’s Weblog (
http://blogs.sun.com/roller/page/perrin?entry=the_lumberjack)
ZFS: From Wikipedia, the free encyclopedia (http://en.wikipedia.org/wiki/ZFS)
Matthew Ahren. What is ZFS? Matthew Ahren’s Weblog (
http://blogs.sun.com/roller/page/ahrens?catname=%2FZFS)
NewsForge: Sun’s ZFS builds on promise of RAID
(http://os.newsforge.com/os/06/01/11/1921211.shtml?tid=16 )
Jeff Darcy. In ZFS’s Defense, RAID-Z Redux, No More Mr. Nice Guy, ZFS Again,
ZFS; Canned Platypus (http://pl.atyp.us/wordpress/?p=1009)
Dave Hitz, James Lau, & Micheal Malcolm – Network Applicance; File System
Design for an NFS File Server Applicance
Sun Microsystems; ZFS Administration Guide, March 2006
Sun Microsystems; ZFS On-Disk Specification (Draft 12/9/2005)
Eric Schrock. Ztest on Linux. Eric Schrock's Weblog
(http://blogs.sun.com/roller/page/eschrock?entry=ztest_on_linux)
C O M P U T A T I O N A L R E S E A R C H D I V I S I O N
Thank you
C O M P U T A T I O N A L R E S E A R C H D I V I S I O N