Professional Documents
Culture Documents
AGENDA
Quick orientation
Columns
Projections
Clustering
Hybrid storage
Special features
JDBC/ODBC drivers
Concepts: column-oriented
Vertica stores data as columns, instead of each row as unit
Allows for efficient data compression
Concepts: column-oriented
ROWS VS. COLUMNS
2014-03-15
23.43
2014-03-15
23.43
2014-03-15
23.97
2014-03-15
23.97
2014-03-15
24.51
2014-03-15
24.51
2014-03-15
25.05
2014-03-15
25.05
2014-03-15
25.59
2014-03-15
25.59
2014-03-16
26.13
2014-03-16
26.13
2014-03-16
26.67
2014-03-16
26.67
2014-03-16
27.21
2014-03-16
27.21
2014-03-16
27.75
2014-03-16
27.75
2014-03-16
28.29
2014-03-16
28.29
Concepts: column-oriented
RUN LENGTH ENCODING
2014-03-15
23.43
2014-03-15
23.43
2014-03-15
23.97
(5 times)
23.97
2014-03-15
24.51
24.51
2014-03-15
25.05
25.05
2014-03-15
25.59
25.59
2014-03-16
26.13
2014-03-16
26.13
2014-03-16
26.67
(5 times)
26.67
2014-03-16
27.21
27.21
2014-03-16
27.75
27.75
2014-03-16
28.29
28.29
Concepts: column-oriented
SKIP UNWANTED COLUMNS
date
value
id
2014-03-15
23.97
2014-03-15
24.51
2014-03-15
25.05
2014-03-15
25.59
2014-03-16
26.13
2014-03-16
26.67
2014-03-16
27.21
2014-03-16
27.75
2014-03-16
28.29
Concepts: projections
Data physically stored in projections
Projections similar to materialized views
Data optimized for querying during insert
Table has one or more projections
Projection contains one or more columns
Data can be duplicated in projections for query efficiency
Concepts: projections
ONE DATA, MANY PROJECTIONS
Sorted by date
Sorted by id
2014-03-15
23.43
2014-03-16
27.21
2014-03-15
23.97
2014-03-15
23.43
2014-03-15
24.51
2014-03-16
27.75
2014-03-15
25.05
2014-03-15
23.97
2014-03-15
25.59
2014-03-16
26.67
2014-03-16
26.13
2014-03-15
25.05
2014-03-16
26.67
2014-03-15
24.51
2014-03-16
27.21
2014-03-15
25.59
2014-03-16
27.75
2014-03-16
26.13
2014-03-16
28.29
2014-03-16
28.29
Concepts: clustering
Parallel processing
Data segments distributed across cluster nodes
Reliability (K-safety)
Tolerates nodes going offline
All nodes can respond to queries queries can be load
balanced between nodes
Concepts: clustering
SEGMENTATION
Node 2
Node 4
SEGMENT2
SEGMENT4
Node 1
Node 3
SEGMENT1
SEGMENT3
Concepts: clustering
K-SAFETY
Node 1
Node 2
Node 4
SEGMENT2
SEGMENT4
SEGMENT3
Node 3
SEGMENT1
SEGMENT3
SEGMENT2
SEGMENT4
SEGMENT1
No encoding or compression
user
action
enter
browse
browse
purchase
enter
browse
enter
browse
purchase
PATTERNS IN DATA
Extending Vertica
www.vertica.com/documentation
db.csail.mit.edu/projects/cstore/
THANKS!
Autom
at
nivety isoi tai
ja mu
ita
kirjoit
Eficode uksia
n blogis
sa.
E
FICODE
.FI/BLO
GI