You are on page 1of 42

<Insert Picture Here>

Extreme Performance with Oracle Data Warehousing

Distributed Data Marts and Servers


Expensive Data Warehouse Architecture

Data Marts

Online Analytics Data Mining

ETL

Consolidate onto a Data Warehouse


Single Source of Truth on Low-Cost Servers and Storage

Data Marts

Online Analytics Data Mining

ETL Oracle Database 11g

Choice of Data Warehouse Solutions


Custom Solutions Reference Configurations Database Machine

Flexibility for the most demanding data warehouse

Best-practice configurations for data warehousing

Complete system, including software, servers, networking and storage

Drastically Simplified Deployments


Database Machine eliminates the complexity of deploying database systems
Months of configuration, troubleshooting, tuning

Database Machine is ready on day one


Pre-built, tested, standard, supportable configuration

Runs existing applications unchanged

Months to Days

Extreme performance out of the box

Best Data Warehouse Machine


Massively parallel high volume hardware to quickly process vast amounts of data
Exadata runs data intensive processing directly in storage

OLAP

Most complete analytic capabilities


ETL
OLAP, Statistics, Spatial, Data Mining, Real-time transactional ETL, Efficient point queries

Data Mining

Powerful warehouse specific optimizations


Flexible Partitioning, Bitmap Indexing, Join indexing, Materialized Views, Result Cache

Why is Exadata Fast?


Exadata off-loads data intensive processing to the storage
Row filtering based on where predicate Column filtering Join filtering Incremental backup filtering Storage Indexing Scans on encrypted data Data Mining model scoring

Database Machine delivers a high speed IO subsystem Exadata delivers smart flash cache for all workloads

Simple Query Example


What were my sales yesterday? Optimizer Chooses Partitions and Indexes to Access

Exadata Storage Grid

Oracle Database Grid


Select sum(sales) where Date=24-Sept

Scan compressed blocks in partitions/indexes Retrieve sales amounts for Sept 24

SUM

10 TB scanned 1 GB returned to servers

Exadata Hybrid Columnar Compression


Data is stored by column and then compressed Query Mode for data warehousing
Optimized for speed 10X compression ratio is typical Scans improve proportionally

Up To

Archival Mode for infrequently accessed data


Optimized to reduce space 15X compression is typical Up to 50X for some data

50X

Exadata Storage Index


Transparent I/O Elimination with No Overhead Table
A B C D 1 3 5 5 8 3

Index

Exadata Storage Indexes maintain summary information about table data in memory
Store MIN and MAX values of columns Typically one index entry for every MB of disk

Min B = 1 Max B =5 Eliminates disk I/Os if MIN and MAX can never match where clause of a query Min B = 3 Completely automatic and transparent Max B =8

Select * from Table where B<2 - Only first set of rows can match

Benefits Multiply

10 TB of user data Requires 10 TB of IO

1 TB with compression

100 GB with partition pruning

Subsecond On Database Machine


20 GB with Storage Indexes 5 GB Smart Scan on Memory or Flash

Data is 10x Smaller, Scans are 2000x faster

Flash
60 50

Query Throughput

Query Throughput with Flash

Flash storage more than doubles scan throughput


50 GB/sec

Query Throughput Uncompressed Data

50

40 30
21

20

Combined with Columnar compression


Up to 50 TB of data fits in flash Queries on compressed data run up to 500 GB/sec

11.4

10 0
HITACHI
USP V

7.5

10

TERADATA
2550

NETEZZA SUN ORACLE


TwinFin 12 Database Machine

Database Machine Success


Database Machine is succeeding in all geographies and industries against every competitor

Oracle Database Machine


What Customers Think

Please see notes!

Sun Oracle Database Machine


Extreme performance for data warehousing, powered by Sun Oracle Exadata Storage Servers

Extreme Performance

Pre-Installed and Pre-Configured

Enterprise ready for rapid customer success and high return on investment

Complete Functionality

Complete functionality, with in-database ETL, analytics (OLAP, Data Mining) and more

Oracle Data Integration


Covers the Entire Data Integration Spectrum SOA Oriented Workloads
Data as a Service Event Based Processing

Real-Time and Operational Workloads


Heterogeneous Change Data Capture Heterogeneous Real-Time Data Processing

High-Volume Batch Workloads


Extreme Performance through ELT Architecture Optimized for Loading Oracle Data Warehouses

Heterogeneous Data Quality Solutions

Oracle GoldenGate
Change Data Capture, Replication and Real-Time
Zero Downtime Upgrade & Migration New Application/ Database/Hardware Applications Disaster Recovery, Query Offloading

Database

Oracle GoldenGate

Standby/Replica

Data Synchronization across the Enterprise Distributed Systems Real-time BI, Operational Reporting ODS/Data Warehouse

Legacy

Oracle Data Integration


Pervasive Data Integration Embed Data Integration within Oracle Database
Integrated, optimized and Best for Oracle Database Easiest way to load external information into Oracle Database

Provide Comprehensive Data Integration


Comprehensive Heterogeneous Technology Foundation Integrated Runtime, Data Management Tools and Administration Best of Breed: significant architectural differentiators vs. competitors Broad support of sources & packaged applications

Pre-Integrate Solutions for Oracle Portfolio


Make data integration pervasive with Lower Cost & Complexity

Data Integration Platform


Leveraging in-database processing for ETL, Oracle outperforms best-of-breed tools

Best in-class Performance

Cover Full Data Integration Spectrum

Oracle offers a comprehensive portfolio to solve all enterprise data integration problems, including SOA, real-time and batch ETL

Fully Integrated

ETL and Data Quality Integrated with Oracle Database, Oracle Business Intelligence and Performance Management Solutions

Oracle Runs the Largest Databases

Website Personalization
500,000,000 unique users 200 Terabyte Data Warehouse

Meteorological Research
220+ Terabyte Oracle database Worlds largest database on Linux

Star Query Optimization


Specific Data Warehouse Access Methods
Q: What was the total number of umbrellas sold in Boston during the month of May 2008?

Customers Sales

Times

Products

Channel

Optimization technique that rewrites SQL No application changes required

The Concept of Partitioning


Simple yet powerful
SALES SALES SALES Europe USA Jan Large Table
Difficult to Manage

Feb

Jan

Feb

Partition
Divide and Conquer Easier to Manage Improve Performance

Composite Partition
Higher Performance More flexibility to match business needs

Partition for Performance


Partition Pruning
Q: What was the total sales amount for May 20 and May 21 2009?

Sales Table
5/17 5/18 5/19 5/20 5/21 5/22

Select sum(sales_amount) From SALES Where sales_date between to_date(05/20/2009,MM/DD/YYYY) And to_date(05/22/2009,MM/DD/YYYY);

Only the 2 relevant partitions are read

Partition to Manage Data Growth

SALES TABLE (7 years)

2003

2008

2009 5% Active

95% Less Active

Low End Storage Tier 2-3x less per terabyte

High End Storage Tier

In-Memory Parallel Execution


Challenge Disk speeds are not keeping up with Moores law while CPU and Memory are Parallel Execution in the traditional sense takes advantage of the IO capacity (e.g. disks) of a system Solution Use the faster components of a system for parallel processing:
Harness the memory capacity of the entire system Associate only fragments of an object in the memory of a node in the system

Automated Parallelism
Challenge Balance Flexibility with Simplicity Get Optimal Performance for a Variety of Workloads Never Overload the System Solution: Optimizer automatically derives the DOP from the statement (for Query, DML and DDL) DOP based on resource requirements for all scan operations Statement Queuing to ensure appropriate resources are available

Summary Management
Materialized Views

Separate database object


Stores pre-calculated and aggregated results

Database supports sophisticated transparent query rewrite


Queries don't change run against base tables Join-backs, additional aggregations etc. supported

Database supports incremental fast refresh


Based on the query definition

Summary Management
Refresh time for conventional insert, aggregate MV
120

100

Refresh time new vs. old [%]

80

60

MV refresh 11.2 MV refresh pre 11.2

40

20

0 20000 50000 250000 number of inserted rows 500000 750000

On average 30 40% better refresh performance in 11g Release 2

Summary Management
Cube Organized Materialized Views
SQL Query Region Date

Summaries

Query Rewrite

Product

Channel Automatic Refresh OLAP Cube

Data Warehousing Platform


Oracle runs the largest data warehouses in the world delivering value to thousands of customers

Proven Performance and Scalability

Best in-class Agility

Any workload, any query requirement, any number of users in an ever changing environment

Enterprise Ready

A complete data warehouse platform including advanced security, high availability and superior functionality

In-Database Analytics
Extreme Analytics
Oracle OLAP Multi-dimensional queries Summary management Embedded and integrated Secure Oracle Data Mining Advanced predictive analytics Advanced algorithms Embedded and integrated Secure

Analyze and Summarize

Uncover and Predict

Oracle OLAP
A full-featured multidimensional OLAP server
Excellent query performance for ad-hoc / unpredictable query Enhances the analytic content of Business Intelligence applications Fast, incremental updates of data sets

A summary management solution for SQL-based Business Intelligence applications


An alternative to table-based materialized views Offers improved query performance and fast, incremental update

Embedded in the Oracle Database instance and storage


Safe, secure and manageable Fully compatible with Grid Computing/Real Application Clusters

Analytics in Spreadsheets

Dimensional calculations Bubble-up functions

Hierarchical navigation and aggregation

Trend indicators

Oracle Data Mining


Retail
Customer segmentation Response modeling

Telecom
Customer churn Network intrusion

Banking
Credit scoring Possibility of default

Public Sector

High Education
Alumni donations Student acquisition

Find hidden patterns Discover new insights Make predictions about the future

Tax fraud Crime analysis

Manufacturing
Warranty analysis Reliability analysis

Healthcare
Patient outcome prediction Fraud detection

Chemical
Molecule clustering Yield analysis

In-Database Data Mining


Better Information for OBI EE Reports & Dashboards

ODMs predictions & probabilities are available in the Database for reporting using Oracle BI EE and other tools

In-Database Data Mining


Exadata Smart Scan

Model scoring performed in the Exadata Storage Server Dramatic performance improvements
SELECT cust_id from customers WHERE region = US AND prediction_probability (churnmod, Y, using *) > 0.8;

Additional smart scan capabilities

Data Analytics Platform


In-Database analytics deliver results faster by eliminating data movement and leveraging Oracles scalability

Better Time To Insight

Enterprise Ready

Fully embedded analytics deliver enterprise scalability, security and maintainability

Lower Total Cost of ownership

No duplication of data and open standards hardware and software deliver a lower TCO compared to proprietary solutions

Oracle Retail Data Model


Built using industry standards
3NF logical data model Retail-specific measures & KPIs Pre-built OLAP models Pre-built Data Mining models Sample reports and dashboards

Oracle Retail Data Model Compression Data Mining Partitioning

Quick and easy to deploy


Minimize time-to-value

Oracle Database 11g Oracle Database Machine

Optimized for Oracle


Best-in-class data warehousing technology Leverage all Oracle Exadata functionality

OLAP

RAC

Oracle Data Warehousing


Complete, Open and Integrated
BI Applications BI Tools Data Models Database Operating System Smart Storage

Standard components Comprehensive security Higher availability Easier to manage Lower cost of ownership

Oracle #1 for Data Warehousing

Source: IDC, July 2009 Worldwide Data Warehouse Management Tools 2008 Vendor Shares

Oracle Data Warehousing


What customers think

Please see notes!

For More Information

http://search.oracle.com
data warehouse

or oracle.com/datawarehouse

You might also like