Professional Documents
Culture Documents
Real-Time Analytics
#vmworldapps
Disclaimer
Technical feasibility and market demand will affect final delivery. Pricing and packaging for any new technologies or features
discussed or presented have not been determined.
DATA VOLUME
Zettabyte
2011
Machine To Machine
Exabyte
Petabyte
Interactions
Terabyte
Transactions
Mainframe
PC
Internet
Mobile
Machine
Time
Chart based on IDC and UC Berkeley Data Growth Estimates, Source: IDC & CosmoBC.com:
3
http://techblog.cosmobc.com/2011/08/26/data-storage-infographic/
Velocity
Volume
Variety
Value
$
10s of Billions of Daily Records From Terabytes to Petabytes Multi-Structured Business Insights
Customer profiles
2. Data engineers/analysts
Need: out-of-the-box + some customization Designed for: admin + operations
3. Data scientists
Need: power capabilities + heavy customization Designed for: data scientists
4. IT, Operations
Need: out-of-the-box + some customization Designed for: IT/admin, ops
Cetas
SQL Interface
App s
Category Online App Analytics Problem Provide real-time visibility into business for companies Insights and interpretation of Big data Use Cases Customer Behavior Analytics and Audience Segmentation Grow revenue and focus on monetization Grow traffic, user engagement and user acquisition
11
12
13
Event correlation
Use (graph-based or vector-based) clustering
Causality Analysis
Link analysis
14
What-if Analysis, Machine Learning, Statistical/Math Modeling, Support for External Models (R, Mahout) Static & Dynamic (AutoDerived) Clusters, Product Cohorts, Cluster Churn over time, Market Basket Analysis
Did you know the top 10 countrie s are Data source 1 Data source 2 Users whose virality > 1 declined 13% in Dec
The male populatio n aged 18-34 plays Level 3 and buys $18 worth of virtual goods
Pattern Extraction, Autoderived data insights Interactive Search-based Analytics, Filtering and Drilldowns Aggregate Metrics standard and custom, MultiDimensional Graphs, TimeSeries, Click-Pathing, Funnel Conversion
Number of invites and Did you number know the of top 10 downloa countrie ds are s are both down in Jan
Search
Recommendation engine
Predictive analytics
Data modeling
Analytics
16
17
Customer deployment
Application
APIs, SDK
JS
Cetas
Seamless ingestion thru app integration Rapid parsing, processing Multidimensional analytics
Transactional data
RDBMS
ETL
DW DW
DW DW
Short-term data
18
Historical data
Batch Processing
19
Discover multiple sources of live data Navigate data & perform multi-dimensional analysis Drill into and analyze time-trends Auto-discover key insights Use pre-defined measures to measure business metrics & KPIs Custom define new functions, as needed Create dynamic dashboards Take instant action to monetize users/customers optimally
20
Customer On-Boarding
Step 1: Self-registration & login by customer Step 2: Setting up of live and batch data feeds Step 3: Raw dimension and time analytics Step 4: Development of custom measures Step 5: Creation of dashboards & taking action
21
Value proposition
Salesforce data has been trusted to the cloud Payroll data has been trusted to the cloud HR data has been trusted to the cloud In fact, a lot more of your data has been trusted to the cloud
Whats next?
Analytics-as-a-service
23
To provide the right information, at the right level, to the right user, at the right time, to create the right insights and make the right decision.
24
Challenges Hypergrowth data, users, business lines, products No downtime theres always someone analyzing something Self Serve vs. Concierge different styles in different depts and geographies. Technology/Skill Balance theres always something new Apploranges (Apples+ Oranges) - Data Integration is difficult different atomic levels of data, different subject areas
25
Business Needs Our No downtime theres always someone analizing something Information that is
Self Serve vs. Concierge Cleaner different styles in different depts and geographies. Technology/Skill Balance theres always something new Apploranges (Apples+ Oranges) - Data Integration is and difficult different atomic levels of data, different subject areas
Cheaper Faster
Integrated
26
POCS
We do 10-12 proofof-concepts per year now.
Virtualize
We use public, private, and hybrid clouds to bring data together from in-house and SAAS systems
Compartmentalization
Like Apple consumers are now accustomed to apps that are singularly focused, we have started to see report suites and OLAP capabilities that must be specialized for deep reactive and predictive analysis but also be linked to a larger ecosphere of data.
Just as water sustains life and enables the growth of organisms, data sustains the growth of organizations and enables insights.
27
Visualize
We have been experimenting with various visualization techniques to turn the data sideways to see if it reveals something different/new.
DMTOF
We have been testing data marts on-the-fly allowing users to quickly save off subsets of enterprise data (columns, rows, subject areas, time frame) as static and dynamic snapshots to be saved off in a temporary cloud space for custom analysis.
We have started using replication in lieu of ETL to put the data AS PHYSICALLY CLOSE to the users as possible in order to eliminate SPOF, speed access, provide autonomy and policing
Storm Clouds
The use of clouds is accelerating
28
End-user focused
Interactive, fast, simple Drill-downs, pre-defined and custom analytics Self-service (mostly!); no need for specialized skills
www.cetas.net
30
Product showcase
31
Data sources
32
33
Date fields
Start Dt:
End Dt:
34
35
BEFORE
Filter
AFTER
36
37
38
39
40
41
42
Dynamic Dashboard
43
44
45
46
Project container
Export
47
Q&A
48
APP-CAP2985
Real-Time Analytics
#vmworldapps