Professional Documents
Culture Documents
0: a General Overview
jtrujillo@dlsi.ua.es
Juan Trujillo
Content
Introduction BI & Data Warehouses in a Nutshell Basic Concepts related to BI 2.0 Influence from the Web on BI Technical Challenges of the new BI 2.0 General Overview of Tools Stepping Towards BI 2.0 Conclusions
Juan C. Trujillo Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011 2
Content
Introduction BI & Data Warehouses in a Nutshell Basic Concepts related to BI 2.0 Influence from the Web on BI Technical Challenges of the new BI 2.0 General Overview of Tools Stepping Towards BI 2.0 Conclusions
Juan C. Trujillo Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011 3
Introduction
The use of Business Intelligence solutions has been steadily increasing
In the recession period, the BI market grew 4%
BI allows the business to gain a competitive edge by analyzing the data of the organization
Juan C. Trujillo Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011 4
Introduction
Traditional technologies to support BI processes range from Data warehouses to OLAP and Data mining.
These technologies allow to query the organizations internal data
However, a new trend has emerged: analyzing data from outside the organization.
Juan C. Trujillo Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011 5
Introduction
For example, including information like:
Retail prices of products sold by competitors Opinions from customers
Result: Richer analysis and better support for the decision-making process
Juan C. Trujillo Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Introduction
However this trend is bidirectional
As BI applications include information from the Web, These applications have also been evolving towards web technologies.
Juan C. Trujillo Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Introduction
Evolution driven by technologies appeared in the Web 2.0:
Social Networks (e.g. Facebook) Graph and linked data Interactive Web applications Cloud computing Collaborative Networks Process Intelligence Software as a Service
Juan C. Trujillo Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011 8
Introduction
Some authors call it BI 2.0, others BI 3.0
Which are the common aspects that define the new BI?
How is the web affecting BI and which new features are being included from this influence?
Juan C. Trujillo Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011 9
Introduction
Which technical challenges must be overcomed?
Which are already solved, which require further research?
Juan C. Trujillo 10 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Introduction
A first look into a BI 2.0 architecture
Juan C. Trujillo 11 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Content
Introduction BI & Data Warehouses in a Nutshell Basic Concepts related to BI 2.0 Influence from the Web on BI Technical Challenges of the new BI 2.0 General Overview of Tools Stepping Towards BI 2.0 Conclusions
Juan C. Trujillo 12 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Result:
Vendors implement the logical models Star Schema, Snowflake, Fact Constellation,
[Kimball, 96]
Juan C. Trujillo 14 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Composite Key
Juan C. Trujillo 15 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Juan C. Trujillo 16 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Juan C. Trujillo 17 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Fact
Juan C. Trujillo 19 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
However, the RE stage allows us to identify and guarantee that the analysts needs are met
Juan C. Trujillo 22 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Juan C. Trujillo 23 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Considers both user requirements and data sources Hybrid approaches allow us to identify problems in early stages
Juan C. Trujillo 24 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Juan C. Trujillo 25 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Juan C. Trujillo 26 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Content
Introduction BI & Data Warehouses in a Nutshell Basic Concepts related to BI 2.0 Influence from the Web on BI Technical Challenges of the new BI 2.0 General Overview of Tools Stepping Towards BI 2.0 Conclusions
Juan C. Trujillo 29 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
All the information used must be fresh and up-to-date Exceptional situations previously unknown
E.g. Sales data without the list of products
Juan C. Trujillo 30 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Juan C. Trujillo 31 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Software is now consumed as a remote service Use of Service Oriented Architecture (SOA) and SOA Protocol (SOAP) for interoperability Recently applied to BI solutions, resulting in BI as a service
Juan C. Trujillo 32 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Integration of several, heterogeneous elements into a network Middleware provides homogeneous interface
Services provided consumed through SaaS
Salesman Problem
Each individual contributes with a little effort to the global goal Depending on how the crowd is organized, the collective intelligence can achieve better solutions than a single expert
Juan C. Trujillo 35 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
The most relevant data are the contributions from the participants and the relationships between them
Juan C. Trujillo 36 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Knowing the relationships between each piece of data and the rest In order to be able to reason and infere knowledge, the relationships must be semantically tagged Allows to obtain knowledge automatically
Juan C. Trujillo 37 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Describing the general feelings of a group of people towards a certain element Requires to analyze unstructured data, understand its content and obtain a conclusion Highly relevant to identify how customers perceive products
Juan C. Trujillo 38 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Point of view focused on Business processes and their logic Tries to relate the stored data to the process performance
Extensions of BPMN 2.0
Content
Introduction BI & Data Warehouses in a Nutshell Basic Concepts related to BI 2.0 Influence from the Web on BI Technical Challenges of the new BI 2.0 General Overview of Tools Stepping Towards BI 2.0 Conclusions
Juan C. Trujillo 40 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Juan C. Trujillo 45 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Traditionally, decisions would be taken by executives in an isolated manner However, it has been proposed that it is better to take decisions using collective intelligence or even crowdsourcing
Often, employees have relevant knowledge regarding a specific problem
Juan C. Trujillo 49 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Users should be able to make annotations and enrich the data with relevant information
Juan C. Trujillo 51 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
However, decision-makers wish to use the data to identify which strategies are working and which ones are not
Juan C. Trujillo 52 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Other proposals relate data directly to business goal models or to business process models
Juan C. Trujillo 53 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Juan C. Trujillo 54 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Content
Introduction BI & Data Warehouses in a Nutshell Basic Concepts related to BI 2.0 Influence from the Web on BI Technical Challenges of the new BI 2.0 General Overview of Tools Stepping Towards BI 2.0 Conclusions
Juan C. Trujillo 56 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Juan C. Trujillo 58 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Juan C. Trujillo 59 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Juan C. Trujillo 60 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Solution?
Alter the way of capturing data Obtain the information simultaneously as it is stored in transactional databases (trickle-feed)
Juan C. Trujillo 61 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
The ETL process is transformed into a modeled parallel flow of data towards the DW
Information may be incomplete at certain points Important to model unexpected flows (exceptions)
Juan C. Trujillo 62 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Juan C. Trujillo 63 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Juan C. Trujillo 66 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Two approaches:
Public Clouds (i.e. Amazon, Azure Cloud, iCloud) Private Clouds (with your own high-end servers!)
Juan C. Trujillo 68 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Juan C. Trujillo 69 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Unstructured data
NLP Is Hadoop the right solution ?
Juan C. Trujillo 70 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Create a separate DW for Web information Link this information as a detailed view after generating the analysis cube
Juan C. Trujillo 71 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Juan C. Trujillo 72 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
answer
Juan C. Trujillo 75 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Juan C. Trujillo 76 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Juan C. Trujillo 77 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Juan C. Trujillo 78 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Juan C. Trujillo 79 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
They are the basis for analysis and structuring the information
Juan C. Trujillo 80 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Juan C. Trujillo 82 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Content
Introduction BI & Data Warehouses in a Nutshell Basic Concepts related to BI 2.0 Influence from the Web on BI Technical Challenges of the new BI 2.0 General Overview of Tools Stepping Towards BI 2.0 Conclusions
Juan C. Trujillo 84 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Juan C. Trujillo 85 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Limitations:
Limited predictive analysis support Interaction and collaboration between users? Integration of business processes?
Juan C. Trujillo 87 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Limitations:
Designing and integrating dashboards requires some effort Interactivity and data enrichment? Integration of business processes?
Juan C. Trujillo 88 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Juan C. Trujillo 89 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Limitations:
Predictive analysis support? Business processes?
Juan C. Trujillo 90 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Limitations:
Predictive analysis? Collaborative BI? Business processes?
Juan C. Trujillo 91 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Juan C. Trujillo 92 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Juan C. Trujillo 93 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Juan C. Trujillo 94 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Juan C. Trujillo 95 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Juan C. Trujillo 97 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Juan C. Trujillo 98 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Content
Introduction BI & Data Warehouses in a Nutshell Basic Concepts related to BI 2.0 Influence from the Web on BI Technical Challenges of the new BI 2.0 General Overview of Tools Stepping Towards BI 2.0 Conclusions
Juan C. Trujillo 100 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Conclusions
BI 2.0 has to deal with several aspects:
Real-time analysis Intuitive and interactive analysis from anywhere Collaboration between decision-makers Linking and enriching data Focusing on the immediate future
Juan C. Trujillo 101 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Conclusions
Further research needs to be done in:
Predictive algorithms with strong time restrictions Identify the most effective way of presenting the data Develop a series of best practices when taking decisions in a collaborative manner Process Intelligence and Process Mining
Juan C. Trujillo 102 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Conclusions
Further research needs to be done in BI:
Juan C. Trujillo 103 Alejandro Mat First European Business Intelligence Summer School (eBISS), Ec. Centrale, Paris, 2011
Conclusions
In our Lucentia Research Group: Process Intelligence
Mining processes Linking processes to data from BPMN 2.0
Traceability of users requirements BI 2.0 security and quality Web Warehouses Advanced visualization techniques More:
http://www.lucentia.es Recent publications on DB&LP
104
Conclusions
Lucentia BI Tool To be extended + Users requirements for BI 2.0 + Automatic transformations adapted + Reverse engineering adapted to new BI 2.0 sources (e.g. docs,) + PIM, PSM adapted + New traces
105
Juan Trujillo