Professional Documents
Culture Documents
Goals:
1. Explain potential advantages and use cases of BI&A (Business Intelligence and Analytics)
a. Need ability to make clear cut decision
b. BI&A is about reporting for the masses, advanced BI frontends, real-time access time,
planning capabilities, closed loop performance management
2. Describe the basic concepts of decisional making and decisional guidance
a. Decision making is the process of sufficiently reducing uncertainty and doubt about
alternatives to allow a reasonable choice to be made from among them Four steps:
i. Intelligence: search for conditions that call for decisions
ii. Design: invent, develop, and analyze possible alternative solutions
iii. Choice: select one of the solutions
iv. Implementation: adapt the selected course of action to the decision situation
3. Characterize the purpose of decision support systems and business intelligence systems
a. Decision Support System is a computer-based information system that supports DM
activities Pro: Speedy computations, improved communication/collaboration, improved
data management, quality and agility support, overcoming cognitive limits
b. Business Intelligence and Analytics refers to techniques, technologies, systems, etc.
that analyze critical business data to help an enterprise better understand its business
and market and make timely business decisions
4. List the most important components of BI&A systems
a. Slide 42?
b. Data Warehouse Environment (Technical Staff)
c. Business Analytics Environment (Business users) with user interface
d. Performance and Strategy (Managers/executives) with user interface
e. Features provided: reporting, dashboard, office integration, search-based BI, mobile BI,
OLAP, interactive visualization, scorecards, metadata management, collaboration, etc..
5. Characterize the BI software market and list major players in it
a. Complete BI platforms:
i. IBM, SAP, SAS, Oracle, MS, MicroStrategy
b. Focus on selected BI strategies:
i. InfoZoom, tableau, QlikView, MIS, .
c. Landscape continuously increases, but only few visionaries
Lecture 2:
1. Understand the basic definitions and concepts of data warehousing
a. A data warehouse is a pool of data produced to support decision making Data are
usually structured to be available in a form ready for analytical processing.
b. Characteristics: Subject oriented, integrated, time variant, nonvolatile,
relational/multidimensional, client/server, include metadata
c.
d. Direct Benefits:
i. allows end users to perform extensive analysis
ii. consolidated view of corporate data
iii. better and more timely information access
iv. enhanced system performance
e. Indirect benefits:
i. Enhance business knowledge
ii. Create competitive advantage
iii. Enhance customer satisfaction
iv. Facilitate DM
v. Optimizing business processes
2. Understand data warehousing architectures
a. Elements:
i. Data warehouse with data and associated software
ii. Data acquisition (back-end that extracts data from ERP systems and external
sources and summarizes them
iii. Client (front-end) software that allows access and analyzes data
b. Concepts:
i. Data Sources: contains data to be loaded into DW (ERP, Web Services)
ii. Enterprise DW is a centralized repository for the entire enterprise
iii. Data Mart is a departmental DW that stores only relevant data
c. Factors that can affect the architecture:
i. Information interdependence between units, management needs, urgency of need
for DW, constraints on resources, compatibility, technical issues, social factors
d. Architecture selection: Kimball vs Inmon
i. Kimball views data warehousing as a constituency of data marts. Data marts are
focused on delivering business objectives for departments in the organization. And
the data warehouse is a conformed dimension of the data marts. Hence a unified
view of the enterprise can be obtained from the dimension modeling on a local
departmental level.
ii. Inmon beliefs in creating a data warehouse on a subject-by-subject area basis.
Hence the development of the data warehouse can start with data from the online
store. Other subject areas can be added to the data warehouse as their needs
arise. Point-of-sale (POS) data can be added later if management decides it is
necessary.
iii.
Overall
approach
Architectur
e structure
Complexity
Developm
ent
methodolo
gy
High
Iterative
EDW Approach
Inmon
Top-Down
Low
Step-wise
4. Explain the extraction, transformation, and load (ETL) processes Data provision process
a. Extraction (reading data from database)
i. Synchronous access / asynchronous access
ii. File based extraction / stream based extraction
iii. Full extraction / delta extraction
iv. Usage of filters / no filters
v. Standard extractors / custom extractors
b. Transformation (conversion of data into needed form)
i. Filtering (e.g. filter all deleted orders)
ii. Harmonization (e.g. resolve master data inconsistencies)
iii. Enrichment (e.g. calculate new facts from existing ones)
iv. Aggregation (e.g. by minimizing a dimension)
c. Load (place data in DW)
i. During this phase data is updated
ii. Alternatives: Full vs Delta load, daily/weekly/
iii. Has to be customized to the chosen model
iv. Data quality mechanisms are often implemented to be triggered during the load.
d. Usually a triggered (automated) process, logs and monitoring helps to find errors
automation is very important and can require large portions of the project effort
5. Metadata Management
a. Metadata is information about data (tables, columns)
b. Master data is the opposite of transactional data, one entry per legal entity
c. Transactional data is the opposite of master data, one entry per transfer
ii. Technical: technical description of data assets (tables, data stores, physical
attributes)
iii. Operational: monitor job execution (statistics about processes)
f. Benefits:
i. Build common understanding of data
ii. Facilitate the quest for data quality
iii. Support discovery and reuse of data
iv. Analyze dependencies
v. Facilitate future changes
vi. Monitor Usage
Lecture 3,4:
1. Understand the concept of Big Data and some typical applications
a. BD is about leveraging the extended capabilities to analyze information and emphasize
diverse data sources and formats, volume only secondary role atm.
b. Four dimensions of BD that have to be differentiated:
i. Volume: Data at large scale
ii. Variety: Data in many forms (structured, unstructured, text)
iii. Velocity: Data in motion (analysis of streaming data to enable decisions in
fractions of a second)
iv. Veracity: Data uncertainty (managing the reliability and predictability of inherently
imprecise data types
2. Explain the technological changes which foster Big Data Analysis
a. Increasing data storage capacity from analog to digital
b. Increasing computation capacity
c. Increasing data collection
d. New, emerging data sources such as social networks
e. Digitization and connection of traditionally physical devices
f. Technological changes and changes in the use of existing technology processing 32
64bit, multicore, disk> in-memory storage, data organization: row column,
vectors dictionary encoding very important because data is only stored
once and not every time a new item is created
g. Slide 25 as example?
3. Differentiate Big Data technologies from traditional BI approaches
a. Traditional vs. Big Data approach (structured, analytical, logical vs. creative, holistic
thought, intuition)
Business Users determine what question to
ask
IT structures the data to answer that question
Traditional Computing
Historical fact finding, find stored information,
query data results
Problems of traditional DW
Data might be outdated before users are able
to analyze it, Data rates and volumes are too
big for storing and subsequent analysis
Slide 53?
4. Depict the implications of this technology and its application
a. Basics:
i. Same data can be represented using different data models pick the one that
supports the app
ii. Data model defines a way of representing data in a db and how an app accesses
the data (relational data model is most common)
iii. Two data model design: denormalized (embedded data models) and normalized
Denormalized data model
(allows storage of related pieces of data
information in the same database record)
d.
Business reporting
Analytical reporting
iii. Prescriptive Analytics: Recommend one or more causes of action and show mostlikely outcome of the action actionable data and feedback required in order to
learn continuously
d. Definition: Text mining is the application of data mining to non-structured or
less structured text files. It entails the generation of meaningful numerical
indices from the unstructured text and then processing these indices using
various data mining algorithms large databases.
i. Example: spam recognition, help desks, analysis of related scientific publications
e. Definition: Web mining is the discovery and analysis of interesting and useful
information from the Web, about the Web, and usually through Web-based
tools.
d. Business Intelligence
i. BI in classical form is high-level oriented
ii. Process Intelligence focuses on operational performance to be transparent at all
times
iii. BPI comprises a large range of application areas spanning from process
monitoring and analysis to process discovery, conformance checking,
prediction and optimization.
iv. Challenge: Multiple actors need real-time information about business processes
that is tailored to their needs approach: integrating all data in an in-memory
database
Lecture 8, 9:
1. Explain different elements of a BI strategy
a. Organizational Adoption (process) involves all actions of individuals in an organization
that deal with creating awareness, selecting, evaluating, initiating and deciding for the
implementation of new ES technology.
b. Conversion (process) involves all actions of individuals (in an organization or across
organizations) that deal with developing and implementing a new ES technology.
c. Use (process) involves all actions of individuals in an organization that deal with using
and changing ES technology or the respective work system to realize intended business
value.
2. Characterize different types of key performance indicators and understand the importance of
alignment
a. KPI (Key Performance Indicator) Quarter/Year Motivation Index
b. OPI (Operational Performance Indicator) Weeks/Months Health rate
c. PPI (Process Performance Indicator) Hours/days Accidents
d. Pros of BI strategy: help align with business partners, formalize business needs, create
prioritized roadmaps with strategic business goals to deliver measurable results
e. Alignment is a continuous process
f. Dispositive (the data contained in data warehouses) Data Architecture: high level
reference model for all BI projects of an enterprise Starting from this high level model,
individual projects develop a concrete dispositive data model gap detection through
mapping to existing data sources
g. BI Governance: Data Monitoring Adapt entire BI technology stack (software, hardware,
networks) to technological change and increasing data volumes
h. Governance: Development Model
i. REALLY BAD s. 16 - 29
3. Understand the basics of BI implementation and the importance of BI Strategy for BI
Implementation
a. Disparate business data must be integrated need for information consolidation
alignment across organizations regarding master data and KPIs, new technology changes
a lot
i. Organizational Issues
ii. Project Issues
iii. Technical Issues
iv. User participation in the development is crucial
b. Common failure factors in BI projects
i. Unclear business / information objectives
6. Characterize the individual stages and associated steps of the methodology in more detail slide
52
Will der wirkliche 16 verdammte Phasen einen BI Integration hren? Mir reichen die 6
Stages..alles andere wrde eher zu IS540 passen!
Lecture 10:
1. Explain the importance of post-adoption BI system use for the return on BI technology
investments
a. Returns on IS investments are mainly gained within the post-adoption stages!
c.
Guest Lecture:
Really?