Professional Documents
Culture Documents
Walter Kriha
Business Driver: Customer Relationship
Management (CRM)
Web Servers
SAP
RDBMS
PeopleSoft
Mining Ware
Off-line tools house
Personalized
Rule Operational
information
and offerings Engine DB
Integ
Navigation,
Transactions, Log ration
Messages
Framewk Web stats
Messages: 3 new
From advisor: about X inv. Common: Banner about X
Web Mining
• Adds text-mining, ontologies and things like xml
to the above
Data Mining Methods
Data mining
Data retained
Data Distilled
CBR. K-nearest n.
Use a special log framework! Make sure there are meta-data for
the content available (e.g. dynamically generated page content)
Data Analysis
User A
Session flow
User B
Meta-Information
Conceptual
Learning Concept User Profile
Algorithm
The Text-Warehouse: Information Extraction
Financial Research
User profile Autom. IE Documents
With interests Database Tool (pdf, html, doc,xml)
Software
build, extracts Ontologies/Vocabularies
new Humans
Ontologies XML Schemas/RDF define
(e.g. meta-data
Ontobroker) and use
XML Syntax them
AI on Topic Maps?
Associations
Topics
Occurrences
Dep.
Dep.BB Dep.
Dep.AA
Wrapper Induction
discovers facts
XML Editor Schema translation,
Warehouse
Meta- semantic
Data consistency checks
Topic e.g.
Maps Result DBs recommendations
Internal Information
Model Distribution users
Deployment
Mining Ware
Off-line tools house
On-line
The Main Problems for the “Web-house”