Professional Documents
Culture Documents
CONTENTS :
Types of Databases
1. 2. 3. 4.
to the task of managing the organizations data resources to meet the information needs of their business stakeholders
Database
A database is an integrated collection of logically related data elements. The data stored in a database are independent of the application programs using them & of the type of storage devices on which they are stored.
Operational Databases
These databases store detailed data needed to support business processes & operations of a company.
Other names for this are : Subject Area Databases (SADB) Transaction Databases Production Databases Examples: HR Database Customer Database etc
Operational Databases
HR Database
Payroll Benefits
Employee Record 1
Employee Record 2
Employee Code
Employee Name
Salary
Employee Code
Employee Name
Salary
1001
James
$3500
1002
Martha
$2750
Multimedia databases
Multimedia databases. Multimedia databases can be defined as the database systems that can store, manipulate and query information presented in more than one format such as text, audio, video, graphics, and images. The multimedia databases are of prominence in the world of computers today and more so for the flexibility and convenience they offer in representing various forms of objects that we come across in our everyday lives
Multimedia databases
Thishasnecessitatedutilizationofdifferentformsforsoring multimediaandtheyinclude: ImageData:Imagesareverycommonlyfoundinmultimedia databasesandtheirapplicationscoversimplefigures,icons,medical imageslikeXraysetc. VideoFiles:Thesehavebecomeveryimportantwiththeadventof technologieslikedistributionofvideoetc.Itisnowmoreconvenient thanevertostoreahomevideoonapersonalcomputer. Audiofiles:Thesefilesarebeingusedextensivelytostoreaswellas distributemusicandareeveneing sharedovertheinternet! DocumentData:Thesearethetraditionaltextfileswhere informationisstoredintheformoftext.Thesefilesarestill inuseand havechangedintermsofthecapabilityofstoragesize.
Distributed Databases
Many Organizations replicate and distribute copies of parts of databases to network servers at a variety of sites. These Distributed databases can reside on network servers on the WWW on corporate intranets or extranets etc.
Distributed Databases
Partitioned Databases
Duplicate Databases
Distributed Databases
Distributed Databases
Fragmentation
Horizontal fragmentation: each tuple of r is assigned to one or more fragments Vertical fragmentation: the schema for relation r is split into several smaller schemas Example : relation account with following schema Account-schema = (branch-name, accountnumber, balance)
account1=branch-name=Hillside(account) branch-name Valleyview Valleyview Valleyview Valleyview account-number A-177 A-402 A-408 A-639 balance 205 10000 1123 750
account2=branch-name=Valleyview(account)
Lowman Hillside Camp Hillside Camp Valleyview Kahn Valleyview Kahn Hillside Kahn Valleyview Green Valleyview deposit1=branch-name, customer-name, tuple-id(employee-info) account number balance
tuple-id 1 2 3 4 5 6 7
500 A-305 336 A-226 205 A-177 10000 A-402 62 A-155 1123 A-408 750 A-639 deposit2=account-number, balance, tuple-id(employee-info)
Advantages of Fragmentation
Horizontal:
allows parallel processing on fragments of a relation allows a relation to be split so that tuples are located where they are most frequently accessed
Vertical:
allows tuples to be split so that each part of the tuple is stored where it is most frequently accessed tuple-id attribute allows efficient joining of vertical fragments allows parallel processing on a relation
Data warehousing
A data warehouse stores data that have been extracted from the various operational, external & other databases of an organization.
It is a central source of data that has been cleaned, transformed & catalogued so that they can be used by managers & other business professionals for data mining, online analytical processing & other forms of business analysis, market research & decision support.
Data warehousing
Various units worldwide each having their own IT resources (collecting & processing local data) The fuel company wanted to combine data from its ERP Financial applications with data from its various systems to process information on how much GAS & OIL the company finds & collects? Steve Much (Data warehouse team leader SHELL Scotland) faced major problems as each system had their set of codes. The option of going back cleansing & integrating data in host system wasn't an option which was feasible ???
No single Business Unit lost control of its data. Hence all Business Heads contribute to a greater understanding of information for the company as a whole. After this success Mutch faced pressure from TOP Executives to integrate data from other applications also. Hence the TRUE POTENTIAL of a Data warehouse is realized at SHELL
A Database Management System (DBMS) is a collection of interrelated data (database) and a set of programs to access those data. A Database Management System (DBMS) is a software that: Defines a database Stores the data Supports a Query Language Produces Reports Creates data entry screens
Report Writer
Generator Generator
Application
Security
Experian Inc a unit of a LONDON based company. It runs one of the largest credit reporting agencies in the US. Experian wanted to go beyond credit checks for automotive loans. Experian wanted to collect vehicle data from various Motor vehicle departments in the US, & blend it with other data such as Change of Address records & then it could sell the enhanced data. To offer these services, Experian first needed a way to extract, transfer and load data from 50 different US State Department of Motor Vehicles (DMV) systems into a single database.
This was difficult as per Ken Kauppila, VP(IT) at Experian Automotive (California) As Each DMV had their own format for entering data Kaupilla decided to use ETL( Extracting, Transforming & Loading) tools to combine very large data repositories. Using ETL EXTRACT by Evolutionary Technologies, Experian created database that can incorporate vehicle information within 48hrs of its entry into any States DMV computer. Experian automotive database is the 10th largest automobile database in the world (16 billion rows of data)
Database Concepts
1:1
Employee
1:many Employee 1
Sales 2 Sales 3
Customer A Customer B
Employee 2 Many:Many
Employee 3
Relational Database Model -- need to include a common, unique field between tables in order to link or "relate" the different tables.
Basic Definitions
Primary
key
field (column) that is unique for table (a data item cannot be repeated anywhere in the field -- product number, employee id., etc.)
Foreign
key
A primary key in another table that is used to join (connect) two tables.
Relational Databases
Tables
Rows Columns Primary
Customer Table
Phone 312-555-1234 502-555-8876 602-555-9987 612-555-4325 Name Jones Smith Juarez Olsen Address 123 Main 456 Oak 887 Ribera 465 Thor City Chicago Glasgow Phoenix Minneapolis
keys
Orders Table
Customer 502-555-8876 602-555-9987 612-555-4325 502-555-8876 Date 3/3/04 4/4/04 4/9/04 5/7/04 Salesperson 2223 8876 8876 3345 Total_sale 157.92 295.53 132.94 183.67
Data types
Text Dates