Introducing the database and the DBMS Database: shared, integrated computer structure that stores a collection of data Metadata: data about data Complements and expands value of data database management system (DBMS): collection of programs - Manage structure and control access to data Role and Advantages of a DBMS: - Improved data sharing - Better data integration - Minimized data inconsistency - Increased end-user productivity DBMS is the intermediary between the user and the database database
Introducing the database and the DBMS Database: shared, integrated computer structure that stores a collection of data Metadata: data about data Complements and expands value of data database management system (DBMS): collection of programs - Manage structure and control access to data Role and Advantages of a DBMS: - Improved data sharing - Better data integration - Minimized data inconsistency - Increased end-user productivity DBMS is the intermediary between the user and the database database
Introducing the database and the DBMS Database: shared, integrated computer structure that stores a collection of data Metadata: data about data Complements and expands value of data database management system (DBMS): collection of programs - Manage structure and control access to data Role and Advantages of a DBMS: - Improved data sharing - Better data integration - Minimized data inconsistency - Increased end-user productivity DBMS is the intermediary between the user and the database database
Database Systems: – End-user data: raw facts of interest to the
Design, Implementation, and Management end user
Eighth Edition – Metadata: data about data • Metadata provides description of data Chapter 1 characteristics and relationships in data Database Systems – Complements and expands value of data • Database management system (DBMS): Data vs. Information collection of programs • Data are raw facts – Manage structure and control access to • Information is the result of processing raw data data to reveal meaning • Information requires context to reveal meaning Role and Advantages of the DBMS • Raw data must be formatted for storage, • DBMS is the intermediary between the user and processing, and presentation the database • Data are the foundation of information, which is • Database structure stored as file collection the bedrock of knowledge • Access database through the DBMS • Data: building blocks of information • DBMS enables data to be shared • Information produced by processing data • DBMS integrates many users’ views of the data • Information used to reveal meaning in data • Accurate, relevant, timely information is the key • Advantages of a DBMS: to good decision making – Improved data sharing • Good decision making is the key to – Improved data security organizational survival – Better data integration – Minimized data inconsistency Introducing the Database and the DBMS – Improved data access • Database: shared, integrated computer structure – Improved decision making that stores a collection of data – Increased end-user productivity – Structure applied based on type of Types of Databases processing to be performed • Databases can be classified according to: • Semistructured data have been processed to – Number of users some extent – Database location(s) • Extensible Markup Language (XML) – Expected type and extent of use represents data elements in textual format • Single-user database supports only one user • XML database supports semistructured XML at a time data – Desktop database – single-user, runs on PC Why Database Design is Important? • Multiuser database supports multiple users at • Database design focuses on design of the same time database structure used for end-user data – Workgroup database supports a small – Designer must identify database’s expected number use – Enterprise database supports a large • Well-designed database: number – Facilitates data management • Centralized database: data located at a single – Generates accurate and valuable site information • Distributed database: data distributed across • Poorly designed database: several different sites – Causes difficult-to-trace errors • Operational database: supports a company’s day-to-day operations Historical Roots: Files and File Systems – Transactional or production database • Reasons for studying file systems: • Data warehouse: stores data used for tactical – Complexity of database design easier to or strategic decisions understand • Unstructured data exist in their original state – Understanding file system problems helps • Structured data result from formatting to avoid problems with DBMS systems – Knowledge of file system useful for • File system an improvement over manual system converting file system to database system – File systems used for more than two • File systems typically composed of collection of decades file folders, each tagged and kept in cabinet – Understanding the shortcomings of file – Organized by expected use systems aids in development of modern • Contents of each file folder logically related databases • Manual system served as a data repository for – Many problems not unique to file systems small data collections • Even simple file system retrieval task required – Cumbersome for large collections extensive programming • Data processing (DP) specialist converted – Ad hoc queries impossible computer file structure from manual system – Changing existing structure difficult – Wrote software that managed the data • Security features difficult to program – Designed the application programs – Often omitted in file system environments • Initially, computer files systems resembled • Summary of file system limitations: manual systems – Requires extensive programming • As number of files increased, file systems – Can not perform ad hoc queries evolved – System administration complex and difficult – Each file used its own application program – Difficult to make changes to existing to store, retrieve, modify data structures – Each file owned by individual or – Security features likely to be inadequate department that commissioned its creation • Data processing (DP) manager supervised Structural and Data Dependence the DP department • Structural dependence: access to a file • DP department’s primary activity remained dependent on its own structure programming – All file system programs must be modified to conform to a new file structure Problems with File System Data Management • Structural independence: change file – Some software places restrictions on length structure without affecting data access of field names • Data dependence: data access changes when • Each record should have unique identifier data storage characteristics change • Data independence: data storage Data Redundancy characteristics do not affect data access • File system structure makes it difficult to • Practical significance of data dependence is combine data from multiple sources difference between logical and physical format – Vulnerable to security breaches • Logical data format: how human views the • Organizational structure promotes storage of data same data in different locations • Physical data format: how computer must – Islands of information work with data • Data stored in different locations unlikely to be • Each program must contain: updated consistently – Lines specifying opening of specific file • Data redundancy: same data stored type unnecessarily in different places – Record specification • Data inconsistency: different and conflicting – Field definitions versions of same data occur at different places • Data anomalies: abnormalities when all Field Definitions and Naming Conventions changes in redundant data not made correctly • Storing customer name as single field is a liability – Update anomalies – Better record definition breaks fields into – Insertion anomalies component parts – Deletion anomalies • Selecting proper field names important; field names are descriptive Database Systems – With proper naming conventions, file • Database system consists of logically related structure becomes self-documenting data stored in a single logical data repository – May be physically distributed among • Procedures: instructions and rules that govern multiple storage facilities the design and use of the database system • DBMS eliminates most of file system’s problems • Data: the collection of facts stored in the • Current generation stores data structures, database relationships between structures, access paths • Database systems created and managed at – Takes care of defining, storing, managing different levels of complexity all access paths and components • Database solutions must be cost-effective as well as tactically and strategically effective The Database System Environment • Database technology already in use affects • Database system: defines and regulates the selection of a database system collection, storage, management, use of data • Five major parts of a database system: DBMS Functions – Hardware • Most functions transparent to end users – Software – Can only be achieved through the DBMS – People • Data dictionary management – Procedures – DBMS stores definitions of data elements – Data and relationships (metadata) in a data • Hardware: all the system’s physical devices dictionary • Software: three types of software required: – DBMS looks up required data component – Operating system software structures and relationships – DBMS software – Changes automatically recorded in the – Application programs and utility software dictionary • People: all users of the database system: – DBMS provides data abstraction, removes – System and database administrators structural and data dependency – Database designers • Data storage management – Systems analysts and programmers – DBMS creates and manages complex – End users structures required for data storage – Also stores related data entry forms, • Critical to preserving database’s screen definitions, report definitions, etc. integrity – Performance tuning: activities that make • Data integrity management the database perform more efficiently – DBMS promotes and enforces integrity – DBMS stores the database in multiple rules physical data files • Minimizes redundancy • Data transformation and presentation • Maximizes consistency – DBMS transforms data entered to conform – Data relationships stored in data dictionary to required data structures used to enforce data integrity – DBMS transforms physically retrieved data – Integrity especially important in to conform to user’s logical expectations transaction-oriented database systems • Security management • Database access languages and application – DBMS creates a security system that programming interfaces enforces user security and data privacy – DBMS provides access through a query – Security rules determine which users can language access the database, which items can be – Query language is a nonprocedural accessed, etc. language • Multiuser access control – Structured Query Language (SQL) is – DBMS uses sophisticated algorithms to the de facto query language ensure concurrent access does not affect • Standard supported by majority of integrity DBMS vendors • Backup and recovery management • Database communication interfaces – DBMS provides backup and data recovery – Current DBMSs accept end-user requests to ensure data safety and integrity via multiple different network environments – Recovery management deals with recovery – Communications accomplished in several of database after a failure ways: • End users generate answers to • Data are raw facts queries by filling in screen forms • Information is the result of processing data to through Web browser reveal its meaning • DBMS automatically publishes • Accurate, relevant, timely information is the key predefined reports on a Web site to good decision making • DBMS connects to third-party • Data usually stored in a database systems to distribute information via • DBMS implements a database and manages its e-mail contents • Metadata is data about data Managing the Database System: • Database design defines the database structure A Shift in Focus • Well-designed database facilitates data • Database system provides a framework in which management, generates valuable strict procedures and standards enforced information – Role of human changes from programming • Poorly-designed database leads to bad to managing organization’s resources decision making, organizational failure • Database system enables more sophisticated use • Databases evolved from manual and of the data computerized file systems • Data structures created within the database and • In a file system, data stored in their relationships determine effectiveness independent files • Disadvantages of database systems: • Each requires its own management – Increased costs program – Management complexity • Some limitations of file system data – Maintaining currency management: – Vendor dependence • Requires extensive programming – Frequent upgrade/replacement cycles • System administration complex and difficult • Changing existing structures difficult SUMMARY • Security features likely inadequate • Independent files tend to contain redundant data • Structural and data dependency problems