Sei sulla pagina 1di 30

Information and Databases

Udayan Shinde 2011

Data
Is a basic unit of information in a raw and unorganized format Bits
On /Off 0 or 1

Text, Images, Audio ,Video


Character ASCII Binary A 65 0100 0001 z 122 01111010 00111001 9

Udayan Shinde 2011

Data
Image

Udayan Shinde 2011

Table
NO Firstname Lastname Birthdate Address Mobile Picture Position Salary 1Amit Joshi Depar tment 1/1/1980121, 9898967676 Marke Akruti,SB ting Road,Pune -001 2/1/1975878/1, 8808811111 Accou Galaxy nts Complex, Flat No. 301, MG Rd, Pune004 13/11/198 77/1, Air 9823046789 HR 2 Palace, Wakad, Pune-033
Udayan Shinde 2011

Executive

15000

2Amit

Raj

Manager

22000

3Madhuri Agarwal

Director

30000

Table
A table is a set of data elements (values) that is organized using a model of vertical columns (which are identified by their name) and horizontal rows. Rows Columns Fields
Refered to the data in row/column

Records
Contains several data fields
Udayan Shinde 2011

Database
Several data fields form a record Several Records form a data file/table Several Data Files form a database Collection of data in an organized format Structured collection of electronically stored data , which is controlled and accessed through computers based on predefined relationships between predefined types of data items, related to a specific business, situation or a problem
Udayan Shinde 2011

Database Management Systems


A set of programs to define, update and control, maintain the Database Store data in an efficient and compact manner Data Access to external applications Typically supports query language

Udayan Shinde 2011

Data Tables

Udayan Shinde 2011

File system/DB Concerns


Data size increases to huge extent
Lacks organization Redundancy Repeating groups Subject to modification anomalies

Difficult to Manage Not Efficient Security


Udayan Shinde 2011

Database Structure
Hierarchical Structure Network Structure Relational Structure Multidimensional Structure

Udayan Shinde 2011

Types of Database
OLTP Vs OLAP Relational DBs Multidimensional DBs Data Warehouses Geographic Information Systems Text and Image DBs Hypermedia DB
Udayan Shinde 2011

Relational Database Management Systems

Udayan Shinde 2011

RDBMS
Normalization Organize and store data in a relational model for efficient Data management Entity Relations

Udayan Shinde 2011

Normalization
Normalization
Efficiently organize data Eliminate redundancy Storing only related data Expandable and easier to extend Make data model more informative

Udayan Shinde 2011

Normalization
First Normal Form (1NF)
Eliminate duplicative columns from the same table. Create separate tables for each group of related data and identify each row with a unique column or set of columns (the primary key)

Udayan Shinde 2011

Normalization
Second Normal Form (2NF)
Meet all the requirements of the first normal form. Remove subsets of data that apply to multiple rows of a table and place them in separate tables. Create relationships between these new tables and their predecessors through the use of foreign keys.

Udayan Shinde 2011

Normalization
Third normal form (3NF)
Meet all the requirements of the second normal form. Remove columns that are not dependent upon the primary key.

Other Normal Forms


Boyce-Codd Normal Form [BCNF] Fourth Normal Form 4NF
Udayan Shinde 2011

Entity Relations
Analyze and describe data requirements and assumptions Used in design of Databases Logical View
Helps the user understand the Data Hides the details

Physical View
Techincal aspect of how and what the computer should do to find, retrieve the data
Udayan Shinde 2011

Entity Relationship Diagrams


To show how information is, or should be stored and used within a business system Success of any IS depends on efficient flow and processing of the information Identifying Entities and their relationship

Udayan Shinde 2011

Entity Relationship Diagrams


Entities are the "things" about which we seek information. Attributes are the data we collect about the entities. Relationships provide the structure needed to draw information from multiple entities.

~ERwin, MS Visio, Oracle Designer, Toad Data Modeler


Udayan Shinde 2011

Relationship types
One-to-Many

One-to-One

Many-to-Many

Udayan Shinde 2011

Entity Relationship Diagrams

Udayan Shinde 2011

Types of Database
Relational DBs Multidimensional DBs Data Warehouses Geographic Information Systems Text and Image DBs Hypermedia DB
Udayan Shinde 2011

Multidimensional Database
Variation of relational structure Uses multidimensional structure to organize data and express relationships between data Cubes of data Each side is considered a dimension Each cell contains aggregated data related to elements among each dimension
Udayan Shinde 2011

Types of Database
OLTP Vs OLAP Relational DBs Multidimensional DBs Data Warehouses Geographic Information Systems Text and Image DBs Hypermedia DB
Udayan Shinde 2011

Data Warehouse

Udayan Shinde 2011

Data Warehouse
Storing company wide data together Repository for historical data Organized data vital for creating reports and monitoring an organizations status Backbone of Business Intelligence

Udayan Shinde 2011

Data Warehouse
Combination of DB and Software to support business analysis and decision making Design steps
Extraction, Consolidation, Filtering, Cleansing, transformation, aggregation and updating

Data Mart: Subset of larger data warehouse devoted to a particular business function or department
Udayan Shinde 2011

Data Mining
Analysis and Summarizing Discover patterns-Classes, Clusters, Associations, Sequential Patterns To help determine relationships among internal factors Data Mining is the discovery of useful patterns in data. And is used for prediction analysis and classification e.g. what is the likelihood that a customer will migrate to a competitor.

~IBM Intelligent Miner, Cognos Scenario, SAS Enterprise Miner


Udayan Shinde 2011

Data as a resource
Extremely Vital Resource Having quality data about internal and external environment is a key success factor Information Quality
Accuracy,Precision and Completeness, Source

Information Accessibility
Availability

Information Presentation
Easily available summarized data in a presentable format
Udayan Shinde 2011

Potrebbero piacerti anche