Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
ARCHITECTURES
Basic information system architectures
• Client server
• n-tier
• One tier
• Two tier
• Three tier
Disadvantage of 2-tier
performance problems
• Parallel processing/partitioning ?
• Information interdependence
• Upper management’s information needs
• Urgency of need for a data warehouse
• Nature of end-user tasks
• Constraints on resources
Selection Decision continued…
• Strategic view of the data warehouse prior to implementation
• Technical issues
Quiz
A star schema has what type of relationship between a dimension and
fact table?
1. Many-to-many
2. One-to-one
3. One-to-many
4. All of the above
What is data scrubbing?
1. A process to reject data from the data warehouse and to create the
necessary indexes
2. A process to load the data in the data warehouse and to create the
necessary indexes
3. A process to upgrade the quality of data after it is moved into a data
warehouse
4. A process to upgrade the quality of data before it is moved into a data
warehouse
Fact tables are
1.Completely demoralized
2. Partially demoralized
3. Completely normalized
4. Partially normalized
Attempt to find a function which models the data with the least error is
known as
1. Clustering
2. Regression
3. Association rule
4. Clustering
The active data warehouse architecture includes which of the
following?
- Both a and b
- OLAP
- Dashboard
- Warehouse
Which of the following BI technique can predict value for a specific data item
attribute?
Options
- Classification
- Clustering
- Regression
• Data access
• Data federation
• Change capture
Challenges
• Huge volume of data – short time
• In case of failure- recovery
Types of loading
• Initital load
• Incremental load
• Full refresh
Load verification
• Ensure that the key field data is neither missing nor null.
• Data checks in dimension table as well as history table.
• Check calculated measures.
VARIATIONS OF OLAP
• ROLAP - Relational Online Analytical Processing.
• MOLAP - Multidimensional OLAP
• HOLAP -
ROLAP - Relational Online
Analytical Processing
• Data is stored and fetched from the main data warehouse.
• Data is stored in the form of relational tables.
• Large data volumes.
• Uses Complex SQL queries to fetch data from the main warehouse.
• ROLAP creates a multidimensional view of data dynamically.
MOLAP - Multidimensional OLAP
• Believing that data warehousing database design is the same as transactional database
design.
• Choosing a data warehouse manager who is technology oriented rather than user
oriented.
Contd..
• Focusing on traditional internal record-oriented data and ignoring the
value of external data and of text, images, and, perhaps, sound and video.
• Believing that your problems are over when the data warehouse is up and
running.
Real time data warehousing
Why Real time data warehousing?
• Updates in OLTP
• Traditional data warehouses - not business
critical
• Updates – weekly
Power users, knowledge workers, internal Operational staffs, call centers, external users
users
Issues with real time data warehousing
• Reporting
• Not all field updates
• Enabling Real-time ETL
• No system downtime
Security in data warehouse