Docsity
Docsity

Prepare for your exams
Prepare for your exams

Study with the several resources on Docsity


Earn points to download
Earn points to download

Earn points by helping other students or get them with a premium plan


Guidelines and tips
Guidelines and tips

Data Warehousing - Data Warehousing - Lecture Slide, Slides of Data Warehousing

Some concept of Data Warehousing are Aggregate Functions, Applications and Trends in Data Mining, Classification and Prediction, Cluster Analysis, Data Mining Primitives, Data Warehousing Design. Main points of this lecture are: Data Warehousing Design, Materialize in the Warehouse,Materialize in the Warehouse, Which Source Data, Summary Tables, Which Indices, Both Querying, Maintenance, Trade Storage, Update Time

Typology: Slides

2012/2013

Uploaded on 04/25/2013

khushia
khushia 🇮🇳

4.3

(9)

112 documents

1 / 68

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
Data Warehousing/Mining
Comp 150
Additional Information
Docsity.com
pf3
pf4
pf5
pf8
pf9
pfa
pfd
pfe
pff
pf12
pf13
pf14
pf15
pf16
pf17
pf18
pf19
pf1a
pf1b
pf1c
pf1d
pf1e
pf1f
pf20
pf21
pf22
pf23
pf24
pf25
pf26
pf27
pf28
pf29
pf2a
pf2b
pf2c
pf2d
pf2e
pf2f
pf30
pf31
pf32
pf33
pf34
pf35
pf36
pf37
pf38
pf39
pf3a
pf3b
pf3c
pf3d
pf3e
pf3f
pf40
pf41
pf42
pf43
pf44

Partial preview of the text

Download Data Warehousing - Data Warehousing - Lecture Slide and more Slides Data Warehousing in PDF only on Docsity!

Data Warehousing/Mining

Comp 150

Additional Information

The Need for Data Analysis

 Constant pressure from external and internal forces

requires prompt tactical and strategic decisions.

 The decision-making cycle time is reduced, while

problems are increasingly complex with a growing number of internal and external variables.

 Managers need support systems for facilitating quick

decision making in a complex environment.

 Decision support systems (DSS).

Decision Support Systems

 Decision Support is a methodology (or a series of methodologies) designed to extract information from data and to use such information as a basis for decision making.

 A decision support system (DSS) is an arrangement of computerized tools used to assist managerial decision making within a business.

  • A DSS usually requires extensive data “massaging” to produce information.
  • The DSS is used at all levels within an organization and is often tailored to focus on specific business areas or problems.
  • The DSS is interactive and provides ad hoc query tools to retrieve data and to display data in different formats.

Decision Support Systems

 Four Components of a DSS

  • The data store component is basically a DSS database.
  • The data extraction and filtering component is used to extract and validate the data taken from the operational database and the external data sources.
  • The end user query tool is used by the data analyst to create the queries that access the database.
  • The end user presentation tool is used by the data analyst to organize and present the data.

Decision Support Systems

 Operational Data vs. Decision Support Data

  • Most operational data are stored in a relational database in which the structures tend to be highly normalized.
  • The operational data storage is optimized to support transactions that represent daily operations.
  • Whereas operational data capture daily business transactions, DSS data give tactical and strategic business meaning to the operational data.

Decision Support Systems

 Three Main Areas in Which DSS Data Differ from Operational Data

  • Time spanOperational data represent current (atomic) transactions.DSS data tend to cover a longer time frame.
  • GranularityOperational data represent specific transactions that occur at a given time.DSS data must be presented at different levels of aggregation.
  • DimensionalityOperational data focus on representing atomic transactions.DSS data can be analyzed from multiple dimensions. Docsity.com

Table 13.2 Contrasting Operational And DSS Data Characteristics

Decision Support Systems

 The DSS Database Requirements

  • Database SchemaThe DSS database schema must support complex (non- normalized) data representations.The queries must be able to extract multidimensional time slices.

Yearly Sales Summaries, Two Stores and Two Departments Per Store, Millions Of Dollars

Table 13.

Decision Support Systems

  • Data Extraction and LoadingThe DBMS must support advanced data extracting and filtering tools.The data extraction capabilities should support different data sources and multiple vendors.Data filtering capabilities must include the ability to check for inconsistent data or data validation rules.The DBMS must support advanced data integration, aggregation, and classification capabilities.

Decision Support Systems

  • End-User Analytical InterfaceThe DSS DBMS must support advanced data modeling and data presentation tools, data analysis tools, and query generation and optimization components.The end user analytical interface is one of the most critical components.
  • Database Size RequirementsDSS databases tend to be very large.The DBMS must be capable of supporting very large databases (VLDB).The DBMS may be required to use advanced hardware, such as multiple disk arrays and multiple-processor technologies. Docsity.com

The Data Warehouse

 The Data Warehouse is an integrated, subject-

oriented, time-variant, non-volatile database

that provides support for decision making.

  • IntegratedThe Data Warehouse is a centralized, consolidated database that integrates data retrieved from the entire organization.
  • Subject-OrientedThe Data Warehouse data is arranged and optimized to provide answers to questions coming from diverse functional areas within a company.

Table 13.6A Comparison Of Data Warehouse And Operational Database Characteristics

Creating A Data Warehouse

Figure 13.