Preview

Data Warehouse

Powerful Essays
Open Document
Open Document
4241 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
Data Warehouse
Data Warehouse Concepts and Design
Contents
Data Warehouse Concepts and Design 1
Abstract 2
Abbreviations 2
Keywords 3
Introduction 3
Jarir Bookstore – Applying the Kimball Method 3
Summary from the available literature and Follow a Proven Methodology: Lifecycle Steps and Tracks 4
Issues and Process involved in Implementation of DW/BI system 5
Data Model Design 6
Star Schema Model 7
Fact Table 10
Dimension Table: 11
Design Feature: 12
Identifying the fields from facts/dimensions: MS: 12
Advanced business analytics techniques -- optimization, data mining and regression 14
Optimization: Creating Distributed Tables 15
The Data Mining Process 16
Data Mining Tasks 16
Analysis of the given data in the excel 17
a. Optimization 17
b. Data mining 19
C. Regression Analysis: 21
References: 27
1. Degenerate dimension and how is it used. Available from:: http://www.information-management.com/news/7844-1.html 27
7. Role-Playing Dimensions. Available from: http://pic.dhe.MS.com/infocenter/cbi/v10r1m1/index.jsp?topic=%2Fcom.MS.swg.ba.cognos.ug_fm.10.1.1.doc%2Fc_bp-multiplerelationships.html 27

Abstract
This analysis report explains how the Kimball method to architecting and building data warehouses using an Organization’s and how the system works for a real time multinational organization with MS Parallel Data (PDW) Warehouse. This report also shows as to how one would incorporate this MS data warehouse product as a tool for the business analytics and data warehousing solution. The system makes use of the Kimball’s method. Herein, an introductory overview of the method is given and the key principles have been discussed and applied for the organization. Three different dataset are taken and the discussion is based on how a hypothetical organization – a bookstore, for instance would use such data. The case study is presented as to how a hypothetical bookstore will utilize the data warehouse.
The Parallel Data Warehouse (PDW) system architecture has been explored and discussed

You May Also Find These Documents Helpful

  • Satisfactory Essays

    This document is a proposal for building a data warehouse architecture that will consolidate and transform data into useful information for the purpose of decision-making and for establishing a new function that offers a broad array of decision support services to all units at ABC Retail Chain Corporation. Executives and decision-makers often need information to analyze the past, describe current circumstances, and anticipate the future. Presently, decision-makers across the Institute rely on hard copy reports or Excel Sheets to provide information. Typically, any request for information is forwarded to the operational areas of the Organization, which provide hard copy reports reflecting the data gathered in their functional area. To analyze and transform data into useful information, decision-makers and their staff have to manually re-enter the non-integrated data into their own mini-systems. This type of operation hinders the ability of decision making and the executives are either drowning in too much data with no option to analyze it or too little data, which means they are back to square one and must request additional information. Often executives receive multiple, conflicting information or information that is based on incomplete assumptions about the types of analysis required.…

    • 641 Words
    • 3 Pages
    Satisfactory Essays
  • Good Essays

    Data Base

    • 2312 Words
    • 10 Pages

    * Each value manipulated by an Oracle database possesses a data type. The data type of a value links a selection of attributes to the value. These attributes of the value differentiate one data type from the others. Oracle treats certain data types in a distinct way. For instance, one can add values of NUMBER data type, but not values of RAW data type. When one builds a table or a cluster, one must assign data types for all its columns. In Oracle, the arguments of a procedure or stored function also need to be allocated data types. The data types specify the domain of values which…

    • 2312 Words
    • 10 Pages
    Good Essays
  • Good Essays

    The most important aspect of having data warehousing is the fact that it allows for data storage and presentation of this data enabling executives to make sound decisions. Another important use of data warehousing is it takes the separate areas the company is divided up in and takes it all and lumps it in to one single entity. One great benefit of data warehousing is that Huffman will be able to handle server task connected to all queries which is not commonly found in all systems. “Another powerful benefit of data warehouses is that they allow companies to use data modeling for querying tasks that are quite difficult for transaction processing” (Exforsys, 2007). Huffman trucking is already successful but by implementing a success data warehousing system they would be able to understand and analyze all data coming in and leaving the system better and at a more efficient rate. Attached to this report is a…

    • 891 Words
    • 3 Pages
    Good Essays
  • Satisfactory Essays

    Part B) What was their advice on how to get a start in the industry?…

    • 482 Words
    • 2 Pages
    Satisfactory Essays
  • Good Essays

    | * The data warehouse of St George bank supports the integrated data among different departments * Data from different departments can be accessed freely * Integrated data from the data warehouse is more beneficial and creates more opportunities and BI for all departments (1+1=3) * “Most departments extract what they need from the warehouse using customer relationship management and BI applications without intervention.” * “They have access to all the data, can create their own filters, their own campaigns.”…

    • 341 Words
    • 1 Page
    Good Essays
  • Good Essays

    An active data warehousing, or ADW, is a data warehouse implementation that supports near-time or near-real-time decision making. It is featured by event-driven actions that are triggered by a continuous stream of queries that are generated by people or applications regarding an organization or company against a broad, deep granular set of enterprise data. Continental uses active data warehousing to keep track of their company’s daily progress and performance. Continental’s management team holds an operations meeting every morning to discuss how their company is performing in regards to the data collected by their active data warehousing program. The management team believes, “you can’t manage what you can’t measure,” so they use active data warehousing to keep track of their customers experience while using Continental Airlines. The information that the management team uses to analyze their company in regards to customer relationship is on-time arrival, on-time departures, baggage handling, and other key performance indicators. Continental also uses active data warehousing for revenue management, revenue accounting, flight operations, fraud detection and airline security. Continental restructured their goals to try to become customers “favorite” airline to use. They use their active data warehousing to gain as much information about the company’s performance as well as the customers experience. They use this real-time warehousing program to interpret information that is provided and make changes that will better improve their customers experience and help Continental better suit their business in regards to their customers’ needs.…

    • 1485 Words
    • 6 Pages
    Good Essays
  • Good Essays

    Database is designed to make transactional systems that run efficiently. Characteristically, this is type of database that is an online transaction processing database. An electronic strength record system is a big example of a submission that runs on an OLTP database. An OLTP database is typically controlled to a single application. The significant fact is that a transactional database does not lend itself to analytics. To effectively achievement analytics, you require a data warehouse. A data warehouse is a database of a diverse kind of an online analytical processing database (In Yang, In Everson & in Yin, 2004). A data warehouse survives as a layer on top of another OLTP databases. The data warehouse obtains the data from all these databases and builds a layer optimized for and dedicated to analytics. A database designed is used to handle transactions designed analytics. It is not structured to do analytics well. A data warehouse is structured to make analytics fast and easy.…

    • 628 Words
    • 3 Pages
    Good Essays
  • Good Essays

    If I were to design Ben & Jerry's data warehouse I would use several dimensions of information. The first dimension would consist of the company's products; ice cream, frozen yogurt or merchandise. The marketing department has to know which products are selling, if Ben & Jerry's didn't know that their T-shirts are selling out as soon as they hit the stores, then they wouldn't be able to take advantage of the opportunity to sell the shirts. The second dimension would consist of the different areas of sales; US, Canada, Mexico, or Europe. I am not sure if they sell their ice cream in Mexico, but with data collection they can find out if their ice cream would be a better seller in the hot climate, rather than pushing for greater distribution in Canada. The third dimension would consist of the "specifics"; where the sale was made, when the sale was made, and who purchased the product. This information can help in the design of the product to focus on the buyer; it can tailor flavors to seasons, and packaging to buyer who looks for the better-looking product. If Ben & Jerry's could know when a season was coming to an end in a specific area, then they could forecast the need or the decline in need and speed up, or slow down distribution to those areas. The focus of the information is that it needs to be useful, and almost any information is useful.…

    • 605 Words
    • 3 Pages
    Good Essays
  • Satisfactory Essays

    Digital Forensic Evidence

    • 592 Words
    • 3 Pages

    The future research in this work will involve the implementation of the model in a real world data organization to help to define the functionality of the…

    • 592 Words
    • 3 Pages
    Satisfactory Essays
  • Good Essays

    A data warehouse is a database that stores current and historical data of potential interest to decision makers throughout the company.[1] In the Terrorist Watch List Database case, the information about suspected terrorists are consolidated and standardized from multiple government agencies so that the information can be centralized into a single list, from which different agencies can communicate and share information with each other. This centralized database is a specific example of data warehouse. In this case, the data warehouse containing the relevant information of individuals from each agency’s list enhancing effectiveness of communication between agencies as well as increase the consistency of information from separate databases.…

    • 860 Words
    • 4 Pages
    Good Essays
  • Satisfactory Essays

    Data Base

    • 250 Words
    • 1 Page

    Review and describe the most important criteria for selecting internetworking devices at the core, access, and distribution layer in a computer network…

    • 250 Words
    • 1 Page
    Satisfactory Essays
  • Powerful Essays

    Chapter 11 Enterprise Resource Planning Systems 1. Closed database architecture is a. a control technique intended to prevent unauthorized access from trading partners. b. a limitation inherent in traditional information systems that prevents data sharing. c. a data warehouse control that prevents unclean data from entering the warehouse. d. a technique used to restrict access to data marts. e. a database structure that many of the leading ERPs use to support OLTP applications. 2. Each of the following is a necessary element for the successful warehousing of data EXCEPT a. cleansing extracted data. b. transforming data. c. modeling data. d. loading data. e. all of the above are necessary. 3. Which of the following is typically NOT part of an ERP’s OLAP applications? a. decision support systems b. information retrieval c. ad hoc reporting/analysis d. logistics e. what-if analysis 4. There are a number of risks that may be associated with ERP implementation. Which of the following was NOT stated as a risk in the chapter? a. A drop in firm performance after implementation because the firm looks and works differently than it did while using a legacy system. b. Implementing companies have found that staff members, employed by ERP consulting firms, do not have sufficient experience in implementing new systems. c. Implementing firms fail to select systems that properly support their business activities. d. The selected system does not adequately meet the adopting firm’s economic growth. e. ERP’s are too large, complex, and generic for them to be well integrated into most company cultures. 5. Which statement is NOT true? a. In a typical two-tier client-server architecture, the server handles both application and database duties. b. Client computers are responsible for presenting data to the user and passing user input back to the server. c. Two-tier architecture is for local area network (LAN) applications where the demand on the server is restricted to a relatively small…

    • 2756 Words
    • 12 Pages
    Powerful Essays
  • Satisfactory Essays

    Mr Raza

    • 789 Words
    • 4 Pages

    4. The table in the data warehouse that contains foreign keys and quantitative/qualitative information is called the…

    • 789 Words
    • 4 Pages
    Satisfactory Essays
  • Satisfactory Essays

    Specialized Database Presentation Specialized Database Presentation Team B: Chappell Grant, John Hainline Linda Hannigan DBM/384 Special Purpose Databases Brando Sumayao Specialized Database Presentation • • • • • • • • • • Executive Overview Strategic Goal Proposal Comparison of different database and purposes SQL concepts relative to spatial and temporal databases Uses of databases in the business environment Description of the information retrieval process in relations to the specialized databases Differences between Online Transaction Processing (OLTP) and Online Analytical Processing (OLAP) Define knowledge management and how it’s used within our organization Conclusion Executive Overview…

    • 656 Words
    • 4 Pages
    Satisfactory Essays
  • Good Essays

    Analytical Crm

    • 814 Words
    • 4 Pages

    Initially a connection is established. To perform the task of performing data mining through excel first a connection needs to be established to sql server. Server used is infodata.tamu.edu.…

    • 814 Words
    • 4 Pages
    Good Essays