what is the difference between a data warehouse and a data mart

Upload: venkata-ravi

Post on 08-Apr-2018

262 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/7/2019 What is the difference between a data warehouse and a data mart

    1/5

    What is the difference between a data warehouse and a

    data mart?

    This is a heavily debated issue. There are inherent similarities between the basic constructs usedto design a data warehouse and a data mart. In general a Data Warehouse is used on an

    enterprise level, while Data Marts is used on a business division/department level. A data martonly contains the required subject specific data for local analysis.Back to top of file

    What is the difference between a W/H and an OLTP

    application?

    Typical relational databases are designed for on-line transactional processing (OLTP) and do notmeet the requirements for effective on-line analytical processing (OLAP). As a result, datawarehouses are designed differently than traditional relational databases.

    Warehouses are Time Referenced, Subject-Oriented, Non-volatile (read only) and Integrated.

    OLTP databases are designed to maintain atomicity, consistency and integrity (the "ACID" tests).Since a data warehouse is not updated, these constraints are relaxed.

    Back to top of file

    What is the difference between OLAP, ROLAP, MOLAP

    and HOLAP?

    ROLAP, MOLAP and HOLAP are specialized OLAP (Online Analytical Analysis) applications.

    ROLAP stands for Relational OLAP. Users see their data organized in cubes with dimensions,but the data is really stored in a Relational Database (RDBMS) like Oracle. The RDBMS will storedata at a fine grain level, response times are usually slow.

    MOLAP stands for Multidimensional OLAP. Users see their data organized in cubes withdimensions, but the data is store in a Multi-dimensional database (MDBMS) like Oracle ExpressServer. In a MOLAP system lot of queries have a finite answer and performance is usually criticaland fast.

    HOLAP stands for Hybrid OLAP, it is a combination of both worlds. Seagate Software's Holos isan example HOLAP environment. In a HOLAP system one will find queries on aggregated data

    as well as on detailed data.

    Back to top of file

    What is the difference between an ODS and a W/H?

    http://www.orafaq.com/faqwh.htm#TopOfFile%23TopOfFilehttp://www.orafaq.com/faqwh.htm#TopOfFile%23TopOfFilehttp://www.orafaq.com/faqwh.htm#TopOfFile%23TopOfFilehttp://www.seagatesoftware.com/http://www.orafaq.com/faqwh.htm#TopOfFile%23TopOfFilehttp://www.orafaq.com/faqwh.htm#TopOfFile%23TopOfFilehttp://www.seagatesoftware.com/http://www.orafaq.com/faqwh.htm#TopOfFile%23TopOfFilehttp://www.orafaq.com/faqwh.htm#TopOfFile%23TopOfFile
  • 8/7/2019 What is the difference between a data warehouse and a data mart

    2/5

    An ODS (Operational Data Store) is an integrated database of operational data. Its sourcesinclude legacy systems and it contains current or near term data. An ODS may contain 30 to 90days of information.

    A warehouse typically contains years of data (Time Referenced). Data warehouses group data bysubject rather than by activity (subject-oriented). Other properties are: Non-volatile (read only)

    and Integrated.

    Back to top of file

    What is the difference between Oracle Express and

    Oracle Discoverer?

    Express is an MD database and development environment. Discoverer is an ad-hoc end-userquery tool.

    Back to top of file

    Q. Why should the OLTP database different from data warehouse database?

    OLTP and data warehousing require two very differently configured systems

    Isolation of Production System from Business Intelligence System

    Significant and highly variable resource demands of the data warehouse

    Cost of disk space no longer a concern

    Production systems not designed for query processing

    Data warehouse usually contains historical data that is derived from transaction data, but it can

    include data from other sources. Having separate databases will separate analysis workload fromtransaction workload and enables an organization to consolidate data from several sources.

    Q. What is the main difference between Data Warehousing and Business Intelligence?

    The differentials are:

    DW - is a way of storing data and creating information through leveraging data marts. DM's aresegments or categories of information and/or data that are grouped together to provide'information' into that segment or category. DW does not require BI to work. Reporting tools cangenerate reports from the DW.

    BI - is the leveraging of DW to help make business decisions and recommendations. Information

    and data rules engines are leveraged here to help make these decisions along with statisticalanalysis tools and data mining tools.

    Q. What is the difference between OLTP and OLAP?

    OLAP - Online Analytical processing, mainly required for DSS, data is in denormalized mannerand mainly used for non volatile data, highly indexed, improve query response time

    http://www.orafaq.com/faqwh.htm#TopOfFile%23TopOfFilehttp://www.orafaq.com/faqwh.htm#TopOfFile%23TopOfFilehttp://www.orafaq.com/faqwh.htm#TopOfFile%23TopOfFilehttp://www.orafaq.com/faqwh.htm#TopOfFile%23TopOfFile
  • 8/7/2019 What is the difference between a data warehouse and a data mart

    3/5

    OLTP - Transactional Processing - DML, highly normalized to reduce deadlock & increaseconcurrency

    Q. What is the difference between OLTP and data warehouse?

    Operational System Data Warehouse

    Transaction Processing Query ProcessingTime Sensitive History Oriented

    Operator View Managerial View

    Organized by transactions (Order,Input, Inventory)

    Organized by subject (Customer,Product)

    Relatively smaller database Large database size

    Many concurrent users Relatively few concurrent users

    Volatile Data Non Volatile Data

    Stores all data Stores relevant data

    Not FlexibleFlexible

    What is the difference between sequential file and a dataset? When to usethe copy stage?

    Sequentiial Stage stores small amount of the data with any extension in order to

    acces the file where as DataSet is used to store Huge amount of the data and itopens only with an extension (.ds ) .The Copy stage copies a single input data set to

    a number of output datasets. Each record of the input data set is copied to everyoutput data set.Records can be copied without modification or you can drop or

    change theorder of columns.

    What is the exact difference betwwen Join,Merge and Lookup Stage??

    The exact difference between Join,Merge and lookup is

    The three stages differ mainly in the memory they use

    DataStage doesn't know how large your data is, so cannot make an informed choice

    whether to combine data using a join stage or a lookup stage. Here's how to decidewhich to use:

    if the reference datasets are big enough to cause trouble, use a join. A join does a

    high-speed sort on the driving and reference datasets. This can involve I/O if thedata is big enough, but the I/O is all highly optimized and sequential. Once the sort

    is over the join processing is very fast and never involves paging or other I/O

    Unlike Join stages and Lookup stages, the Merge stage allows you to specify severalreject links as many as input links.

    What is the difference between Datastage and Datastage TX?Its a critical question to answer, but one thing i can tell u that Datastage Tx is not a

    ETL tool & this is not a new version of Datastage 7.5.

  • 8/7/2019 What is the difference between a data warehouse and a data mart

    4/5

    Tx is used for ODS source ,this much i know

    Difference between Hashfile and Sequential File?

    Hash file stores the data based on hash algorithm and on a key value. A sequential

    file is just a file with no key column. Hash file used as a reference for look up.Sequential file cannot

    Difference between Hashfile and sequential file is , searching a record is too fast in

    hash file based on the hashkey, we can get the address of record directly in hashfilebased on the hashkey, and in sequential file it should search record sequential mode

    only, it has to search for record by record, and we can remove duplicate records

    based on the hash key in hashfile, we cannot in sequential file

    What are Static Hash files and Dynamic Hash files?

    The hashed files have the default size established by their modulus and separation

    when you create them, and this can be static or dynamic.

    Overflow space is only used when data grows over the reserved size for someone ofthe groups (sectors) within the file. There are many groups as the specified by the

    modulus.

    There are two types of hashfiles are there1.Static

    2.dynamic

    dynamic will use only when we dont know howmuch data will coming from thesource side, this will allow data loading grow automatically,

    only we use static when we know the fixed amount of data we are trying to load inthe target DB,this is the scenario for use both types

    2)Differences between star and snowflake schemas?The star schema is created when all the dimension tables directly link to the fact table. Sincethe graphical representation resembles a star it is called a star schema. It must be noted thatthe foreign keys in the fact table link to the primary key of the dimension table. This sampleprovides the star schema for a sales_ fact for the year 1998. The dimensions created are

    Store, Customer, Product_class and time_by_day. The Product table links to the product_classtable through the primary key and indirectly to the fact table. The fact table contains foreignkeys that link to the dimension tables.

    What is the difference between view and materialized view?View - store the SQL statement in the database and let you use it as a table. Everytime youaccess the view,? the SQL statement executes.materialized view - stores the results of the SQL in table form in the database. SQL statementonly executes once and after that everytime you run the query, the stored result set is used.Pros include quick query results.

    Differences between star and snowflake schemas?

  • 8/7/2019 What is the difference between a data warehouse and a data mart

    5/5

    Star schema - all dimensions will be linked directly with a fat table.Snow schema - dimensions maybe interlinked or may have one-to-many relationship withother tables.