copy (2) of dwh file1

Upload: kamran

Post on 04-Jun-2018

225 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/13/2019 Copy (2) of DWH File1

    1/81

    1. What is Data warehouse? According to Bill Inmon, known as father of Data warehousing. A Data warehouse is asubject oriented, integrated ,time ariant, non olatile collection of data in su!!ort ofmanagement"s decision making !rocess#.$. What are the t%!es of data warehouses?&here are three t%!es of data warehouses'nter!rise Data Warehouse(D) *o!erational data store+Data art-. What is Data mart? A data mart is a subset of data warehouse that is designed for a !articular line ofbusiness, such as sales, marketing, or finance. In a de!endent data mart, data can be deriedfrom an enter!rise wide data warehouse. In an inde!endent data mart can be collected directl%from sources.. What is star schema? A star schema is the sim!lest form of data warehouse schema that consists of one ormore dimensional and fact tables./. What is snow flake schema? A )nowflake schema is nothing but one 0act table which is connected to a number of

    dimension tables, &he snowflake and star schema are methods of storing data which aremultidimensional in nature.

    (eriewDataware 2ousing Informatica Interiew 3uestions and )olutions

    What is the difference between static cache and d%namic cache?Ans In case of d%namic cache, when we are inserting a new row it checks the looku! cache tosee if it e4ists, if not inserts it into the target as well asthe cache but in case of static cache the new row is written onl% in the target and not the looku!cache.&he looku! cache remains static and does not change during the session but incase of d%namiccache the serer inserts, u!dates in the cache during session.

    2ow do we join two tables without joiner or )35 oerride?Ans We can join the tables using looku! tansformation and making a cartesian !roduct.

    What are !re6session and !ost6session o!tions?Ans &hese are shell commands that Informatica serer !erforms before running the session. Weshould hae !ermission and !riileges to run the o!tions in Informatica.

    2ow do we co!% ma!!ing?Ans &he ste!s to co!% ma!!ing are mentioned below 1.+ 0irst, o!en naigator window $.+ (!enthe ma!!ing in re!ositor% -.+ 7lick co!%

    What are the tasks done b% Informatica serer?

    Ans &asks done b% Informatica serer are a.+ anaging the sessions b.+ 8unning the workflowsc.+ &uning !erformance d.+ )cheduling of sessions and workflows

    What is the difference between source 9ualifier and joiner transformation?Ans We need matching ke%s in source 9ualifier which we do not need in joiner transformation.

    What are ma!!ing !arameters and ma!!ing ariables?Ans a!!ing !arameters are !arameters which are constant through out the session. &he% ares!ecified before running the session. :nlike !arameters, ariables change through out the

  • 8/13/2019 Copy (2) of DWH File1

    2/81

    session.

    Wh% do we use !artitioning the session in Informatica?Ans ;artitioning achiees the session !erformance b% reducing the time !eriod of reading thesource and loading the data into target.

    &o achiee the session !artition what are the necessar% tasks %ou hae to do?Ans 7onfigure the session to !artition source data.Install the Informatica serer on a machine with multi!le 7;:"s.

    7an %ou co!% the batches?Ans ersionnumber 0lag alue, 'ffectie date 8ange

    7an %ou start a session within a batch indiiduall%?Ans 5et me tell %ou that one can start the sessions in this wa% in the case of se9uential batchesonl%. In case of the concurrent batches it is irtuall% im!ossible.6 )ee more at htt!www.aired.in$@1@@$informatica6etl6interiew69uestions.htmlsthash.m-gA)gcB.d!uf

    7an %ou co!% the session to a different folder or re!ositor%?es. B% using co!% session wi=ard, %ou can co!% a session in a different folder or re!ositor%.

    What is a command that used to run a batch?!mcmd is used to start a batch.

    What are the t%!es of ma!!ing in Cetting )tarted Wi=ard?)im!le ;ass through ma!!ing)lowl% growing target ma!!ing

    What are the different t%!es of &%!e$ slowl% changing dimensions?

  • 8/13/2019 Copy (2) of DWH File1

    3/81

    &here are three t%!es of slowl% changing dimensions)7D with ersioning)7D with flags)7D with Date

    What are the different threads in D& !rocess?aster threada!!ing thread8eader threadWriter thread;re and !ost session threads

    What are the actie and !assie transformations?An actie transformation can change the number of rows that !ass through it.A !assie transformation does not change the number of rows that !ass through it.

    What are the scheduling o!tions to run a session?Different o!tions of scheduling are8un onl% on demand Informatica serer runs the session onl% when user starts session e4!licitl%8un once Informatica serer runs the session onl% once at a s!ecified date and time.8un eer% Informatica serer runs the session at regular interals as %ou configured.7ustomi=ed re!eat Informatica serer runs the session at the date and time s!ecified in there!eat dialog bo4.

    2ow can we store !reious session logs?ust run the session in time stam! mode then automaticall% session log will not oerwrite currentsession log.

    What is the difference between a!!ing and a!!let?a!!ing is the collection of )ource Definition, &arget Definition, &ransformation*s+ andora!!let.

    a!!let is a set of reusable transformations for a!!l%ing same business logic.

    What is the use of 8e!ositor% anager?8e!ositor% anager is used to manage folders, iew )ource&arget de!endencies, a!!ingDe!endencies, adding and remoing re!ositories, >ersioning etc.

    What are the arious &asks in Informatica Workflow anager?

  • 8/13/2019 Copy (2) of DWH File1

    4/81

    &he arious &asks in Informatica are

    Assignment &ask7ommand &ask

    7ontrol &askDecision &ask'6mail &ask'ent 8aise &ask'ent Wait &ask)ession &ask&imer &ask5ink &ask

    What is the use of )ource Anal%=er?)ource Anal%=er is used to create source definition.

    What is the use of &arget Designer?&arget Designer is used to create &arget Definition.

    What is &arget Designer called !rior to Informatica E.F?Warehouse Designer

    7an we use 8eusable )e9uence Cenerator transformation in a!!let?

  • 8/13/2019 Copy (2) of DWH File1

    5/81

  • 8/13/2019 Copy (2) of DWH File1

    6/81

    What is )taging Area?)taging Area is indeed a database where data from different source s%stems are brought togetherand this database acts as an in!ut to Data 7leansing.

    What are the t%!es of joins in Informatica and in (racle?&here are four t%!es of joins in oraclee9ui joinnon e9ui joinself joinouter joinoins in informatica

    master join *right outer join+detailed join *left outer join+normal join

    What is the file e4tension or format of files for the Informatica (bjects like sessions, ma!!ingsetc. in 8e!ositor%?&he format of files for Informatica (bjects in 8e!ositor% is K5

    Where can we find >ersioning in Informatica? What ha!!ens if >ersioning is turned off?In Informatica, we can find >ersioning in 8e!ositor% anager. If >ersioning is turned off, we will

    not be able to track the changes for the res!ectie )essionsa!!ingsWorkflows.

    What is tracing leel? What are the t%!es of tracing leel?&racing leel is the amount of information that Informatica serer writes into a log file.&%!es of tracing leelerbose init>erbose data

    In joiner transformation, we take the table with lesser number of rows as master while the morenumber of rows as detailed. Wh%?In joiner, each and eer% row of the master is com!ared with eer% row of the detailed and so, the

    less number of rows in master, the less is the number of iterations and so better is the!erformance of the s%stem.

    What are all the databases the Informatica serer on windows can connect to?

    Informatica serer on windows can connect to)35 serer database(racle)%base

  • 8/13/2019 Copy (2) of DWH File1

    7/81

    &eradata) Access) '4celInformi4DB$

    What are the databases the Informatica serer on :

  • 8/13/2019 Copy (2) of DWH File1

    8/81

    2ow does serer recogni=e the source and target databases?B% using (DB7 if the% are relational, 0&; if the% are flat files.

    What is rank inde4 in a grou!?;ower 7enter Designer automaticall% creates a 8A

  • 8/13/2019 Copy (2) of DWH File1

    9/81

    What is !age code com!atibilit%?It is nothing but com!atibilit% of code for maintaining data accurac%. It comes into !icture whendata is in different languages.

    What are )%non%ms?)%non%ms are alternatie names for database objects such as tables, iews, stored !rocedures

    etc.)%nta47reate Qre!laceR s%non%m for Qschema.RobjectOname

    &%!es of 5ooku! 7ache?)tatic cacheD%namic cache;ersistent cache8ecache from database)hared cache

    What are arious stages of )D57?8e9uirements Cathering, Anal%sis, Design, Im!lementation and &esting and aintenance.

    What is )tar )chema?)tar )chema is a sim!lest form of schema which has one fact table and at least one dimensiontable. &he dimensions here are denormali=ed.

    What is 0act table?It is a centrali=ed table in )tar )chema. 0act table has two t%!es of columns. 0irst t%!e ismeasures and second t%!e is the foreign ke%s for the dimension tables.

    What is )now0lake )chema?In )now0lake )chema, the dimensions are further diided into sub dimensions. &he dimensionshere are normali=ed.

    What is Dimension table?A dimension table is one that describes the business entities of an enter!rise.

    What is o!erational data store *(D)+?(!erational data store is defined to be structure that is)ubject6orientedIntegrated>olatileand current data that is a da% or !erha!s a month old.

    What is 5ogical Data odeling?5ogical Data odeling is a t%!e of data modeling which re!resents business re9uirements of anorgani=ation.

    What is ;h%sical Data odeling?;h%sical Data odeling is a t%!e of data modeling which includes all re9uired tables, columns,and relationshi!s for the !h%sical im!lementation of a database.

    Difference between ma!!let and reusable transformation?:sing ma!!let we can make set of transformations reusable whereas in reusable transformation,we can make onl% one transformation as reusable.

  • 8/13/2019 Copy (2) of DWH File1

    10/81

    what is Data cleansing?It is the !rocess of conerting data from different format of files or databases to single re9uiredformat.

    What are the t%!es of files created b% Informatica serer during the session running?

    &%!es of files created are7ache file)ession log fileInformatica serer log file(ut!ut file8eject file

    What are the t%!es of re!ositories created b% Informatica re!ositor% manager?0our t%!es of re!ositories are created using re!ositor% manager)tandalone re!ositor%Clobal re!ositor%5ocal re!ositor%>ersioned re!ositor%

    What are the two t%!es of !rocesses that run the session?&he two t%!es of !rocesses that run the session are5oad anagerD& !rocesses *Data &ransformation anager+

    What is the difference between :nion and 5ooku! transformation?:nion transformation is Actie while 5ooku! transformation is ;assie.0or :nion transformation, the source tables or data should hae similar structure while its not thecase with the 5ooku! transformation.5ooku! transformation can run on source or target tables while :nion tables work onl% on sourcetables.

    What is through!ut in Informatica?&hrough!ut is nothing but the rate at which Informatica serer reads the data from sources andwrites them successfull% to the target.

    Where can we find the through!ut o!tion in Informatica?We can iew this in workflow monitorIn workflow monitor, right click on session, then click on Cet 8un ;ro!erties and under)ource&arget statistics we can find through!ut o!tion

    What is code !age?7ode !age consists of encoding to s!ecif% characters in set of one or more languages and areselected based on source language.

    What are t%!es of loading in Informatica?&he two t%!es of loading aailable in Informatica areBulk 5oading

  • 8/13/2019 Copy (2) of DWH File1

    11/81

    What is 7om!le4 a!!ing?7om!le4 a!!ing will hae the following featuresDificult re9uirementsan% number of transformations7om!le4 business logic

    2ow man% wa%s %ou can add !orts?&wo wa%s0rom other transformation7lick on add !ort button

    2ow man% number of sessions can %ou can %ou grou! in batches?An% number of sessions but the lesser the number of sessions in a batch, the easier themigration.

    What is the difference between Aggregator transformation and '4!ression &ransformation?Aggregator transformation use aggregator functions and !erforms calculation on entire grou!whereas in '4!ression transformation !erforms calculation on row b% row basis.

    What is the difference between 0ilter transformation and 8outer transformation?0ilter transformation dro!s the data that do not meet the condition whereas 8outer transformationca!tures the data een though the condition is not met and saes it in Default out!ut grou!.

    0ilter transformation works on single condition onl% while 8outer transformation works on multi!leconditions as well.

    0ilter transformation gies onl% one out!ut. 8outer transformation can gie more than one out!ut.

    What is the similarit% between 8outer and 0ilter transformations?8outer transformation and 0ilter transformations are used to filter the data based on condition.

    Both 0ilter and 8outer transformation are Actie transformations.

    Both 0ilter and 8outer transformation are connected.

    What is the difference between )ource 3ualifier transformation and oiner transformation?)ource 3ualifier transformation is used to join the data from homogeneous sources while oinertransformation is used to join data from heterogeneous sources as well as homogenous sourcesfrom different schemas.

    We need matching ke%s to join two relational sources in )ource 3ualifier transformation and isnot the case with joiner transformation.

    What is ;A8A file?

    ;aram file is an ordinar% te4t file where we can define the alue for the !arameter which isdefined in session.

    &hese !arameter files are defined in session !ro!erties.

    Which transformation should we use to normali=e the 7(B(5 and relational sources?We need to make use of

  • 8/13/2019 Copy (2) of DWH File1

    12/81

    &ransformations which we cannot use in a!!let are

  • 8/13/2019 Copy (2) of DWH File1

    13/81

    es

    What is the transformation used in loading flat files of similar structure to a single target?We can make use of :nion transformation

    Difference between direct and indirect loading o!tions in sessions?

    Direct loading can be used on single transformations while indirect loading can be used onmulti!le transformations

    In direct loading, we can !erform recoer% !rocess while in indirect loading, we cannot !erformrecoer% !rocess.

    What are the arious techni9ues for im!lementing a dimensional model?)tar schema)nowflake schema

    What are the t%!es of dimensions?&here are three t%!es of dimensions)lowl% changing dimensions

    7onfirmed dimensions7asual dimensions

    What is )35 oerride?It is nothing but oerriding )35 in source 9ualifier or looku! for additional logic.

    What are the default alues for ariables?

    &he default ariables forariableS

  • 8/13/2019 Copy (2) of DWH File1

    14/81

    )ource Definition&arget Definition5ink&ransformationa!!let

    What is the default source o!tion for :!date )trateg%?Data Drien

    What is the use of :!date )trateg% transformation?:!date )trateg% is used to !erform D5 o!erations like Insert, :!date, Delete and 8eject onalread% !o!ulated targets.

    What is the default join that the oiner transformation !roides?

  • 8/13/2019 Copy (2) of DWH File1

    15/81

    8owId)%sdate:ser7urral>A8IAarchar datat%!e has a si=e of $@@@ b%tes

    whereas archar$ has a si=e of @@@ b%tes.>archar is in ascii whereas archar$ is in unicode.

    What is

  • 8/13/2019 Copy (2) of DWH File1

    16/81

    are !ut to a se!arate table along with de!ended !rimar% ke% . &his should also satisf% the 1nf. 0ore4am!le, if a table has attributes like !artid, cit%, state and countr% and !artid, countr% for acom!osite !rimar% ke% and cit%, state de!end on countr% then the table is se!eated as twodifferent tables as attributes haing !artid, countr% and countr%, state, cit%.&hird normal form &hird normal form states that if a de!endenc% e4ist between non ke%attributes, then these attributes are isolated to a different table. &his should also satisf% the 1nf T$nf.

    SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS

    &agsInformatica 3uestions Answers.3uestions asked in Infos%s, &7), 7&), WI;8(, Accenture,(8A75', 7)7, 'D), IB, 5U&

    When to use Abort, Decode functions?Ans Abort can be used to Abort sto! the session on an error condition. If the !rimar% ke%column contains

  • 8/13/2019 Copy (2) of DWH File1

    17/81

    What is the use of tracing leels in transformation?Ans &racing leels store information about ma!!ing and transformations.

    What are the basic needs to join two sources in a source 9ualifier?Ans &wo sources should hae !rimar% and foreign ke% relationshi!s. &wo sources should hae

    matching data t%!es.

    What is u!date strateg% transformation?Ans &his transformation is used to maintain the histor% data or just most recent changes in totarget table.

    Describe two leels in which u!date strateg% transformation sets?Ans Within a session. When %ou configure a session, %ou can instruct the Informatica )erer toeither treat all records in the same wa% *for e4am!le, treat all records as inserts+, or useinstructions coded into the session ma!!ing to flag records for different database o!erations.Within a ma!!ing. Within a ma!!ing, %ou use the :!date )trateg% transformation to flag recordsfor insert, delete, u!date, or reject.

    What are the arious t%!es of transformation?Ans >arious t%!es of transformation are Aggregator &ransformation, '4!ression &ransformation,0ilter &ransformation, oiner &ransformation, 5ooku!&ransformation,

  • 8/13/2019 Copy (2) of DWH File1

    18/81

    integrated.

  • 8/13/2019 Copy (2) of DWH File1

    19/81

    Detailed data)ummari=ed data8e!etitie access

    Ad6hoc access/0ew 8ecords accessed at a time *tens+, sim!le 9uer%5arge olumes accessed at a time*millions+, com!le4 9uer%F)mall database5arge DatabaseL7urrent data2istorical dataE7lerical :serNnowledge :serM8ow b% 8ow 5oadingBulk 5oading

    1@&ime inariant&ime ariant11

  • 8/13/2019 Copy (2) of DWH File1

    20/81

    (5&; '&5 Data mart DW2 (5A;&o! down(5&; '&5 DW2 Data mart (5A;&o! downV 7ost of initial !lanning U design is highV &akes longer duration of more than an %earBottom u!V ;lanning U Designing the Data arts without waiting for the Clobal warehouse designV Immediate results from the data martsV &ends to take less time to im!lementV 'rrors in critical modules are detected earlier.V Benefits are reali=ed in the earl% !hases.V It is a Best A!!roachData odeling &%!esV 7once!tual Data odelingV 5ogical Data odelingV ;h%sical Data odelingV Dimensional Data odeling1. 7once!tual Data odelingV 7once!tual data model includes all major entities and relationshi!s and does not contain

    much detailed leel of information about attributes and is often used in the I

  • 8/13/2019 Copy (2) of DWH File1

    21/81

    Definition7ommentDimensional Data odelingV Dimension model consists of fact and dimension tablesV It is an a!!roach to deelo! the schema DB designs&%!es of Dimensional modelingV )tar schemaV )now flake schemaV )tar flake schema *or+ 2%brid schemaV ulti star schemaWhat is )tar )chema?V &he )tar )chema 5ogical database design which contains a centrall% located fact tablesurrounded b% at least one or more dimension tablesV )ince the database design looks like a star, hence it is called star schema dbV &he Dimension table contains ;rimar% ke%s and the te4tual descri!tionsV It contain de6normali=ed business informationV A 0act table contains a com!osite ke% and measuresV &he measure are of t%!es of ke% !erformance indicators which are used to ealuate theenter!rise !erformance in the form of success and failureV 'g &otal reenue , ;roduct sale , Discount gien, no of customers

    V &o generate meaningful re!ort the re!ort should contain at least one dimension and one facttable&he adantage of star schemaV 5ess number of joinsV Im!roe 9uer% !erformanceV )licing downV 'as% understanding of data.DisadantageV 8e9uire more storage s!ace

    '4am!le of )tar )chema)nowflake )chemaV In star schema, If the dimension tables are s!itted into one or more dimension tables

    V &he de6normali=ed dimension tables are s!itted into a normali=ed dimension table'4am!le of )nowflake )chema

    V In )nowflake schema, the e4am!le diagram shown below has dimension tables, looku!tables and 1 fact table. &he reason is that hierarchies *categor%, branch, state, and month+ arebeing broken out of the dimension tables *;8(D:7&, (8CA

  • 8/13/2019 Copy (2) of DWH File1

    22/81

    V ulti!le fact tables sharing a set of dimension tables

    V 7onfirmed Dimensions are nothing but 8eusable Dimensions.V &he dimensions which u r using multi!le times or in multi!le data marts.V &hose are common in different data martseasure &%!es *or+ &%!es of 0acts

    Additie 6 easures that can be summed u! across all dimensions.'4 )ales 8eenue)emi Additie 6 easures that can be summed u! across few dimensions and not with others'4 7urrent Balance

  • 8/13/2019 Copy (2) of DWH File1

    23/81

    !erformance. )( here the column, billOno is a degenerate dimension or line item dimension.Informatica Architecture

    &he ;ower 7enter domain It is a !rimar% unit of the Administration.

    7an hae single and multi!le domains. It is a collection of nodes and serices.

  • 8/13/2019 Copy (2) of DWH File1

    24/81

    3. What is a source 9ualifier? What is meant b% 3uer% (erride?A. )ource 3ualifier re!resents the rows that the ;ower7enter )erer reads from a relational orflat file source when it runs a session. When a relational or a flat file source definition is added toa ma!!ing, it is connected to a )ource 3ualifier transformation.;ower7enter )erer generates a 9uer% for each )ource 3ualifier &ransformation wheneer itruns the session. &he default 9uer% is )'5'& statement containing all the source columns.)ource 3ualifier has ca!abilit% to oerride this default 9uer% b% changing the default settings ofthe transformation !ro!erties. &he list of selected !orts or the order the% a!!ear in the default9uer% should not be changed in oerridden 9uer%.3. What is aggregator transformation?

    A. &he Aggregator transformation allows !erforming aggregate calculations, such as aeragesand sums. :nlike '4!ression &ransformation, the Aggregator transformation can onl% be used to!erform calculations on grou!s. &he '4!ression transformation !ermits calculations on a rowb%6row basis onl%.

    Aggregator &ransformation contains grou! b% !orts that indicate how to grou! the data. Whilegrou!ing the data, the aggregator transformation out!uts the last row of each grou! unlessotherwise s!ecified in the transformation !ro!erties.>arious grou! b% functions aailable in Informatica are A>C, 7(:

  • 8/13/2019 Copy (2) of DWH File1

    25/81

    Z DDO8''7& If this is used the :!date )trateg% flags the row for rejection. '9uialent numericalue of DDO8''7& is -.3. What are the t%!es of loading in Informatica?&here are two t%!es of loading, 1. ersioned 8e!ositor% &his can either be local or global re!ositor% but it allows ersion controlfor the re!ositor%. A ersioned re!ositor% can store multi!le co!ies, or ersions of an object. &hisfeature allows efficientl% deelo!ing, testing and de!lo%ing metadata in the !roductionenironment.3. What is a code !age?

    A. A code !age contains encoding to s!ecif% characters in a set of one or more languages. &hecode !age is selected based on source of the data. 0or e4am!le if source contains a!anese te4tthen the code !age should be selected to su!!ort a!anese te4t.When a code !age is chosen, the !rogram or a!!lication for which the code !age is set, refers to

    a s!ecific set of data that describes the characters the a!!lication recogni=es. &his influences thewa% that a!!lication stores, receies, and sends character data.3. Which all databases ;ower7enter )erer on Windows can connect to?

    A. ;ower7enter )erer on Windows can connect to following databasesZ IB DB$Z Informi4Z icrosoft AccessZ icrosoft '4celZ icrosoft )35 )ererZ (racleZ )%baseZ &eradata3. Which all databases ;ower7enter )erer on :

  • 8/13/2019 Copy (2) of DWH File1

    26/81

    3. What is Data Drien?&he informatica serer follows instructions coded into u!date strateg% transformations within thesession ma!!ing which determine how to flag records for insert, u!date, delete or reject. If we donot choose data drien o!tion setting, the informatica serer ignores all u!date strateg%transformations in the ma!!ing.3. What are the t%!es of ma!!ing wi=ards that are !roided in Informatica?&he designer !roide two ma!!ing wi=ard.1. Cetting )tarted Wi=ard 6 7reates ma!!ing to load static facts and dimension tables as well asslowl% growing dimension tables.$. )lowl% 7hanging Dimensions Wi=ard 6 7reates ma!!ings to load slowl% changing dimensiontables based on the amount of historical dimension data we want to kee! and the method wechoose to handle historical dimension data.3. What is 5oad anager?

    A. While running a Workflow, the ;ower7enter )erer uses the 5oad anager!rocess and the Data &ransformation anager ;rocess *D&+ to run the workflow and carr% outworkflow tasks. When the ;ower7enter )erer runs a workflow, the 5oad anager !erforms thefollowing tasks1. 5ocks the workflow and reads workflow !ro!erties.$. 8eads the !arameter file and e4!ands workflow ariables.-. 7reates the workflow log file.

    . 8uns workflow tasks./. Distributes sessions to worker serers.F. )tarts the D& to run sessions.L. 8uns sessions from master serers.E. )ends !ost6session email if the D& terminates abnormall%.When the ;ower7enter )erer runs a session, the D& !erforms the following tasks1. 0etches session and ma!!ing metadata from the re!ositor%.$. 7reates and e4!ands session ariables.-. 7reates the session log file.. >alidates session code !ages if data code !age alidation is enabled. 7hecks3uer% conersions if data code !age alidation is disabled./. >erifies connection object !ermissions.F. 8uns !re6session shell commands.

    L. 8uns !re6session stored !rocedures and )35.E. 7reates and runs ma!!ings, reader, writer, and transformation threads to e4tract,transform, and load data.M. 8uns !ost6session stored !rocedures and )35.1@. 8uns !ost6session shell commands.11. )ends !ost6session email.3. What is Data &ransformation anager?

    A. After the load manager !erforms alidations for the session, it creates the D&!rocess. &he D& !rocess is the second !rocess associated with the session run. &he!rimar% !ur!ose of the D& !rocess is to create and manage threads that carr% outthe session tasks.Z &he D& allocates !rocess memor% for the session and diide it into buffers. &hisis also known as buffer memor%. It creates the main thread, which is called the

    master thread. &he master thread creates and manages all other threads.Z If we !artition a session, the D& creates a set of threads for each !artition toallow concurrent !rocessing.. When Informatica serer writes messages to thesession log it includes thread t%!e and thread ID.0ollowing are the t%!es of threads that D& createsaster &hread 6 ain thread of the D& !rocess. 7reates and manages all otherthreads.a!!ing &hread 6 (ne &hread to 'ach )ession. 0etches )ession and a!!ingInformation.;re and ;ost )ession &hread 6 (ne &hread each to ;erform ;re and ;ost )ession

  • 8/13/2019 Copy (2) of DWH File1

    27/81

    (!erations.8eader &hread 6 (ne &hread for 'ach ;artition for 'ach )ource ;i!eline.Writer &hread 6 (ne &hread for 'ach ;artition if target e4ist in the source !i!elinewrite to the target.&ransformation &hread 6 (ne or ore &ransformation &hread 0or 'ach ;artition.3. What is )ession and Batches?)ession 6 A )ession Is A set of instructions that tells the Informatica )erer 2ow

    And When &o oe Data 0rom )ources &o &argets. After creating the session, wecan use either the serer manager or the command line !rogram !mcmd to startor sto! the session.Batches 6 It ;roides A Wa% to Crou! )essions 0or 'ither )erial (r ;arallel '4ecution B% &heInformatica )erer. &here Are &wo &%!es (f Batches1. )e9uential 6 8un )ession (ne after the (ther.$. 7oncurrent 6 8un )ession At &he )ame &ime.3. 2ow man% wa%s %ou can u!date a relational source definition and whatare the%?

    A. &wo wa%s1. 'dit the definition$. 8eim!ort the definition3. What is a transformation?

    A. It is a re!ositor% object that generates, modifies or !asses data.3. What are the designer tools for creating transformations?

    A. a!!ing designer&ransformation deelo!era!!let designer3. In how man% wa%s can %ou create !orts?

    A. &wo wa%s1. Drag the !ort from another transformation$. 7lick the add button on the !orts tab.3. What are reusable transformations?

    A. A transformation that can be reused is called a reusable transformation&he% can be created using two methods1. :sing transformation deelo!er

    $. 7reate normal one and !romote it to reusable3. Is aggregate cache in aggregator transformation?

    A. &he aggregator stores data in the aggregate cache until it com!letes aggregate calculations.When u run a session that uses an aggregator transformation, the Informatica serer createsinde4 and data caches in memor% to !rocess the transformation. If the Informatica serer re9uiresmore s!ace, it stores oerflow alues in cache files.3. What r the settings that u use to configure the joiner transformation?[ aster and detail source[ &%!e of join[ 7ondition of the join3. What are the join t%!es in joiner transformation?

    A.

  • 8/13/2019 Copy (2) of DWH File1

    28/81

    cache for each row that comes into the transformation. When the looku! condition is true, theInformatica serer does not u!date the cache while it !rocesses the looku! transformation.[ D%namic cache If %ou want to cache the target table and insert new rows into cache andthe target, %ou can create a look u! transformation to use d%namic cache. &he Informatica sererd%namicall% inserts data to the target table.[ ;ersistent cache ou can sae the looku! cache files and reuse them the ne4t time theInformatica serer !rocesses a looku! transformation configured to use the cache.[ 8ecache from database If the !ersistent cache is not s%nchroni=ed with the looku! table,%ou can configure the looku! transformation to rebuild the looku! cache.[ )hared cache ou can share the looku! cache between multi!le transactions. ou canshare unnamed cache between transformations in the same ma!!ing.3. What is &ransformation?

    A &ransformation is a re!ositor% object that generates, modifies, or !asses data.&ransformation !erforms s!ecific function. &he% are two t%!es of transformations1. Actie8ows, which are affected during the transformation or can change the no of rows that !assthrough it. 'g Aggregator, 0ilter, oiner,

  • 8/13/2019 Copy (2) of DWH File1

    29/81

    the informatica serer.3. &;') (0 DI'

  • 8/13/2019 Copy (2) of DWH File1

    30/81

  • 8/13/2019 Copy (2) of DWH File1

    31/81

    on !rimar% and foreign ke%s constraints.)et the o!tion is Double click the session7onfigure (bject P\ check the 7onstraint Based 5oading3. What is the status code in stored !rocedure transformation?)tatus code !roides error handling for the informatica serer during the session. &he stored!rocedure issues a status code that notifies whether or not stored !rocedure com!letedsuccessfull%. &his alue cannot see b% the user. It onl% used b% the informatica serer todetermine whether to continue running the session or sto!.3. Define Informatica 8e!ositor%?&he Informatica re!ositor% is a relational database that stores information, or metadata, used b%the Informatica )erer and 7lient tools. etadata can include information such as ma!!ingsdescribing how to transform source data, sessions indicating when %ou want the Informatica)erer to !erform the transformations, and connect strings for sources and targets.&he re!ositor% also stores administratie information such as usernames and !asswords,!ermissions and !riileges, and !roduct ersion.:se re!ositor% manager to create the re!ositor%. &he 8e!ositor% anager connects to there!ositor% database and runs the code needed to create the re!ositor% tables. &hese tablesstores metadata in s!ecific format the informatica serer, client tools use.3. What is a metadata?Designing a data mart inoles writing and storing a com!le4 set of instructions. ou need to

    know where to get data *sources+, how to change it, and where to write the information *targets+.;owerart and ;ower7enter call this set of instructions metadata. 'ach !iece of metadata *fore4am!le, the descri!tion of a source table in an o!erational database+ can contain commentsabout it.In summar%, etadata can include information such as ma!!ings describing how to transformsource data, sessions indicating when %ou want the Informatica )erer to !erform thetransformations, and connect strings for sources and targets.3. What is metadata re!orter?It is a web based a!!lication that enables %ou to run re!orts against re!ositor% metadata. With aeta data re!orter %ou can access information about %our re!ositor% without haing knowledge ofs9l, transformation language or underl%ing tables in the re!ositor%.

    3. What are the t%!es of metadata that stores in re!ositor%?)ource definitions. Definitions of database objects *tables, iews, s%non%ms+ or files that !roidesource data.&arget definitions. Definitions of database objects or files that contain the target data. ulti6dimensional metadata. &arget definitions that are configured as cubes and dimensions.a!!ings. A set of source and target definitions along with transformations containing businesslogic that %ou build into the transformation. &hese are the instructions that the Informatica )ereruses to transform and moe data.8eusable transformations. &ransformations that %ou can use in multi!le ma!!ings.a!!lets. A set of transformations that %ou can use in multi!le ma!!ings.)essions and workflows. )essions and workflows store information about how and when theInformatica )erer moes data. A workflow is a set of instructions that describes how and when torun tasks related to e4tracting, transforming, and loading data. A session is a t%!e of task that %ou

    can !ut in a workflow. 'ach session corres!onds to a single ma!!ing.0ollowing are the t%!es of metadata that stores in the re!ositor%[ Database 7onnections[ Clobal (bjects[ ultidimensional etadata[ 8eusable &ransformations[ )hort cuts[ &ransformations3. 2ow can we store !reious session logs?Co to )ession ;ro!erties P\ 7onfig (bject P\ 5og (!tions

  • 8/13/2019 Copy (2) of DWH File1

    32/81

    )elect the !ro!erties as follows].)ae session log b% P\ )ession8uns)ae session log for these runs P\ 7hange the number that %ou want to sae the number of logfiles *Default is @+If %ou want to sae all of the logfiles created b% eer% run, and then select the o!tion)ae session log for these runs P\ )ession &ime)tam!ou can find these !ro!erties in the sessionworkflow ;ro!erties.3. What is 7hanged Data 7a!ture?7hanged Data 7a!ture *7D7+ hel!s identif% the data in the source s%stem that has changedsince the last e4traction. With 7D7 data e4traction takes !lace at the same time the insert u!dateor delete o!erations occur in the source tables and the change data is stored inside the databasein change tables.&he change data thus ca!tured is then made aailable to the target s%stems in a controlledmanner.3. What is an indicator file? and how it can be used?Indicator file is used for 'ent Based )cheduling when %ou don"t know when the )ource Data isaailable. A shell command, scri!t or a batch file creates and send this indicator file to thedirector% local to the Informatica )erer. )erer waits for the indicator file to a!!ear beforerunning the session.3. What is audit table? and What are the columns in it?

    Audit &able is nothing but the table which contains about %our workflow names and sessionnames. It contains information about workflow and session status and their details.[ WN05O8:

  • 8/13/2019 Copy (2) of DWH File1

    33/81

    3. What are the Different methods of loading Dimension tables?7onentional 5oad 6 Before loading the data, all the &able constraints will be checked against thedata.Direct load *0aster 5oading+ 6 All the 7onstraints will be disabled. Data will be loaded directl%.5ater the data will be checked against the table constraints and the bad data won"t be inde4ed.

    3. What are the different t%!es of 7ommit interals?&he different commit interals are[ )ource6based commit. &he Informatica )erer commits data based on the number ofsource rows. &he commit !oint is the commit interal %ou configure in the session !ro!erties.[ &arget6based commit. &he Informatica )erer commits data based on the number of targetrows and the ke% constraints on the target table. &he commit !oint also de!ends on the bufferblock si=e and the commit interal.3. 2ow to add source flat file header into target file?'dit &ask66\a!!ing66\&arget66\2eader (!tions66\ (ut!ut field names3. 2ow to load name of the file into relation target?)ource Definition66\;ro!erties66\Add currentl% !rocessed file name !ort3. 2ow to return multi!le columns through un6connect looku!?)u!!ose %our look table has fOname,mOname,lOname and %ou are using unconnected looku!. Inoerride )35 of looku! use fOname^^_^^mOname^^_^^lOname %ou can easil% get this alue usingunconnected looku! in e4!ression. :se substring function in e4!ression transformation tose!arate these three columns and make then indiidual !ort for downstream transformation&arget.66666666666666666666666666666666666666666666666666666666666666666666666666666666666666666

    3. What is 0actless fact table? In which !ur!ose we are using this in our DW2 !rojects? ;l= gieme the !ro!er answer?It is a fact table which does not contain an% measurable data.'K )tudent attendance fact *it contains onl% Boolean alues, whether student attended class ornot ? es or

  • 8/13/2019 Copy (2) of DWH File1

    34/81

    eg in case of filter we hae the data likeid name de!t rowOnum1 aa 1$ bb - $- cc -and we hae a filter condition like de!tS then the o! wldb likeid name de!t rowOnum1 aa 1- cc $)o row num changed and it is an actie transformation$. or the order of the row changeseg when :nion transformation !ulls in data, su!!ose we hae$ sourcessources1id name de!t rowOnum1 aa 1

    $ bb - $- cc -source$id name de!t rowOnum aaa / bbb - /F ccc Fit neer restricts the data from an% source so the data cancome in an% mannerid name de!t rowOnum old rowOnum1 aa 1 1

    aaa $ / bbb - - /$ bb - $- cc / -F ccc F Fso the rowOnum are changing . &hus we sa% that union is an actie transformation3. What is use of batch file in informatica? 2ow man% t%!es of batch file in informatica?With the batch file, we can run sessions either in se9uential or in concurrentl%.Crou!ing of )essions is known as Batch.&wo t%!es of batches1+)e9uential 8uns )essions one after another.$+7oncurrent 8un the )essions at the same time.

    If u hae sessions with source6target de!endencies u hae to go for se9uential batch to start thesessions one after another. If u hae seeral inde!endent sessions u can use concurrent batchesWhich run all the sessions at the same time3. What is joiner cache?When we use the joiner transformation an integration serice maintains the cache, all the recordsare stored in joiner cache. oiner caches hae $ t%!es of cache 1.Inde4 cache $. oiner cache.Inde4 cache stores all the !ort alues which are !artici!ated in the join condition and data cache

  • 8/13/2019 Copy (2) of DWH File1

    35/81

    hae stored all !orts which are not !artici!ated in the join condition.3. What is the location of !arameter file in Informatica?G;BW;A8A3. 2ow can %ou dis!la% onl% hidden files in :

  • 8/13/2019 Copy (2) of DWH File1

    36/81

    for this t%!e of loading to control the e4traction of data from source b% source 9ualifier we usetarget load !lan.3. What is meant b% data drien.. in which scenario we use that..?Data drien is aailable at session leel. it sa%s that when we r using u!date strateg% tr ,how theintegration serice fetches the data and how to u!dateinsert row in the database log.Data drien is nothing but instruct the source rows that should take action on targeti.e*u!date,delete,reject,insert+. If we use the u!date strateg% transformation in a ma!!ing then willselect the data drien o!tion in session.3. 2ow to run workflow in uni4?)%nta4 !mcmd startworkflow 6s serice name\ 6d domain name\ 6u user name\ 6!!assword\ 6f folder name\ workflow name\'4am!le;mcmd start workflow PsericeGI

  • 8/13/2019 Copy (2) of DWH File1

    37/81

    As !er 8al!h Nimbal, first we need to build u! the Data arts then we need to build u! theDatawarehouse..this a!!roach is most useful in real time while creating the Data warehouse.

    3. What are the different caches used in informatica?[ )tatic cache[ D%namic cache[ )hared cache[ ;ersistent cache3. What is the command to get the list of files in a director% in uni4?Gls 6lrt3. 2ow to im!ort multi!le flat files in to single target where there is no common column in the flatfilesin workflow session !ro!erties in a!!ing tab in !ro!erties choose )ource filet%!e 6 IndirectCie the )ource filename fileO!ath\&his fileO!ath\ file should contain all the multi!le files which %ou want to 5oad

    3. 2ow to connect two or more table with single source 9ualifier?7reate a (racle source with how much eer column %ou want and write the join 9uer% in )359uer% oerride. But the column order and data t%!e should be same as in the )35 9uer%.3. 2ow to call unconnected looku! in e4!ression transformation?5N;.5N;O

  • 8/13/2019 Copy (2) of DWH File1

    38/81

    . 8ound robin/. Database3. Which transformation uses cache?1. 5ooku! transformation$. Aggregator transformation-. 8ank transformation. )orter transformation/. oiner transformation3. '4!lain about union transformation?

    A union transformation is a multi!le in!ut grou! transformation, which is used to merge the datafrom multi!le sources similar to :

  • 8/13/2019 Copy (2) of DWH File1

    39/81

    /. )%stem3. What is incremental aggregation?When the aggregator transformation e4ecutes all the out!ut data will get stored in the tem!orar%location called aggregator cache. When the ne4t time the ma!!ing runs the aggregatortransformation runs for the new records loaded after the first run. &hese out!ut alues will getincremented with the alues in the aggregator cache. &his is called incremental aggregation. B%this wa% we can im!roe !erformance...666666666666666666666666666Incremental aggregation means a!!l%ing onl% the ca!tured changes in the source to aggregatecalculations in a session. When the source changes onl% incrementall% and if we can ca!ture those changes, then wecan configure the session to !rocess onl% those changes. &his allows informatica serer tou!date target table incrementall%, rather than forcing it to !rocess the entire source andrecalculate the same calculations each time %ou run the session. B% doing this obiousl% thesession !erformance increases.3. 2ow can i e4!lain m% !roject architecture in interiew..? &ell me %our !roject flow from sourceto target..?

    ;roject architecture is like1. )ource )%stems 5ike ainframe,(racle,;eo!le soft,DB$.$. 5anding tables &hese are tables act like source. :sed for eas% to access, for backu! !ur!ose,as reusable for other ma!!ings.-. )taging tables 0rom landing tables we e4tract the data into staging tables after all alidationsdone on the data.. Dimension0acts &hese are the tables those are used for anal%sis and make decisions b%anal%=ing the data./. Aggregation tables &hese tables hae summari=ed data useful for managers who wants toiew monthl% wise sales, %ear wise sales etc.F. 8e!orting la%er and / !hases are useful for re!orting deelo!ers to generate re!orts. I ho!ethis answer hel!s %ou.

    3. What t%!e of transformation is not su!!orted b% ma!!lets?[

  • 8/13/2019 Copy (2) of DWH File1

    40/81

    3. What is ;re )9l and ;ost )9l?;re )35 means that the integration serice runs )35 commands against the source databasebefore it reads the data from source.;ost )35 means integration serice runs )35 commands against target database after it writesto the target.

    3. Insert else u!date o!tion in which situation we will use?if the source table contain multi!le records .if the record s!ecified in the associated !ort to insertinto looku! cache. it does not find a record in the looku! cache when it is used find the !articularrecord U change the data in the associated !ort.6666666666666666666666We set this !ro!ert% when the looku! &80 uses d%namic cache and the session !ro!ert%&8'A& )(:87' 8(W) A) XInsertX has been set.66666666666666666666

    &his o!tion we use when we want to maintain the histor%.If records are not aailable in target table then it inserts the records in to target and records areaailable in target table then it u!dates the records.3. What is an incremental loading? in which situations we will use incremental loading?Incremental 5oading is an a!!roach. 5et su!!ose %ou a ma!!ing for load the data from em!lo%eetable to a em!lo%eeOtarget table on the hire date basis. Again let su!!ose %ou alread% moe theem!lo%ee data from source to target u! to the em!lo%ees hire date -161$6$@@M.our organi=ationnow want to load data on em!lo%eeOtarget toda%. our target alread% hae the data of thatem!lo%ees haing hire date u! to -161$6$@@M.so %ou now !icku! the source data which are hiringfrom 1616$@1@ to till date. &hats wh% %ou neednt take the data before than that date, if %ou do

    that wrongl% it is oerhead for loading data again in target which is alread% e4ists. )o in source9ualifier %ou filter the records as !er hire date and %ou can also !arameteri=ed the hire date thathel! from which date %ou want to load data u!on target.&his is the conce!t of Incremental loading.3. What is target u!date oerride?B% Default the integration serice u!dates the target based on ke% columns. But we might want tou!date non6ke% columns also, at that !oint of time we can oerride the:;DA&' statement for each target in the ma!!ing. &he target oerride affects onl% when thesource rows are marked as u!date b% an u!date strateg% in the ma!!ing.

    3. What is the a!!ing !arameter and a!!ing ariable?

    a!!ing !arameter a!!ing !arameter is constant alues that can be defined before ma!!ingrun. A ma!!ing !arameter reuses the ma!!ing for arious constant alues.

    a!!ing ariable a!!ing ariable is re!resent a alue that can be change during the ma!!ingrun that can be stored in re!ositor% the integration serice retriee that alue from re!ositor% andincremental alue for ne4t run.3. What is rank and dense rank in informatica with an% e4am!les and gie s9l 9uer% for this bothranksfor eg the file contains the records with column

  • 8/13/2019 Copy (2) of DWH File1

    41/81

    1@@$@@*re!eated rows+$@@-@@@@/@@the rank function gies out!ut as1$$/Fand dense rank gies1$$-/

    for eg the file contains the records with columnem!no sal1@@ 1@@@$@@*re!eated rows+ $@@@$@@ -@@@-@@ @@@@@ /@@@/@@ F@@@

    8ank select rank*+ oer *!artition b% em!no order b% sal+ from em!

    1$$/FDense 8ankselect denseOrank*+ oer *!artition b% em!no order b% sal+ from em!and dense rank gies1

    $$-/3. What is the incremental aggregation?&he first time %ou run an u!graded session using incremental aggregation, the Integration )ericeu!grades the inde4 and data cache files. If %ou want to !artition a session using a ma!!ing withincremental aggregation, the Integration )erice realigns the inde4 and data cache files.

  • 8/13/2019 Copy (2) of DWH File1

    42/81

    3. What is session !arameter?;arameter file is a te4t file where we can define the alues to the !arameters .session !arametersare used for assign the database connection alues3. What is ma!!ing !arameter?

    A ma!!ing !arameter re!resents a constant alue that can be defined before ma!!ing run. Ama!!ing !arameter defines a !arameter file which is saed with an e4tension.!rm a ma!!ing!arameter reuse the arious constant alues.3. What is !arameter file?

    A !arameter file can be a te4t file. ;arameter file is to define the alues for !arameters andariables used in a session. A !arameter file is a file created b% te4t editor such as word !ad ornote!ad. ou can define the following alues in !arameter file[ a!!ing !arameters[ a!!ing ariables[ )ession !arameters3. What is session oerride?)ession oerride is an o!tion in informatica at session leel. 2ere we can manuall% gie a s9l

    9uer% which is issued to the database when the session runs. It is nothing but oer riding thedefault s9l which is generated b% a !articular transformation at ma!!ing leel.3. What are the diff. bw informatica ersions E.1.1 and E.F.1?5ittle change in the Administrator 7onsole. In E.1.1 we can do all the creation of I) and re!ositor%)erice, web serice, Domain, node, grid * if we hae licensed ersion+,In E.F.1 the Informatica

    Admin console we can manage both Domain !age and securit% !age. Domain ;age means allthe aboe like creation of I) and re!ositor% )erice, web serice, Domain, node, grid * if we haelicensed ersion+ etc. )ecurit% !age means creation of users, !riileges, 5DA; configuration,'4!ort Im!ort user and ;riileges etc.3. What are the uses of a ;arameter file?;arameter file is one which contains the alues of ma!!ing ariables.t%!e this in note!ad.sae it .

    foldername.sessionnameGGin!utalue1S666666666666666666666666666666666;arameter files are created with an e4tension of .;8&hese are created to !ass alues those can be changed for a!!ing ;arameter and )ession;arameter during ma!!ing run.

    a!!ing ;arametersA ;arameter is defined in a !arameter file for which a ;arameter is create alread% in the a!!ingwith Data &%!e , ;recision and scale.&he a!!ing !arameter file s%nta4 *4444.!rm+.

    Q0older

  • 8/13/2019 Copy (2) of DWH File1

    43/81

    GIn!ut0ile>alue1S;ath of the source 0lat file

    After that we hae to select the !ro!erties &ab of )ession and )et ;arameter file name including!h%sical !ath of this %%%%.!rm file.Do following changes in a!!ing &ab of )ource 3ualifiers;ro!erties section Attributes alues

    )ource file &%!e 666666666\ Direct)ource 0ile Director% 66666666\ 'm!t%)ource 0ile alue13. What is the default data drien o!eration in informatica?&his is default o!tion for u!date strateg% transformation.&he integration serice follows instructions coded in u!date strateg% within session ma!!ingdetermine how to flag records for insert,delete,u!date,reject. If %ou do not data drien o!tionsetting, the integration serice ignores u!date strateg% transformations in the ma!!ing.3. What is threshold error in informatica?When the target is used b% the u!date strateg% DDO8''7&,DDO:;DA&' and some limited

    count, then if it the number of rejected records e4ceed the count then thesession ends with failed status. &his error is called &hreshold 'rror.

    3. )( man% times i saw XG; !arser error X. What is meant b% ;?; ;(W'8 A8&1+ ;arsing error will come for the in!ut !arameter to the looku!.$+ Informatica is not able to resole the in!ut !arameter 75A)) for %our looku!.-+ 7heck the ;ort 75A)) e4ists as either in!ut !ort or a ariable !ort in %our e4!ression.+ 7heck data t%!e of 75A)) and the data t%!e of in!ut !arameter for %our looku!.3. What is a candidate ke%?

    A candidate ke% is a combination of attributes that can be uni9uel% used to identif% a database

    record without an% e4traneous data *uni9ue+. 'ach table ma% hae one or more candidate ke%s.(ne of these candidate ke%s is selected as the table !rimar% ke% else are called Alternate Ne%.3. What is the difference between Bitma! and Btree inde4?Bitma! inde4 is used for re!eating alues.e4 Cender malefemale Account statusActieInactieBtree inde4 is used for uni9ue alues.e4 em!id.3. What is &hrough;ut in Informatica?&hought!ut is the rate at which !ower centre serer read the rows in b%tes from source or writesthe rows in b%tes into the target !er second.

    ou can find this o!tion in workflow monitor. 8ight click on session choose !ro!erties and)ource&arget )tatictics tab %ou can find thought!ut details for each instance of source andtarget.3. What are set o!erators in (racle:

  • 8/13/2019 Copy (2) of DWH File1

    44/81

    3. 2ow i can )chedule the Informatica job in X:ni4 7ron scheduling toolX?7rontab&he crontab *cron deries from chronos, Creek for timeJ tab stands for table+ command, found in:ni4 and :ni46like o!erating s%stems, is used to schedule commands to be e4ecuted !eriodicall%.&o see what crontabs are currentl% running on %our s%stem, %ou can o!en a terminal and runsudo crontab 6l&o edit the list of cronjobs %ou can runsudo crontab 6e&his will o!en a the default editor *could be i or !ico, if %ou want %ou can change the defaulteditor+ to let us mani!ulate the crontab. If %ou sae and e4it the editor, all %our cronjobs are saedinto crontab. 7ronjobs are written in the following format bine4ecutethisscri!t.sh)cheduling e4!lained

    As %ou can see there are / stars. &he stars re!resent different date !arts in the following order1. minute *from @ to /M+$. hour *from @ to $-+-. da% of month *from 1 to -1+. month *from 1 to 1$+/. da% of week *from @ to F+ *@S)unda%+

    '4ecute eer% minuteIf %ou leae the star, or asterisk, it means eer%. a%bethats a bit unclear. 5ets use the the !reious e4am!leagain bine4ecutethisscri!t.sh&he% are all still asterisks )o this meanse4ecute bine4ecutethisscri!t.sh1. eer% minute$. of eer% hour-. of eer% da% of the month. of eer% month/. and eer% da% in the week.In short &his scri!t is being e4ecuted eer% minute.

    Without e4ce!tion.'4ecute eer% 0rida% 1A)o if we want to schedule the scri!t to run at 1A eer%0rida%, we would need the following cronjob@ 1 / bine4ecutethisscri!t.shCet it? &he scri!t is now being e4ecuted when the s%stemclock hits1. minute @$. of hour 1-. of da% of month *eer% da% of month+. of month *eer% month+/. and weekda% / *S0rida%+'4ecute on weekda%s 1A

    )o if we want to schedule the scri!t to run at 1A eer% 0rida%, we would need the followingcronjob@ 1 16/ bine4ecutethisscri!t.shCet it? &he scri!t is now being e4ecuted when the s%stemclock hits1. minute @$. of hour 1-. of da% of month *eer% da% of month+. of month *eer% month+/. and weekda% 16/ *Sonda% til 0rida%+

  • 8/13/2019 Copy (2) of DWH File1

    45/81

    '4ecute 1@ !ast after eer% hour on the 1st of eer% month2eres another one, just for !racticing1@ 1 bine4ecutethisscri!t.sh0air enough, it takes some getting used to, but it offers great fle4ibilit%.

    3. 7an an%one tell me the difference between !ersistence and d%namic caches? (n whichconditions we are using these caches?D%namic661+When %ou use a d%namic cache, the Informatica )erer u!dates the looku! cache as it !assesrows to the target.$+In D%namic, we can u!date catch will

  • 8/13/2019 Copy (2) of DWH File1

    46/81

    )tored ;rocedure &ransformationK5 )ource 3ualifier &ransformation'4ternal ;rocedure &ransformation3. 'liminating of du!licate records without using d%namic looku!s?2i : can eliminate du!licate records b% an sim!le one line )35 3uer%.)elect id, count *+ from se91 grou! b% id haing count *+\1JBelow are the wa%s to eliminate the du!licate records1. B% enabling the o!tion in )ource 3ualifier transformation as select distinct.$. B% enabling the o!tion in sorter transformation as select distinct.-. B% enabling all the alues as grou! b% in Aggregator transformation.3. 7an an%one gie idea on how do we !erform test load in informatica? What do we test as !artof test load in informatica?With a test load, the Informatica )erer reads and transforms data without writing to targets. &heInformatica )erer does eer%thing, as if running the full session. &he Informatica )erer writesdata to relational targets, but rolls back the data when the session com!letes. )o, %ou can enablecollect !erformance details !ro!ert% and anal%=e the how efficient %our ma!!ing is. If the sessionis running for a long time, %ou ma% like to find out the bottlenecks that are e4isting. It ma% bebottleneck of t%!e target, source, ma!!ing etc.

    &he basic idea behind test load is to see the behaior of Informatica )erer with %our session.3. What is (D) *(!erational Data )tore+?

    A collection of o!eration or bases data that is e4tracted from o!eration databases andstandardi=ed, cleansed, consolidated, transformed, and loaded into enter!rise data architecture.An (D) is used to su!!ort data mining of o!erational data, or as the store for base data that issummari=ed for a data warehouse.&he (D) ma% also be used to audit the data warehouse to assure summari=ed and deried datais calculated !ro!erl%. &he (D) ma% further become the enter!rise shared o!erational database,allowing o!erational s%stems that are being reengineered to use the (D) as there o!erationdatabases.3. 2ow man% tasks are there in informatica?[ )ession &ask

    [ 'mail &ask[ 7ommand &ask[ Assignment &ask[ 7ontrol &ask[ Decision &ask[ 'ent68aise[ 'ent6 Wait[ &imer &ask[ 5ink &ask

    3. What are business com!onents in Informatica?

    [ Domains[ '8)I(

  • 8/13/2019 Copy (2) of DWH File1

    47/81

    ma!!ingworkflow.-. >ersion histor% It will show %ou all the changes made and who made it.3. Diff between GGGsessstarttime and sessstarttime?GGG)ess)tart&ime 6 8eturns session start time as a string alue *)tring datat%!e+)')))&A8&&I' 6 8eturns the date along with date timestam! *Date datat%!e+3. Difference between G,GG,GGG in Informatica?1. G 8efers&hese are the s%stem ariables)ession ;arameters like GBad file,Gin!utfile, Gout!ut file, GDB connection,Gsource,Gtarget etc..$. GG 8efers:ser defined ariablesa!!ing ;arameters like GG)tate,GG&ime, GG'ntit%, GGBusinessODate, GG)87,etc.-. GGG 8efers)%stem ;arameters like GGG)ess)tart&imeGGG)ess)tart&ime returns the session start time as a string alue. &he format of the stringde!ends on the database %ou are using.GGG)ess)tart&ime returns the session start time as a string alue 66\ &he format of the stringde!ends on the database %ou are using.3. 0inding Du!licate 8ows based on ulti!le 7olumns?

    )'5'7& firstname, 7(:

  • 8/13/2019 Copy (2) of DWH File1

    48/81

    An inline iew is term gien to sub 9uer% in 08( clause of 9uer% which can be used as table.Inline iew effectiel% is a named sub 9uer%'4 )elect &ab1.col1,&ab1.col.$,Iniew.col1,Iniew.7ol$0rom &ab1, *)elect statement+ IniewWhere &ab1.col1SIniew.col1

    )'5'7& D

  • 8/13/2019 Copy (2) of DWH File1

    49/81

    3. W2A& I) DI00'8'

  • 8/13/2019 Copy (2) of DWH File1

    50/81

    D'5'&' from tableOname where rowid not in *select ma4*rowid+ from m%Otable grou! b%ke%Oalue +3. 3:'8 &( 8'&8I'>'

  • 8/13/2019 Copy (2) of DWH File1

    51/81

    3. 2ow do send the session re!ort *.t4t+ to manager after session is com!leted?'mail ariable P a *attach the file+ g P attach session log file3. 2ow to check all the running !rocesses in :

  • 8/13/2019 Copy (2) of DWH File1

    52/81

    '4am!le 6 gre! com!u details.t4tDis!la% the whole line,in which line com!u string is found.

    0ind is used to find the file or director% in gien !ath,)%nta4 6 find filename\'4am!le 6 find com!uDis!la% all file names starting with com!uter

    3. W2A& A8' &2' DI00'8'';(I

  • 8/13/2019 Copy (2) of DWH File1

    53/81

    3. What is &rigger?A trigger is a )35 !rocedure that initiates an action when an eent *Iiew?

    A sim!le iew can be thought of as a subset of a table. It can be used for retrieing data, as wellas u!dating or deleting rows. 8ows u!dated or deleted in the iew are u!dated or deleted in thetable the iew was created with. It should also be noted that as data in the original table changes,so does data in the iew, as iews are the wa% to look at !art of the original table. &he results ofusing a iew are not !ermanentl% stored in the database. &he data accessed through a iew is

    actuall% constructed using standard &6)35 select command and can come from one to man%different base tables or een other iews.3. What is Inde4?

    An inde4 is a !h%sical structure containing !ointers to the data. Indices are created in an e4istingtable to locate rows more 9uickl% and efficientl%. It is !ossible to create an inde4 on one or morecolumns of a table, and each inde4 is gien a name. &he users cannot see the inde4esJ the% are

    just used to s!eed u! 9ueries. 'ffectie inde4es are one of the best wa%s to im!roe !erformancein a database a!!lication. A table scan ha!!ens when there is no inde4 aailable to hel! a 9uer%.In a table scan )35 )erer e4amines eer% row in the table to satisf% the 9uer% results. &ablescans are sometimes unaoidable, but on large tables, scans hae a terrific im!act on!erformance. 7lustered inde4es define the !h%sical sorting of a database table"s rows in thestorage media. 0or this reason, each database table ma%hae onl% one clustered inde4. IIII

  • 8/13/2019 Copy (2) of DWH File1

    54/81

    -. 2aing 7lause is basicall% used onl% with the C8(:; B function in a 9uer%. W2'8' 7lauseis a!!lied to each row before the% are !art of the C8(:; B function in a 9uer%.

    8A

  • 8/13/2019 Copy (2) of DWH File1

    55/81

    )tores data for the columns used in the looku! condition.$. 5ooku! Data 7ache[ 0or a connected 5ooku! transformation, stores data for the connected out!ut !orts, notincluding !orts used in the looku! condition.[ 0or an unconnected 5ooku! transformation, stores data from the return !ort.

    (5&; and (5A;5ogical Data odeling >s ;h%sical Data odeling

    8outer &ransformation And 0ilter &ransformation)ource 3ualifier And 5ooku! &ransformationa!!ing And a!!letoiner &ransformation And 5ooku! &ransformationDimension &able and 0act &able

    7onnected 5ooku! and :nconnected 5ooku!7onnected 5ooku!:nconnected 5ooku!8eceies in!ut alues directl% from the !i!eline.

    8eceies in!ut alues from the result of a 5N; e4!ression in another transformation.We can use a d%namic or static cache.We can use a static cache.7ache includes all looku! columns used in the ma!!ing.7ache includes all looku!out!ut !orts in the looku! condition and the looku!return !ort.If there is no match for the looku! condition, the ;ower 7enter )erer returns the default alue forall out!ut !orts.If there is no match for the looku! condition, the ;ower 7enter )erer returns iew)tar )chema And )now 0lake )chema

    Informatica 6 &ransformations

  • 8/13/2019 Copy (2) of DWH File1

    56/81

    In Informatica, &ransformations hel! to transform the source data according to the re9uirementsof target s%stem and it ensures the 9ualit% of the data being loaded into target.&ransformations are of two t%!es Actie and ;assie.

    Actie &ransformationAn actie transformation can change the number of rows that !ass through it from source to

    target. *i.e+ It eliminates rows that do not meet the condition in transformation.

    ;assie &ransformationA !assie transformation does not change the number of rows that !ass through it *i.e+ It !assesall rows through the transformation.

    &ransformations can be 7onnected or :nconnected.

    7onnected &ransformation7onnected transformation is connected to other transformations or directl% to target table in thema!!ing.

    :nconnected &ransformation

    An unconnected transformation is not connected to other transformations in the ma!!ing. It iscalled within another transformation, and returns a alue to that transformation.

    0ollowing are the list of &ransformations aailable in InformaticaAggregator &ransformation'4!ression &ransformation0ilter &ransformationoiner &ransformation5ooku! &ransformation

  • 8/13/2019 Copy (2) of DWH File1

    57/81

    '4!ression transformation is a ;assie and 7onnected transformation.

    &his can be used to calculate alues in a single row before writing to the target.

    0or e4am!le, to calculate discount of each !roduct

    or to concatenate first and last names

    or to conert date to a string field.SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS0ilter &ransformation

    0ilter transformation is an Actie and 7onnected transformation.

    &his can be used to filter rows in a ma!!ing that do not meet the condition.

    0or e4am!le,

    &o know all the em!lo%ees who are working in De!artment 1@ or

    &o find out the !roducts that falls between the rate categor% G/@@ and G1@@@.SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSoiner &ransformation

    oiner &ransformation is an Actie and 7onnected transformation. &his can be used to join twosources coming from two different locations or from same location. 0or e4am!le, to join a flat fileand a relational source or to join two flat files or to join a relational source and a K5 source.

    In order to join two sources, there must be at least one matching !ort. While joining two sources itis a must to s!ecif% one source as master and the other as detail.

    &he oiner transformation su!!orts the following t%!es of joins

    1+

  • 8/13/2019 Copy (2) of DWH File1

    58/81

    0ull outer join kee!s all rows of data from both the master and detail sources.SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS5ooku! transformation5ooku! transformation is ;assie and it can be both 7onnected and :n7onnected as well. It is

    used to look u! data in a relational table, iew, or s%non%m. 5ooku! definition can be im!ortedeither from source or from target tables.

    0or e4am!le, if we want to retriee all the sales of a !roduct with an ID 1@ and assume that thesales data resides in another table. 2ere instead of using the sales table as one more source,use 5ooku! transformation to looku! the data for the !roduct, with ID 1@ in sales table.

    7onnected looku! receies in!ut alues directl% from ma!!ing !i!eline whereas

    :nconnected looku! receies alues from 5N; e4!ression from another transformation.

    7onnected looku! returns multi!le columns from the same row whereas

    :nconnected looku! has one return !ort and returns one column from each row.

    7onnected looku! su!!orts user6defined default alues whereas

    :nconnected looku! does not su!!ort user defined alues.SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS

  • 8/13/2019 Copy (2) of DWH File1

    59/81

    8outer &ransformation

    8outer is an Actie and 7onnected transformation. It is similar to filter transformation.

    &he onl% difference is, filter transformation dro!s the data that do not meet the condition whereasrouter has an o!tion to ca!ture the data that do not meet the condition. It is useful to test multi!le

    conditions.

    It has in!ut, out!ut and default grou!s.

    0or e4am!le, if we want to filter data like where )tateSichigan, )tateS7alifornia, )tateS

  • 8/13/2019 Copy (2) of DWH File1

    60/81

    )tored ;rocedure transformation is a ;assie and 7onnected U :nconnected transformation. It isuseful to automate time6consuming tasks and it is also used in error handling, to dro! andrecreate inde4es and to determine the s!ace in database, a s!eciali=ed calculation etc.

    &he stored !rocedure must e4ist in the database before creating a )tored ;rocedure

    transformation, and the stored !rocedure can e4ist in a source, target, or an% database with aalid connection to the Informatica )erer. )tored ;rocedure is an e4ecutable scri!t with )35statements and control statements, user6defined ariables and conditional statements.SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS:!date )trateg% &ransformation

    :!date strateg% transformation is an Actie and 7onnected transformation.

    It is used to u!date data in target table, either to maintain histor% of data or recent changes.

    ou can s!ecif% how to treat source rows in table, insert, u!date, delete or data drien.SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS

    SSSSSK5 )ource 3ualifier &ransformation

    K5 )ource 3ualifier is a ;assie and 7onnected transformation.

    K5 )ource 3ualifier is used onl% with an K5 source definition.

    It re!resents the data elements that the Informatica )erer reads when it e4ecutes a session withK5 sources.SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS

    7onstraint6Based 5oadingIn the Workflow anager, %ou can s!ecif% constraint6based loading for a session. When %ouselect this o!tion, the Integration )erice orders the target load on a row6b%6row basis. 0or eer%row generated b% an actie source, the Integration )erice loads the corres!onding transformedrow first to the !rimar% ke% table, then to an% foreign ke% tables. 7onstraint6based loadingde!ends on the following re9uirements

    Actie source 8elated target tables must hae the same actie source.Ne% relationshi!s &arget tables must hae ke% relationshi!s.&arget connection grou!s &argets must be in one target connection grou!.&reat rows as insert. :se this o!tion when %ou insert into the target. ou cannot use u!dates withconstraint based loading.

    Actie )ourceWhen target tables receie rows from different actie sources, the Integration )erice reerts tonormal loading for those tables, but loads all other targets in the session using constraint6basedloading when !ossible. 0or e4am!le, a ma!!ing contains three distinct !i!elines. &he first twocontain a source, source 9ualifier, and target. )ince these two targets receie data from differentactie sources, the Integration )erice reerts to normal loading for both targets. &he third!i!eline contains a source,

  • 8/13/2019 Copy (2) of DWH File1

    61/81

    )imilarl%, when target tables hae circular ke% relationshi!s, the Integration )erice reerts to anormal load. 0or e4am!le, %ou hae one target containing a !rimar% ke% and a foreign ke% relatedto the !rimar% ke% in a second target. &he second target also contains a foreign ke% thatreferences the !rimar% ke% in the first target. &he Integration )erice cannot enforce constraint6based loading for these tables. It reerts to a normal load.&arget 7onnection Crou!s&he Integration )erice enforces constraint6based loading for targets in the same targetconnection grou!. If %ou want to s!ecif% constraint6based loading for multi!le targets that receiedata from the same actie source, %ou must erif% the tables are in the same target connectiongrou!. If the tables with the !rimar% ke%6foreign ke% relationshi! are in different target connectiongrou!s, the Integration )erice cannot enforce constraint6based loading when %ou run theworkflow. &o erif% that all targets are in the same target connection grou!, com!lete the followingtasks[ >erif% all targets are in the same target load order grou! and receie data from the sameactie source.[ :se the default !artition !ro!erties and do not add !artitions or !artition !oints.[ Define the same target t%!e for all targets in the session !ro!erties.[ Define the same database connection name for all targets in the session !ro!erties.[ 7hoose normal mode for the target load t%!e for all targets in the session !ro!erties.&reat 8ows as Insert

    :se constraint6based loading when the session o!tion &reat )ource 8ows As is set to insert. oumight get inconsistent data if %ou select a different &reat )ource 8ows As o!tion and %ouconfigure the session for constraint6based loading.When the ma!!ing contains :!date )trateg% transformations and %ou need to load data to a!rimar% ke% table first, s!lit the ma!!ing using one of the following o!tions[ 5oad !rimar% ke% table in one ma!!ing and de!endent tables in another ma!!ing. :seconstraint6based loading to load the !rimar% table.[ ;erform inserts in one ma!!ing and u!dates in another ma!!ing.7onstraint6based loading does not affect the target load ordering of the ma!!ing. &arget loadordering defines the order the Integration )erice reads the sources in each target load ordergrou! in the ma!!ing. A target load order grou! is a collection of source 9ualifiers,transformations, and targets linked together in a ma!!ing. 7onstraint based loading establishesthe order in which the Integration )erice loads indiidual targets within a set of targets receiing

    data from a single source 9ualifier.

    '4am!le&he following ma!!ing is configured to !erform constraint6based loadingIn the first !i!eline, target &O1 has a !rimar% ke%, &O$ and &O- contain foreign ke%s referencingthe &1 !rimar% ke%. &O- has a !rimar% ke% that &O references as a foreign ke%.)ince these tables receie records from a single actie source, )3OA, the Integration )ericeloads rows to the target in the following order1. &O1$. &O$ and &O- *in no !articular order+-. &O&he Integration )erice loads &O1 first because it has no foreign ke% de!endencies and containsa !rimar% ke% referenced b% &O$ and &O-. &he Integration )erice then loads &O$ and &O-, but

    since &O$ and &O- hae no de!endencies, the% are not loaded in an% !articular order. &heIntegration )erice loads &O last, because it has a foreign ke% that references a !rimar% ke% in&O-.After loading the first set of targets, the Integration )erice begins reading source B. If thereare no ke% relationshi!s between &O/ and &OF, the Integration )erice reerts to a normal loadfor both targets.If &OF has a foreign ke% that references a !rimar% ke% in &O/, since &O/ and &OF receie datafrom a single actie source, the Aggregator ACC&8A

  • 8/13/2019 Copy (2) of DWH File1

    62/81

    &O1, &O$, &O-, and &O are in one target connection grou! if %ou use the same databaseconnection for each target, and %ou use the default !artition !ro!erties. &O/ and &OF are inanother target connection grou! together if %ou use the same database connection for eachtarget and %ou use the default !artition !ro!erties. &he Integration )erice includes &O/ and &OFin a different target connection grou! because the% are in a different target load order grou! fromthe first four targets.'nabling 7onstraint6Based 5oadingWhen %ou enable constraint6based loading, the Integration )erice orders the target load on arow6b%6row basis. &o enable constraint6based loading1. In the Ceneral (!tions settings of the ;ro!erties tab, choose Insert for the &reat )ource 8ows

    As !ro!ert%.$. 7lick the 7onfig (bject tab. In the Adanced settings, select 7onstraint Based 5oad (rdering.-. 7lick (N.

    &arget 5oad ;lanWhen %ou use a ma!!let in a ma!!ing, the a!!ing Designer lets %ou set the target load !lan forsources within the ma!!let.)etting the &arget 5oad (rderou can configure the target load order for a ma!!ing containing an% t%!e of target definition. Inthe Designer, %ou can set the order in which the Integration )erice sends rows to targets indifferent target load order grou!s in a ma!!ing. A target load order grou! is the collection ofsource 9ualifiers, transformations, and targets linked together in a ma!!ing. ou can set thetarget load order if %ou want to maintain referential integrit% when inserting, deleting, or u!datingtables that hae the !rimar% ke% and foreign ke% constraints.&he Integration )erice reads sources in a target load order grou! concurrentl%, and it !rocessestarget load order grou!s se9uentiall%.&o s!ecif% the order in which the Integration )erice sends data to targets, create one source9ualifier for each target within a ma!!ing. &o set the target load order, %ou then determine inwhich order the Integration )erice reads each source in the ma!!ing.&he following figure shows two target load order grou!s in one ma!!ingIn this ma!!ing, the first target load order grou! includes I&'), )3OI&'), and &OI&'). &hesecond target load order grou! includes all other objects in the ma!!ing, including the&(&A5O(8D'8) target. &he Integration )erice !rocesses the first target load order grou!, and

    then the second target load order grou!.When it !rocesses the second target load order grou!, it reads data from both sources at thesame time.&o set the target load order[ 7reate a ma!!ing that contains multi!le target load order grou!s.[ 7lick a!!ings \ &arget 5oad ;lan.[ &he &arget 5oad ;lan dialog bo4 lists all )ource 3ualifier transformations in the ma!!ingand the targets that receie data from each source 9ualifier.[ )elect a source 9ualifier from the list.[ 7lick the :! and Down buttons to moe the source 9ualifier within the load order.[ 8e!eat ste!s - to for other source 9ualifiers %ou want to reorder. 7lick (N.

    a!!ing ;arameters U >ariables

    a!!ing !arameters and ariables re!resent alues in ma!!ings and ma!!lets.When we use a ma!!ing !arameter or ariable in a ma!!ing, first we declare the ma!!ing!arameter or ariable for use in each ma!!let or ma!!ing. &hen, we define a alue for thema!!ing !arameter or ariable before we run the session.a!!ing ;arameters

    A ma!!ing !arameter re!resents a constant alue that we can define before running a session.A ma!!ing !arameter retains the same alue throughout the entire session.'4am!le When we want to e4tract records of a !articular month during '&5 !rocess, we willcreate a a!!ing ;arameter of data t%!e and use it in 9uer% to com!are it with the timestam!

  • 8/13/2019 Copy (2) of DWH File1

    63/81

    field in )35 oerride.After we create a !arameter, it a!!ears in the '4!ression 'ditor.We can then use the !arameter in an% e4!ression in the ma!!let or ma!!ing.We can also use !arameters in a source 9ualifier filter, user6defined join, or e4tract oerride, andin the '4!ression 'ditor of reusable transformations.a!!ing >ariables:nlike ma!!ing !arameters, ma!!ing ariables are alues that can change between sessions.[ &he Integration )erice saes the latest alue of a ma!!ing ariable to the re!ositor% at theend of each successful session.[ We can oerride a saed alue with the !arameter file.[ We can also clear all saed alues for the session in the Workflow anager.We might use a ma!!ing ariable to !erform an incremental read of the source. 0or e4am!le, wehae a source table containing time stam!ed transactions and we want to ealuate thetransactions on a dail% basis. Instead of manuall% entering a session oerride to filter source dataeach time we run the session, we can create a ma!!ing ariable, GGIncludeDate&ime. In thesource 9ualifier, create a filter to read onl% rows whose transaction date e9uals GGIncludeDate&ime, such as&I')&A; S GGIncludeDate&imeIn the ma!!ing, use a ariable function to set the ariable alue to increment one da% each timethe session runs. If we set the initial alue of GGIncludeDate&ime to E1$@@, the first time the

    Integration )erice runs the session, it reads onl% rows dated E1$@@. During the session, theIntegration )erice sets GGIncludeDate&ime to E$$@@. It saes E$$@@ to the re!ositor% at theend of the session. &he ne4t time it runs the session, it reads onl% rows from August $, $@@.:sed in following transformations'4!ression0ilter8outer:!date )trateg%Initial and Default >alueWhen we declare a ma!!ing !arameter or ariable in a ma!!ing or a ma!!let, we can enter aninitial alue. When the Integration )erice needs an initial alue, and we did not declare an initialalue for the !arameter or ariable, the Integration )erice uses a default alue based on thedata t%!e of the !arameter or ariable.

    Data 6\Default >alue

  • 8/13/2019 Copy (2) of DWH File1

    64/81

    7ount Integer and small integer data t%!es are alid onl%.a4 All transformation data t%!es e4ce!t binar% data t%!e are alid.in All transformation data t%!es e4ce!t binar% data t%!e are alid.>ariable 0unctions>ariable functions determine how the Integration )erice calculates the current alue of ama!!ing ariable in a !i!eline.)eta4>ariable )ets the ariable to the ma4imum alue of a grou! of alues. It ignores rowsmarked for u!date, delete, or reject. Aggregation t%!e set to a4.)etin>ariable )ets the ariable to the minimum alue of a grou! of alues. It ignores rowsmarked for u!date, delete, or reject. Aggregation t%!e set to in.)et7ount>ariable Increments the ariable alue b% one. It adds one to the ariable alue when arow is marked for insertion, and subtracts one when the row is arked for deletion. It ignoresrows marked for u!date or reject. Aggregation t%!e set to 7ount.)et>ariable )ets the ariable to the configured alue. At the end of a session, it com!ares thefinal current alue of the ariable to the start alue of the ariable. Based on the aggregate t%!e ofthe ariable, it saes a final alue to the re!ositor%.7reating a!!ing ;arameters and >ariables[ (!en the folder where we want to create !arameter or ariable.[ In the a!!ing Designer, click a!!ings \ ;arameters and >ariables. 6or6 In the a!!letDesigner, click a!!let \ ;arameters and >ariables.

    [ 7lick the add button.[ 'nter name. Do not remoe GG from name.[ )elect &%!e and Data t%!e. )elect Aggregation t%!e for ma!!ing ariables.[ Cie Initial >alue. 7lick ok.[ '4am!le :se of a!!ing of a!!ing ;arameters and >ariables[ '; will be source table.7reate a target table ;O>O'KA;5' haing columns ';A8.&(&A5O)A5 S )A5T 7( T GGB(A8.1-. (!en e4!ression editor for &(&A5O)A5. Do the same as we did earlier for )A5T 7(. &oadd GGB(A8IAB5' from left side !ane. )electGGarOma4 from ariable tab and )A5 from !orts tab as shown below. )'&AK>A8IAB5'*GGarOma4,)A5+1L. (!en '4!ression editor for outOminOar and write the following e4!ression)'&IA8IAB5'*GGarOmin,)A5+. >alidate the e4!ression.

  • 8/13/2019 Copy (2) of DWH File1

    65/81

    1E. (!en '4!ression editor for outOcountOar and write the following e4!ression)'&7(:A8IAB5'*GGarOcount+. >alidate the e4!ression.1M. (!en '4!ression editor for outOsetOar and write the following e4!ression)'&>A8IAB5'*GGarOset,ADDO&(ODA&'*2I8'DA&',,1++. >alidate.$@. 7lick (N. '4!ression &ransformation below$1. 5ink all !orts from e4!ression to target and >alidate a!!ing and )ae it.$$. )ee ma!!ing !icture on ne4t !age.

    ;A8A'&'8 0I5'A !arameter file is a list of !arameters and associated alues for a workflow, worklet, or session.;arameter files !roide fle4ibilit% to change these ariables each time we run a workflow orsession.We can create multi!le !arameter files and change the file we use for a session or workflow. Wecan create a !arameter file using a te4t editor such as Word;ad or O'4am!leRGGBonusS1@@@GGarOma4S/@@GGarOminS1$@@GGarOcountS@7(

  • 8/13/2019 Copy (2) of DWH File1

    66/81

    It contains a set of transformations and lets us reuse that transformation logic in multi!lema!!ings.7reated in a!!let Designer in Designer &ool.We need to use same set of / transformations in sa% 1@ ma!!ings. )o instead of making /transformations in eer% 1@ ma!!ing, we create a ma!!let of these / transformations.

  • 8/13/2019 Copy (2) of DWH File1

    67/81

    a!!ing(!en folder where we want to create the ma!!ing.[ 7lick &ools 6\ a!!ing Designer.[ 7lick a!!ing6\ 7reate6\ Cie name. '4 mOm!ltOe4am!le1[ Drag m!ltO'4am!le1 and target table.[ &ransformation 6\ 7reate 6\ )elect 0ilter for list 6\ 7reate 6\ Done.[ Drag all !orts from m!ltOe4am!le1 to filter and gie filter condition.[ 7onnect all !orts from filter to target. We can add more transformations after filter ifneeded.[ >alidate ma!!ing and )ae it.ake session and workflow.Cie connection information for ma!!let source tables.Cie connection information for target table.8un workflow and see result.

    Indirect 5oading 0or 0lat 0iles)u!!ose, %ou hae 1@ flat files of same structure. All the flat files hae same number of columnsand data t%!e.

  • 8/13/2019 Copy (2) of DWH File1

    68/81

    calculations in a session. If the source changes incrementall% and %ou can ca!ture changes, %oucan configure the session to !rocess those changes. &his allows the Integration )erice tou!date the target incrementall%, rather than forcing it to !rocess the entire source and recalculatethe same data each time %ou run the session.0or e4am!le, %ou might hae a session using a source that receies new data eer% da%. ou canca!ture those incremental changes because %ou hae added a filter condition to the ma!!ing thatremoes !re6e4isting data from the flow of data. ou then enable incremental aggregation.When the session runs with incremental aggregation enabled for the first time on arch 1, %ouuse the entire source. &his allows the Integration )erice to read and store the necessar%aggregate data. (n arch $, when %ou run the session again, %ou filter out all the records e4ce!tthose time6stam!ed arch $. &he Integration )erice then !rocesses the new data and u!datesthe target accordingl%. 7onsider using incremental aggregation in the following circumstancesou can ca!ture new source data. :se incremental aggregation when %ou can ca!ture newsource data each time %ou run the session. :se a )tored ;rocedure or 0ilter transformation to!rocess new data.Incremental changes do not significantl% change the target. :se incremental aggregation whenthe changes do not significantl% change the target. If !rocessing the incrementall% changedsource alters more than half the e4isting target, the session ma% not benefit from usingincremental aggregation. In this case, dro! the table and recreate the target with com!lete sourcedata.

  • 8/13/2019 Copy (2) of DWH File1

    69/81

    [ When the Integration )erice rebuilds incremental aggregation files, the data in the!reious files is lost.erif% the location where %ou want to store the aggregate files.&he inde4 and data files grow in !ro!ortion to the source data. Be sure the cache director% hasenough disk s!ace to store historical data for the session.When %ou run multi!le sessions with incremental aggregation, decide where %ou want the files

    stored. &hen, enter the a!!ro!riate director% for the !rocess ariable, G;7acheDir, in theWorkflow anager. ou can enter session6s!ecific directories for the inde4 and data files.2oweer, b% using the !rocess ariable for all sessions using incremental aggregation, %ou caneasil% change the cache director% when necessar% b% changing G;7acheDir.7hanging the cache director% without moing the files causes the Integration )erice toreinitiali=e the aggregate cache and gather new aggregate data.In a grid, Integration )erices rebuild incremental aggregation files the% cannot find. When anIntegration )erice rebuilds incremental aggregation files, it loses aggregate histor%.*ii+ >erif% the incr