000 303
000 303
com
http://www.ipass4sure.com
000-303
IBM
DISCLAIMERS
===========
.ip
as
s4
su
re
.co
Whilst every care has been taken by iPass4Sure to ensure that the information contained
in this publication is correct and complete, it is possible that this is not the case.
iPass4Sure provides the publication "as is", without any warranty for its soundness,
suitability for a different purpose or otherwise. iPass4Sure is not liable for any damage
which has occurred or may occur as a result of or in any respect related to the use of
this publication. iPass4Sure may change or terminate this publication at any time without
further notice and shall not be responsible for any consequence(s) arising there from.
Subject to this disclaimer, iPass4Sure is not responsible for any contributions by third
parties to this publication.
Copyright holder "ipass4sure.com" does not allow copy, distribution, reselling and
reproduction of full or any part of the document. Strict action will be taken including
lawsuit against violation. Please report violations at copyrights@ipass4sure.com
All other trademarks are the property of their respective owners.
All rights reserved.
www.ipass4sure.com
www.ipass4sure.com
000-303
QUESTION: 1
In your ETL application design you have found several areas of common processing
requirements in the mapping specifications. These common logic areas found include:
code validation lookups and name formatting. The common logic areas have the same
logic, but the jobs using them would have different column metadata. Choose the
action that gives you the best reusability design to effectively implement these
common logic areas in your ETL application?
A. Create parallel routines for each of the common logic areas and for each of the
unique column metadata formats.
B. Create separate jobs for each layout and choose the appropriate job to run within a
job sequencer.
C. Create parallel shared containers and define columns combining all data formats.
D. Create parallel shared containers with Runtime Column Propagation (RCP) ON
and define only necessary common columns needed for the
logic.
s4
su
re
.co
Answer: D
.ip
as
QUESTION: 2
When optimizing a job, Balanced Optimization will NOT search the job for what
pattern?
A. Links
B. Stages
C. Sequencers
D. Property Settings
Answer: C
QUESTION: 3
You are asked to optimize the fork join job design in the exhibit. This job uses the
sort aggregator and a left outer join on the ZIP code column. Currently all partitioning
is set to "Auto" and automatic sort insertion is allowed.
www.ipass4sure.com
2
http://www.certarea.com
www.ipass4sure.com
000-303
Which change will reduce the cost of partitioning that occurs in this job?
as
s4
su
re
.co
A. Use Entire partitioning on the input links to the Aggregator and Join stages.
B. Hash partition and sort on ZIP code column on the input links to the Aggregator
and Join stages.
C. Hash partition and sort on ZIP code column prior to the Copy stage and use entire
partitioning on the Aggregator and Join stages.
D. Hash partition and sort on ZIP code column prior to the Copy stage, and use same
partitioning on the Aggregator and Join stages.
.ip
Answer: D
QUESTION: 4
You have a parallel job that based on operational recoverability requirements needs to
be broken up into two separate parallel jobs. You have decided to use the Data Set
stage to support this job design change. What two characteristics of Data Sets make
them a good design consideration in your jobs design change? (Choose two.)
QUESTION: 5
What two binding types are supported by Information Services Director (ISD) for a
parallel job that is designed to be used as a service? (Choose two.)
www.ipass4sure.com
3
http://www.certarea.com
www.ipass4sure.com
000-303
A. EJB
B. SQL
C. HDFS
D. SOAP
E. STREAMS
Answer: A, D
re
.co
QUESTION: 6
You are assigned to correct a job from another developer. The job contains 20 stages
sourcing data from two Data Sets and many sequential files. The annotation in the job
indicates who wrote the job and when, not the objective of the job. All link and stage
names use the default names. One of the output columns has an incorrect value which
should have been obtained using a lookup. What could the original developer have
done to make this task easier for maintenance purposes?
.ip
as
s4
su
Answer: B
QUESTION: 7
You are asked by management to document all jobs written to make future
maintenance easier. Which statement is true about annotations?
A. The short job description can be identified within the Description Annotation
stage.
B. The Description Annotation stage contains both the short and full descriptions for
the job.
C. The background for the Description Annotation stage can be changed for each
unique stage.
D. The Description Annotation stage can be added several times at different locations
to identify business logic.
www.ipass4sure.com
4
http://www.certarea.com
www.ipass4sure.com
000-303
Answer: A
QUESTION: 8
A job design consists of an input Row Generator stage, a Filter stage, followed by a
Transformer stage and an output Sequential File stage. The job is run on an SMP
machine with a configuration file defined with three nodes. The
$APT_DISABLE_COMBINATION variable is set to True. How many player
processes will this job generate?
A. 8
B. 10
C. 12
D. 16
.co
Answer: A
A. Same
B. Modulus
C. Sort Merge
D. Round Robin
.ip
as
s4
su
re
QUESTION: 9
Which partitioning method requires a key?
Answer: B
QUESTION: 10
A job design consists of an input Row Generator stage, a Sort stage, followed by a
Transformer stage and an output Data Set stage. The job is run on an SMP machine
with
a
configuration
file
defined
with
four
nodes.
The
$APT_DISABLE_COMBINATION variable is set to True. How many player
processes will this job generate?
A. 7
B. 16
C. 13
D. 16
www.ipass4sure.com
5
http://www.certarea.com
www.ipass4sure.com
000-303
Answer: C
QUESTION: 11
The data going into the target Sequential Files stage is range-partitioned and sorted.
Which technique method would be the most efficient to create a globally sorted target
sequential file?
re
.co
Answer: B
.ip
as
s4
su
QUESTION: 12
In the exhibit, a Funnel stage has two input links. Input 1 (Seq_File) comes from a
Sequential File stage with "Readers per Node" set to "2". Input 2 (Dataset) comes
from a dataset created with 3 partitions. In the Funnel stage, the funnel type is set to
"Sequence".
The parallel configuration file contains 4 nodes. How many instances of the Funnel
stage run in parallel?
www.ipass4sure.com
6
http://www.certarea.com
www.ipass4sure.com
000-303
A. 1
B. 2
C. 4
D. 6
Answer: C
QUESTION: 13
Your job sequence must be restartable. It runs Job1, Job2, and Job3 serially. It has
been compiled with "Add checkpoints so sequence is restartable". Job1 must execute
every run even after a failure. Which two properties must be selected to ensure that
Job1 is run each time, even after a failure? (Choose two.)
Answer: A, D
.ip
as
s4
su
re
.co
QUESTION: 14
Which two actions are available when editing a message handler? (Choose two.)
A. Abort job
B. Demote to warning
C. Suppress from job log
D. Demote to informational
E. Suppress from the project
Answer: C, D
QUESTION: 15
What is the result of running the following command: dsjob -report DSProject
ProcData
www.ipass4sure.com
7
http://www.certarea.com
www.ipass4sure.com
000-303
A. Generates a report about the ProcData job, including information about its stages
and links.
B. Returns a report of the last run of the ProcData job in a DataStage project named
DSProject.
C. Runs the DataStage job named ProcData and returns performance information,
including the number of rows processed.
D. Runs the DataStage job named ProcData and returns job status information,
including whether the job aborted or ran without warnings.
Answer: B
re
.co
QUESTION: 16
You would like to pass values into parameters that will be used in a variety of
downstream activity stages within a job sequence. What are two valid ways to do
this? (Choose two.)
.ip
as
s4
su
Answer: A, E
QUESTION: 17
On the DataStage development server, you have been making enhancements to a
copy of a DataStage job running on the production server. You have been asked to
document the changes you have made to the job. What tool in DataStage Designer
would you use?
A. Compare Against
B. diffapicmdline.exe
C. DSMakeJobReport
D. Cross Project Compare
Answer: D
www.ipass4sure.com
8
http://www.certarea.com
www.ipass4sure.com
000-303
QUESTION: 18
You are working on a project that contains a large number of jobs contained in many
folders. You would like to review the jobs created by a former developer of the
project. How can you find these jobs?
Answer: B
.ip
as
s4
su
re
.co
QUESTION: 19
Your customer is using Source Code Control Integration for Information server and
have tagged artifacts for version 1. You must create a deployment package from the
version 1. Before you create the package you will have to ensure the project is up to
date with version 1. What two things must you do to update the meta-data repository
with the artifacts tagged as version 1? (Choose two.)
Answer: D, E
QUESTION: 20
You want to find out which table definitions have been loaded into a job, and
specifically which stages of the job they has been loaded into? How will you
determine this?
A. Select the job, right-click, then click the Find where used (deep) command.
B. Select the job, right-click, then click the Find dependencies (deep) command.
C. Select the job, right-click, then click the Find where used command. Then rightclick and select "Show the dependency path from the job".
www.ipass4sure.com
9
http://www.certarea.com
www.ipass4sure.com
000-303
D. Select the job, right-click, then click the Find dependencies command. Then rightclick and select "Show the dependency path from the job".
Answer: D
QUESTION: 21
You are responsible for deploying objects into your customers production
environment. To ensure the stability of the production system the customer does not
permit compilers on production machines. They have also protected the project and
only development machines have the required compiler. What option will enable jobs
with a parallel transformer to execute in the customers production machines?
s4
su
re
.co
A. Add $APT_COMPILE_OPT=-portable
B. Set $APT_COPY_TRANSFORM_OPERATOR
C. Use protected projects in the production environment.
D. Create a package with Information Server Manager and select the option to include
executables.
.ip
as
Answer: D
QUESTION: 22
What two features distinguish the Operations Console from the Director job log?
(Choose two.)
A. Jobs can be started and stopped in Director, but not in the Operations Console.
B. The Operations Console can monitor jobs running on only one DataStage engine.
C. Workload management is supported within Director, but not in the Operations
Console.
D. The Operations Console can monitor jobs running on more than one DataStage
engine.
E. The Operations Console can run on systems where the DataStage clients are not
installed.
Answer: D, E
QUESTION: 23
The Score is divided into which two sections? (Choose two.)
www.ipass4sure.com
10
http://www.certarea.com
www.ipass4sure.com
000-303
A. Stages
B. File sets
C. Schemas
D. Data sets
E. Operators
Answer: D, E
.co
re
as
s4
su
A. $APT_IO_MAP
B. $OSH_EXPLAIN
C. $APT_STARTUP_STATUS
D. $APT_EXPORT_FLUSH_COUNT
E. $APT_PM_STARTUP_CONCURRENCY
QUESTION: 24
Which two environment variables add additional reporting information in the job log
for DataStage jobs? (Choose two.)
.ip
Answer: B, C
QUESTION: 25
A job validates account numbers with a reference file using a Join stage, which is
hash partitioned by account number. Runtime monitoring reveals that some partitions
process many more rows than others. Assuming adequate hardware resources, which
action can be used to improve the performance of the job?
Answer: B
QUESTION: 26
You are asked by your customer to collect partition level runtime metadata for
DataStage parallel jobs. You must collect this data after each job completes. What
www.ipass4sure.com
11
http://www.certarea.com
www.ipass4sure.com
000-303
two options allow you to automatically save row counts and CPU time for each
instance of an operator? (Choose two.)
A. $APT_CPU_ROWCOUNT
B. $APT_PERFORMANCE_DATA
C. Enable the job property "Record job performance data".
D. Open up the job in Metadata Workbench and select the "Data Lineage" option.
E. Click the Performance Analysis icon in the toolbar to open the Performance
Analyzer utility.
Answer: B, C
.co
QUESTION: 27
Which option is required to identify a particular job player processes?Which option is
required to identify a particular job? player processes?
.ip
as
s4
su
re
Answer: B
QUESTION: 28
How is DataStage table metadata shared among DataStage projects?
A. Import another copy of the table metadata into the project where it is required.
B. Use the "Shared Table Creation Wizard" to place the table in the shared repository.
C. Export DataStage table definitions from one project and importing them into
another project.
D. Use the InfoSphere Metadata Asset Manager (IMAM) to move the DataStage table
definition to the projects where it is needed.
Answer: B
QUESTION: 29
Which two parallel job stages allow you to use partial schemas? (Choose two.)
www.ipass4sure.com
12
http://www.certarea.com
www.ipass4sure.com
000-303
A. Peek stage
B. File Set stage
C. Data Set stage
D. Column Export stage
E. External Target stage
Answer: B, E
QUESTION: 30
In addition to the table and schema names, what two element names must be specified
when you create a shared table definition in DataStage Designer? (Choose two.)
w
w
w
Answer: A, B
.ip
as
s4
su
re
.co
A. Database
B. Host system
C. Project name
D. Database instance
E. DataStage server system name
QUESTION: 31
When using Runtime Column Propagation, which two stages require a schema file?
(Choose two.)
A. Peek stage
B. Pivot stage
C. Column Import stage
D. DB2 Connector stage
E. Sequential File stage
Answer: C, E
QUESTION: 32
What are the two Transfer Protocol Transfer Mode property options for the FTP
Enterprise stage? (Choose two.)
www.ipass4sure.com
13
http://www.certarea.com
www.ipass4sure.com
000-303
A. FTP
B. EFTP
C. TFTP
D. SFTP
E. RFTP
Answer: A, D
QUESTION: 33
Your job will write its output to a fixed length data file. When configuring the
sequential file stage as a target what format and column tab properties need to be
considered for this type of file output?
.ip
as
s4
su
re
.co
A. On the Output Link format tab, change the 'Delimiter' property to whitespace.
B. On the Output Link format tab, add the 'Record Type' property to the tree and set
its value to be 'F'.
C. On the Output Link column tab, insure that all the defined column data types are
fixed length types.
D. On the Output Link column tab, specify the record size total based on all of the
columns defined
Answer: C
QUESTION: 34
Identify the two statements that are true about the functionality of the XML Pack 3.0.
(Choose two.)
Answer: C, E
QUESTION: 35
www.ipass4sure.com
14
http://www.certarea.com
www.ipass4sure.com
000-303
Identify the two delimiter areas available to be configured in the Sequential File
format tab properties? (Choose two.)
A. File delimiter
B. Null delimiter
C. Final delimiter
D. Field delimiter
E. End of group delimiter
Answer: C, D
QUESTION: 36
When using a Sequential File stage as a source what are the two reject mode property
options? (Choose two.)
.ip
as
s4
su
re
.co
A. Set
B. Fail
C. Save
D. Convert
E. Continue
Answer: B, E
QUESTION: 37
Which two statements are true about Data Sets? (Choose two.)
Answer: B, D
QUESTION: 38
What is the correct method to process a file containing multiple record types using a
Complex Flat File stage?
www.ipass4sure.com
15
http://www.certarea.com
www.ipass4sure.com
000-303
Answer: D
QUESTION: 39
When using the Column Export stage, what are two export column type property
values allowed for the combined single output column result? (Choose two.)
w
w
w
Answer: B, E
.ip
as
s4
su
re
.co
A. Vector
B. Binary
C. Integer
D. Decimal
E. VarChar
QUESTION: 40
Which two file stages allow you to configure rejecting data to a reject link? (Choose
two.)
Answer: C, E
QUESTION: 41
Identify two items that are created as a result of running a Balanced Optimization on
a job that accesses a Hadoop distributed file system as a source? (Choose two.)
www.ipass4sure.com
16
http://www.certarea.com
www.ipass4sure.com
000-303
Answer: A, C
QUESTION: 42
A customer must compare a date column with a job parameter date to determine
which output links the row belongs on. What stage should be used for this
requirement?
w
w
w
Answer: D
.ip
as
s4
su
re
.co
A. Filter stage
B. Switch stage
C. Compare stage
D. Transformer stage
QUESTION: 43
Rows of data going into a Transformer stage are sorted and hash partitioned by the
Input.Product column. Using stage variables, how can you determine when a new
row is the first of a new group of Product rows?
www.ipass4sure.com
17
http://www.certarea.com
www.ipass4sure.com
000-303
D. Create a stage variable named sv_Product and follow it by a second stage variable
named sv_IsNewProduct. Map the Input.Product column to
sv_Product. The derivation for sv_IsNewProduct is: IF Input.Product <> sv_Product
THEN "YES" ELSE "NO".
Answer: B
QUESTION: 44
Which statement describes what happens when Runtime Column Propagation is
disabled for a parallel job?
as
s4
su
re
.co
A. An input column value flows into a target column only if it matches it by name.
B. An input column value flows into a target column only if it is explicitly mapped to
it.
C. You must set APT_AUTO_MAP project environment to true to allow output link
mapping to occur.
D. An input column value flows into a target column based on its position in the input
row. For example, first column in the input row goes into the first target column.
.ip
Answer: B
QUESTION: 45
Which statement is true when using the SaveInputRecord() function in a Transformer
stage.
A. You can only use the SaveInputRecord() function in Loop variable derivations.
B. You can access the saved queue records using Vector referencing in Stage variable
derivations.
C. You must retrieve all saved queue records using the GetSavedInputRecord()
function within Loop variable derivations.
D. You must retrieve all saved queue records using the GetSavedInputRecord()
function within Stage variable derivations.
Answer: C
QUESTION: 46
In the Slowly Changing Dimension stage, a dimension columns Purpose code
property can trigger which two actions. (Choose two.)
www.ipass4sure.com
18
http://www.certarea.com
www.ipass4sure.com
000-303
Answer: B, C
QUESTION: 47
Which derivations are executed first in the Transformer stage?
s4
su
re
.co
QUESTION: 48
.ip
as
Answer: C
Answer: D, E
QUESTION: 49
Identify two different types of custom stages you can create to extend the Parallel job
syntax? (Choose two.)
www.ipass4sure.com
19
http://www.certarea.com
www.ipass4sure.com
000-303
A. Input stage
B. Basic stage
C. Group stage
D. Custom stage
E. Wrapped stage
Answer: D, E
QUESTION: 50
Which two statements are true about stage variables in a Transformer Stage?
(Choose two.)
w
w
w
Answer: A, C
.ip
as
s4
su
re
.co
QUESTION: 51
What is the purpose of the APT_DUMP_SCORE environment variable?
Answer: D
QUESTION: 52
Suppose a user ID has been created with DataStage and QualityStage component
authorization. Which client application would be used to give that user ID DataStage
Developer permission?
www.ipass4sure.com
20
http://www.certarea.com
www.ipass4sure.com
000-303
Answer: C
QUESTION: 53
Which two data repositories can be used for user authentication within the
Information Server Suite? (Choose two.)
as
s4
su
re
.co
.ip
Answer: C, E
QUESTION: 54
Which two statements are true about the use of named node pools? (Choose two.)
A. Grid environments must have named node pools for data processing.
B. Named node pools can allow separation of buffering from sorting disks.
C. When named node pools are used, DataStage uses named pipes between stages.
D. Named node pools limit the total number of partitions that can be specified in the
configuration file.
E. Named node pools constraints will limit stages to be executed only on the nodes
defined in the node pools.
Answer: B, E
QUESTION: 55
Which step is required to change from a normal lookup to a sparse lookup in an
ODBC Connector stage?
www.ipass4sure.com
21
http://www.certarea.com
www.ipass4sure.com
000-303
QUESTION: 56
Which method is used to specify when to stop a job because of too many rejected
rows with an ODBC Connector?
re
.co
.ip
as
s4
su
Answer: B
QUESTION: 57
Which two pieces of information are required to be specified for the input link on a
Netezza Connector stage? (Choose two.)
A. Partitioning
B. Server name
C. Table definitions
D. Buffering settings
E. Error log directory
Answer: A, D
QUESTION: 58
Which requirement must be met to read from a database in parallel using the ODBC
connector?
www.ipass4sure.com
22
http://www.certarea.com
www.ipass4sure.com
000-303
Answer: B
QUESTION: 59
Which two statements about the Additional Connections Options property in the
Teradata Connector stage to specify details about the number of connections to
Teradata are true? (Choose two.)
as
s4
su
re
.co
.ip
Answer: B, D
QUESTION: 60
Configuring the weighting column of an Aggregator stage affects which two options.
(Choose two.)
A. Sum
B. Maximum Value
C. Average of Weights
D. Coefficient of Variation
E. Uncorrected Sum of Squares
Answer: A, E
QUESTION: 61
The parallel framework was extended for real-time applications. Identify two of these
aspects. (Choose two.)
www.ipass4sure.com
23
http://www.certarea.com
www.ipass4sure.com
000-303
A. XML stage.
B. End-of-wave.
C. Real-time stage types that re-run jobs.
D. Real-time stage types that keep jobs always up and running.
E. Support for transactions within source database connector stages.
Answer: B, D
QUESTION: 62
How must the input data set be organized for input into the Join stage? (Choose two.)
re
.co
A. Unsorted
B. Key partitioned
C. Hash partitioned
D. Entire partitioned
E. Sorted by Join key
.ip
as
s4
su
Answer: B, E
QUESTION: 63
The Change Apply stage produces a change Data Set with a new column representing
the code for the type of change. What are two change values identified by these code
values? (Choose two.)
A. Edit
B. Final
C. Copy
D. Deleted
E. Remove Duplicates
Answer: C, D
QUESTION: 64
What stage allows for more than one reject link?
A. Join stage
B. Merge stage
www.ipass4sure.com
24
http://www.certarea.com
www.ipass4sure.com
000-303
C. Lookup stage
D. Funnel stage
Answer: B
QUESTION: 65
A 100MB input dataset has even distribution across 400 unique key values. When
you run with a 4-node configuration file, which two changes could improve sort
performance in this scenario? (Choose two.)
.co
as
s4
su
re
Answer: B, E
.ip
QUESTION: 66
Which statement is correct about the Data Rules stage?
A. The Data Rules stage works with rule definitions only; not executable rules.
B. As a best practice, you should create and publish new rules from the Data Rules
stage.
C. If you have Rule Creator role in InfoSphere Information Analyzer, you can create
and publish rule definitions and rule set definitions directly
from the stage itself.
D. When a job that uses the Data Rules stage runs, the output of the stage is passed to
the downstream stages and results are stored in the
Analysis Results database (IADB).
Answer: A
QUESTION: 67
Which job design technique can be used to give unique names to sequential output
files that are used in multi-instance jobs?
www.ipass4sure.com
25
http://www.certarea.com
www.ipass4sure.com
000-303
Answer: A
QUESTION: 68
The ODBC stage can handle which two SQL Server data types? (Choose two.)
.co
A. Date
B. Time
C. GUID
D. Datetime
E. SmallDateTime
as
s4
su
re
Answer: C, E
.ip
QUESTION: 69
Which DB2 to InfoSphere DataStage data type conversion is correct when reading
data with the DB2 Connector stage?
A. XML to SQL_WVARCHAR
B. BIGINT to SQL_BIGINT (INT32)
C. VARCHAR, 32768 to SQL_VARCHAR
D. CHAR FOR BIT DATA to SQL_VARBINARY
Answer: A
QUESTION: 70
Which Oracle data type conversion is correct?
www.ipass4sure.com
26
http://www.certarea.com
www.ipass4sure.com
000-303
Answer: B
QUESTION: 71
Which two statements about using a Load write method in an Oracle Connector stage
to tables that have indexes on them are true? (Choose two.)
re
.co
Answer: C, E
A. Array size
B. Memory size
C. Partition size
D. Transaction size
.ip
as
s4
su
QUESTION: 72
Which Oracle Connector stage property can be set to tune job performance?
Answer: A
QUESTION: 73
In a Transformer expression for a stage variable, there is a nullable input column.
Assume the legacy NULL processing option is turned off. What happens when a row
is processed that contains NULL in that input column?
www.ipass4sure.com
27
http://www.certarea.com
www.ipass4sure.com
000-303
Answer: C
QUESTION: 74
Identify two different types of custom stages you can create to extend the Parallel job
syntax? (Choose two.)
A. Input stage
B. Basic stage
C. Group stage
D. Custom stage
E. Wrapped stage
Answer: D, E
as
s4
su
re
.co
QUESTION: 75
When using the loop functionality in a transformer, which statement is true regarding
Transformer processing.
.ip
Answer: A
QUESTION: 76
In your job you are using a Sequential File stage as a source. You have decided to use
a schema file as an alternative way of specifying column definitions and record
format properties. What two items are true in configuring this stage in your parallel
job? (Choose two.)
www.ipass4sure.com
28
http://www.certarea.com
www.ipass4sure.com
000-303
QUESTION: 77
Which stage classifies data rows from a single input into groups and computes totals?
A. Modify stage
B. Compare stage
C. Aggregator stage
D. Transformer stage
Answer: C
.ip
as
s4
su
re
.co
QUESTION: 78
Which statement describes a SCD Type One update in the Slowly Changing
Dimension stage?
Answer: D
QUESTION: 79
Which derivations are executed last in the Transformer stage?
Answer: C
QUESTION: 80
You are processing groups of rows in a Transformer. The first row in each group
contains "1" in the Flag column and "0" in the remaining rows of the group. At the
www.ipass4sure.com
29
http://www.certarea.com
www.ipass4sure.com
000-303
end of each group you want to sum and output the QTY column values. Which
technique will enable you to retrieve the sum of the last group?
A. Output the sum that you generated each time you process a row for which the
LastRow() function returns True.
B. Output the sum that you generated up to the previous row each time you process a
row with a "1" in the Flag column.
C. Output a running total for each group for each row. Follow the Transformer stage
by an Aggregator stage. Take the MAX of the QTY column for each group.
D. Output the sum that you generated up to the previous row each time you process a
row with a "1" in the Flag column. Use the FirstRow() function to determine when
the group is done.
Answer: C
.ip
as
s4
su
re
.co
QUESTION: 81
The derivation for a stage variable is:
Upcase(input_column1) : ' ' :
Upcase(input_column2). Suppose that input_column1 contains a NULL value.
Assume the legacy NULL processing option is turned off. Which behavior is
expected?
Answer: B
QUESTION: 82
Which statement is true about table definitions created in DataStage Designer?
www.ipass4sure.com
30
http://www.certarea.com
www.ipass4sure.com
000-303
Answer: D
QUESTION: 83
What are two advantages of using Runtime Column Propagation (RCP)? (Choose
two.)
re
.co
.ip
as
s4
su
Answer: B, E
QUESTION: 84
In a file defining a partial schema, which two properties define the record? (Choose
two.)
A. intact
B. delim
C. position
D. length_of_record
E. record_delim_string
Answer: A, E
QUESTION: 85
What role must a user have to delete shared metadata assets from the repository?
A. DataStage Administrator
B. Business Glossary Administrator
C. Common Metadata Administrator
www.ipass4sure.com
31
http://www.certarea.com
www.ipass4sure.com
000-303
Answer: C
QUESTION: 86
Your job is using the Sequential File stage as a source. Identify two stage property
configurations that will allow you to accomplish parallel reads? (Choose two.)
as
s4
su
re
.co
A. Read Method property set to 'File Pattern' and the environment variable
$APT_IMPORT_PATTERN_USES_FILESET is set to 'FALSE'.
B. Read Method property set to 'File Pattern' and the 'Readers per Node' property
value set to greater than 1.
C. Read Method property is set to 'Parallel' with the Key property set to a Input
column that has the file names.
D. Read Method property set to 'Specific File(s)' with only one file specified and the
'Readers Per Node' property value set to greater than 1.
E. Read Method property set to 'Specific File(s)' with more than one file specified and
each file specified has the same format/column properties.
.ip
Answer: D, E
QUESTION: 87
Identify the two statements that are true about the functionality of the XML Pack 3.0.
(Choose two.)
Answer: C, E
QUESTION: 88
When using the Sequential File stage as a source, what two property options allow
you to add extra columns about the file(s) you are reading onto the output link?
(Choose two.)
www.ipass4sure.com
32
http://www.certarea.com
www.ipass4sure.com
000-303
Answer: A, C
QUESTION: 89
The parallel framework supports standard and complex data types in the SQL type
column tab property. Identify the two complex data types? (Choose two.)
as
s4
su
re
.co
A. Vector
B. VarChar
C. Linked list
D. Subrecord
E. Binary trees
.ip
Answer: A, D
QUESTION: 90
How does the Complex Flat File stage (CFF) support the use of OCCURS clauses
within COBOL files?
Answer: D
QUESTION: 91
Identify two areas that DataStage can integrate with a Hadoop environment. (Choose
two.)
A. Use the Pig Activity stage in a sequencer to invoke a Pig MapReduce program.
www.ipass4sure.com
33
http://www.certarea.com
www.ipass4sure.com
000-303
B. Use the Hive Activity stage in a sequences to invoke a Hive MapReduce program.
C. Use the Big Data File stage to access files on the Hadoop Distributed File System.
D. Use the Oozie Workflow Activity stage in a sequencer job to invoke Oozie work
flows.
E. Use the MapReduce Transformer stage to convert Hadoop Distributed File System
data.
Answer: C, D
QUESTION: 92
You are using the Complex Flat File stage as a source in your job. What are two types
of data specifically supported by the Complex Flat File stage for your job? (Choose
two.)
w
w
w
Answer: B, E
.ip
as
s4
su
re
.co
A. XML files.
B. Mainframe data sets with VSAM files.
C. Mainframe data files with DB2 tables.
D. Data from files that contain Hadoop record types.
E. Data from flat files that contain multiple record types.
QUESTION: 93
The number of File Set data files created depends upon what two items? (Choose
two.)
A. Amount of memory.
B. Schema definition of the file.
C. Operating system limitations.
D. Number of logical processing nodes.
E. Number of disks in the export or default disk pool connected to each processing
node in the default node pool.
Answer: D, E
QUESTION: 94
What are the two Transfer Protocol Transfer Mode property options for the FTP
Enterprise stage? (Choose two.)
www.ipass4sure.com
34
http://www.certarea.com
www.ipass4sure.com
000-303
A. FTP
B. EFTP
C. TFTP
D. SFTP
E. RFTP
Answer: A, D
QUESTION: 95
Identify two restructure stages that allow you to create or organize vectors in the
output link results? (Choose two.)
w
w
w
Answer: D, E
.ip
as
s4
su
re
.co
A. Split Vector
B. Column Import
C. Merge records
D. Split Subrecord
E. Make Subrecord
QUESTION: 96
You are using the Change Capture stage in your job design to identify changes made
to the input link (before image) to obtain the results in the output link. This job will
be using a multi-node configuration file when executed. What are two requirements
on the input link data for the Change Capture stage? (Choose two.)
A. sorted
B. merged links
C. parallel mode
D. key partitioned
E. sequential mode
Answer: A, D
QUESTION: 97
www.ipass4sure.com
35
http://www.certarea.com
www.ipass4sure.com
000-303
What are two statistics or attributes that can be added to the output of a Data Rule
stage? (Choose two.)
A. Record ID
B. System timestamp
C. Number of violations
D. Job execution owner
E. Percentage of violations
Answer: A, B
QUESTION: 98
What are two properties dependent of both Column for Calculation and Summary
Column for Recalculation? (Choose two.)
.ip
as
s4
su
re
.co
A. Summary
B. Missing Value
C. Sum of Squares
D. Mean Deviation
E. Coefficient of Variation
Answer: A, B
QUESTION: 99
A DataStage job uses an Inner Join to combine data from two source parallel datasets
that were written to disk in sort order based on the join key columns. Which two
methods could be used to dramatically improve performance of this job? (Choose
two.)
Answer: B, E
www.ipass4sure.com
36
http://www.certarea.com
www.ipass4sure.com
000-303
QUESTION: 100
Indicate two possible job design topologies that Information Services Director (ISD)
parallel jobs can conform to. (Choose two.)
Answer: B, C
re
su
s4
as
.ip
w
w
w
A. Unsorted
B. Key partitioned
C. Hash partitioned
D. Entire partitioned
E. Sorted by Join key
.co
QUESTION: 101
How must the input data set be organized for input into the Join stage? (Choose two.)
Answer: B, E
QUESTION: 102
What are the valid join operations for the Join stage? (Choose two.)
A. Inner join
B. Where join
C. Top outer join
D. Right outer join
E. Bottom inner join
Answer: A, D
QUESTION: 103
Which statement is true about creating DataStage projects?
www.ipass4sure.com
37
http://www.certarea.com
www.ipass4sure.com
000-303
Answer: C
QUESTION: 104
You are editing the configuration file. Which statement is true about file system
efficiency?
Answer: C
.ip
as
s4
su
re
.co
QUESTION: 105
Which statement is true about Information Server logging?
Answer: B
QUESTION: 106
What is used to configure the DataStage QualityStage Operations Console?
www.ipass4sure.com
38
http://www.certarea.com
www.ipass4sure.com
000-303
A. DataStage Administrator
B. The DSODBCConfig.cfg file
C. The DSAppWatcher.sh script
D. Information Server Web Console
Answer: B
QUESTION: 107
What two project environment variables can be considered in your parallel jobs to
support your optimization strategy of partitioning and sorting? (Choose two.)
s4
su
re
.co
A. $APT_NO_PART_INSERTION
B. $APT_OPT_SORT_INSERTION
C. $APT_RESTRICT_SORT_USAGE
D. $APT_PARTITION_FLUSH_COUNT
E. $APT_TSORT_STRESS_BLOCKSIZE
.ip
as
Answer: A, E
QUESTION: 108
The effective use of naming conventions means that objects need to be spaced
appropriately on the DataStage Designer canvas. For stages with multiple
links,expanding the icon border can significantly improve readability. This approach
takes extra effort at first, so a pattern of work needs to be identified and adopted to
help development. Which feature of Designer can improve development speed?
A. Palette Layout
B. Repository Layout
C. Snap to Grid Feature
D. Show Performance Statistics Feature
Answer: C
QUESTION: 109
You have created three parallel jobs (Job A, B and C) in which the output of one job
is the input to the other job. You are required to create processing that manages this
www.ipass4sure.com
39
http://www.certarea.com
www.ipass4sure.com
000-303
data relationship of the jobs and provide job level restart-ability. What two tasks will
accomplish these objectives? (Choose two.)
A. Enable runtime column propagation for the job sequence running the jobs.
B. Set the 'Add checkpoints so sequence is restartable' option in the Sequencer job.
C. Enable the 'Add checkpoints so container is restartable' option for each of the
parallel shared containers.
D. Create two parallel shared containers that has the logic of Job B and Job C and
include those shared containers in the correct flow order within
the design of Job A.
E. Create a Sequencer job that has triggered events configured allowing Job A to run
first, then Job B to run when A completes successfully, and
then Job C to run when Job B completes successfully.
Answer: B, E
.ip
as
s4
su
re
.co
QUESTION: 110
Which statement is true about improving job performance when using Balanced
Optimization?
A. Convert a job to use bulk staging tables for Big Data File stages.
B. Balance optimization attempts to balance the work between the source server,
target sever, and the job.
C. If the job contains an Aggregator stage, data reduction stages will be pushed into a
target data server by default.
D. To ensure that a particular stage can only be pushed into a source or target
connector, you can set the Stage Affinity property to source or target.
Answer: D
QUESTION: 111
In your parallel job design you have selected a parallel shared container to be
included. Which area of your job design is required to be configured to use the
parallel shared container?
www.ipass4sure.com
40
http://www.certarea.com
www.ipass4sure.com
000-303
D. Configure the number of input and/or output links to support the parallel shared
container.
Answer: D
QUESTION: 112
A job design reads from a complex flat file, performs some transformations on the
data, and outputs the results to a ISD output stage. How can the parameter values can
be passed to this job at run-time?
s4
su
re
.co
Answer: C
.ip
as
QUESTION: 113
What two binding types are supported by Information Services Director (ISD) for a
parallel job that is designed to be used as a service? (Choose two.)
A. EJB
B. SQL
C. HDFS
D. SOAP
E. STREAMS
Answer: A, D
QUESTION: 114
Modular development techniques should be used to maximize re-use of DataStage
jobs and components. Which statement represents Modular Development techniques?
www.ipass4sure.com
41
http://www.certarea.com
www.ipass4sure.com
000-303
D. Job parameterization allows a single job design to process similar logic instead of
creating multiple copies of the same job.
Answer: D
.ip
as
s4
su
re
.co
QUESTION: 115
Click on the Exhibit button.
Provided you have enough system resources, what is the maximum number of jobs
that could be running concurrently in this image?
A. 2
www.ipass4sure.com
42
http://www.certarea.com
www.ipass4sure.com
000-303
B. 3
C. 4
D. 5
Answer: B
QUESTION: 116
You would like to run a particular processing job within a job sequence for each
weekday. What two methods could be used? (Choose two.)
as
s4
su
re
.co
.ip
Answer: C, E
QUESTION: 117
When you run a parallel job, any error messages and warnings are written to the job
log and can be viewed from the Director client. What message handler level is
available?
A. stage level
B. record level
C. project level
D. parameter level
Answer: C
QUESTION: 118
Which two commands start a DataStage job named procRows in a project named
DSProject. It also passes the value 200 to the numRows parameter. (Choose two.)
www.ipass4sure.com
43
http://www.certarea.com
www.ipass4sure.com
000-303
Answer: B, E
QUESTION: 119
What two repositories does the Source Code Control Integration for DataStage and
QualityStage serve? (Choose two.)
s4
su
re
.co
A. WebSphere Repository
B. Source Code Repository
C. Universe Project Repository
D. Information Analyzer Repository
E. Information Server metadata Repository
.ip
as
Answer: B, E
QUESTION: 120
You have finished changes to many jobs and shared containers. You must export all
of your changes and integrate them into a test project with other objects. What is a
way to select the objects you changed for the export?
Answer: D
QUESTION: 121
You have made a copy of your job and made major changes to a job in your project.
You now want to identify all the changes that have been made. What task will allow
you to identify these changes?
www.ipass4sure.com
44
http://www.certarea.com
www.ipass4sure.com
000-303
Answer: C
QUESTION: 122
What two Information Server tools can be used to create deployment packages for
DataStage jobs? (Choose two.)
s4
su
re
.co
A. istool
B. Information Server Console
C. Information Server Manager
D. InfoSphere Metadata Assset Manager
E. Information Server Web Console - Administration tab
.ip
as
Answer: A, C
QUESTION: 123
Two new columns have been deleted from a sequential file that a number of
DataStage jobs write to. You have imported a table definition for the changed file.
How can you determine what jobs might be affected by this change?
A. Select the new table definition, right-click, then click on the Where used
command.
B. Select the original table definition, right-click, then click on the Where used
command.
C. Select the original table definition, right-click, then click on the Find dependencies
command.
D. Select the new table definition, right-click, click on the Compare against
command, then select original table definition.
Answer: B
QUESTION: 124
www.ipass4sure.com
45
http://www.certarea.com
www.ipass4sure.com
000-303
A job using a one-node configuration file writes to a target Sequential File stage. The
target Sequential File stage has been set to write to three different sequential files.
How many instances of the Sequential File stage will run in parallel?
A. One.
B. Two.
C. Three.
D. The job will abort, because a configuration file with at least three nodes is
required.
Answer: C
su
s4
as
.ip
w
w
w
A. Auto
B. Ordered
C. Sort Merge
D. Round Robin
re
.co
QUESTION: 125
The data going into the target Sequential File stage is sorted in each partition by the
Date field. You want the data in the sequential file to be sorted by Date. Which
collection algorithm should you choose in the Sequential File stage?
Answer: C
QUESTION: 126
A job design consists of an input Sequential Files stage, a Sort stage, followed by a
Transformer stage and an output Sequential File stage. In the input Sequential Files
stage, the "Number of readers per node" property has been set to 3. The job is run on
an SMP machine with a configuration file defined with 3 nodes. The
$APT_DISABLE_COMBINATION variable is set to True. How many player
processes will this job generate?
A. 6
B. 8
C. 10
D. 12
Answer: B
www.ipass4sure.com
46
http://www.certarea.com
www.ipass4sure.com
000-303
QUESTION: 127
A job using a three-node configuration file writes to a target Sequential File stage.
The target Sequential File stage has been set to write to two different sequential files.
How many instances of the Sequential File stage will run?
A. One instance.
B. Two instances.
C. Three instances.
D. Four instances.
Answer: B
.ip
as
s4
su
re
.co
QUESTION: 128
Each row contains the CustID, ProductID, ProductType and TotalAmount columns.
You want to retain the record with the largest value in the TotalAmount column for
each CustID and ProductType using a RemoveDuplicates stage. How can this be
accomplished?
Answer: A
QUESTION: 129
What two computer system resources on the DataStage engine are monitored in the
Operations Console? (Choose two.)
A. Disk
B. CPU
C. Memory
D. Scratch disk
E. XMETA database
www.ipass4sure.com
47
http://www.certarea.com
www.ipass4sure.com
000-303
Answer: B, C
QUESTION: 130
You want to better understand the tsort operators that are inserted into the job flow at
runtime. Which environment variable can provide this information?
A. $OSH_DUMP
B. $APT_DUMP_SCORE
C. $APT_STARTUP_STATUS
D. $APT_PM_PLAYER_MEMORY
Answer: B
.ip
as
s4
su
re
.co
QUESTION: 131
Which job log message heading identifies the score in the job log?
Answer: D
QUESTION: 132
You are experiencing performance issues for a given job. You are assigned the task of
understanding what is happening at run time for that job. What step should you take
to understand the job performance issues?
Answer: C
QUESTION: 133
www.ipass4sure.com
48
http://www.certarea.com
www.ipass4sure.com
000-303
re
.co
The job in the exhibit runs to successful completion with no warnings. However, a
validation test identifies that the CUSTOMER_MASTER table is empty. You
suspect the cause to be related to the NEW_CUSTOMERS table or a defect in the
job. Examining runtime metadata may help you track down the problem. Which two
product features will help you isolate the cause of the problem? (Choose two.)
.ip
as
s4
su
Answer: A, C
QUESTION: 134
Which option is required to identify a particular job player processes?Which option is
required to identify a particular job? player processes?
Answer: B
www.ipass4sure.com
49
http://www.certarea.com
www.ipass4sure.com
.ip
as
s4
su
re
.co
www.ipass4sure.com