IBM InfoSphere

logo

Tailored to Needs

Call Us: 01225 339705 Email: This email address is being protected from spambots. You need JavaScript enabled to view it. Address: Verhoef Training, 11 Kingsmead Square, Bath, BA1 2AB

  • Register


Audience

Project administrators and ETL developers responsible for data extraction and transformation using DataStage.

Dates (Bath):

30 Jan 2018

20 Mar 2018

9 Jul 2018

1 Oct 2018

Click for details & booking.

Also available on your site, please call for details.

Prerequisites

You should have:

  • Basic knowledge of Windows operating system
  • Familiarity with database access techniques


Duration

4 days.


Objectives

This course enables the project administrators and ETL developers to acquire the skills necessary to develop parallel jobs in DataStage. The emphasis is on developers. Only administrative functions that are relevant to DataStage developers are fully discussed. Students will learn to create parallel jobs that access sequential and relational data and combine and transform the data using functions and other job components.

Course objectives include:

  • Describe the uses of DataStage and the DataStage workflow
  • Describe the Information Server architecture and how DataStage fits within it
  • Describe the Information Server and DataStage deployment options
  • Use the Information Server Web Console and the DataStage Administrator client to create DataStage users and to configure the DataStage environment
  • Import and export DataStage objects to a file
  • Import table definitions for sequential files and relational tables
  • Design, compile, run, and monitor DataStage parallel jobs
  • Design jobs that read and write to sequential files
  • Describe the DataStage parallel processing architecture
  • Design jobs that combine data using joins and lookups
  • Design jobs that sort and aggregate data
  • Implement complex business logic using the DataStage Transformer stage
  • Debug DataStage jobs using the DataStage PX Debugger


Topics

Introduction to DataStage

Deployment

DataStage Administration

Work with Metadata

Create Parallel Jobs

Access Sequential Data

Partitioning and Collecting Algorithms

Combine Data

Group Processing Stages

Transformer Stage

Repository Functions

Work with Relational Data

Control Jobs

S5 Box

Login

Register