Banesco Basico DataStage DIA 01-1 chapter 1pdf

ssuser6df8c1 2 views 149 slides Sep 16, 2025
Slide 1
Slide 1 of 149
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14
Slide 15
15
Slide 16
16
Slide 17
17
Slide 18
18
Slide 19
19
Slide 20
20
Slide 21
21
Slide 22
22
Slide 23
23
Slide 24
24
Slide 25
25
Slide 26
26
Slide 27
27
Slide 28
28
Slide 29
29
Slide 30
30
Slide 31
31
Slide 32
32
Slide 33
33
Slide 34
34
Slide 35
35
Slide 36
36
Slide 37
37
Slide 38
38
Slide 39
39
Slide 40
40
Slide 41
41
Slide 42
42
Slide 43
43
Slide 44
44
Slide 45
45
Slide 46
46
Slide 47
47
Slide 48
48
Slide 49
49
Slide 50
50
Slide 51
51
Slide 52
52
Slide 53
53
Slide 54
54
Slide 55
55
Slide 56
56
Slide 57
57
Slide 58
58
Slide 59
59
Slide 60
60
Slide 61
61
Slide 62
62
Slide 63
63
Slide 64
64
Slide 65
65
Slide 66
66
Slide 67
67
Slide 68
68
Slide 69
69
Slide 70
70
Slide 71
71
Slide 72
72
Slide 73
73
Slide 74
74
Slide 75
75
Slide 76
76
Slide 77
77
Slide 78
78
Slide 79
79
Slide 80
80
Slide 81
81
Slide 82
82
Slide 83
83
Slide 84
84
Slide 85
85
Slide 86
86
Slide 87
87
Slide 88
88
Slide 89
89
Slide 90
90
Slide 91
91
Slide 92
92
Slide 93
93
Slide 94
94
Slide 95
95
Slide 96
96
Slide 97
97
Slide 98
98
Slide 99
99
Slide 100
100
Slide 101
101
Slide 102
102
Slide 103
103
Slide 104
104
Slide 105
105
Slide 106
106
Slide 107
107
Slide 108
108
Slide 109
109
Slide 110
110
Slide 111
111
Slide 112
112
Slide 113
113
Slide 114
114
Slide 115
115
Slide 116
116
Slide 117
117
Slide 118
118
Slide 119
119
Slide 120
120
Slide 121
121
Slide 122
122
Slide 123
123
Slide 124
124
Slide 125
125
Slide 126
126
Slide 127
127
Slide 128
128
Slide 129
129
Slide 130
130
Slide 131
131
Slide 132
132
Slide 133
133
Slide 134
134
Slide 135
135
Slide 136
136
Slide 137
137
Slide 138
138
Slide 139
139
Slide 140
140
Slide 141
141
Slide 142
142
Slide 143
143
Slide 144
144
Slide 145
145
Slide 146
146
Slide 147
147
Slide 148
148
Slide 149
149

About This Presentation

chap[ter 1


Slide Content

© 2019 IBM Corporation
BÁSICO DE PARALLEL JOBS
DIA 01

© 2019 IBM Corporation2
Agenda DataStageDia 01
•Introduccióna Information Server y DataStage.
•Topologías.
•Arquitecturadel Job Parallel
•Fuentes de consultas: Knowledge Center y Manuales.
•Introducciónal DataStage Designer
•CreaciónParallel Jobs
•Stages: RowGenerator, ColumnGenerator, Peek, Head y Tail.
Parte 2

© 2019 IBM Corporation
IBMINFOSPHERE
INFORMATIONSERVER

© 2019 IBM Corporation4

© 2019 IBM Corporation
What is IBM InfoSphere DataStage?

© 2019 IBM Corporation
What is IBM InfoSphere DataStage? Features

© 2019 IBM Corporation
What is IBM InfoSphere DataStage? Features

© 2019 IBM Corporation
What is IBM InfoSphere DataStage? Pre Build Stages

© 2019 IBM Corporation
What is IBM InfoSphere DataStage? Pre Build Stages

© 2019 IBM Corporation
What is IBM InfoSphere DataStage? Components

© 2019 IBM Corporation
What is IBM InfoSphere DataStage? Types Jobs

© 2019 IBM Corporation
Installation Overview

© 2019 IBM Corporation
Installation Overview -Tiers

© 2019 IBM Corporation
Installation Overview –Client Tier

© 2019 IBM Corporation
Installation Overview –Services Tier

© 2019 IBM Corporation
Installation Overview –MetaData Tier

© 2019 IBM Corporation17
ARQUITECTURA PARALLEL JOBS

© 2019 IBM Corporation18
Arquitectura en Paralelo

© 2019 IBM Corporation19
DataStage documentación job tipo Parallel

© 2019 IBM Corporation20
Conceptos claves de paralelismo

© 2019 IBM Corporation21
Scalables Ambientes de Hardware

© 2019 IBM Corporation22
Procesamiento tradicional en Batch

© 2019 IBM Corporation23
Paralelismo de Tuberias

© 2019 IBM Corporation24
Paralelismo de Particionamiento

© 2019 IBM Corporation25
Ilustración de Particionamiento

© 2019 IBM Corporation26
DataStage Combina Paralelismo y Pipelining

© 2019 IBM Corporation27
Diseño de Jobs versus Ejecución

© 2019 IBM Corporation28
Definiendo el Paralelismo

© 2019 IBM Corporation29
Archivo de Configuracion

© 2019 IBM Corporation30
Archivo de Configuracion mostrado en el job log

© 2019 IBM Corporation
KNOWLEDGE CENTER

© 2019 IBM Corporation32
https://www.ibm.com/support/knowledgecenter/en/SSZJPZ_11.3.0/com.ibm.swg.im.iis.produ
ctization.iisinfsv.relinfo.doc/topics/iisihrinfo_infsv_rnote_v113.html

© 2019 IBM Corporation33

© 2019 IBM Corporation34

© 2019 IBM Corporation35
Manuales del Information Server
https://www.ibm.com/support/pages/infosphere-information-server-version-113-product-documentation

© 2019 IBM Corporation36

© 2019 IBM Corporation37
INTRODUCCION AL
DATASTAGEDESIGNER

© 2019 IBM Corporation38
Creación Parallel Job

© 2019 IBM Corporation39
Arrastrar Stages y Links

© 2019 IBM Corporation40
Renombrar Stages y Links

© 2019 IBM Corporation41
Vista desde el Director

© 2019 IBM Corporation42
Job Log visto desde el Designer

© 2019 IBM Corporation43
CREACIONDE PARALLELJOBS

© 2019 IBM Corporation44
¿Qué es un Parallel Job?

© 2019 IBM Corporation45
Stage Row Generator

© 2019 IBM Corporation46
Stage Column Generator
•Generatemockdata forspecificcolumnsforeachdata row
processed
•Single input link ; Single Output link
•OnPropertiesTab, specifyhowtheColumnGeneratorstage
operates: Explicit, SchemaFile.
•Algorithmsforintegertype
✓Random; Seed, Limit
✓Cycle: Initalvalue, Increment
•AlgorithmsforStringtype: Cycle, alphabet
•AlgorithmsforDate type: Random, cycle

© 2019 IBM Corporation47
Stage Peek

© 2019 IBM Corporation48
Stage Head
•Selects the first Nrows from each partition of an input data sets
and copies the selected rows to an output data set
•Single input link ; Single Output link
•OnPropertiesTab, specify:
✓The number of rows to copy
✓The partition from which the rows are copied
✓The location of the rows to copy
✓The number of rows to skip before the copying operation begins
•This stage is helpful in testing and debugging applications with
large data sets. For example,
✓The Partition property lets you see data from a single partition to
determine if the data is being partitioned as you want it to be.
✓The Skip property lets you access a certain portion of a data set.

© 2019 IBM Corporation49
Stage Tail
•The Tail Stage selects the last N records from each partition of an
input data set and copies the selected records to an output data
set
•Single input link ; Single Output link
•OnPropertiesTab, specify:
✓The number of records to copy
✓The partition from which the records are copied
•This stage is helpful in testing and debugging applications with
large data sets. For example,
✓The Partition property lets you see data from a single partition to
determine if the data is being partitioned as you want it to be.
✓The Skip property lets you access a certain portion of a data set.

© 2019 IBM Corporation50
Agenda DataStage
BÁSICO DE PARALLEL JOBS
•Importaciónde Metadata
•Accediendo data Secuencial.
•Stages: Secuencial File, Data Set, Copy.
•Accediendo data No Estructurada.
•Stage: UnstructuredData.
•Particionamiento, colección y archivo de Configuración.
•Combinando Datos
•Stages: Lookup, Merge, Join, Funnel
Parte 1

© 2019 IBM Corporation
IMPORTACION DE METADATA

© 2019 IBM Corporation52
SourceandTargetMetadata

© 2019 IBM Corporation53
Sequencialfile importprocedure

© 2019 IBM Corporation54
Importingsequencialmetadata

© 2019 IBM Corporation55
Sequencialimportwindow

© 2019 IBM Corporation56
Specifyformat

© 2019 IBM Corporation57
Editcolumnsnamesandtypes

© 2019 IBM Corporation58
ExtendedPropertiesWindows

© 2019 IBM Corporation59
Tabledefinitionintherepository

© 2019 IBM Corporation
ACCEDIENDO DATA SECUENCIAL
SEQUENTIALFILE STAGE

© 2019 IBM Corporation61
Cómo se maneja la data sequencial

© 2019 IBM Corporation62
Caracteristicas del Stage Sequential File

© 2019 IBM Corporation63
Ejemplo de Formato Sequential File

© 2019 IBM Corporation64
Diseño de Job con stage Sequential File

© 2019 IBM Corporation65
Propiedades Stage Sequential File

© 2019 IBM Corporation66
Tab Format

© 2019 IBM Corporation67
Tab Columns

© 2019 IBM Corporation68
Múltiples Lectores

© 2019 IBM Corporation69
Escribiendo en un Sequential File

© 2019 IBM Corporation70
Links de Rechazos (reject)

© 2019 IBM Corporation71
Links de Rechazos

© 2019 IBM Corporation72
Fuentes y Destinos links de Rechazdos

© 2019 IBM Corporation73
Ajuste Propiedad Modo de Rechazo

© 2019 IBM Corporation74
ACCEDIENDO DATA SECUENCIAL
Copystage

© 2019 IBM Corporation75
Copy Stage

© 2019 IBM Corporation76
Ejemplo Copy Stage

© 2019 IBM Corporation77
Tab Mapping

© 2019 IBM Corporation78
ACCEDIENDO DATA SECUENCIAL
Data Set stage

© 2019 IBM Corporation79
Data Set Stage

© 2019 IBM Corporation80
Job con un Data Set de destino

© 2019 IBM Corporation81
Utility Data Set Management

© 2019 IBM Corporation82
Visualizar data y Schema

© 2019 IBM Corporation
ACCEDIENDO DATA NO ESTRUCTURADA
UNSTRUCTURED DATA STAGE

© 2019 IBM Corporation84
UnStructured File Stage
•Unstructureddataisinformationthatdoesnothaveapredefined
datamodelordoesnotfitwellintorelationaltables.Unstructured
datacanbetextfrombooks,journals,metadata,audio,video
files,thebodyofwordprocessordocuments,webpages,and
presentationcharts
•Oneinput link
•MultipleOutput link (largeamountofmemory)
✓Onlysupportformat(.xlsx)
✓MaximunnumberofExcel recordsis1.048.576
•Currentlysupportsonlyreadand writestoExcel sheets:
✓Excel 97-2003 (.xls)
✓Excel 2007-2010 (.xlsx)
•Supports password-encrypted files
•Doesn´t support Excel files created on Mac

© 2019 IBM Corporation85
Visualizar data y Schema, Lectura hoja Excel

© 2019 IBM Corporation86
Primera Hoja del Libro , no se procesa

© 2019 IBM Corporation87
Segunda Hoja a Procesar

© 2019 IBM Corporation88
Propiedades

© 2019 IBM Corporation89
Configurar

© 2019 IBM Corporation90
Resultado de la Extracción

© 2019 IBM Corporation91
Grabación data y Schema, grabación en hoja Excel

© 2019 IBM Corporation92
Visualizar data de entrada

© 2019 IBM Corporation93
Visualizar columnas

© 2019 IBM Corporation94
Propiedades

© 2019 IBM Corporation95
Configurar

© 2019 IBM Corporation96
Visualizar

© 2019 IBM Corporation
PARTICIONAMIENTO,
COLECCIÓN Y
ARCHIVO DE CONFIGURACIÓN

© 2019 IBM Corporation98
Paralelismo de Partición

© 2019 IBM Corporation99
Stage partitioning

© 2019 IBM Corporation100
Algoritmos de Particionamiento

© 2019 IBM Corporation101
Collecting

© 2019 IBM Corporation102
Collecting

© 2019 IBM Corporation103
Algoritmos de Coleccionamiento

© 2019 IBM Corporation104
Algoritmos de Particionamiento

© 2019 IBM Corporation105
KEYLESS vs KEYED Algoritmos de Particionamiento

© 2019 IBM Corporation106
Particionamiento Round Robin y Random

© 2019 IBM Corporation107
Particionamiento Completo (KEYLESS)

© 2019 IBM Corporation108
Particionamiento HASH (KEYED)

© 2019 IBM Corporation109
Particionamiento HASH (KEYED)

© 2019 IBM Corporation110
Particionamiento MODULUS (KEYED)

© 2019 IBM Corporation111
Particionamiento Auto

© 2019 IBM Corporation112
Iconos Partitioning / Collecting

© 2019 IBM Corporation113
Otros Iconos Partitioning / Collecting

© 2019 IBM Corporation114
Archivo de Configuración

© 2019 IBM Corporation115
Uso del Archivo de Configuración

© 2019 IBM Corporation116
Ejemplo Archivo de Configuración

© 2019 IBM Corporation117
Editando Archivos de Configuración

© 2019 IBM Corporation
COMBINANDO DATOS
STAGELOOKUP

© 2019 IBM Corporation119
Combinando datos

© 2019 IBM Corporation120
Características Stage Lookup

© 2019 IBM Corporation121
Ejemplo Stage Lookup búsqueda exacta

© 2019 IBM Corporation122
Stage Lookup con búsqueda exacta

© 2019 IBM Corporation123
Definiendo clave para la búsqueda (Lookup)

© 2019 IBM Corporation124
Especificando columnas de salida

© 2019 IBM Corporation125
Acciones en falla de busqueda

© 2019 IBM Corporation126
Especificando acciones en falla de busqueda

© 2019 IBM Corporation127
Búsqueda Con Link De Rechazo

© 2019 IBM Corporation128
Funcionamiento Stage Lookup

© 2019 IBM Corporation129
Fallas Stage Lookup (ejemplos)

© 2019 IBM Corporation
COMBINANDO DATOS
STAGEJOIN

© 2019 IBM Corporation131
Stage Join

© 2019 IBM Corporation132
Stage Join ejemplo

© 2019 IBM Corporation133
Propiedades Stage Join

© 2019 IBM Corporation134
Tab Mapping Salidas

© 2019 IBM Corporation135
Funcionamiento Stage Lookup

© 2019 IBM Corporation136
Salida Inner Join

© 2019 IBM Corporation137
Salida Left Outer Join

© 2019 IBM Corporation138
Salida Right Outer Join

© 2019 IBM Corporation139
Salida Full Outer Jooin

© 2019 IBM Corporation
COMBINANDO DATOS
STAGEMERGE

© 2019 IBM Corporation141
Stage Merge

© 2019 IBM Corporation142
Ejemplo Stage Merge

© 2019 IBM Corporation143
Propiedades Stage Merge

© 2019 IBM Corporation144
Comparasion entre Join, Lookup y Merge

© 2019 IBM Corporation
COMBINANDO DATOS
STAGEFUNNEL

© 2019 IBM Corporation146
¿Qué es un stage Funnel ?

© 2019 IBM Corporation147
Ejemplo Stage Funnel

© 2019 IBM Corporation148
Propiedades Funnel Stage

© 2019 IBM Corporation149
Gracias.!
Tags