没有合适的资源?快使用搜索试试~ 我知道了~
首页Talend DI 用户手册5.2.2 英语.pdf
Talend DI 用户手册5.2.2 英语.pdf
4星 · 超过85%的资源 需积分: 9 23 下载量 67 浏览量
更新于2023-03-16
评论 1
收藏 6.58MB PDF 举报
Talend DI 用户手册5.2.2 英语版 talend学习必要手册 Talend是一款ETL 软件,用于抽取不同数据库的数据转换加载到公用的数据库中。(DI是data interligence,而不是数据挖掘)
资源详情
资源评论
资源推荐
Talend Open Studio
for Data Integration
User Guide
5.2.2
Talend Open Studio for Data Integration
Adapted for Talend Open Studio for Data Integration 5.2.2. Supersedes previous User Guide releases.
Copyleft
This documentation is provided under the terms of the Creative Commons Public License (CCPL).
For more information about what you can and cannot do with this documentation in accordance with the CCPL,
please read: http://creativecommons.org/licenses/by-nc-sa/2.0/
Notices
All brands, product names, company names, trademarks and service marks are the properties of their respective
owners.
Talend Open Studio for Data Integration User Guide
Table of Contents
Preface ............................................... vii
1. General information ......................... vii
1.1. Purpose .............................. vii
1.2. Audience ............................ vii
1.3. Typographical conventions .......... vii
2. Feedback and Support ...................... vii
Chapter 1. Data integration and
Talend Studio ....................................... 1
1.1. Data analytics ............................... 2
1.2. Operational integration ..................... 2
Chapter 2. Getting started with Talend
Studio .................................................. 5
2.1. Important concepts in Talend Open
Studio for Data Integration ...................... 6
2.2. Launching Talend Open Studio for
Data Integration .................................. 6
2.2.1. How to launch the Studio for
the first time .............................. 6
2.2.2. How to set up a project ............ 10
2.3. Working with different workspace
directories ....................................... 10
2.3.1. How to create a new
workspace directory ...................... 11
2.4. Working with projects ..................... 11
2.4.1. How to create a project ............ 12
2.4.2. How to import the demo
project .................................... 14
2.4.3. How to import projects ............ 15
2.4.4. How to open a project ............. 17
2.4.5. How to delete a project ............ 17
2.4.6. How to export a project ........... 18
2.4.7. Migration tasks .................... 19
2.5. Setting Talend Open Studio for Data
Integration preferences ......................... 20
2.5.1. Java Interpreter path (Talend) ..... 20
2.5.2. Designer preferences (Talend
> Appearance) ........................... 21
2.5.3. BPM Runtime preferences
(Talend > BPM Runtime
Configuration) ........................... 22
2.5.4. External or User components
(Talend > Components) .................. 23
2.5.5. Documentation preferences
(Talend > Documentation) ............... 24
2.5.6. Exchange preferences (Talend
> Exchange) ............................. 25
2.5.7. Adding code by default
(Talend > Import/Export) ................ 25
2.5.8. Language preferences (Talend
> Internationalization) ................... 26
2.5.9. Performance preferences
(Talend > Performance) .................. 26
2.5.10. Debug and Job execution
preferences (Talend > Run/Debug) ...... 28
2.5.11. Displaying special characters
for schema columns (Talend >
Specific settings) ......................... 29
2.5.12. Schema preferences (Talend
> Specific Settings) ...................... 29
2.5.13. Libraries preferences (Talend
> Specific Settings) ...................... 30
2.5.14. Type conversion (Talend >
Specific Settings) ........................ 31
2.5.15. SQL Builder preferences
(Talend > Specific Settings) ............. 31
2.5.16. Usage Data Collector
preferences (Talend > Usage Data
Collector) ................................ 32
2.6. Customizing project settings .............. 33
2.6.1. Palette Settings .................... 34
2.6.2. Version management .............. 35
2.6.3. Status management ................ 37
2.6.4. Job Settings ....................... 38
2.6.5. Stats & Logs ...................... 39
2.6.6. Context settings ................... 40
2.6.7. Project Settings use ............... 41
2.6.8. Status settings ..................... 42
2.6.9. Security settings ................... 44
2.7. Filtering entries listed in the
Repository tree view ............................ 44
2.7.1. How to filter by Job name ......... 44
2.7.2. How to filter by user .............. 46
2.7.3. How to filter by job status ......... 48
2.7.4. How to choose what repository
nodes to display .......................... 48
Chapter 3. Designing a Business Model
........................................................... 51
3.1. What is a Business Model ................. 52
3.2. Opening or creating a Business Model
.................................................... 52
3.2.1. How to open a Business Model
............................................ 53
3.2.2. How to create a Business
Model .................................... 53
3.3. Modeling a Business Model ............... 54
3.3.1. Shapes ............................. 54
3.3.2. Connecting shapes ................ 55
3.3.3. How to comment and arrange
a model .................................. 57
3.3.4. Business Models .................. 59
3.4. Assigning repository elements to a
Business Model ................................. 61
3.5. Editing a Business Model .................. 62
3.5.1. How to rename a Business
Model .................................... 62
3.5.2. How to copy and paste a
Business Model .......................... 62
3.5.3. How to move a Business
Model .................................... 62
3.5.4. How to delete a Business
Model .................................... 62
3.6. Saving a Business Model .................. 62
Chapter 4. Designing a data
integration Job .................................... 65
4.1. What is a Job design ...................... 66
4.2. Getting started with a basic Job
design ............................................ 66
4.2.1. How to create a Job ............... 66
4.2.2. How to drop components to
the workspace ............................ 69
4.2.3. How to search components in
the Palette ................................ 71
4.2.4. How to connect components
together .................................. 71
4.2.5. How to drop components in
the middle of a Row link ................ 72
4.2.6. How to define component
properties ................................. 73
4.2.7. How to run a Job .................. 79
4.2.8. How to customize your
workspace ................................ 85
4.3. Using connections .......................... 89
4.3.1. Connection types .................. 90
4.3.2. How to define connection
settings ................................... 94
4.4. Using the Metadata Manager ............. 96
4.4.1. How to centralize the Metadata
items ..................................... 96
Talend Open Studio for Data Integration
iv Talend Open Studio for Data Integration User Guide
4.4.2. How to centralize contexts and
variables .................................. 97
4.4.3. How to use the SQL
Templates ............................... 107
4.5. Handling Jobs: advanced subjects ....... 107
4.5.1. How to map data flows .......... 107
4.5.2. How to create queries using
the SQLBuilder ......................... 108
4.5.3. How to download/upload
Talend Community components ........ 111
4.5.4. How to install external
modules ................................. 118
4.5.5. How to launch a Job
periodically (feature deprecated) ....... 120
4.5.6. How to use the tPrejob and
tPostjob components .................... 122
4.5.7. How to use the Use Output
Stream feature .......................... 123
4.6. Handling Jobs: miscellaneous
subjects ......................................... 124
4.6.1. How to share a database
connection .............................. 124
4.6.2. How to define the Start
component .............................. 125
4.6.3. How to handle error icons on
components or Jobs ..................... 126
4.6.4. How to add notes to a Job
design ................................... 128
4.6.5. How to display the code or the
outline of your Job ..................... 129
4.6.6. How to manage the subjob
display .................................. 130
4.6.7. How to define options on the
Job view ................................ 132
4.6.8. How to find components in
Jobs ..................................... 133
4.6.9. How to set default values in
the schema of an component ........... 135
Chapter 5. Managing data integration
Jobs .................................................. 137
5.1. Activating/Deactivating a Job or a
sub-job ......................................... 138
5.1.1. How to disable a Start
component .............................. 138
5.1.2. How to disable a non-Start
component .............................. 138
5.2. Importing/exporting items or Jobs ....... 139
5.2.1. How to import items ............. 139
5.2.2. How to export Jobs .............. 141
5.2.3. How to export items ............. 152
5.2.4. How to change context
parameters in Jobs ...................... 154
5.3. Managing repository items ............... 155
5.3.1. How to handle updates in
repository items ......................... 155
5.4. Searching a Job in the repository ........ 157
5.5. Managing Job versions ................... 159
5.6. Documenting a Job ....................... 160
5.6.1. How to generate HTML
documentation .......................... 160
5.6.2. How to update the
documentation on the spot .............. 161
5.7. Handling Job execution .................. 161
5.7.1. How to deploy a Job on
SpagoBI server ......................... 161
Chapter 6. Mapping data flows ............ 165
6.1. tMap and tXMLMap interfaces ......... 166
6.2. tMap operation ........................... 167
6.2.1. Setting the input flow in the
Map Editor ............................. 168
6.2.2. Mapping variables ............... 175
6.2.3. Using the expression editor ...... 176
6.2.4. Mapping the Output setting ...... 180
6.2.5. Setting schemas in the Map
Editor ................................... 185
6.2.6. Solving memory limitation
issues in tMap use ...................... 188
6.2.7. Handling Lookups ............... 190
6.3. tXMLMap operation ..................... 191
6.3.1. Using the document type to
create the XML tree .................... 192
6.3.2. Defining the output mode ........ 202
6.3.3. Editing the XML tree schema .... 207
Chapter 7. Managing Metadata ............ 209
7.1. Objectives ................................. 210
7.2. Setting up a DB connection .............. 211
7.2.1. Step 1: General properties ........ 211
7.2.2. Step 2: Connection ............... 212
7.2.3. Step 3: Table upload ............. 213
7.2.4. Step 4: Schema definition ........ 216
7.3. Setting up a Hive connection ............. 217
7.3.1. Prerequisites ..................... 217
7.3.2. Step 1: General properties ........ 217
7.3.3. Step 2: Connection ............... 217
7.4. Setting up a JDBC schema ............... 219
7.4.1. Step 1: General properties ........ 219
7.4.2. Step 2: Connection ............... 219
7.4.3. Step 3: Table upload ............. 221
7.4.4. Step 4: Schema definition ........ 222
7.5. Setting up a SAS connection ............. 222
7.5.1. Prerequisites ..................... 222
7.5.2. Step 1: General properties ........ 222
7.5.3. Step 2: Connection ............... 222
7.6. Setting up a File Delimited schema ...... 224
7.6.1. Step 1: General properties ........ 224
7.6.2. Step 2: File upload ............... 225
7.6.3. Step 3: Schema definition ........ 225
7.6.4. Step 4: Final schema ............. 227
7.7. Setting up a File Positional schema ...... 228
7.7.1. Step 1: General properties ........ 229
7.7.2. Step 2: Connection and file
upload ................................... 229
7.7.3. Step 3: Schema refining .......... 230
7.7.4. Step 4: Finalizing the end
schema .................................. 230
7.8. Setting up a File Regex schema .......... 231
7.8.1. Step 1: General properties ........ 231
7.8.2. Step 2: File upload ............... 231
7.8.3. Step 3: Schema definition ........ 232
7.8.4. Step 4: Finalizing the end
schema .................................. 232
7.9. Setting up an XML file schema .......... 232
7.9.1. Setting up an XML schema for
an input file ............................. 233
7.9.2. Setting up an XML schema for
an output file ........................... 240
7.10. Setting up a File Excel schema ......... 249
7.10.1. Step 1: General properties ...... 250
7.10.2. Step 2: File upload .............. 250
7.10.3. Step 3: Schema refining ........ 251
7.10.4. Step 4: Finalizing the end
schema .................................. 252
7.11. Setting up a File LDIF schema ......... 253
7.11.1. Step 1: General properties ...... 253
7.11.2. Step 2: File upload .............. 253
7.11.3. Step 3: Schema definition ....... 254
7.11.4. Step 4: Finalizing the end
schema .................................. 255
7.12. Setting up an LDAP schema ............ 255
7.12.1. Step 1: General properties ...... 256
7.12.2. Step 2: Server connection ....... 256
7.12.3. Step 3: Authentication and
DN fetching ............................ 256
Talend Open Studio for Data Integration
Talend Open Studio for Data Integration User Guide v
7.12.4. Step 4: Schema definition ....... 258
7.12.5. Step 5: Finalizing the end
schema .................................. 258
7.13. Setting up a Salesforce connection ...... 259
7.13.1. Step 1: General properties ...... 260
7.13.2. Step 2: Connection to a
Salesforce account ...................... 260
7.13.3. Step 3: Retrieving Salesforce
modules ................................. 260
7.13.4. Step 4: Retrieving Salesforce
schemas ................................. 262
7.13.5. Step 5: Finalizing the end
schema .................................. 263
7.14. Setting up a Generic schema ............ 265
7.14.1. Setting up a Generic schema
from scratch ............................ 265
7.14.2. Setting up a Generic schema
from a source xml file .................. 267
7.15. Setting up an MDM connection ........ 269
7.15.1. Step 1: Setting up the
connection .............................. 269
7.15.2. Step 2: Defining MDM
schema .................................. 271
7.16. Setting up a Web Service schema ....... 285
7.16.1. Setting up a simple schema ..... 285
7.17. Setting up an FTP connection .......... 288
7.17.1. Step 1: General properties ...... 288
7.17.2. Step 2: Connection ............. 289
7.18. Exporting Metadata as context ......... 291
Chapter 8. Managing routines .............. 293
8.1. What are routines ........................ 294
8.2. Accessing the System Routines ........... 294
8.3. Customizing the system routines ......... 295
8.4. Managing user routines .................. 296
8.4.1. How to create user routines ...... 296
8.4.2. How to edit user routines ........ 298
8.4.3. How to edit user routine
libraries ................................. 298
8.5. Calling a routine from a Job ............. 300
8.6. Use case: Creating a file for the
current date .................................... 300
Chapter 9. Using SQL templates ........... 303
9.1. What is ELT .............................. 304
9.2. Introducing Talend SQL templates ...... 304
9.3. Managing Talend SQL templates ........ 304
9.3.1. Types of system SQL
templates ................................ 305
9.3.2. How to access a system SQL
template ................................. 305
9.3.3. How to create user-defined
SQL templates .......................... 307
9.3.4. A use case of system SQL
templates ................................ 309
Appendix A. GUI ............................... 313
A.1. Main window ............................. 314
A.2. Menu bar and Toolbar .................... 315
A.2.1. Menu bar of Talend Open
Studio for Data Integration ............. 315
A.2.2. Toolbar of Talend Open
Studio for Data Integration ............. 316
A.3. Repository tree view ...................... 317
A.4. Design workspace ......................... 318
A.5. Palette ..................................... 319
A.6. Configuration tabs ......................... 319
A.7. Outline and code summary panel .......... 321
A.8. Shortcuts and aliases ...................... 321
Appendix B. Theory into practice: Job
examples ........................................... 323
B.1. tMap Job example ......................... 324
B.1.1. Introducing the scenario ......... 324
B.1.2. Translating the scenario into a
Job ...................................... 325
B.2. Using the output stream feature ........... 333
B.2.1. Introducing the scenario ......... 333
B.2.2. Translating the scenario into a
Job ...................................... 334
Appendix C. System routines ............... 341
C.1. Numeric Routines ......................... 342
C.1.1. How to create a Sequence ....... 342
C.1.2. How to convert an Implied
Decimal ................................. 342
C.2. Relational Routines ........................ 342
C.3. StringHandling Routines .................. 343
C.3.1. How to store a string in
alphabetical order ....................... 344
C.3.2. How to check whether a string
is alphabetical .......................... 344
C.3.3. How to replace an element in
a string .................................. 344
C.3.4. How to check the position
of a specific character or substring,
within a string .......................... 345
C.3.5. How to calculate the length of
a string .................................. 345
C.3.6. How to delete blank characters
........................................... 345
C.4. TalendDataGenerator Routines ............ 345
C.4.1. How to generate fictitious data
........................................... 346
C.5. TalendDate Routines ...................... 346
C.5.1. How to format a Date ........... 347
C.5.2. How to check a Date ............ 348
C.5.3. How to compare Dates .......... 348
C.5.4. How to configure a Date ......... 348
C.5.5. How to parse a Date ............. 349
C.5.6. How to retrieve part of a Date ... 349
C.5.7. How to format the Current
Date ..................................... 349
C.6. TalendString Routines ..................... 350
C.6.1. How to format an XML string ... 350
C.6.2. How to trim a string ............. 351
C.6.3. How to remove accents from a
string .................................... 351
Appendix D. SQL template writing
rules ................................................. 353
D.1. SQL statements ........................... 354
D.2. Comment lines ............................ 354
D.3. The <%...%> syntax .................... 354
D.4. The <%=...%> syntax ................... 355
D.5. The </.../> syntax .................... 355
D.6. Code to access the component schema
elements ........................................ 356
D.7. Code to access the component matrix
properties ....................................... 356
剩余365页未读,继续阅读
ioxygen
- 粉丝: 2
- 资源: 2
上传资源 快速赚钱
- 我的内容管理 收起
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
会员权益专享
最新资源
- stc12c5a60s2 例程
- Android通过全局变量传递数据
- c++校园超市商品信息管理系统课程设计说明书(含源代码) (2).pdf
- 建筑供配电系统相关课件.pptx
- 企业管理规章制度及管理模式.doc
- vb打开摄像头.doc
- 云计算-可信计算中认证协议改进方案.pdf
- [详细完整版]单片机编程4.ppt
- c语言常用算法.pdf
- c++经典程序代码大全.pdf
- 单片机数字时钟资料.doc
- 11项目管理前沿1.0.pptx
- 基于ssm的“魅力”繁峙宣传网站的设计与实现论文.doc
- 智慧交通综合解决方案.pptx
- 建筑防潮设计-PowerPointPresentati.pptx
- SPC统计过程控制程序.pptx
资源上传下载、课程学习等过程中有任何疑问或建议,欢迎提出宝贵意见哦~我们会及时处理!
点击此处反馈
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功
评论2