Talend Exchange is the place where Talend community can share items related to Talend opensource products, such as Data Integration, Data Quality and Data Master Management. Contribution is open to any user, no specific validation is needed. As soon as you have your forum account, you automatically get a Talend Exchange account.


Show

Category
Search
Version
Author
 

Statistics

  • 575 extensions
  • 1000 revisions
  • 275 contributors
  • 141378 downloads
 

Top Contributors

Version Author Released on Rating Downloads
Component

tFileExcelWorkbookSave

5.1 jlolling 2014-04-16
625

This component is designed to work with tFileExcelWorkbookOpen and tFileExcelSheetOutput.
This component writes a workbook into a output file.
This happens after finish writing in several sheets by tFileExcelSheetOutput.
Can delete sheets (usually used as templates before)
Option to (re)evaluate all formulas added
Translation in German and French added
*NEW*: The output file name can be set without file extension. The correct extension will be added (or even fixed) automatically. The final file name can be retrieved from the return value FILENAME.

Component

tFileExcelSheetInput

5.1 jlolling 2014-04-11
1087

This component works in conjunction with tFileExcelWorkbookOpen.
This component reads a sheet identified by name or index.
Unlike the tFileInputExcel component you can specify which columns you want to read and leaf out what is not needed.
The fields can comfortably configured by excel column name or index.
It will always use the current Apache POI 3.10 libs (also for the older xls format).
The sheet name can be retrieved from the output of the new component tFileExcelSheetList.
Can read comment (alternatively to the value) and
Can use none empty value of previous row to fill empty values in current row.
Can ignore cell format errors if wanted.
*NEW*: Number format can changed for different languages/countries in advanced options.
*NEW*: Column positions can automatically be adjusted by header line
*NEW*: Can read hyperlinks (title and/or URL) configurable in the advanced settings
*NEW*: Can limit the number of rows read.
*NEW*: Can stop reading if a row is empty
*NEW*: Column position in the header row can be found with regularly expressions

For using this component, you have to update tFileExcelWorkbookOpen at first!

For all which have problems installing this component. Installing problems has in 99,9 % of the cases nothing to do with the component. This component is tested with all releases. In case of trouble installing this component, please contact me directly instead of rating the component bad for problems not caused by the component.
email: jan.lolling@gmail.com

Component

tFileExcelSheetOutput

5.1 jlolling 2014-04-11
892

This component is designed to work with the components tFileExcelWorkbookOpen and tFileExcelWorkbookSave. This component use the Apache POI library version 3.10.
The goal of this component component is to write or create a sheet with a minimum impact to the structure of the spreadsheet (it will as best it can keep references, macros, graphics and so on).
Many reports are especially designed Excel documents with lots of formulas and references and the automated report creation process have to write into a well described area and keep the rest unchanged.

This component can also write formulas (define a String column and write the formula in the English language starting with a =. To follow the row index in the formula you can write {row} into your formula and this will be replaced with the current row index.
Example for a formula string: =A{row}+G{row}

You can define for every column the target column in Excel with the Excel column name, set the target column auto sized and define the format for Date and Number columns.

There is also an option to write datasets into columns (every new row creates an new column (like if you rotate you data by 90 degree).
This component can be used in sub jobs in iterations to create or write into many sheets.

*NEW*: Sheets can be created as clone of an other sheet.
*NEW*: Freeze option added
*NEW*: Option to allow overwrite existing cells with null (default = false)

*NEW*: Can reuse from the template file the styles from first or first two row(s) for all data rows
To use this feature simply define in a file styles in a template sheet, read this file in the tFileExcelWorkbookOpen and use this as template.
*NEW*: Can remove all surplus rows

*NEW*: Can reuse conditional formats
*NEW*: improves the handling of styles and avoids overwriting styles related also to other columns
*NEW*: Can reuse the row height from the first data row for all new rows.

Please take a look into the new usage guid linked in this component detail page!

Please update always the component tFileExcelWorkbookOpen to get the current necessary library.

Component

tFileExcelWorkbookOpen

5.1 jlolling 2014-04-11
1710

This component is the base component for all other tFileExcel-components.
This component use the Apache POI library version 3.10.
This components reads a spreadsheet in a workbook or create a new empty workbook in memory.
This workbook can be filled (with sheets) by the component tFileExcelSheetOutput or can be read by tFileExcelSheetList/Input components.
Finally the component tFileExcelWorkbooksave persists the workbook in the same file as read or in a new file.
The component recognize the file format by the filename extension, it is not necessary to configure it (except if you create a new workbook).
*NEW*: can read and write xlsm files.
Bug fixed: reading rows stops if row is empty or does not exists
*NEW*: Memory saving mode for XSLX type
*NEW*: Can read password protected files
*NEW*: support for IFERROR function added
*NEW*: Support for reading hyperlinks added to the library

Component

tNotesRunAgent

5.4.3 tuanport 2014-04-10
1

Component

tNotesOutput

4.5.3 tuanport 2014-04-10
0

Component

tNotesInput

5.4.3 tuanport 2014-04-10
0

Component

tUniservRTIdentitySearch

2.3.0 dqsh_uniserv 2014-04-08
82

tUniservRTIdentitySearch allows Talend jobs to search in a DQ identity RT search index for customer data records taking into account misspellings, phonetic similarities, synonyms or missing data.

To be able to use tUniservRTIdentitySearch, the search index has to be built using the tUniservRTIdentityBulk component.

To be able to use the tUniservRTIdentitySearch component, the Uniserv DQ identity RT software must be installed.

For more information on DQ identity RT, visit http://www.uniserv.com/en/products/uniserv-data-quality-functions/identity-resolution.

Component

tUniservRTIdentityOutput

2.3.0 dqsh_uniserv 2014-04-08
133

tUniservRTIdentityOutput allows Talend jobs to update an existing DQ identity RT search index that has previously been built using tUniservRTIdentityBulk by inserting data for new customers, deleting existing customer entries or modifying the attributes for existing customers.

Updates the index pool which is used for duplicate search.

To be able to use the tUniservRTIdentityOutput component, the Uniserv DQ identity RT software must be installed.

For more information on DQ identity RT, visit http://www.uniserv.com/en/products/uniserv-data-quality-functions/identity-resolution.

Component

tUniservRTIdentityBulk

2.3.0 dqsh_uniserv 2014-04-08
88

tUniservRTIdentityBulk allows Talend jobs to build a search index over a database of customer data for later use by tUniservRTIdentitySearch to efficiently and effectively retrieve addresses and identify duplicates even with misspelled or incomplete data.

Prepares the index pool for the search for duplicates.

To be able to use the tUniservRTIdentityBulk component, the Uniserv DQ identity RT software must be installed.

For more information on DQ identity RT, visit http://www.uniserv.com/en/products/uniserv-data-quality-functions/identity-resolution.

Show

Category
Search
Version
Author
 

Statistics

  • 141 extensions
  • 174 revisions
  • 36 contributors
  • 15674 downloads
 

Top Contributors

Version Author Released on Rating Downloads
ParserRule

Tweets

1.0 scorreia 2013-11-20
18

get information from tweets.
Extract the date/time, user, hashtags, referenced users and urls from Twitter messages.

Regex

Only alphabetical characters not empty

1.0 dcortinovis 2013-06-19
62

Only alphabetical characters not empty.
And at least one (empty forbidden)

Indicator

EMail validation via mail server

5.4/5.3 mzhao 2013-06-03
517

This Java UDI check emails by sending a SMTP request to mail server. the code sample can be found at: http://www.rgagnon.com/javadetails/java-0452.html

Indicator

Frequency table of hours

2.0 scorreia 2013-04-25
355

This indicator helps to analyze the most frequent day hours that appear in date time columns.

Indicator

Sample Standard Deviation

1.1 scorreia 2013-04-25
268

This indicator computes the sample standard deviation of any numerical column

Indicator

Variance

1.1 scorreia 2013-04-25
249

This indicator computes the variance of numeric columns

Indicator

Trimmed

1.0 scorreia 2013-04-25
60

evaluate the number of data which are correctly trimmed

Indicator

Week Frequency

2.0 scorreia 2013-04-25
270

aggregates Date fields into weeks

Indicator

Duplicate Rows

2.0 scorreia 2013-04-25
774

this indicator counts the number of duplicate rows.
It's different from the system indicator called "duplicate count" because it counts the number of duplicate rows, not the number of duplicate values.

Indicator

Length Range Frequency

1.1 scorreia 2013-04-25
122

get length ranges of data.

group data according to their length range.
Ranges are the following:
data of length < 10
data of length < 20
data of length < 30
data of length >= 30
null data

Show

Category
Search
Version
Author
 

Statistics

  • 5 extensions
  • 7 revisions
  • 4 contributors
  • 3864 downloads
 

Top Contributors

Version Author Released on Rating Downloads
Export

Product Demo

3.0 ctoum 2012-05-31
559

Product & families, with Cafepress pictures.

Data-Model

Clinical Trials: Janus Model Basics

1.0 jaymce 2010-11-22
376

This is a model of the basic of the Janus Clinical Data Repository.
http://www.fda.gov/ForIndustry/DataStandards/StudyDataStandards/ucm155327.htm

Data-Model

D* Demo Model

1.0 ctoum 2010-08-13
698

Model used in the D* Demo.

Export

Talendshop Demo

1.0 ctoum 2010-08-04
1109

Talendshop Demo (Demo Project)


63 ms