#1 2012-04-17 15:35:19

sset
New member
Registered: 2012-04-17
Posts: 5

Talend data quality

Hello,

We understand data integration is more related to adapters to acquire feed for instance  FTP, JMS.

1.  How does Talend Parallelization work?
     Assume we need to acquire some 3-5TB of data, using TALEND  - what kind of parallel features be adopted?
     Optimizations? Concurrency?

2.  Data integration studio is related to adapters
     Data quality studio is related to transformation,validation....

     How do we integrate both - output of adapters should go validation,tranformation....
     Any chaining?

3.  How do we integrate rules engine (jboss rules) with Talend?

Thanks

Offline

#2 2012-04-23 17:44:30

sizhaoliu
Talend Team
Registered: 1970-01-01
Posts: 49
Website

Re: Talend data quality

Hi,
For Question 2, we have more than 40 integration components to resolve DQ relevant tasks such as matching, validation, consolidation, deduplication and standardization. Some tasks can be directly generated from the data analysis in the DQ profiler perspective. Besides, a DI process can send data to Data Stewardship Console to introduce human intervention. Users can also create jobs to generate DQ report.

For Question 3, about the integration of business rules, we currently offer 2 components: tBRMS and tRules.
They are included in TIS Professinal Edition.
tBRMS works with rule packages created in Drools Guvnor, while tRules works with local DRL rules.
Scenarios can be found in the component documentation.

Regards,

Offline

Board footer

Powered by FluxBB