[2012-05-16] Talend Blog: The New Use Cases of Big Data: Marketing

In The New Use Cases of Big Data series, we are reviewing some of the use cases enabled by new technology for big data.

Leveraging of big data for marketing can be real-time, on the fly behavioral analysis for cross promotions, or batch analysis and segmentation for targeting and nurturing.

In the first case – real-time – the goal is to influence consumer behavior at the point of sale or on the site. The most common example is the recommendation engine, but check out couponing also belongs to this category. For years, large organizations such as Amazon or eBay have used recommendation engines to match and recommend products, people and advertisements to users based on analysis of user profile and behavioral data. These problems were actually some of the first tackled by big data and have helped develop the technology into what it is today. Now, much smaller organizations can benefit from the processing power of Hadoop to process these large data sets and provide real-time recommendations.

In the second case – batch segmentation – massive amounts of data are analyzed, which that was just not possible or practical with traditional relational solutions.  Organizations are now able to better identify a target audience and identify the right person for the right offerings. Big data allows marketing teams to evaluate large volumes from new data sources, like click-stream data and call detail records, to increase the accuracy of analysis. Indeed, the more information made available to a marketer the more granular targets can be identified and messaged – but the traditional issue was that too much data killed data. No longer!

Starting with marketing in The New Use Cases of Big Data may seem far-fetched. But that’s where it all started. We owe big data technologies to the marketing needs of large internet organizations.  And not to worry, we’ll dabble into other domains in the next posts.

Yves

 

[2012-05-15] Talend Blog: Version 5.1 is Ready, and it’s Big

Shortly after the GA open source versions were uploaded to the download area of the website, we officially announced version 5.1 today.

Among a wealth of enhancements to Talend’s next-generation integration platform, there are certain aspects of this new release that are worth highlighting:

  • Version 5.1 includes the official GA version of Talend Open Studio for Big Data, now available under the Apache license. It was possible since March to download a technology preview of the product, and we have had lots of very positive feedback on it. Now the real thing is there… download it now!
  • After introducing BPM as a piece of the holistic integration puzzle with version 5.0, this version reinforces the links between BPM and ESB, with a new connector that provides point-and-click connectivity between a business process and web service running on the ESB.
  • This version marks the first synchronized release of all the products: data integration, data quality, MDM, ESB, BPM and big data – all leveraging the Talend Unified Platform technology foundation.

All products from the Talend Open Studio family are available for immediate download. Try them out now!

Yves

 

[2012-05-14] Talend Blog: The New Use Cases of Big Data

In The Big Use Cases of Big Data, we have discussed how large and wealthy organizations have been leveraging their big data for a long time. In a series of short blog posts, we’ll be reviewing some of the less mainstream, or more “democratic” uses cases that new technology for big data is enabling.

Like many other open source vendors, Talend was founded on the promise to democratize integration. The means to achieve this democratization is through open source that accelerates development cycles, promotes adoption, and enables companies of all sizes to use enterprise-grade technology without mortgaging their future.

Hadoop, its Apache ecosystem, and a number of tools such as Talend Open Studio for Big Data are making this democratization possible. The same way open source has democratized several IT domains that nobody wanted to predict would open to such technologies – the MDM market is a good example – open source is the key to big data democratization, or Big Data for the Masses (a great Talend whitepaper, registration required).

Stay tuned for some new use cases of big data…

Yves

 

[2012-05-11] Talend Blog: Open Source Companies Are Doing Well, Thank You Very Much

Channel-focused blog The VAR Guy recently posted an update to its Open Source 50, an “annual look at open source in the IT channel” it had launched in 2008. The purpose of the update is to analyze where each member of the original 50 is now.  And the results are quite interesting!

While not all open source companies have been striving like Red Hat (becoming the first open source vendor to reach $1 billion in revenue), or MySQL (sold to Sun for $1 billion), or Talend (100%+ growth sustained for several years), the results are quite encouraging:

  • Only 2 of the 50 companies have either closed or are on life support.  4% failure rate: I’d venture to say it’s not bad if you compare it to the overall IT industry.
  • 6 companies have been acquired, or 12% – and I would say that at least 3 or 4 of these are striving in their new homes. I have no idea how that ratio compares to industry average?
  • 8 companies downplay open source in their positioning or have forsaken open source all together. I would venture that these companies never understood what it meant to be open source, they only wanted the marketing benefits (real or supposed) to be open source but were not ready to make a commitment to the cause. I have written about this in the past, and all I’ll say is: good for them if they were able to reposition successfully.

Overall, not a bad track record.  Open source companies are doing well, thanks for asking!

Yves

 

[2012-05-09] Talend Forum Announcement: Talend Unified Platform 5.1.0 is Available

Dear Community members,

We are proud to announce that Talend's 5.1.0 Unified Platform release is available.
Download Talend's Unified Platform General Availability version.

Compared to version 5.0.0, many new features have been added. Following are links to release notes for Talend Open Studio for Data Integration, Talend Open Studio for Big Data, Talend Open Studio for Data Quality, Talend Open Studio for MDM and Talend Open Studio for ESB.

Talend Open Studio for Data Integration

http://www.talend.com/docs/community/di … notes.html

Talend Open Studio for Big Data

http://www.talend.com/docs/community/bd … notes.html

Talend Open Studio for Data Quality

http://www.talend.com/docs/community/dq … notes.html

Talend Open Studio for MDM

http://www.talend.com/docs/community/md … notes.html

Talend Open Studio for ESB

http://www.talend.com/docs/community/es … notes.html

For more information on fixed bugs and new features, go to the TalendForge Bugtracker.

Thanks for being a part of our community,
The Talend Team.

[2012-05-09] Talend Blog: The Big Use Cases of Big Data

As discussed in a previous post, What is Big Data, Anyway?, big data is nothing new. What’s new(er) are the tools are technologies that are contributing to the democratization of big data (see Tools and Technologies of Big Data for some technology basics).

While most organizations have been amassing big data for years, it is worth noting that some of them have been “doing” big data for years. By “doing”, I mean processing, leveraging, analyzing, mining – anything else than just storing it.  It’s one thing for Wal-Mart to process in excess of one million transactions per hour and store this historical data, or for the US Census Bureau to collect demographics on 300 million Americans – and it’s another thing for these parties to actually process it, massage it, and extract actionable information.

As we have all experienced first-hand, or heard of, a number of real-life (or rumored) big data use cases have been in existence for years.  Among some obvious and famed “big” use cases:

  • Credit card fraud detection: anyone who travels a lot for business is often led to some unusual spending patterns.  Like buying French train tickets online from a US IP address, minutes after ordering a Kindle book and just before paying for an intercontinental flight. So when my credit card company calls me to check, I can hardly complain (and I am actually glad they call).
  • Retail: the decades-old beer and diapers mining story (allegedly a rumored one), recently supplanted by the teen pregnancy one (it will probably take time before we know if this one is real or not!). Less prone to urban legend, the long string of coupons that print out at the register reminding you that you need dip for these chips, or conditioner to go with this shampoo, or mulch to help your flowers grow, are the result of big data analysis.
  • Yield management: with airfares that vary on an hourly basis and hotel room prices continuously adjusted, the travel & hospitality industry has been a master of shifting through massive historical data to get the best possible price out of a finite inventory of airplane seats or hotel rooms – without leaving any unsold.

What are the commonalities between these “big” use cases? Credit card companies, retail chains, airlines & hotel chains are for the most part large and wealthy organizations. They did not wait for the recent big data frenzy to “do” big data. They invested a lot of money into hardware and software, hired the best talents, and built their infrastructure and algorithms, by trial and error.  And for the most part, they have reaped the rewards of this investment.

What’s new is the democratization of big data. In future posts, we’ll be reviewing some of the less “big” use cases of big data, and how big data can help smaller, less wealthy organizations.

Yves

 

[2012-05-09] Talend Forum Announcement: Talend Tech video tutorials on the Talend Channel

Hi Community,

At Talend, we contribute to open source communities, because they challenge us with new ideas and keep us sharp. Funneling those ideas into our products, we have the aim to make open source software intuitive and easy to use through better tooling and the adoption of open standards.

With the Talend Tech video tutorials, our R&D teams put together some quick, "how-to" videos, which hopefully better enables users to get to grips with new features and product changes that we are making. We hope you find these useful, we do ourselves by the way!

Watch our videos on the Talend Channel on Youtube: http://www.youtube.com/user/TalendChannel
Video tutorials can be found on the Talend Tech playlist: http://www.youtube.com/playlist?list=PLF582198EE7B0E862

Best,
Pcoffre.

[2012-05-07] Talend Blog: Tools and Technologies of Big Data

Big data is first and foremost data.  It is not tools and technologies.  There is however a number of tools and technologies for big data that have seen the light of the day in the recent years, that are used to store, process, analyze and otherwise harvest big data.  Vendor marketing being what it is (and I am as guilty as anyone else), these are often referred to as “big data tools” or “big data technologies”.

In this post I will try to clarify some of the basic concepts and tools for big data.

 

MapReduce

At the foundation of these technologies is a concept called MapReduce.  It provides a massively parallel environment for executing computationally advanced functions in very little time, on a grid of commodity hardware.  Exit, the Crays and other massively parallel, helium-cooled supercomputers.

MapReduce allows a programmer to express a transformation of data that can be executed on a cluster that may include thousands of computers operating in parallel.  At its core, it uses a series of “maps” to divide a problem across multiple parallel servers and then uses a “reduce” to consolidate responses from each map and identify an answer to the original problem.

MapReduce enables technology for big data such as Hadoop to function.

 

Hadoop

Started at Yahoo! as an implementation of MapReduce in 2005 and released as an open source project in 2007, Apache Hadoop has the basic constructs needed to perform computing: a file system, a language to write programs, a way of managing the distribution of those programs over a distributed cluster, and a way of accepting the results of those programs. Ultimately the goal is to create a single result set.

With Hadoop, big data is distributed into pieces that are spread over a series of nodes running on commodity hardware.  In this structure the data is also replicated several times on different nodes to secure against node failure. The data is not organized into the relational rows and columns as expected in traditional persistence.  This lends to the ability to store structured, semi-structured and unstructured content.

Hadoop is an Apache top level project.

 

The Hadoop Ecosystem

A number of projects have seen the light of day around Hadoop, aiming at providing additional features. The main ones for processing big data include:

  • Pig, to write complex MapReduce transformations using a scripting language, Pig Latin, which defines a set of transformations on a data set such as aggregate, join and sort.
  • Hive, a data warehouse infrastructure built on top of Hadoop for providing data summarization, ad-hoc query, and analysis of large datasets.
  • HBase, a non-relational columnar database, which provides fault-tolerant storage and quick access to large quantities of sparse data.
  • HCatalog, a table and storage management service for data created using Apache Hadoop that provides a table abstraction to where and how data is stored.
  • Sqoop, a set of tools that allow Hadoop to interact with traditional relational databases and data warehouses.

 

This post is meant only as a high level overview of tools for big data. For more detailed information on these technologies (and more), I would recommend a very good Talend white paper: Big Data for the Masses (registration required).

Yves

 

[2012-05-04] Talend Blog: What is Big Data, Anyway?

Big data is nothing new.  Humans, helped by their computers, have been collecting massive data sets, since the dawn of times.  Weather patterns, harvest yields, stock prices, geo-localization, searches, web browsing – the types of data are as broad as the needs of the organizations, large and small, in all industries, which collect mind-boggling amounts of data.

A clear definition of big data is also difficult to pin down.  What is big to one organization may not big to the next.  As the name implies, big data is characterized by the size or volume of data records, but other attributes need to be considered, such as velocity, variety, or complexity.  Most scholars of big data today tend to converge on the “3Vs” definition (Volume, Velocity, Variety), some will agree that these need to be augmented with the “C” of Complexity.

  • V for Volume: this one seems pretty obvious. But it isn’t. Take Wal-Mart, whose stores routinely handle 1 million customer transactions per hour. Most organizations’ IT systems wouldn’t be able to cope, but Wal-Mart has been doing this for decades. What’s big for Wal-Mart and what’s big for – say – Consolidated Autodealers of Middle America, Inc. just isn’t the same. But it’s equally critical.
  • V for Velocity: also called latency. How often do we need to refresh the data stores? How long are we willing to wait for new transactions to become available for analysis?  Velocity has a huge impact. Updating 1 million records every minute is a lot more difficult than updating 100 million records every day.
  • V for Variety: unstructured and semi-structured (or poly-structured) content appears in all kinds of places; for example web content, Twitter/Facebook posts, customer comments, etc. And even with structured data, the type of sources can quickly become a challenge: all kinds of databases, SaaS applications, files, Cloud systems, etc.
  • C for Complexity: let’s put it this way – if it was simple, if we knew how to do it with traditional technologies, why invent new paradigms? Big data opens new doors to what we can learn, extract from it. Translating this into computer programs is where the complexity resides.

One thing is certain: big data is not defined by a set of technologies; rather it defines a set of techniques and technologies that are used to make sense out of it.

In a future post, we will be looking at the tools and technologies for big data.

Yves

 

[2012-05-03] Talend Document: Talend Enterprise ESB Technical Overview

Talend Enterprise ESB is the first to combine application integration with data management to allow businesses to cope with the volume of data in an increasingly connected world. Talend Enterprise ESB enables developers to easily build reliable, scalable and secure REST, Web and data services to quickly integrate heterogeneous IT environments, both on-premises and in the cloud.

Download this Talend White Paper to find out more about the key features that make Talend Enterprise ESB a reliable and scalable enterprise service bus (ESB) that allows development teams to holistically address integration projects – combining application integration and data management for complex, heterogeneous IT environments.”

[2012-04-27] Talend Forum Announcement: Talend Connect in Boston on June 19: an Event for the Community

Talend Customers, Community Members and Partners…
Reserve your spot at Talend Connect in Boston today!

http://info.talend.com/TalendConnect_Bo … 20619.html

Talend is bringing together its entire community, including our users, customers and partners for a full day of valuable insight, information sharing and networking. Don’t miss Talend Connect where you’ll learn firsthand from Talend experts and industry luminaries how organizations are successfully tackling the integration demands of today’s data-driven business.

WHO
Talend Customers, Users and Partners

DATE
Tuesday, June 19, 2012

TIME
9:00 am – 6:00 pm

LOCATION
Millennium Bostonian Hotel

Please register today to reserve your space. Space is limited:

http://info.talend.com/TalendConnect_Bo … 20619.html

[2012-04-26] Talend Document: TDWI - Next Generation Master Data Management

While Master Data Management (MDM) has become less avant-garde and more mainstream, we still may be wondering – what is its future?

This report, written by Philip Russom and published by TDWI, tracks trends of companies that are planning to successfully modernize their data management best practices.

It looks at the cutting-edge processes and technical functions for MDM, and it uses survey data to predict which MDM feature sets will be most important for the future.

[2012-04-17] Talend Forum Announcement: For test only, Talend's 5.1.0RC1 Unified Platform release is available

Dear Community,

We are pleased to announce that Talend's 5.1.0RC1 Unified Platform release is available, for testing only. This release candidate contains many bug fixes and new features, and is recommended for experienced users who need an early preview of upcoming 5.1 release.

Download Talend's second Unified Platform release candidate here.

Talend Open Studio for Data Integration:

You will find in this list features and bug fixes for the Data Integration Studio and Components.

Studio : 
New Features :
TDI-20019           Allow to set a loop element to be optional
TDI-19671           UI part: Add the possibility to set multiple loop elements
TDI-7284              Suggestions to improve use of tAggregateRow

Bugs :
TDI-19359           The job Can't save
TDI-20633           "item ...not found..." shell appeares when Import WSDL Schemas for the second services in V5.1.0NB_r81649 version studio
TDI-20617           Fail to create a services use "Import existing WSDL" mode
TDI-20615           Fail to import items with "Select root directory" type
TDI-20578           TOS_BD shows metadata and business model
TDI-20565           Fail to create a new routine
TDI-20526           Windows->Preferences->Talend->Export/Import :error message
TDI-20524           <Copybook>:Not support change "Property Type" from "Repository" to "Build-in"
TDI-20483           error shows in the error log when update the repository for metadata
TDI-20480           Export items from the toolbar can be a little slow...
TDI-20479           Exception with TalendSelectionManager
TDI-20440           Gabage string instead of "Creaet job" icon on the product menu
TDI-20438           Unable to enable multi-loop on the second tXMLMap
TDI-20385           Studio Repo View : Generic metadata contextual menu in recycle bin is wrong
TDI-20336           Press F5 will make a NPE error in Job Setting->Stats&Logs
TDI-20335           user can't open sqlbuilder in Job Setting
TDI-20318           an error happened when drag a copybook to job
TDI-20317           in configure contexts, new a context, close directly will add a null-name one
TDI-20302           tXMLMap: Deleting the schemas and creating new - Nullpointer exception when trying to create new Mapping - job model unrepairable afterwards
TDI-20296           software update functionality does not work in Studio
TDI-20266           tXMLMap is edit able from main project side
TDI-20246           dirfferent warining about check expired day,
TDI-20238           tXMLMap:if shema transfer from input component, the root element can not be set as a loop element automatic
TDI-20234           tXMLMap:Faile to use xml from repository
TDI-20223           The content of the tMap display incompletely in component basic settings
TDI-20208           Button "Deselect All" doesn't work
TDI-20204           creating XML metadata from XSD file
TDI-20199           Setting a repository schema in a tMap with multiple output has unexpected consequences
TDI-20192           MDMInput - Unable to retrieve MDM Entities
TDI-20118           Fail to use Copybook in a job
TDI-20063           DB connection_Some garbage strings on File XML retrieve schema page.
TDI-20018           Create a oracle DB connection, Retrieve schema, select 'all synonyms' then 'select all' will show errors in error log.
TDI-19980           Job duplicated and KO after import JobScript with version >0.1
TDI-19971           Can not use context variables of type "Document"
TDI-19869           Job designs_Export job, the path are not perserverd and if you export another job the default path is not correct and can not export job derectly.
TDI-19770           PostgresqlInput Database Schema value not used
TDI-19555           Problems of generation of query when the db connection doesn't hold any schema info
TDI-19535           <Import>:one job with different version can appear at repository at the same time
TDI-19352           Fail to use "tAmazonOralecInput" in a job
TDI-19257           Accents in the components labels are lost when switching from "Jobscript" to "Designer" view
TDI-18849           Fail to configure AMC with informix
TDI-17122           some about access db
TDI-16706           Repository filter not support HL7 and MDM
TDI-13613           Guess Schema button should be disabled in schema based on file where the example file is no more available.
TDI-7196              checked salesforce socks proxy ,make "http proxy" and direct connection fail.

Components :
New Features :
TDI-20124           Upgrade Marketo from 1.5 to 1.7
TDI-19628           Manage choice and substitutionGroup XSD element
TDI-20389           Add "pending" status in the list of statuses in tStewardshipTaskInput
TDI-20355           The value being read from a column shows up a different value than the actual value when using the tFileInputEBCDIC component
TDI-20071           Support MS CRM 2011
TDI-19786           EBCDIC component: Create new component tExtractEBCDICFields
TDI-19781           EBCDIC component: Review implementation of the component to improve the speed when read files.
TDI-19689           Homogenization of the technologies used in the tWebservice and tESBConsumer components
TDI-18873           Talend converts a string that has a date value automatically to a different date format.


Talend Open Studio for Big Data:

New features:
TDI-20467           Add the possibility to set the row format for a HCatalog creation
TDI-20301           Support Hive with HDP

Bug :
TDI-20085           HTTP Method "DELETE" of the tREST component is an unreadable code If Local language is chinese
TDI-20471           tHttpRequest does not send Response.Content on Get request with Write to File unchecked
TDI-20394           tAdvancedFileOutputXML in append mode corrupt output data
TDI-20328           NPE in tOracleSCD when updating a BigDecimal from non-null to null
TDI-20323           These components lost their Help files in the Studio
TDI-20251           tVerticaOutput - Unexpected query, COPY is in progress
TDI-20144           Rejects flow in tOracleSCD component
TDI-19997           tTeradataOutput : reject causes lack of rows to be inserted
TDI-17674           diffDate() gives unexpected results on DST change
TDI-20529           ORA-01727: numeric precision specifier is out of range 1 to 38
TDI-20285           The encoding shouldn't be set by default.
TDI-20453           Fix somes issues on tHCatalogOperation
TDI-19709           NoClassDefFoundError in tHBaseOutput component
TDI-20298           tHCatalogOperation:Hide STDERR and STDOUT


Talend Open Studio for Data Quality:

Bug fixes:
6 bugs >= major fixed
2 bugs <= minor fixed


Talend Open Studio for MDM:

Web Features:
TMDM-3522, 3523: set the name + "catalog" (folder) of a picture. Catalog browser to browse & pick pictures.

Studio Features:
TMDM-3363: Choose to share password or not in Server explorer (also in SE but this is only useful when repository hosted on SVN)
TMDM-1816: Complete/fix a record in a beforeSaving process before it is committed in the DB

Bugs:
44 bugs >= major fixed
38 bugs <= minor fixed


Talend Open Studio for ESB:

In the ESB tooling, especially in the Route Builder, we like to highlight the following new features in 5.1.0RC1:

- Improvement in REST Service development (HTTPS, Basic Auth)
- Updated Rent-a-car example
- Routes are exported as kar file now to improve the dependency management
- SAM Server on Talend Runtime works now with all supported databases.
- ESB Consumer configuration at runtime (SSL, proxy, etc )

Within the Tooling and Runtime we switched now to the SNAPSHOT versions of the following underlying Apache Projects:
-    Apache CXF: 2.6.0-SNAPSHOT
-    Apache Camel: 2.9.2-SNAPSHOT
-    Apache Karaf: 2.2.6 (Release)

The Release Notes report can be found here:
http://jira.talendforge.org/secure/Conf … sion=12613


For more information on fixed bugs and new features, go to the TalendForge Bugtracker.

Thanks for being a part of our community,
The Talend Team.

[2012-04-17] Talend Document: Big Data for the Masses

Big data represents the significant shift in the way that companies receive and used data to fuel business. As the volume, velocity and variety of data increases, we must ready ourselves for managing it with big data strategies.

Download this white paper to read more about the current state of big data and what we can anticipate as it begins to gain mass appeal.

[2012-03-29] Talend Blog: The Second Billion Dollar Open Source Company

Yesterday, Red Hat reported its FY 2012 results (their Fiscal Year ended Feb 29), and, in line with expectations and predictions, it broke the billion dollar in revenue at $1.13b.

That makes Red Hat the second billion dollar open source company.

Who was the first billion dollar open source company? Am I such a dinosaur in the open source industry that nobody else remembers the headlines when MySQL was bought by Sun?

Funny also that, in 2008, Infoworld called MySQL a Billion Dollar Baby, and that yesterday, just before the Red Hat results were released, Wired titled Red Hat Becomes Open Source’s First $1 Billion Baby.

Of course, you can’t compare a billion dollar valuation with a billion dollar revenue. Even I, in marketing, know that. And I don’t intend to downplay the tremendous work that the Red Hat teams have been doing, building a large, profitable and sustainable open source business. Red Hat is clearly the poster child of commercial open source, and we are all proud to have them lead the pack.

Still, the parallel is hard to miss. 4 years ago, a billion dollar in open source could only mean one thing: an exit, over a very hot technology or trend. Today, a billion dollar in open source is not different from a billion dollar in traditional software.

All it took was four years. (Red) hats off.

Yves

 

[2012-03-28] Talend Forum Announcement: For test only, Talend's 5.1.0M2 Unified Platform release is available

Dear Community,

We are pleased to announce that Talend's 5.1.0M2 Unified Platform release is available, for testing only.  This milestone contains many bugs fixes and new features, and is recommended for experienced users who need an early preview of upcoming 5.1 release.

Download Talend's second Unified Platform milestone here.

Talend Open Studio for Data Integration:

You will find in this list features for the Data Integration Studio and Components.

tXMLMap Multiple loop:
Studio : http://jira.talendforge.org/browse/TDI-19671
- Available in M2, but lots of known remaining task on UI. In particular, User interface will be greatly enhanced in RC1
Components :
http://jira.talendforge.org/browse/TDI-18966  Support optional elements as loop element (Component)
http://jira.talendforge.org/browse/TDI-8399 Support multiple loop element in the XMLMap (Component)

EBCDIC Enhancements :
Studio : http://jira.talendforge.org/browse/TDI-19724 (global feature)
- Already Done: Simplify column namings + redefine + xc2j on svn (occurs should still work... to be tested)
- Planned for RC1 : real size of fields (TDI-17612)
Component :
http://jira.talendforge.org/browse/TDI-19784 : Add trim possibilities
http://jira.talendforge.org/browse/TDI-19782 : Add some setup of codepage in the component
- A few other component enhancement are planned for RC1.

Repository view :
http://jira.talendforge.org/browse/TDI-19373
- New view available for testing (but not shown by default)

Import update record for MDM repository connection:
http://jira.talendforge.org/browse/TDI-19826

Bonita Connectors :
- http://jira.talendforge.org/browse/TDI-19652 : Bonita connector to call a TAC Webservices

Talend Open Studio for Big Data:

Studio : http://jira.talendforge.org/browse/TDI-19731
- Repository without Business model / metadata / with only hive sql template
- Refactor LGPL to start now

Components :
http://jira.talendforge.org/browse/TDI-20076 : Support HDP v1.0 in all the BigData components
http://jira.talendforge.org/browse/TDI-20072 : Support HCatalog in Talend
http://jira.talendforge.org/browse/TDI-19870 : Add the possibility to overwrite/remove/append in the hadoop components

Talend Open Studio for Data Quality:

New features:
http://jira.talendforge.org/secure/Issu … C+key+DESC

There were 40 bug fixes.

Talend Open Studio for MDM:

New features:
http://jira.talendforge.org/secure/Issu … C+key+DESC

There were 53 bug fixes.

Talend Open Studio for ESB:

In the ESB tooling, especially in the Route Builder, we like to highlight the following new features in 5.1.0M2:

- New cHTTP component. (a few improvements will follow with RC1)
- New cErrorHandler component to provide much easier error handling within routes
- Additional to the shared connection pool for JMS Connections we now introduce the transaction support for JMS Connections (see the new ‘Use Transaction’ flag)
- Improvements related to tXMLMap (please see: TOS for DI release notes)

Within the Tooling and Runtime we switched now to the SNAPSHOT versions of the following underlying Apache Projects:
-    Apache CXF: 2.6.0-SNAPSHOT
-    Apache Camel: 2.9.2-SNAPSHOT
-    Apache Karaf: 2.2.6-SNAPSHOT
-    Apache Cellar: 2.2.4-SNAPSHOT

The Release Notes report can be found here:
http://jira.talendforge.org/secure/Rele … sion=12585


For more information on fixed bugs and new features, go to the TalendForge Bugtracker.

Thanks for being a part of our community,
The Talend Team.

[2012-03-26] Talend Document: Survey : Who Controls Your Data

Discovering a few incorrect, duplicate, or inconsistent records might seem to be a small or insignificant problem, but in reality it might indicate something more serious – a problem that will only get bigger and harder to manage the longer it is left unresolved.

An exclusive survey by Computing suggests that, in many organizations, incomplete, inconsistent, duplicate or non-standardized data is a widespread problem:

  • Only 34% of organizations say that their core systems ensure that data is consistent and in a standardized format
  • 52% responded that data was only managed in this way “to some extent”

We invite you to download the survey today and learn how to regain the control of your data.

[2012-03-23] Talend Blog: MapR Certification Provides More Big Data Options

This week at GigaOm’s structure:data conference, we announced jointly with MapR the certification of Talend Open Studio for Big Data with MapR’s Hadoop Distribution. Following our embedding in the Hortonworks Data Platform, and our longstanding partnership with Cloudera, it means that Talend is now working with all three leading commercial Hadoop distributions (in addition to supporting the native Apache Hadoop project).

Not only working just as in “we trade logos for our sites” (even though we do that too), but working as in “we have tested, and proven that our solutions work well together”. In the case of MapR, our colleagues from R&D spent lots of time in the lab, testing all the connectors against a number of configurations.

For MapR’s distribution, we provide the HDFS, HBase, Hive, Pig & Sqoop connectors (like for the other distros), But MapR also includes a unique feature: Direct Access NFS, which enables high throughput streaming to HDFS. Combined with Talend’s real-time capabilities such as Change Data Capture (CDC) or even Talend’s ESB, it enables real-time streaming of pretty much any data to Hadoop, instead of batch loading. This can find some unique applications for real-time analytics, and Talend is very happy to also include NFS support in the set of MapR connectivity.

Another side benefit of this certification is that it “carries over” to EMC’s Greenplum MR: Greenplum’s Hadoop platform is indeed based on MapR’s distro. Of course, Greenplum has been a Talend partner for a long time, and we already provide support for Greenplum HD. The addition of Greenplum MR to our portfolio of technology makes our partnership even more valuable for our joint customers.

With this new certification, we are very excited to be providing more options for our clients embarking on big data projects.

Yves

 

[2012-03-23] Talend Blog: French Big Data Deserved Better

This week, the first French Big Data Mess took place at the Cité Universitaire in Paris. While I applaud the (oh so original) idea to run a big data conference in Paris, the result turned out to be very disappointing. And frankly, the organizers are guilty.

Probably trying to make a quick euro, Corp Events (virtually unknown in our space) decided to jump on the big data bandwagon. After all, big data is kind of a hot topic today, isn’t it? And it’s not hard to do a conference, right?  Sell sponsorships to vendors, find a suitable venue, sell sponsorships to vendors, round up some experts, sell more sponsorships to vendors, fill the rest of the agenda with sponsor panels, sell more sponsorships to vendors – done.

What worked? The selling of sponsorships to vendors did. Pretty much all companies that deal with data (big or small), and have a French office, paid their due to Corp Events (Talend included). Of course, no vendor can conceive a big data event where they would not be visible!  Problem is that, a big data conference without Cloudera, Hortonworks, MapR… it’s kind of Database World in the 90’s without Oracle, Sybase, Ingres and Informix!  Sure, they don’t have French offices, and it would probably have been more difficult to convince them to come. But did Corp Events even try?

The filling of (the rest of) the agenda with sponsor panels also worked. It was funny how vendors were trying very hard to be polite to their colleagues, while scornfully dismissing each other performance claims.

The rest was a miserable failure.

Most non-vendor speakers were utterly irrelevant, or just ignorant. Tell me why the “sociological impact of big data” matters to IT professionals.  Vendors landscapes “updated every other week” could maybe include relevant vendors (heard of Google search?). Examples of open data such as (real-time) train station locations or Orange selling traffic conditions (that’s open, right?) are simply missing the point. I could go on (take a look at my tweets if you are curious, but be warned, they are not very kind).

Logistic was a disaster. Paris is not known for its congress infrastructure, but I can think of a dozen other locations better suited for a conference this size than the Cité Universitaire. Expo space was neither attractive nor inductive to visitor traffic (see for example: peak hour at Big Data Paris). Coffee and food would run out fast. The free Wifi network crashed, probably after the tenth attendee tried to get on it. And there wasn’t even an official #hashtag for Tweets!

I have been to many conferences (probably too many!) in my professional life. Not all were stellar. But even many of the ones put together by volunteers or non-pros were a lot better than this one.

The problem is that such an event will leave a bittersweet taste for attendees and sponsors. Next time a real event company tries to run a big data conference, it will be an uphill battle for them to fight. The big data community deserved better.

Yves

 

[2012-03-14] Talend Document: Forrester ETL Wave

Talend Named a Leader in The Forrester Wave for Enterprise ETL

Talend was one of 10 top vendors that Forrester invited to participate in this rigorous evaluation to assess the state of the extract, transform and load (ETL) market.

Forrester developed a comprehensive set of 60 criteria grouped into three high-level categories: Current Offering, Strategy and Market Presence. In addition to being named a “Leader”, Talend – a first-time entrant in the Forrester Wave and the only open source ETL vendor Forrester evaluated – received one of the top three scores in the “Strategy” category.

“Talend, a new entrant in the Forrester Wave, has done extremely well to gain a spot in the Leaders category, offering strong support for deployment options, integration techniques, and connectivity to various data sources.”

[2012-03-14] Talend Document: Talend Selected in “Who’s Who in Open Source Data Quality”

Talend has been recognized by Gartner, Inc. for data quality in their recent “Who’s Who in Open Source Data Quality” report.

To access this report, please fill out the form. You will then receive the report via email.

[2012-03-13] Talend Blog: Talend Big (Data) Coverage

Two weeks ago, at Strata Conference, we announced our big data strategy and technology, along with products and a strategic partnership with Hortonworks. The news received a lot of interest, and conversely of coverage by media and bloggers, and I wanted to highlight a few of the key pickups and comments.

  • In Network World, Alan Shimel commented on how Talend is contributing to the democratization of big data: “There are many folks hoping that Talend can do for Big Data and Hadoop what it has done for data integration in general. A reoccurring theme I have heard recently around Hadoop is that it is too hard and not mature enough for many organizations.”
  • SearchDataManagement’s Marc Brunelli also picked up the complexity of big data, having me commenting that: “Our vision is that it shouldn’t require a Ph.D in MapReduce to implement your transformations.
  • DJ Walker Morgan preferred to lead with the licensing angle in The H Open, discussing the importance of “Talend’s Open Studio for Big Data moving to Apache licensing” and thus becoming a closer part of the Hadoop ecosystem.
  • Christopher Tozzi wrote for The VAR Guy that “The Hortonworks partnership introduces perhaps the most complete open source solution for Big Data management yet.
  • I also had the privilege to be interviewed by Dave Vellante and John Furrier on theCube, Silicon Angle’s TV channel broadcasting live from Strata Conf.
  • And last – but not least – our good friends at the BeyeNETWORK invited me to a Q&A Spotlight on Big Data Integration with Hadoop. Definitely worth the read.

This is by no means exhaustive coverage, just a few highlights on different angles of these announcements.

Yves

 

[2012-03-07] Talend Forum Announcement: For test only, Talend's 5.1.0M1 Unified Platform release is available

We are pleased to announce that Talend's 5.1.0M1 Unified Platform release is available, for testing only.  This milestone contains many bugs fixes and new features, and is recommended for experienced users who need an early preview of upcoming 5.1 release.

Download Talend's first Unified Platform milestone here.

Talend Open Studio for Data Integration:

Features for Components:

Hadoop
- Support of Hadoop 1.0
- Support of MapR

XMLMap
- Support of loop on root element
- Support of loop on an optional element.

Support of Bonita 5.6.1 for components tBonitaInstantiateProcess et tBonitaDeploy
- Support of RFH2 and SSL on Mom components.
- tSetKerberosConfiguration usable in tSoap component.
- More components support Document type
- Salesforce scomponents now supports version 24

Features for the Data Integration Studio:

- XMLMap : Subtitutions Management (Step one)
- Start to refactor Repository view

Talend Open Studio for Data Quality:

View this link for new features:
http://jira.talendforge.org/secure/Issu … C+key+DESC

Bug fixes:
http://jira.talendforge.org/secure/Issu … C+key+DESC

Talend Open Studio for MDM:

Note:
- when feature status <> "QA done" or "Closed", the feature may still be in an early stage (but you can still give it a try)
- when bug fix status <> "QA done" or "Closed", the fix is implemented but not validated yet (there is still a good chance it works though)

New features:
http://jira.talendforge.org/secure/Issu … C+key+DESC

Bug fixes:
http://jira.talendforge.org/secure/Issu … C+key+DESC

Talend Open Studio for ESB:

In the ESB tooling, especially in the Route Builder, we like to highlight the following new features in 5.1.0M1:

- New cLog and cMail component in Route builder
- Shared JMS connection management and JMS connection pooling in Route Builder
- SL Meta data now also supported with  cCXF

For Runtime we like to mention that 5.1.0M1 still includes the CXF (2.5.0) and Camel (2.8.2) version as in included in the ESB 5.0.1 Release
With the next Milestone 2 Runtime and Tooling we will switch to the related newer snapshot versions.

The Release Notes report can be found here:
http://jira.talendforge.org/secure/Rele … sion=12584


For more information on fixed bugs and new features, go to the TalendForge Bugtracker.

Thanks for being a part of our community,
The Talend Team.

[2012-03-06] Talend Forum Announcement: Maintenance on Talend Bug Tracker - March 10, 2012

Dear Community,

This Saturday, Saturday 10th, at Noon CET / 6:00 AM ET, our operations team will be performing routine maintenance on the systems that power the Talend Bug Tracker (http://jira.talendforge.org). There will be a small amount of downtime during this service window, but we anticipate services to be fully accessible within 1 hour.

Questions or concerns regarding this maintenance window may be directed to the comment section on this thread.

Sincerely,
The Talend Team

[2012-03-05] Talend Blog: Impressions from Strata Conference

Strata Conference (#strataconf) was held this week in cloudy/rainy Santa Clara, Calif. The second West Coast edition (there is also one in New York), it was (like last year) sold out. This year, O’Reilly managed to jam 2000 attendees in the Santa Clara Convention Center. And it was crowded – keynotes and many other sessions were standing room only, lunch took over every single available ballroom, patio, mezzanine both at the Convention Center and the adjoining Hyatt, and getting from point A to point B often meant fighting a sea of attendees.

Online activity was overwhelming too. Keynotes were streamed live, allowing people all over the world to join. When I setup the #strataconf hashtag in Hootsuite, I didn’t realize I would not be able to follow all the tweets. Stop paying attention for 15 minutes, and there was no way to catch up. Almost 600 attendees had declared their Twitter accounts in their profile (see great post by Jason Sundram analyzing these Tweeps and their audience), there were 684 people tweeting on the second day (see awesome graphs of top Twitter influencers by Ali Rebaie, and this network map by Benedikt Koehler), and if anyone can find a way to count the #strataconf hastag… with all these data geeks I am sure someone has already done it. And of course, a lot of attendees are active on the social media scene, so blogging went full steam.

Strata is about Making Data Work. But this year, it was about Making Big Data Work. Sessions covered a very broad array of topics. It went from super technical topics, such as the architecture of Hadoop, how to write R, Javascript or JSON, and more – to enlightening presentations on why medical trial data is flawed, or how to show data using colors and saturation. Another recurring topic was the rise of data journalism – how today’s newsrooms rely on the parsing, analysis and display of (sometimes large) datasets – often publicly available. Data.gov seems to be the new Deep Throat.

The show floor was definitely the most technology driven part of the conference. Apart from outliers such as Facebook whose presence was not clear (selling private member data maybe?), all the key players of big data were there (sorry can’t mention everyone, review the full list here):

I have been to (too) many conferences and conventions in my career. Most of them are either technology, or business case driven. I have to say I am impressed by the mix at Strata. There was content for everyone. From the technologist who was looking for a crash course in scripting, to the senior IT executive who wanted to understand the technology stack and business cases behind big data, to the business user (such as the “data scientist” – more on that character in another post) who was looking for best practices and peer networking.

Of course, there was an important milestone for Talend at Strata since we announced both Talend Open Studio for Big Data, and a strategic partnership with Hortonworks. Interest for these news has been tremendous – more on this later.

Glad I was able to attend. Guess I’ll be back next time.

Yves

 

[2012-03-01] Talend Blog: A Wave that’s Making Waves

I bet some vendors have lots of issues with the Forrester Wave on Enterprise ETL (The Forrester Wave: Enterprise ETL, Q1 2012, February 2012, by Forrester Research, Inc.) that was released this week. And that the analysts who wrote it took considerable heat. I am sure there were a lot of complaining, bitching, and threatening.

Why? Not because of individual positions. We all argue that we are misplaced and that we should be much higher and to the right. That’s part of the process. Some vendors are more gracious than others, some even complain publicly, and of course we try to convince the analysts that “this rating can’t be correct”.

No, the reason why this Wave is making waves, is that, for the first time, an open source player joins the leaders. The establishment of proprietary vendors didn’t like it when Talend entered as a Visionary in Gartner’s Magic Quadrants. But visionary was kind of OK, they could still dismiss Talend as a bunch of crazy developers with a quasi-good idea for small projects but no enterprise scalability.

I haven’t always said kind things about analysts, but I have always considered them to be fair and honest. In the case of Forrester, I spent 4 years complaining that the ETL Wave was out of date (and it was).  So I was thrilled when Noel Yuhanna and Rob Karel initiated the Wave project last summer. And also excited about the transparency of the process. The Forrester Wave is probably the most transparent of all similar reports, with all ratings and weights being published and explained.

At the end of the day, once we were done arguing about ratings (and I don’t necessarily agree with all of them, but as I was saying, that’s how it works!), the calculation placed Talend in the Leader Zone.  Then, as is customary, the draft of the report was sent to the vendors, as a courtesy copy.

This is when it takes balls to be an analyst. When VPs of Marketing, CEOs and Directors of AR start telling you what you need to do. And why there is no way you can have Talend as a leader. First try to convince you, then threaten to cancel their research contracts. I am really happy Noel and Rob stood their ground, and that the Wave was published without changes. It proves that our industry analysts are doing their work.

If you want to check out the Forrester Wave for Enterprise ETL, in which Talend is positioned as a Leader (did I mention this?), you can get it here, courtesy of Talend.

Yves

This post was edited on Mar 2 to comply with Forrester’s Citation Policy. The postscriptum was also removed.

 

[2012-02-29] Talend Forum Announcement: Talend proud to announce Talend Open Studio for Big Data

Based on award-winning open source integration technology, big data integration solution to be released under Apache Software License, bundled in Hortonworks Data Platform

Strata Conference, Santa Clara, Calif. - February 29, 2012 - Talend, a global leader in open source integration software, today announced the availability of Talend Open Studio for Big Data, to be released under the Apache Software License. Talend Open Studio for Big Data is based on the world’s most popular open source integration product, Talend Open Studio, augmented with native support for Apache Hadoop. In addition, Talend Open Studio for Big Data will be bundled in Hortonworks’ leading Apache Hadoop distribution, Hortonworks Data Platform, constituting a key integration component of Hortonworks Data Platform, a massively scalable, 100 percent open source platform for storing, processing and analyzing large volumes of data.

Talend Open Studio for Big Data is a powerful and versatile open source solution for data integration that dramatically improves the efficiency of integration job design through an easy-to-use graphical development environment. Talend Open Studio for Big Data provides native support for Hadoop Distributed File System (HDFS), Pig, HBase, Sqoop and Hive.  By leveraging Hadoop's MapReduce architecture for highly-distributed data processing, Talend generates native Hadoop code and runs data transformations directly inside Hadoop for maximum scalability. This feature enables organizations to easily combine Hadoop-based processing, with traditional data integration processes, either ETL or ELT-based, for superior overall performance.

“By making Talend Open Studio for Big Data a key integration component of the Hortonworks Data Platform, we are providing Hadoop users with the ability to move data in and out of Hadoop without having to write complex code,” said Eric Baldeschwieler, CTO & co-founder of Hortonworks. “Talend provides the most powerful open source integration solution for enterprise data, and we are thrilled to be working with Talend to provide to the Apache Hadoop community such advanced integration capabilities.”

Talend Platform for Big Data

Talend Open Studio for Big Data is a core component of the Talend Platform for Big Data, which enables organizations to increase their productivity by deploying big data solutions in hours instead of weeks or months. The Talend Platform for Big Data easily integrates data of all types – structured, semi-structured and un-structured – and maximizes an organization’s resources by abstracting the technical complexity of big data tools and technologies. The Talend Platform for Big Data is compatible with all Apache Hadoop distributions and has been certified for use with Hortonworks Data Platform.

Talend Platform for Big Data provides:

- Big Data Integration: Loading Big Data in Hadoop via HDFS, HBase, Sqoop or Hive is considered an operational data integration problem. Talend Platform for Big Data provides an intuitive set of graphical components and workspace that allows for interaction with a big data source or target without the need to learn and write complicated code.
- Big Data Quality: Talend Platform for Big Data presents data quality functions that take advantage of the massively parallel environment of Hadoop. It enables developers to take advantage of the high performance processing environment to identify duplicate records across these huge data stores in moments not days. It also extends into profiling big data and other important quality issues as the Talend data quality functions can be employed for big data tasks.
- Project Optimization: With Talend Platform for Big Data, the ability to schedule, monitor and deploy any big data job is included, built on a shared repository, so that data analysts can collaborate and share project metadata and artifacts.[...]

Read the full press release here: http://www.talend.com/press/Talend-Empo … g-Data.php

Know more about Talend Open Studio for Big Data and download a beta on this same page: http://www.talend.com/products-big-data … dio-bd.php

Go to this product's new dedicated forums: http://www.talendforge.org/forum/viewforum.php?id=35 and http://www.talendforge.org/forum/viewforum.php?id=36

Best,
Pcoffre.

[2012-02-29] Talend Blog: Talend Open Studio for Big Data joins the Hortonworks Data Platform

At the same time we announced Talend Open Studio for Big Data, our friends at Hortonworks – leading contributor to Apache Hadoop projects – have announced that they have selected Talend Open Studio for Big Data to be bundled as part of Hortonworks Data Platform, simplifying the movement of data between Apache Hadoop and enterprise data systems and accelerating the adoption of Apache Hadoop.

Hortonworks Data Platform, powered by Apache Hadoop, is a massively scalable 100% open source platform for storing, processing, and analyzing large volumes of data. It is designed to deal with data from many sources and formats in a quick, easy, and cost-effective manner. Hortonworks Data Platform provides an open, stable and highly extensible platform that makes it easier to integrate Apache Hadoop with existing data architectures and maximize the value of the data flowing through the business.

Thanks to Talend Open Studio for Big Data, users of Hortonworks Data Platform will be able to greatly simplify the deployment of Hadoop.  Talend Open Studio for Big Data abstracts the complexity of Hadoop and its “interfaces” (specifically Pig, HBase, Sqoop and Hive) by allowing graphical design of the big data integration jobs, and generating native MapReduce code. It alleviates the need for a deep, technical understanding of MapReduce and the different components of Hadoop. And, equally important, it brings to the table over 450 connectors to “the rest” of the information system – integrating enterprise data into Hadoop.

Hortonworks’ choice of Talend is a clear testament to the traction and recognition that Talend has been getting on the integration market, but also of the technical superiority of the solution. Our engineering teams have been working closely together to optimize the combined stack, and will continue to collaborate on further improvements to the solution.

Stay tuned for more.

Yves

 

[2012-02-29] Talend Blog: A Big Announcement for Big Data

Today at Strata Conference, “the” big data event, we announced Talend Open Studio for Big Data. This new product, based on our award winning Talend Open Studio technology, provides native support for HDFS, Pig, HBase, Sqoop and Hive. By leveraging Hadoop’s MapReduce architecture for highly distributed data processing, Talend Open Studio for Big Data generates native Hadoop code and runs data transformations directly inside Hadoop for maximum scalability.

Big data is getting easier

What does this mean?  Quite simply, that it no longer takes a PhD in massively parallel computing and data science to leverage your big data. Quite the opposite. Big data integration jobs can now be designed through a drag and drop user interface, and Talend Open Studio for Big Data generates the complex MapReduce code it takes to leverage your Hadoop clusters and run these complex transformations.

Big data is getting better

Talend Open Studio for Big Data is also part of the Talend Platform for Big Data, which adds enterprise-grade project management features and – very important – big data quality. Bringing data quality to big data is actually a first. The data quality dimension is not additive to the volume & complexity dimensions – it is multiplicative.  Big data exacerbates the data non-quality problems, to the point of rendering it unusable.

Big data is getting its data integration

Released under the Apache Software License, Talend Open Studio for Big Data is becoming a native part of the Apache Hadoop stack. Compatible with all Hadoop distributions, Talend Open Studio for Big Data is also becoming the key integration component of the Hortonworks Data Platform.  And that’s big, too. It makes Talend Open Studio for Big Data the de facto standard big data integration tool.

You can download now a technology preview of Talend Open Studio for Big Data. Don’t wait!

Yves

 

[2012-02-28] Talend Blog: In France, Triple A is not Lost for Everyone in BI

Last week, French BI community Decideo published the results of their annual Barometer, which provides an unbiased, independent view of the expectations and vision of users of BI solutions and platforms. 30 independent vendors and 7 “full stack” providers were graded by members of the community. Based on the overall ratings, vendors would get an A, a B or a C.

Out of the 37 vendors thus graded, only 3 got an A – and all 3 of them are independent, best of breed vendors. They are QlikView, Tableau Software and Talend. This Triple A is also a set of vendors that represent a newer generation of technology, one that is disruptive to established players, and that have been taking over the market at a very fast speed.  It is worth noting that none of the full stack vendors scored exceptionally well.  For complete results, the full report can be purchased from Decideo.

Unlike finance, when only extreme conservatism will get you the best grades, the BI market rewards innovation and disruption. Kudos to the Decideo members for recognizing this.

Yves

PS: in all fairness, despite lots of noise to the contrary, France did not really “lose its Triple A” – only one of the 3 major credit rating agencies downgraded the French debt. Still, it’s one too many.

 

[2012-02-20] Talend Forum Announcement: Talend Unified Platform 5.0.2 is Available

Dear Community members,

We are proud to announce that Talend's 5.0.2 Unified Platform release is available.
Download Talend's Unified Platform General Availability version.

This release is a bugfixing release. You will find more information on all fixed bugs on our bugtracker, for all versions. Below, you will find a direct link to the bugtracker to view resolved issues for all Talend Open Studio products.

For Talend Open Studio for Data Integration: http://jira.talendforge.org/secure/Issu … stId=12088
For Talend Open Studio for Data Quality: http://jira.talendforge.org/secure/Issu … stId=12084
For Talend Open Studio for MDM: http://jira.talendforge.org/secure/Issu … stId=12087
For Talend Open Studio for ESB: http://jira.talendforge.org/secure/Rele … sion=12413

You can visit the JIRA bugtracker homepage for more information on all products.

Thanks for being a part of our community,
The Talend Team.

[2012-02-15] Talend Document: Forrester White paper: Top 10 Top Technology Trends to Watch

Download your complimentary copy of this whitepaper to learn why:

  • Holistic Integration is one of the hottest trends for the next three years
  • Firms are adopting the approach of holistic integration to address integration challenges
  • Data Services and virtualization are reaching a critical mass.

[2012-02-14] Talend Blog: Comparing Elephants and Kettles

Forrester released recently a pretty unusual report: the first Hadoop Wave. While, like other players in the Big Data market, I applaud every effort made to clarify a market that is moving very fast, I am still baffled by the criteria that were used to select the vendors included in this Wave.

I mean, who do we find there? Hadoop distros. Cloud. Management. Modeling. Data integration. Event streaming. Jim Kobielus, the Wave author, told Information Week that “It’s an apples and oranges collection of vendors”. I would go for a less fruity analogy: it’s more like an elephants and kettles collection (never heard this analogy before? come on…)

A number of players were omitted from this Wave. Microsoft, Oracle, Teradata, Informatica, Talend just to name a few – all have been offering Hadoop solutions for diverse amounts of time, some of them well before the Wave project even started – more than 6 months ago. And that puts us back at the heart of one of the critical issues plaguing analyst reports in general: latency. I have been arguing for a long time that such reports become obsolete before they are released.

The Hadoop Wave proves the point again. In a market that is moving so fast, how can you “freeze” offerings at a set date, and release findings more than 6 months later? I certainly understand that analysts have to juggle with many priorities and far too few resources, like the rest of us. But the system is inherently broken. This Hadoop Wave is already in urgent need of revision. Other markets are more mature – but this one needs real-time research.

Yves

PS: As far as the technical accuracy of the Wave is concerned, Curt Monash did a pretty thorough job of pointing out a number of technical aspects, and Derrick Harris from GigaOM offered an interesting taxonomy of Hadoop – saving me the embarrassment of trying to sound clever.

 

[2012-02-14] Talend Blog: Live Tweeporting from #GartnerMDM: Thursday Sessions

The following is an attempt to report via my tweetstream on 2 sessions held on Thursday morning at the #GartnerMDM Summit in London, as well as the guest keynote.

Taking Your Data Integration Competency To The Next Level In Support of MDM, by Ted Friedman

  • #dataintegration is a critical piece of #MDM – the blood of the enterprise. @ted_friedman #GartnerMDM
  • #dataintegration leverages separately designed data structures toward a common purpose. Good starting place for #MDM, right? #GartnerMDM
  • Always baffled by this 37% of companies that have #dataintegration tools standards, but not enforced. #whatsthepoint #GartnerMDM
  • Few orgs have more than 10 different #dataintegration tools. #relieved #GartnerMDM
  • Canonical integration form is a path to #MDM – or it can be the other way. @ted_friedman #GartnerMDM
  • 3 dims of #dataintegration: latency (real timeliness), granularity (batch vs trickle feed), physicality (moving vs virtual) #GartnerMDM
  • Lots of M&A by big vendors combining app integration & #dataintegration, mixed results in terms of integration. #GartnerMDM
  • Hype cycle highlights #opensource #dataintegration, #dataservices, convergence of #dataquality & data integration as key. #GartnerMDM
  • “I can build best #dataintegration in the world, if I don’t include #dataquality I am just pushing garbage around” @ted_friedman #GartnerMDM
  • Multiple vendors on #dataintegration #MQ include #dataquality as part of the stack. #Talend good example! #GartnerMDM

Applying Data Quality Best Practices To Maximize Trust In Your Master Data, by Ted Friedman

  • So, how do you measure quality of data? @ted_friedman has a measuring tape on his slide. Wish it was that easy. #GartnerMDM
  • Difficult to automate the measurement of data accuracy, but it is an objective assessment! #GartnerMDM
  • #dataquality: Minimize risks, meet regulatory requirements #GartnerMDM
  • #dataquality: Optimize costs, reduce waste #GartnerMDM
  • #dataquality: Improve customer experience, maximize revenue. #GartnerMDM
  • Most orgs have no idea of cost of poor #dataquality. Makes it difficult to justify cost of DQ program. #GartnerMDM
  • #BigData makes #dataquality even more complicated. More sources, complexity, technology issues. #GartnerMDM
  • RT @ted_friedman: #BigData is about more than “more”. Diversity, complexity, pace and other dimensions comprise the challenges. #GartnerMDM
  • “Stay out of jail”: great message to justify #dataquality program! #GartnerMDM
  • #dataquality workflow needed to support steward’s job. Think #BPM. Must be integrated – #holistic #integration. #GartnerMDM
  • New entrant on #MQ: @Talend, coming at #dataquality from #OpenSource angle. Also has #MDM product. #GartnerMDM

Guest Keynote: The Art of Business Influence by Mark Jeffries

  • Keynote by @markjeffries1 just starting: The Art of Business Influence. #GartnerMDM
  • @markjeffries1 looking for sales people in audience. Aside from a few from vendors he won’t find many! #GartnerMDM
  • We are all in sales, even though it’s a dirty word, says @markjeffries1 #GartnerMDM
  • When somebody goes above & beyond expectations, you want to pay them back. Can be monetary – or not. @markjeffries1 #GartnerMDM
  • Interactions always start with carefully balanced scales. Then they start tipping. @markjeffries1 #GartnerMDM
  • LWAR: Listen Watch Anticipate React. In this order. @markjeffries1 #GartnerMDM
  • Great example of a company that’s doing things right on Twitter: #SWA. @markjeffries1 #GartnerMDM
  • And now, the 3 Cs: Consistency Clarity Confidence @markjeffries1 #GartnerMDM
  • 7-8 NPM (nods per minute) are a sign to carry on. Nods stop – change topic. Btw it’s contagious. @markjeffries1 #GartnerMDM
  • Creating jealousy effect is very good way to get people to come around to your thinking. @markjeffries1 #GartnerMDM

#TheEnd. Hope you enjoyed it. Don’t hesitate to leave comments!

Yves (@ydemontcheuil)

 

[2012-02-14] Talend Blog: Live Tweeporting from #GartnerMDM: Wednesday Sessions

The following is an attempt to report via my tweetstream on 2 sessions held on Wednesday afternoon at the #GartnerMDM Summit in London.

Improving Business Intelligence Analysis Results With Integrated, Active MDM by Mark Beyer & Roxane Edjlali

  • #BI & #MDM – the chicken egg question. Roxane Edjlali & Mark Beyer at #GartnerMDM
  • Show of hands: most ppl in room do #BI, some do #MDM, none have combined them. #GartnerMDM
  • In #BI data flow is unidirectional, flows from sources to data warehouse. #MDM features a circular flow – like app integration. #GartnerMDM
  • Addressing issues after the fact – #dataquality impact on #dataintegration, changes to endpoints, etc. – should architect it. #GartnerMDM

Organizational Issues in MDM: Critical Roles and Stakeholder Management, by Ted Friedman & Debra Logan

  • Who pays for #MDM is key issue of sponsorship – sponsor may not have $$ but you still need their support. Don’t scare them! #GartnerMDM
  • Are execs reluctant to support #MDM program because they’re afraid they’ll be asked to pay for it? #GartnerMDM
  • RT @mosesjones: #GartnerMDM @ted_friedman ‘master data by definition is shared’
  • Data stewards must have authority and relevance. Don’t matter if they belong to business or IT. #GartnerMDM
  • @ted_friedman explaining RACI framework: Responsible Accountable Consulted Informed. Alleviates many ongoing debates. #GartnerMDM

#MoreToFollow.

Yves (@ydemontcheuil)

 

[2012-02-13] Talend Blog: Live Tweeporting from #GartnerMDM: Opening Sessions

The following is an attempt to report via my tweetstream on 3 important opening sessions of the #GartnerMDM Summit in London.

MDM Concepts & Introduction, by Bill O’Kane

  • Been attempting to unify/master data for 30 yrs. Guess what? Biz success happened anyway! O’Kane #GartnerMDM
  • #MDM value added: Run the Business < Grow the Business < Transform the Business. #GartnerMDM
  • “Don’t boil the ocean, start small” – very wise advice from O’Kane at #GartnerMDM
  • #MDM often the first foray into #SOA: opportunity for adding real-time. #Holistic integration? #GartnerMDM
  • Exec sponsorship changes often fatal to #MDM. Sponsorship by CFO usually effective – less turnover in position. #GartnerMDM

Opening Keynote

  • 420 attendees from 23 countries at #GartnerMDM. No #dataquality issue in country names this year?
  • Cloud, information, social, mobile: unstoppable forces, or more opportunities? Or both? #GartnerMDM
  • #SocialData & #Mobile: vast amounts of information generated about your company. How do you analyze it? Can’t govern but link. #GartnerMDM
  • Lots of barriers to moving #MDM into the #Cloud. I think it’s more about bringing Cloud/#SaaS data into MDM. #GartnerMDM
  • No high profile story of #MDM failure… But does MDM have the potential to be controversial & raise public interest? #GartnerMDM

Panel

(On the panel, two end users and Andrew White played the “Agree/Disagree” game on some key questions surrounding MDM.)

  • Panel arguing whether #MDM projects can prove hard savings ROI. #GartnerMDM
  • Panel agrees that #MDM is not an IT initiative. Still, need to give a chance to IT to make it work or it’s wishful thinking! #GartnerMDM
  • If you don’t have all your data in the ERP, ERP can’t drive your MDM! #duh #bestofbreed #GartnerMDM
  • Key to resilience of #MDM programs is to demonstrate value, beyond original sponsors. #panel #GartnerMDM
  • #MDM not mature enough to deal with unstructured data. Will come later. #GartnerMDM
  • RT @Paul_MDM: #GartnerMDM, more than 50% of audience think their BI provides bad results! (Obviously depends on job responsibilities)

#MoreToFollow.

Yves (@ydemontcheuil)

 

[2012-02-12] Talend Blog: Live Tweeporting from #GartnerMDM

Last week in London, Gartner (@Gartner_inc) ran two important events on the conference circuit: the EMEA Gartner BI Summit (#GartnerBI) and the EMEA Gartner MDM Summit (#GartnerMDM). I attended the latter of the two, and tracked the first one on Twitter.

A little side note to comment on the value of Twitter for conferences. Twitter provides a simple, interactive way for non-attendants to participate into a conference. Maybe not as deeply as by being there, but if you have a sufficient number of Tweeps in the audience, you can follow pretty well what’s happening, and even engage with attendees and speakers. And for attendees, it provides a good way to follow several tracks at the same time.  I think Gartner gets it – attendees were publicly encouraged to tweet with the #GartnerMDM hashtag, and conference chair Ted Friedman (@ted_friedman) has been an early adopter of Twitter and probably one of the most engaged tweeps of large IT analyst firms.

For the #GartnerBI conference, I relied heavily on Timo Elliot (@timoelliot) and Jean-Michel Franco (@jmichel_franco) for my reports. Andy Bitterer (@bitterer) tweeted a bit also but I guess he was super busy with presenting and meetings. Oh, and since I follow less people in the BI space than in the data management or integration space, I have probably missed several other key Tweeps – my apologies to them.

The best sessions at Gartner conference are the Magic Quadrant Power Session (when there is one, doesn’t happen every time) and the guest keynote. Seems that the Power Session was well attended and generated many comments – of course the one I liked best was this one from @jmichel_franco:

  • MQ for data integration tools: #Talend very close to the leaders quadrant now. Impressive moves since two years! #GartnerBI

At the #GartnerMDM conference, I was among the most prolific Tweeps. It’s also a good way for me to track the sessions. I tried to pick a few of the tweets I like the best, to summarize these presentations in the next couple posts (to avoid a Twitter-Blog-Overload).

#MoreToFollow.

Yves (@ydemontcheuil)

 

[2012-02-01] Talend Blog: Who’s in Who’s Who?

Gartner beat by a day the red version of the Who’s Who – the one where you are asked to write your bio for publication. Their Who’s Who in Open Source Data Quality came out on January 18, whereas Amazon lists January 19 as the publication date of the other Who’s Who, the one from A&C Black.

What are the other differences?

  • Gartner’s Who’s Who is in its 2nd edition, A&C Black just published their 164th edition.
  • Gartner’s Who’s Who has 12 pages (PDF version), the other one has 2608 pages.
  • Gartner’s is blue, not red.

Wait, there is more.

Unlike the “other” one, Gartner’s Who’s Who is not autobiographical. Analyst Andy Bitterer wrote the vendor profiles, and while he was kind enough to ask us to review for factual errors, this was not an opportunity to get him to write stuff he did not believe (and anyone who knows Andy knows that would be a lost battle anyway).  And while I agree with a lot of things he wrote in the report, for example that Talend is the most advanced of all open source data quality solutions, I also disagree with his assessment that most deployments of open-source data quality tools are in support of relatively small projects, or that they do not yet play a significant role in the market. But hey, as I have said before, it’s our job – the vendors – to turn him into a (stronger) believer!

Not all profiles are positive. Unlike the other Who’s Who (the red one) where people write their own profiles and say nice things about themselves, some of the profiles are not super positive for these vendors.  I guess the factual review process must have been interesting.

What should we wish for in the next edition? That other open source data quality solutions mature more, proving that there is not only Talend who is able to deliver an enterprise grade open source data quality solution.  I would love for Andy not to be able to write “Nearly all open-source data quality tools offerings must be considered largely toolboxes for techies, with Talend being a reasonable exception, as the vendor has even been included in Magic Quadrant for Data Quality Tools.” Honestly, the more the merrier.

If you want to check out the report, you can access it here, courtesy of Talend (registration required). You can also read our press release (no registration needed…).

Yves

PS: “Who’s Who in Open-Source Data Quality (2012 Update)” by Andreas Bitterer, January 18, 2012 is copyrighted by Gartner, Inc. and may not be reproduced or distributed in any form without Gartner’s prior written permission.


[2012-01-30] Talend Blog: Another Strong Growth Year for Talend

Talend has just issued its traditional “momentum for 2011” press release, which for non-publicly-traded company, is the closest thing one gets to an earnings report. OK, it contains less financial information than an earnings report. Still, there are interesting nuggets that I would like to highlight, and provide a bit of clarification. Please note that the 2011, the Year that Kept on Giving series on this blog will also provide more detail and insight on some of these accomplishments.

Growth

  • 103% growth in 2011 compared to 2010. Is there more to say? Are there many vendors who register this type of growth, especially in today’s economy?  More importantly, who keep doubling every year, 3 years in a row?
  • Almost doubling of the customer base (from 2,000 to 3,500). The closest vendor in the proprietary field, Informatica, managed to acquire only 4,300 customers in their 20 years of existence. And their new customers only accounts for 10% of their revenue.

Innovation

Readers of this blog are familiar with the sustained pace of innovation that is taking place at Talend. A few reminders of important 2011 milestones:

In October, we also celebrated the fifth birthday of Talend Open Studio!

Contributions

Talend is an open source vendor. Being open source is a complex equation with many variables, and while I won’t pretend we master all of these variables perfectly, a few things we did in 2011 are worth highlighting:

  • Launch of the Talend Community Coders program which showcases members of our engineering team who contribute to open source projects.
  • Contributed more code/commits than any other commercial entity to key Apache projects in the integration space – our engineers are committers to 16 Apache projects, up from only 5 one year ago!

Recognition

Beyond being a Visionary in 3 Gartner Magic Quadrants (Data Integration and Data Quality which were released in 2011, and the SOA one from 2010 in which Sopera was a Visionary before we acquired them), beyond being a Gartner Cool Vendor, beyond receiving a bunch of awards, what else is there to highlight?

Well, you’ll need to stay tuned for this. There are interesting reports that are about to be released, but I can’t make waves before they become public. And they are not dated 2011, so they don’t belong in this post anyway!

Yves

 

[2012-01-26] Talend Blog: Discovering Data Services through Use Cases: Legacy Integration

Talend recently announced the availability of the Talend Platform for Data Services. Built around Talend v5’s data integration and ESB technologies, it lets IT organizations operationalize data through the single development, deployment, runtime and monitoring environment of the Talend Unified Platform.

Through a couple blog posts, we look at typical use cases of the Talend Platform for Data Services. In recent posts, we looked at data virtualization and guaranteed delivery. This post explains how Data Services facilitate integration of legacy applications.

One of the key aspects of integrating legacy systems is the ability to preserve the operation of existing applications while the transition occurs. Even when organizations embrace a modern architecture, based on a Bus or Services architecture, it’s often hard to simply “service enable” running applications, which are often mission critical. In addition, it’s usually impossible to perform a big bang and to change everything at the same time.

An interesting use case would be the integration of legacy manufacturing systems, feeding metrics and operational data via messages to a Hub. Implementing Data Services in Hub mode allows these messages to be consumed centrally, enriched & transformed, and most importantly abstracted, such that other applications can now consume this data without being exposed to internal dependencies on such legacy systems.

Over time as the manufacturing software gets modernized, “intelligent” end-points start to be deployed that participate to a distributed Bus architecture and are able to transform & cleanse data at the source, adhering to a canonical message model and alleviating the need to perform this processing in the ESB Hub. At the same time, the remnants of the legacy architecture continue to function in Hub mode.

This model allows a smooth transition from a legacy Hub architecture to a service-enabled model, relying on a Bus architecture.

Yves

 

[2012-01-25] Talend Blog: Discovering Data Services through Use Cases: Guaranteed Delivery

Talend recently announced the availability of the Talend Platform for Data Services. Built around Talend v5’s data integration and ESB technologies, it lets IT organizations operationalize data through the single development, deployment, runtime and monitoring environment of the Talend Unified Platform.

Through a couple blog posts, we look at typical use cases of the Talend Platform for Data Services. In a recent post, we looked at data virtualization. This post explains the use of ESB as a persistent transport layer for guaranteed message delivery.

Unlike point to point interfacing between source and target, the ESB abstracts the transport layer and handles issues such as latency, reliability, security – to ensure a message is guaranteed to be delivered to its recipient. As an example, we can consider a retailer with many small stores, each of them running no more than a few point of sales (PoS) systems. Each of these PoS systems must feed its transactions in near-real-time to a centralized application. Polling each system from the centralized location would place a huge burden on the central system, and would be highly inefficient in term of performance.

Instead, data integration jobs are developed that, upon the execution of a transaction, creates a message containing the relevant information, and sends this message to the central system via the ESB’s messaging system. The ESB handles latency and throttling, network instabilities, and provides guaranteed delivery of the message. At the other end (on the central system side), data integration jobs are being triggered by incoming ESB messages and populate the consolidated transaction database. Because of the high volumes, these jobs are being managed as a pool of services that can scale elastically based on demand and volumes.

In order to ensure quality of the data coming through these messages, data quality jobs can also be developed, exposed as SOAP or REST services, and used by the ESB to deduplicate, cleanse and otherwise enrich the data prior to its insertion in the consolidated database.

An additional benefit is derived in case of the addition of heterogeneous PoS systems (M&A, new version, etc.). Only the data integration jobs need be modified to extract data and transform it to the pivot message format – no change is required on the central system side. Data Services add a layer of “insulation”, moreover it can support multiple formats and versions of the interface at the same time, whilst still ensure an architectural consistency across the enterprise, meaning that as other applications are integration or new business applications developed, they can be easily added in a very cost effective manner.

Finally, when changes are required to the data integration jobs, the ESB handles their deployment to all the PoS systems, greatly facilitating the management of the integration infrastructure.

Yves

Next post: Legacy Integration

 

[2012-01-25] Talend Forum Announcement: New Product Names for the Talend Products

Dear Community,

With the release of Talend v5, names of Talend’s products have changed. This change was primarily driven by a need to streamline the naming convention and to make it easier to understand for the community, customers, partners, etc. Product capabilities are not affected by the name change.

Open Source/Community Products

The open source/community products are now appropriately named to show that they belong to the “Talend Open Studio family”:
•             Talend Open Studio for Data Integration is the new name of Talend Open Studio.
•             Talend Open Studio for Data Quality is the new name of Talend Open Profiler.
•             Talend Open Studio for MDM is the new name of Talend MDM Community Edition.
•             Talend Open Studio for ESB is the new name of Talend ESB Studio Standard Edition.

Enterprise/Commercial Products

All subscription products are now using the same “Talend Enterprise” naming model:
•             Talend Enterprise Data Integration is the new name of Talend Integration Suite.
•             Talend Enterprise Data Quality is the new name of Talend Data Quality.
•             Talend Enterprise MDM is the new name of Talend MDM Enterprise Edition.
•             Talend Enterprise ESB is the new name of Talend ESB Enterprise Edition.

Talend Enterprise BPM is the latest addition to the product portfolio. Since our BPM technology in Talend v5 is provided through an OEM partnership with BonitaSoft, one can simply download their open source edition, Bonita Open Solution, which is essentially the open source version of Talend Enterprise BPM.

Steve Sarsfield, Product Marketing Manager

[2012-01-23] Talend Blog: Discovering Data Services through Use Cases: Data Virtualization

Talend recently announced the availability of the Talend Platform for Data Services. Built around Talend v5’s data integration and ESB technologies, it lets IT organizations operationalize data through the single development, deployment, runtime and monitoring environment of the Talend Unified Platform.

Through a couple blog posts, we look at typical use cases of the Talend Platform for Data Services. The first one is: data virtualization through web services.

Data virtualization abstracts one or several heterogeneous data sources via a Data Services layer that can be invoked in real time, near-real time or polled, to support multiple applications and processes that require access to underlying data-sources.

For example, let’s consider a media organization that delivers content on demand, 24×7, and is handling hundreds of simultaneous requests during peak load. This organization uses a SaaS CRM such as Salesforce.com, a home grown customer license management database running on Oracle, and SAP for invoicing and accounts receivables. Yet, a complete view of the customer is required by applications such as the customer web portal (where customers can retrieve licenses they have purchased, but only if their account is in good standing), the CRM itself (sales people need to see invoicing and payment history), the business intelligence system, and more.

In short, getting the complete view of a customer requires to access, at the same time, all three systems listed above. How can Data Services help?

  • Data integration jobs provide access to the sources through their native protocols and APIs, without the need to code. Data transformation, reconciliation and enrichment is defined graphically and results delivered in through code generation.
  • These data integration jobs are then exposed as web services, either using SOAP or REST and made available as a common Services Layer throughout the organization via the ESB.
  • Immediately, any application that needs to access a complete view of the customer can invoke the data services and retrieve, upon request and in real-time, the consolidated information required, as simply as if it was residing in a single data source.

Of course, to make this process smooth, complete integration between data integration and ESB is required. This enables automatic deployment, metadata compatibility, and ensures proper version control of the services.

Yves

Next posts: Guaranteed Delivery, Legacy Integration

 

[2012-01-19] Talend Blog: 2011, the Year that Kept on Giving: Partners

This past year has been a game changing year for Talend. In this series of short interviews with key Talend managers, we take a look at several key functions at Talend and try to understand which role they have played in Talend’s success in 2011.

Keith Goldstein joined Talend in the summer of 2011 as Vice President of Channels & Alliances.

Q: Keith, how important are partners to Talend’s success?

Keith: One word: cri-ti-cal. Since inception, Talend has been working very closely with partners – SI partners, technology partners, and even OEMs (one of Talend’s first important announcements was our OEM partnership with Jaspersoft, a partnership that is still strong as ever five years later). Partners help our clients be successful with our solutions.

In average, half of the deals we are closing involve a partner. Some of those are deals that are brought to us by a partner. Some of those are deals in which we ask a partner to join us, because we identified a need for some unique vertical expertise that our partner has, or simply for deployment services.

Talend has a strong services organization, but its role is not to deliver long term engagements, and our consultants are technology experts, not always domain experts. Rather, they support our clients and partners alike to ensure the success of deployments.

Q: Why would a partner OEM Talend’s technology?

Keith: OEM stands for Original Equipment Manufacturer. It essentially means the manufacturer is embedding a third party product in their own product. The whole idea is for each vendor to focus on their core competency and get “the rest” from best of breed vendors.

Under the “Powered by Talend” program, OEM partners are able to leverage all of Talend’s products – not just data integration and data quality as was the case historically. We revamped our OEM program in October, adding technologies such as Big Data Management, Master Data Management (MDM) and ESB to our OEMizable product portfolio.

Q: How is Talend addressing the needs of its SI and consulting partners?

Keith: We are about to announce a rejuvenated Talend Alliance Program that will better cater to the needs of our partners, allow them to do business more efficiently with Talend, and streamline a number of processes. Even though working with partners is part of our DNA, and we have done it from Day One, we felt we needed to address some of the growth pains that we had all experienced lately. Stay tuned for a key announcement soon!

 

[2012-01-17] Talend Blog: 2011, the Year that Kept on Giving: Application Integration

This past year has been a game changing year for Talend. In this series of short interviews with key Talend managers, we take a look at several key functions at Talend and try to understand which role they have played in Talend’s success in 2011.

Scott Devens is the General Manager of Talend’s Application Integration Division. His primary job, in 2011, has been to grow the new Application Integration business while ensuring coordination and integration with all other functions and groups of Talend.

Q: Scott, can you summarize 2011 for Talend’s Application Integration business?

Scott: 2011 was the first full year following Talend’s acquisition of Sopera. Together with our colleagues in all groups, we have worked hard to grow this new business, and the results have been rewarding. For example, we more than doubled the run-rate of the business compared to when Sopera was a standalone company. We also built a team of over 80 people dedicated to the Application Integration products and business.

A sure sign of this success is that over the course of the year, we have successfully closed large ESB deals in all 3 major regions where Talend does business (North America, EMEA, Asia-Pacific), proving the strong synergies with existing Talend’s solutions.  Recent customer win announcements such as IER or BF&M are only the first ones of a long series!

I would also like to mention the fact that Gartner named us a Cool Vendor in Application and Integration Platforms, a first recognition of the power of combining data and application integration.

Q: Which progress was made on the technology front?

Scott: Just 6 months after the Sopera acquisition, we released Talend ESB, the first Talend-branded ESB. It relies of course on Sopera’s technology and expertise but is also deeply integrated into the Talend Unified Platform technology. I wonder how many vendors succeed in integrating acquired technologies in such a short time frame (I mean really integrating, not just changing the name and pricing).

Key to this fast development is our strong team of Apache committers and contributors, showing our strategic leadership in key Apache projects. Publicly-available statistics actually show that our team members have made more commits to these projects than any other vendor team, including companies who have been working on these projects for a lot longer than we have (and who seem to have problems with basic arithmetic).

Of course, the release of Talend v5 in November was the latest milestone in this integration, with all Talend products now being part of a single, holistic integration solution.

Q: What are the next steps?

Scott: Talend has really changed over the past year and is now truly a multi-product company with a global go-to-market strategy. Several functions such as R&D, finance or marketing have already been brought together, and we are in the process of completing this integration process. I am proud to see that the Application Integration business has become part of Talend’s mainstream trade. With the first holistic integration solution on the market, Talend is well armed to compete in the broad integration market!

 

[2012-01-16] Talend Forum Announcement: Talend Unified Platform 4.2.4 is Available

Dear Community members,

We are proud to announce that Talend's 4.2.4 Unified Platform release is available.
Download Talend's Unified Platform General Availability version.

This release is a bugfixing release. You will find more information on all fixed bugs for Talend Open Studio for Data Integration, Talend Open Studio for Data Quality, Talend Open Studio for MDM and Talend Open Studio for ESB on the JIRA bugtracker.

Thanks for being a part of our community,
The Talend Team.

[2012-01-16] Talend Blog: New Product Names for all Talend’s Products

This post originally appeared in the Talend Newsletter #30.

With the release of Talend v5, names of Talend’s products are changing. This change was primarily driven by a need to streamline the naming convention and to make it easier to understand for the community, customers, partners, etc.

Note that this naming change does not alter the product capabilities or feature sets.

Open source/community products

The open source/community products are all being renamed to show their belonging to the “Talend Open Studio family”:

Enterprise/commercial products

All subscription products are now using the same “Talend Enterprise” naming model:

Note: There is no “Talend Open Studio for BPM”. Indeed, since the BPM technology in Talend v5 is provided through an OEM partnership with BonitaSoft, one can simply download their open source edition, Bonita Open Solution, which is essentially the open source version of Talend Enterprise BPM.

Yves

 

[2012-01-13] Talend Blog: 2011, the Year that Kept on Giving: Sales

This past year has been a game changing year for Talend. In this series of short interviews with key Talend managers, we take a look at several key functions at Talend and try to understand which role they have played in Talend’s success in 2011.

François Méro is Talend’s Vice President of Global Sales.

Q: François, how did you see Talend’s perception on the market evolve in 2011?

François: Historically, Talend’s model has been based primarily on adoption: a fraction of the users of our community products “maturing” into sales opportunities. Of course this is still going on, strong as ever.  The unbridled adoption of our solutions, thanks to the open source model and community, remains a key to our commercial success. But another trend started to take more and more significance in 2011: spontaneous requests for information or price quotes, by clients who were not already using our open source products. I am talking about organizations starting an integration project, or a MDM project, etc. looking at vendors and technologies, and including Talend in their short list.

I think several factors have contributed to this increased acknowledgment. The recognition we have gained from leading analysts such as Gartner or Forrester has played a role. Word of mouth is important too – IT leaders of Fortune 1000 listen and respect their peers. And of course our vision for democratization of integration through a unified platform is key here, it resonates really well.

Q: Did the holistic integration strategy already bear fruit last year?

François: Absolutely! We have been promoting this approach since 2010, when we acquired Sopera. And customers were more than ready to listen! IT leaders are tired of being the cash cows of large software vendors who provide only partly integrated products, which end up costing a lot more to make work together than their already sky-high acquisition cost. Talend’s holistic integration strategy brings together all aspects of data, application and process integration within a single solution. The value provided to customers is tremendous.

Q: Is Talend truly a multi-product vendor today?

François: Yes. Pure data integration now accounts for less than half of our total bookings. Data Quality and MDM projects have picked up in the past two years, and since we released Talend ESB last spring we have seen high traction for it, both in our user base (a clear confirmation of the holistic integration strategy), and also in green field opportunities.

 

[2012-01-11] Talend Blog: 2011, the Year that Kept on Giving: Communities

This past year has been a game changing year for Talend. In this series of short interviews with key Talend managers, we take a look at several key functions at Talend and try to understand which role they have played in Talend’s success in 2011.

Ross Turk joined Talend as Senior Director of Community just a year ago. His job is to ensure that Talend has a healthy relationship with the communities where we do business.

Q: Ross, what has changed in the past year in our relationship with the communities?

Ross: Before I came on board, Talend had a long and successful history of working with the communities. What we have done in this past year is spent a lot of time measuring and analyzing the behavior of the communities, and of the Talend employees who interact with the communities. We have become a lot more sophisticated in our measurement, which allows us to identify the areas where we can improve, and provide a better service to our community members.

As a side note, I would like to mention that we have been using Talend Open Studio a lot to extract and transform all this data – so yes, we eat our own dog food (and it’s yummy!).

Q: How well is Talend working with other open source communities?

Ross: A few months ago, we launched the Talend Community Coders program. Its purpose is to recognize the Talend employees who are actively contributing to open source projects, giving the community a single place where to look and making it easy to engage with our teams. The site features project lists, blogs, technical tutorials, etc. and is a great resource for the communities.

Q: Is Talend involved on the social media front?

Ross: Talend was an early adopter of social media! And this past year has seen tremendous traction – measurement and analysis has clearly helped us, but so has the overall increase in traction of these media. For example, the @talend Twitter account now has over 3000 followers – double what it had a year ago. The Talend Channel on YouTube is also continuing to grow and so are our communities and groups on LinkedIn, Viadeo and Xing, and of course our Facebook fan page.

Today, the way we use these channels to disseminate information and interact with the community has become really effective. Social media has become an alternative way through which people can interact with Talend.

Q: Are there also opportunities for the community to meet Talend people physically?

Ross: First, when I look at what my colleagues in marketing do, all the events to which we participate as a company, these are so many opportunities to meet with us. I personally attend a fair number of these, and there are always some technical folks who are happy to chat with users.

We also organized back in October a Community Day in Frankfurt, Germany. This was a fantastic opportunity to network between partners, users, customers, Talend employees, etc. We are still in the planning stages but expect more of these, as well as some regional user community events or user groups.

 

[2012-01-09] Talend Blog: 2011, the Year that Kept on Giving: Services

This past year has been a game changing year for Talend. In this series of short interviews with key Talend managers, we take a look at several key functions at Talend and try to understand which role they have played in Talend’s success in 2011.

As Vice President of Customer Services, Vincent Pineau runs the global services organization: training, consulting, and support.  The goal of his teams: help our customers be successful.

Q: Vincent, how did Talend’s support evolve in 2011?

Vincent: As we ramped up the business and the number of clients, we have also ramped up the support organization. Not only in term of people – the teams have grown significantly – but also in terms of infrastructure. We have updated all our Service Level Agreements and overhauled our back-office systems to better track incidents and provide up to the minute information to our clients. This also allows us to offer 24×7 support using a “follow the sun” approach.

Our case management system is based on OTRS, and of course our bug tracking system is the same as the R&D organization (JIRA). We are now using extensively Internet-based desktop sharing for troubleshooting.

Q: How did the consulting organization change?

Vincent: In 2011, we augmented our teams with a number of very talented consultants coming from Sopera after we acquired the company in late 2010. Since then, we have been relentlessly cross-training the consulting teams to broaden their technology expertise and make them more capable to deal with our clients’ complex integration issues. We have also implemented global methodologies and best practices, and introducing high level strategic consulting for our largest clients, in the form of Technical Account Management.

Q: On the training front, anything new?

Vincent: In 2011 we created a dedicated training practice, distinct from the consulting organization. Our training director is responsible for deploying our training curricula for our customers and partners. His role is also to offer a portfolio of courses that cover extensively our product and technology portfolio – not an easy task with our sustained speed of innovation!  And finally, we are taking a close look at delivery channels for training – stay tuned for more this year.

 

[2012-01-06] Talend Blog: 2011, the Year that Kept on Giving: R&D

This past year has been a game changing year for Talend. In this series of short interviews with key Talend managers, we take a look at several key functions at Talend and try to understand which role they have played in Talend’s success in 2011.

As Talend’s Chief Technical Officer, Cédric Carbone is in charge of the entire R&D organization.

Q: Cédric, what has changed in Talend‘s R&D organization in 2011?

Cédric: The biggest change is that we integrated the ESB development teams within the global R&D organization – not only the former Sopera engineers, but also a large number of new hires who joined us to accelerate the development of ESB technologies and their integration within the Talend Unified Platform. The key benefit of this global, integrated R&D team is our ability to easily leverage and reuse components across the different parts of the platform, and to produce a truly unified integration platform. Key features such as the Studio, the Administration Center, the Runtime, and more, are common to all products.

Q: Have you adopted new tools or methodologies?

Cédric: We retooled significantly our development environment last year, to make it more scalable and more contribution-friendly. On the development side, the team deployed JIRA as our central development and bug tracking tool. The global documentation team migrated to DocBook, a semantic language that enables us to create content in a presentation-neutral form and gives tremendous flexibility for publication in the formats our users need.

Q: How large is the R&D team now?

Cédric: We have grown the global R&D organization to approximately 150 people. More than half of them are located in our Beijing, China development center, but we also have sizable teams of developers and architects in Europe (France & Germany primarily) and in the US.  In total, our engineers work in 7 countries and 6 time zones. This creates unique coordination challenges, but also brings many benefits in terms of cultural diversity, proximity with the community, and time coverage (there are very few time slots when we don’t have engineers online).

Q: Is Talend still active in Apache projects?

Cédric: More than ever! This is an area where we have grown a lot last year. Today, Talend is involved in 16 Apache projects – 11 more than one year ago! We work very closely with the Apache community, via Apache Committers and Project Chairs who are part of the Talend teams. Today, 8 of our colleagues are recognized by the Apache Software Foundation, and we are helping more developers hone their skills to become active participants into these projects.

 

[2012-01-05] Talend Blog: 2011, the Year that Kept on Giving

This past year has been a game changing year for Talend. Growing from a pure-play data management vendor into a global player in the integration field has not been without challenges, but has also been a very rewarding process.

2011 has clearly been a milestone year for Talend. All parts of the organization have changed, and have played a critical role in Talend’s growth.

In a series of short interviews with key Talend managers, we will take a look at several key functions at Talend and try to understand their contribution to Talend’s success in 2011.  As new interviews are being posted, they will be linked below.

Yves

 

[2012-01-05] Talend Blog: Don’t You See the March of MDM Democratization?

In several countries, 2011 has seen dictatorial governments fall, and new democracies emerge. And in most cases, these governments, and political analysts, have been taken by surprise.

The same thing is happening in the MDM space. Exactly two years ago, Talend introduced the first open source MDM solution. It’s been seeding its roots since then. Until it started to really blossom. And yet, this is mostly happening under the radar.

In a recent post MDM in 2012: What Was, What Will Be… And What Won’t Be, Forrester’s Rob Karel addressed many of the changes in the MDM market in 2011 – and he is right on the money on all of them (data governance, multi-domain MDM, BPM & MDM convergence, etc.). But a major piece is missing: the march of democracy.

In the same vein, two Gartner Magic Quadrants (MDM of Customer Data by John Radcliffe, MDM of Product Data by Andrew White) were released late last year. And again, they fail to see the march of democracy.

Now, the goal of this post is not to take shots at Rob, John or Andrew. Primarily because I have already exposed my views about the way these major analyst firms follow, and don’t lead, major market trends, and everyone got a chance to comment on that topic. And also because it would be counterproductive. If political analysts around the world failed to see the uprisings of democracy until revolutions were under way, how can I blame industry analysts for not seeing what is happening under the radar in IT organizations? Frankly, it’s also because we, as a vendor, did not do a good enough job of showing them proof points.

The truth is, enterprise open source has democratized data integration & data quality – nobody is questioning this today (although a few years back, it was an uphill battle!). The march of democracy is under way for MDM as well.

In the past year, we have seen many midsize firms deploy Talend Open Studio for MDM (formerly Talend MDM Community Edition) in less than 3 month for real-life projects. Of course we also have references with larger firms such as The Weather Channel that selected Talend Enterprise MDM for single customer view, or Berlingske Media using Talend Enterprise MDM to get a single version of the truth, or B-process leveraging Talend Enterprise MDM to increase productivity. And these are only a few examples, many more references are under way and/or are available under NDA.

Democracy is coming to MDM. The good news is that nobody will get killed during this revolution.

Since this is the season, I’ll conclude this post with a few predicts:

  • This blog post will create a controversy, just how big I don’t know (.5 probability)
  • Open source MDM will continue to explode in 2012, bringing enterprise grade MDM technology to more and more companies, without forcing them to sell a kidney to buy their Kalashnikov (1.0 probability)
  • Methodology teams at major analyst firms will see the light and change the way they are looking at the market (.01 probability)
  • I am going to get spammed by advice from AR firms on how to leverage references with analysts (.99 probability)

Yves

[2012-01-04] Talend Blog: Great Confirmation Points

In May of last year, Talend has started to broaden its focus, extending from our traditional data management expertise into the fast converging application & data integration markets. This expansion has reached an important milestone with the release in November of Talend v5, the first truly holistic integration platform.

Of course, we did not embark on this strategic shift without serious consideration, and we had received converging advice from industry analysts and many IT leaders. Still, seeing the terrific outpouring of positive feedback when we issued the announcement has been heartwarming – a great confirmation that Talend is indeed leading the market toward unified integration.

  • Gartner analysts Jess Thompson & Ted Friedman, at the Application Architecture, Development & Integration Summit in November, presented a session Application and Data Integration: Converging Disciplines or Not? One of the key takeaways of the session was: “There are huge synergies between these domains from both a technology and practice point of view. Organizations must federate these activities & integrate the technologies to reduce costs and increase business agility.” If you are a Gartner client, several pieces of research have been published by the same analysts on that topic that make for very interesting reading.
  • Forrester analyst Mike Gilpin discussed in a recent webinar how the cloud and real-time business is driving the need for a unified approach to integrating data, applications and business processes. This webinar is available for replay.

Of course, we know what we are doing with our holistic integration strategy. But it’s good to receive such confirmation points.

Yves

[2011-12-30] Talend Blog: Happy New Year from Talend’s Co-Founders!

As a New Year is upon us, we would like to send our best wishes to the entire Talend Community: our customers, users, community members, partners, suppliers, and of course the entire Talend team. May 2012 bring you, and your loved ones, everything you wish, and more.

2011 has certainly been a fulfilling year for Talend. Now is not the time to list all our accomplishments – we know Marketing will be doing that soon enough – but when we look back at all the things we have accomplished, we are very pleased, and proud to be part of this great adventure.

This past year has really been a transforming year for Talend, a year during which we have successfully morphed from a pure play data management vendor into the only – and recognized – provider of holistic, and truly unified, integration solutions.

Of course our open source DNA has been staying strong as ever as we moved along this path, and again we are extremely proud of Talend’s ever renewed investment and contribution to the open source community.

So one more time, thank you to every member of the extensive Talend Community for your contribution to making these successes happen. And all the best for 2012!

Bertrand & Fabrice

[2011-12-22] Talend Document: 10 Best Practices for BPM Implementation

While Business Process Management (BPM) has many immediate and long-term benefits, a poor start can jeopardize the overall success of the mission. The steps and guidelines outlined in this white paper will help ensure the success of your organization's BPM implementation while minimizing the possibility of any potential setbacks.

[2011-12-19] Talend Forum Announcement: Talend Unified Platform 5.0.1 is Available

Dear Community members,

We are proud to announce that Talend's 5.0.1 Unified Platform release is available.
Download Talend's Unified Platform General Availability version.

This release is a bugfixing release. You will find more information on all fixed bugs for Talend Open Studio for Data Integration, Talend Open Studio for Data Quality, Talend Open Studio for MDM and Talend Open Studio for ESB on the JIRA bugtracker.

Thanks for being a part of our community,
The Talend Team.

[2011-12-14] Talend Document: Wayne Eckerson on Creating an Enterprise Data Strategy

An E-Book by Tech Target

A viable enterprise data strategy requires that data be managed and protected just like any other corporate asset. You also need a mix of “soft” skills—to build sustainable strategies and manage change, and “hard” skills—to use the technology to ensure the delivery of consistent, high-quality data.

Download this eBook now for proven data management strategies and techniques for avoiding common pitfalls.

[2011-12-12] Talend Blog: Participate to Ventana’s Research on Information Management

ventanaresearch1.jpgOur friends at Ventana Research are conducting research on how organizations adopt comprehensive information management processes and technologies to ensure the best quality and timely information for business. They would like the benefit of your experience and perspective on the process, issues, technology and best practices related to information management, and have asked us to help promote this survey: Uncover Better Methods for Information Management.

What’s in it for you? Upon completion of the survey, you will receive a copy of Ventana Research’s Business Data in the Cloud research white paper. They’ll also send you a free report on the findings when the research is complete. Please take the survey now.

Yves

[2011-12-07] Talend Press Release: Bi3 Solutions Embeds Talend's Data Integration Technology to Power Cloud-based Social Media Intelligence Applications

Bi3 Solutions Embeds Talend's Data Integration Technology to Power Cloud-based Social Media Intelligence Applications

OEM offering “Powered by Talend” solves critical Big Data integration challenges for customers’ real-time, high-volume social media data

Los Altos, CA - December 8, 2011 - Talend, a global open source software leader, today announced that Bi3 Solutions, the leading provider of Decision Intelligence Solutions, has embedded Talend Integration Suite inside its Software-as-a-Service (SaaS) platform to ensure that their applications can process and extract content from high-volumes of social media data.

Bi3 Solutions’ data analytics applications help improve customer service by quantifying customer feedback and making it easier to understand customer sentiment. Given the explosive growth of social media, Bi3 began deploying applications that extract content from social media channels. These applications filter through real-time media streams from sites like Twitter and Facebook, locating and tagging specific topics of interest to help customers gain a real-time understanding of how their brands are faring in the social media realm at any given moment. Transitioning its focus to social media, however, brought new infrastructure challenges, such as a drastic increase in the quantity of data to manage.

“In the past, we had to manage a limited amount of data flowing through a Customer Relationship Management or other in-house operational system,” said Thomas Groh, co-founder of Bi3 Solutions. “In social media, the volumes being churned out are much larger – almost infinite, creating so called ‘Big Data’ challenges. Developers need to ensure that their applications had the ability to process and make sense of thousands of tweets and posts per second, and for more than one customer at a time.”

Also critical to the success of Bi3’s applications was the ability to support a hybrid and highly distributed architecture. Because Bi3 runs in the Cloud and extracts data from multiple sources, its system features many individual nodes—all of which have a constant flow of data arriving. If a node were to fail for some reason, the application needed a way to hold on to incoming data during downtime and then continue processing as soon as the node was functional again.

Bi3 selects Talend’s open source data integration platform for Big Data

With real-time content analytics as its primary selling point, Bi3 knew that its application infrastructure hinged on a solid data integration platform. From the outset, Bi3 developers were drawn to Talend Integration Suite over alternatives from other providers such as IBM and Informatica. The company opted to partner with Talend through its OEM program, “Powered by Talend,” applying the technology at the core of its newly revamped Integrated Decision Optimization Center (IDOC) platform.

“Talend offered the leading data integration capabilities for our SaaS cloud-based Bi3 Solutions application, was the most mature product in the space, and provided a lot of room for us to grow our solutions over time,” said Eric Miles, CEO of Bi3 Solutions. “In addition, their OEM program is clearly designed for a company like ours, offering us the flexibility we need and a price model that does not destroy our competitiveness.”

Due to its real-time integration and execution capabilities, Talend’s Big Data integration ensures that Bi3’s Decision Intelligence applications have the performance capacity to handle vast, ongoing data streams at high speeds. For example, the Twitter Sentiment Analysis application can comb through every message as it’s generated on Twitter, filter them all based on a customer’s desired search terms, and then tag them.

Scalable Talend platform awaits future growth of Bi3 applications

While Bi3 only recently began rolling out its social media analysis applications, the company has several customers currently engaged and another 40 on the horizon over the next 12 months. In addition, the company plans to extend its applications’ social media coverage to include analysis of YouTube content, which – according to Wikipedia -- is added at the rate of around 48 hours of video every minute.

“We feel very comfortable that as we continue to scale up and add new capabilities, we’ve chosen a platform from Talend that covers us, no matter what’s coming,” added Miles.

About Bi3 Solutions

Bi3 Solutions, a private enterprise software company headquartered in San Mateo, California, provides Decision Intelligence applications as a service from the cloud. The applications empower an organizations decision making processes by relating relevant data, both operational and textual, to a specific situation. A quality decision can be made and appropriate actions executed through facilitated employee collaboration. http://www.bi3solutions.com.

[2011-12-05] Talend Document: Preparing your Organization for Master Data Management

This step-by-step report, created by MDM guru David Loshin and leading research firm TDWI, gives powerful strategies for preparing for your MDM initiative, helping you to anticipate potential roadblocks, improve planning and align your successful data management program across the entire organization

[2011-12-03] Talend Blog: Impressions from Gartner’s AADI Summit

hotel-caesars-palace-na.jpgThis week was a busy one: 3 days in Vegas for the Gartner Application Architecture, Development & Integration (AADI) Summit, and the launch of Talend v5. I have already commented on the latter, but wanted to share a few thoughts and comments about the Summit.

The first fleeting thought is a bit of disappointment on the amount of online social interactions.  There were not nearly enough Tweeps at this event. Few analysts cared to tweet their thoughts – with the notable exception of @ted_friedman and @merv. And even vendors, who are usually all about trying to look good by tweeting bits and pieces, did not seem very engaged. I think that with the few tweets I posted, I got on the big screen too often (see below).

Now, not all good interactions are online, right? Otherwise, what would be the point of traveling to Vegas (apart from the gambling, I mean). And from that standpoint, the conference was great. We had many great discussions with a number of analysts who are covering our space – the broad integration market – and got excellent feedback from all of them on our new holistic integration positioning: bringing together data, application and process integration. I am very grateful to Jess Thompson, Yefim Natis, Anne-Thomas Manes, Massimo Pezzini, Ted Friedman, Benoit Lheureux, Jim Sinur, Dan Sholler, Merv Adrian and others for the time they spent with us and their feedback. That’s a lot of people we met but Talend’s scope is clearly expanding and spanning more and more markets.

The sessions were also great, at least the few I was able to attend between meetings, press briefings on v5, etc. Lots of discussions on XaaS: SaaS, IaaS, PaaS, iPaaS, aPaaS.  These three letters – aaS – are pretty hot (no pun intended… OK maybe a pun was intended… this was in Vegas after all). Cloud is not the hot topic it was, but that’s because Cloud is going mainstream and now the issues are much more refined that Cloud or No Cloud – it’s what kind of Cloud, for which purpose, and how to integrate it with the rest of the information system (hybrid cloud integration).

For a conference targeted on integration, I think that the data/information content was under addressed. Of course, the finest data analysts were there (hmm, now I am going to hear from others who don’t like not being the finest), but there were too few sessions on information. We are still educating the applications crowd on the value or proper information management.

tweetscreen.jpgI am sure everyone will be shocked to hear that my best moment at the conference was when Jess Thompson mentioned, in a session he cohosted with Ted Friedman on the convergence of data integration and application integration, that “Talend has launched their new version this morning, that combines data integration and application integration in the same platform with a common repository.” (I think I even enjoyed this moment better than when I split aces and drew two face cards later that night).

As you would expect the “How to deal with your megavendor” session was also pretty interesting, with tales of clients being milked by their vendor, or else not getting the value they should, and looking for exit strategies. But I have blogged about this not too long ago.

In any case, this was a very interesting conference, full of positive interactions and great feedback. And no, not everything that happens in Vegas stays in Vegas – as proven by this post.

Yves

PS: in case anyone is wondering, Vegas was chilly and cloudy/rainy, and anyway there was no free time to enjoy the great pool at the Caesars Palace that is shown on the photo above. It’s just marketing.

[2011-12-02] Talend Forum Announcement: Talend Version 5 and New Product Names

Dear Community members,

You might have noticed that your favorite Talend product you have just dowloaded has changed names with Version 5. And that the forums have been updated to reflect the change. I had announced it in the 5.0.0 GA availability message. For even more visibility, I am creating this thread and adding a message from Yves de Montcheuil, our VP of Marketing, on the reason why products are being renamed. Yves also lists the modifications for open source products.

New Product Names in Talend v5

From Yves de Montcheuil, on the Talend blog

Version 5 is now out, and with it come some product name changes. Specifically, the open source/community products are all being renamed to show their belonging to the “Talend Open Studio family”.  They are listed here:

- Talend Open Studio for Data Integration is the new name of Talend Open Studio, the most widely used open source data integration/ETL tool in the world.
- Talend Open Studio for Data Quality is the new name of Talend Open Profiler, the only open source enterprise data profiling tool.
- Talend Open Studio for MDM is the new name of Talend MDM Community Edition, the first – and only – open source MDM solution.
- Talend Open Studio for ESB is the new name of Talend ESB Studio Standard Edition, the easy to use open source ESB based on leading Apache Software Foundation integration projects.

There is no “Talend Open Studio for BPM”. Indeed, since the BPM technology in Talend v5 is provided through an OEM partnership with BonitaSoft, one can simply download their open source edition, Bonita Open Solution, which is essentially the open source version of Talend Enterprise BPM.

See the message on the Talend blog: http://www.talend.com/blog/2011/11/30/n … talend-v5/

Best,
Pcoffre.

[2011-12-01] Talend Blog: Talend v5: Democratizing Holistic Integration

It’s been one year since Talend finalized the acquisition of Sopera, starting a deep transformation of our product portfolio – transformation that culminates this week with the release of Talend v5, a holistic integration platform for the enterprise.

What do we mean by “holistic”?  The Merriam Webster dictionary defines the adjective as: “relating to or concerned with wholes or with complete systems rather than with the analysis of, treatment of, or dissection into parts – [for example:] holistic medicine attempts to treat both the mind and the body; holistic ecology views humans and the environment as a single system.”

Frankly, IT often uses loosely some terms from the general corpus. But in this case, holistic does the trick. Holistic integration, as defined here, brings together all enterprise integration needs, including data, applications and business processes, as a complete system as opposed to disjointed technologies.

The promise of Talend v5 is to enable IT organizations to converge traditionally disparate integration efforts and practices through a common set of products, tools and best practices. When an organization deploys Talend v5, it will deploy essentially one platform, regardless of the integration need: data integration, application integration, process integration.

The platform approach does not mean that Talend v5 is bloatware (see, an IT word that does not really exist…). Each product within the platform remains best of breed and will also work independently of the others. For example, customers who only need an ETL for BI, or simply an ESB, will of course not need to deploy the entire platform, but just Talend Integration Suite or Talend ESB (for example).

This deep transformation has of course not altered Talend’s DNA. Talend was founded on the promise to democratize integration: make enterprise grade technologies available to all organizations through an open source development and go-to-market strategy.  This has not changed.  The underlying open source foundation of Talend’s products remains strong as ever, and can be downloaded freely.

Furthermore, the enterprise products, built around the Talend Unified Platform foundation, will continue to be priced very aggressively compared to proprietary vendors, while offering superior feature sets.

This is what we mean when we say we are democratizing holistic integration.

Yves

[2011-11-30] Talend Blog: New Product Names in Talend v5

Version 5 is now out, and with it come some product name changes. Specifically, the open source/community products are all being renamed to show their belonging to the “Talend Open Studio family”.  They are listed here:

talend_products_v5_md.jpg

There is no “Talend Open Studio for BPM”. Indeed, since the BPM technology in Talend v5 is provided through an OEM partnership with BonitaSoft, one can simply download their open source edition, Bonita Open Solution, which is essentially the open source version of Talend Enterprise BPM.

Yves

[2011-11-30] Talend Blog: Welcome to Talend v5!

talend_v5_job.jpgIt has been eighteen months since v4, our latest major platform release that saw the unification of multiple data management technologies. In these eighteen months, a lot has happened and Talend’s product portfolio has been expanding considerably, now providing a holistic platform for integration which includes data management, application integration and process integration.

Version 5 is a key milestone for Talend, a new platform that will allow us to continue to build on our foundation and deliver always more advanced integration capabilities.

I want to thank our great engineering team that has put in long hours of work, our QA team that has carefully tested each feature, our technical documentation team that wrote the manuals and user guides, our Product Management and Product Marketing teams, without whom this product would be nowhere near where it is today.

I also want to thank the entire Talend community, who have contributed tremendously to this new version, through features requests & bug reports, through downloads and tests of Milestone and Release Candidate versions, and through continuous encouragement of our teams.

Download the products of Talend v5, try them, and let us know what you think! You can also view a summary of the new features here.

Fabrice

[2011-11-30] Talend Forum Announcement: Talend Unified Platform 5.0.0 is Available

Dear Community members,

We are proud to announce that Talend's 5.0.0 Unified Platform release is available.
Download Talend's Unified Platform General Availability version.

Compared to version 4.2.3, many new features have been added. Following are the most important features included in the milestones and release candidates for Talend Open Studio for Data Integration (previously Talend Open Studio), Talend Open Studio for Data Quality (previously Talend Open Profiler), Talend Open Studio for MDM (previously Talend MDM Community Edition) and Talend Open Studio for ESB (previously Talend ESB).

Talend Open Studio for Data Integration

In RC2:

- TDI-17358: new flow created via "*New Output* (Main)" does not appears in SCHEMAS parameter table

In RC1:

Studio :
- TDI-17374:Exchange view: create a new plugin for exchange system.(almost finished )
- TDI-17999:Import from repository should also import the MDM metadata

Components :
- TDI-17994: Add the option transfert mode in component "tFTPFileProperties component"
- TDI-17944: Create the component tMDMTriggerOutput
- TDI-17943: Create the component tMDMTriggerInput
- TDI-17941: XMLMap: Add an option to write all the input data in only one document
- TDI-17908: Manage the tXMLMap rejects like in the tMap
- TDI-17890: tXMLMap: Add the possibility to create (OR NOT) an empty element
- TDI-17463: Create the component tWriteJSONField

In M5:

Studio:
- TDI-17374: New Exchange system (not fully working)
- TDI-17372: Getting Started modified, use html page. (DI only)
- TDI-17373: New TalendForge account linking.
- TDI-17229: Usage Data Collector.
- TDI-17592: Review Talend Open Studio Login dialog.
- TDI-17258: Add a new attribute like: SHOW_IF with type:String for the ELEMENT:RETURN.

Components:
- TDI-17815: the path in the tOldDbOutput_begin.javajet&tOleDbOutput_main.javajet are incorrect
- TDI-17614: Create a generic component for Teradata TPT
- TDI-17492: Aggregating multiple 'rows' in tXMLMap component
- TDI-17498: Can't write binary data with tFileOutputLdif
- TDI-17470: Vertica components: Support of Vertica 5.0
- TDI-17463: Create the component tWriteJSONField
- TDI-17303: tfileinputmsxml uses wrong dtd file location in the input xml file
- TDI-13094: GlobalMap coming from tLoop doesn't work
- TDI-12941: tMysqlOutputBulk - no error when disk is full

In M4:

- TDI-17222: Wizard Db connection for Oracle RAC.
- TDI-17310: Remove Perl.
- TDI-17413: Review the system of libraries (keep jars in original location / avoid to use .JavaLibs in commandline)
- TDI-8697: Change display of repository for Services. (ESB)

Components:

- TDI-17448: Add the possibility to use the tXMLMap as a virtual component
- TDI-17383: Salesforce component: Add an advanced option to set the Client ID
- TDI-17285: Please update JSch to latest release
- TDI-17283: tFileOutputPositional should use toPlainString() for BigDecimal
- TDI-17276: tSybaseBulkExec component should support the interface file when calling bcp.exe
- TDI-17256: Option to mask data entry to Question tMsgBox
- TDI-17250: Refactor the tFileInputPositional
- TDI-17195: tOracleOutput : truncate table reuse storage
- TDI-17162: Support of Oracle RAC in the existing Oracle components
- TDI-17022: Amazon RDS components
- TDI-6890: "java Code of a method longer than 65535 bytes" in tFileOutputPositional
- TDI-16974: tXMLMap : Match model : support "Unique match" and "First match"

In M3:

- Prepare implementation for the future OBR.
- Change the storage to have separate file for screenshots.
- Improve OSGI export system (prepare for unified platform).
- Enhance the display of the tXmlMap.
- Add a possibility to link files to properties. (Allow to save easily xsd / wsdl in repository with all standard repository functionality like move/rename...).
- Use components/ext only for user components not for other components providers.
- For Talend ESB: add service repository (planned to finish in M4).

Components:

- tMicrosoftCrmExecute.
- Develop missing features in tXMLMap.
- Support Document type in XMLMap lookup.
- tXMLMap : Support "Left outer join" in lookup table.
- tXMLMap: Lookup model: support "Reload at each row" and "Reload at each row (cache)" mod.
- Development of tSAPBWInput.
- Refactor Oracle components.
- Improve tOracleInput.
- Improve tOracleOutputBulkExec.
- Amazon RDS components.
- Develop the Amazon components for Oracle.
- Download and verify the components tAmazonRDS* (Amazon RDS for Mysql).

In M1/M2:

- Look and feel enhancement of tXmlMap

Components:
- tFileFetch Cookies Are Not Being Applied
- Implicit conversion in DOS format when attached file with tsendMail component
- MQ non-destructive read
- Create a new tMDMConnection ant a tMDMClose components
- In the component tFileList, the text type of "Exclude FileMask" is a limitation to the function of the component
- "tMicrosoftCRMOutput won't connect to MS Dynamic CRM2011 Online"
- Cannot keep data in CDC table (Ingres)
- Upgrade jasper report library

Talend Open Studio for Data Quality

In RC3:

- TDQ-1837     Add a preference option to limit the number of analyzed columns

In M5:

- Phone number validation indicators

In M4:

- TDQ-1845 Add Filters in DQ Repository view
- TDQ-1760 Java UDI: not convinient to delete udi jar files

In M3:

- Add the phone number indicators in the column analysis report.
- Add filters in column or table selection dialogs.

In M1/M2:

- Add filters in column or table selection dialogs
- Create phone number validation indicators
- DQ Repository View: Add Filters in DQ Repository view
- Java UDI: not convinient to delete udi jar files
- Cheat sheet to deduplicate data
- Create the ParserRule object
- Create phone number validation indicators
- Add the phone number indicators in the column analysis report

Talend Open Studio for MDM

In RC3:

- [TMDM-2290] - Open /save Process in the repositoryView
- [TMDM-2416] - create/open picture resource in the repositoryView
- [TMDM-2491] - Share server connection information between Server View and Repository View
- [TMDM-2608] - Update must allow deletion on the server-side too
- [TMDM-2651] - The update server dialog must indicate what it is going to do
- [TMDM-2714] - Auto-deploy to last server on save
- [TMDM-2744] - Real classpath isolation in job ran by the "jobox" (zip) container
- [TMDM-2777] - logical delete+recyclebin

In RC2:

Studio

- [TMDM-2610] - Restore the "browse" action on views, rename as "Test"
- [TMDM-2633] - MDM Component Rework (tMDMTriggerInput / tMDMTriggerOutput)
- [TMDM-2653] - The confirmation dialog upon deploying objects should reflect what was actually done
- [TMDM-2707] - Deprecate MDM Server view
- [TMDM-2718] - Server selection basics

Web UI

- [TMDM-2489] - Item browser gxt refactor -- item FK relations
- [TMDM-2496] - To add default value rule for tree panel in new item browser refractor
- [TMDM-2612] - BrowseRecords---ForeignKey component display in ForeignKey Tab
- [TMDM-2627] - Polishing the current new web server -- Actions selector and truncated words for all webapp
- [TMDM-2628] - Polishing the current new web server -- Import and Export
- [TMDM-2655] - Composite View navigation
- [TMDM-2679] - FT search in web ui, displaying the entity name in the 2nd drop-down is misleading
- [TMDM-2728] - Open the detailed panel in the outermost tabpanel
- [TMDM-2734] - Increase the default # of lines per page in the search result
- [TMDM-2662] - CSS Tuning on the new item browser to make it comply with the expected Look&Feel (attachment).
- [TMDM-2678] - Look and feel rework

In RC1:

- [TMDM-1711] - Fr translations handling
- [TMDM-1826] - Item browser gxt refactor -- rules
- [TMDM-1843] - Item browser gxt refactor -- Inheritance & polymorphism
- [TMDM-1865] - Filter by object category in the repository view
- [TMDM-1868] - Item browser gxt refactor -- form
- [TMDM-2390] - Item browser gxt refactor -- Clone
- [TMDM-2391] - Item browser gxt refactor -- Composite view
- [TMDM-2392] - Item browser gxt refactor -- TreeDetail refactor and FK list display
- [TMDM-2395] - Item browser gxt refactor -- Entity Banner
- [TMDM-2397] - Item browser gxt refactor -- Custom UI Form Designer
- [TMDM-2434] - Item browser gxt refactor -- Banner
- [TMDM-2502] - Revisit 'Deploy All' action
- [TMDM-2530] - Import/Export custom form objects from MDM Repository View to the server
- [TMDM-2555] - implement context menu for data model entity/element
- [TMDM-2564] - add remove function for fk node list
- [TMDM-2566] - Repository studio Integration: Item property view
- [TMDM-2567] - MDM Repository cosmetic changes
- [TMDM-2607] - Primary Key info presentation in the header panel
- [TMDM-2609] - Browse container on the data-container repository object, like in the MDM server view
- [TMDM-2613] - BrowseRecords---Display ForeignKey Tab separately.link and create ForeignKey
- [TMDM-2696] - Omit the root element when displaying the default form layout

In M5:

Studio:
- MDM Repository view integrated with the platform repository fully functional
- Graphical custom form editor

Web UI:
- Most web modules ported to GXT
- Redevelopment of web forms with composite view

Server:
- Foreign key integrity constraint

In M3:

- Offer a duplicate action on a repeating element within a group (complex-type). An additional '+' button that clones the element instead of adding a new empty element. Should work with a repeating complex-type too, in which case it clones the complete fragment.

In M1/M2:

- Auto-expand annotation for data model
- Consistency of process results
- Regain some vertical real estate in data-model

Talend Open Studio for ESB

In RC3:

- [TESB-3705] – ESB Studio move to released camel (2.8.2) and cxf (2.5.0)
- [TESB-3760] – Route Builder: Add new cRecipientList component into Router

In RC2:

- [TESB-3352] – Routes: ESB Features support for cCXF components for SL & SAM
- [TESB-3486] – Rent-A-Car sample runs together with new Talend STS
- [TESB-3531] – Data Service: Custom Locator properties for Data Services Consumer
- [TESB-3590] – Data Service: Simplify selection of the services from the workspace in tESBConsumer component
- [TESB-3690] – Data Service: Improved security configuration in tESBConsumer
- [TESB-3583] - Data Service: Improve service metadata editing dialog
Change request
- [TESB-3527] – Locator Service: Enhancements / Interface updates and additional improvements
- [TESB-3546] – Routes: Camel components required by Route builder components preinstalled in Karaf container
- [TESB-3621] – Data Services: studio web service export: change way for generate runtime configuration to template mechanism
- [TESB-3637] – Data Services: Change the metadata editing action title
- [TESB-3655] – Data Services: Remove namespace schema folder when import WSDL schema

In RC1:

- [TESB-2366] – STS: Support KerberosToken in WS-Security header of RST
- [TESB-2367] – STS: Issued SAML Token must be extensible for custom AttributeStatements
- [TESB-2368] – STS: Custom Claims providers
- [TESB-2369] – STS: TokenProvider and TokenValidator must be able to give more context of the error condition
- [TESB-2370] – STS: Username/password authenticator
- [TESB-2371] – STS: An STS JVM must support several authentication systems (IDM)
- [TESB-2911] – STS: Token caching in CXF intermediary service
- [TESB-2971] – Authentication for Data Services
- [TESB-3056] - Enable security for Data Services
- [TESB-3081] - Samples in the software package instead of the documentation package
- [TESB-3104] - Plain ActiveMQ Distribution in Talend ESB Runtime package
- [TESB-3122] – STS: Signature and encryption username and passphrase configurable
- [TESB-3156] - Automated start / stop of Routes and Data Services
- [TESB-3279] - Karaf Cellar in TESB (Technical Prev.)
- [TESB-3351] - tESBProviderRequest Component should use new 'Service' item
- [TESB-3438] - Rename ESB runtime to reflect broadened scope
- [TESB-3525] - Add context names to the OSGi feature configuration
- [TESB-3530] - ESB CXF Features as OSGi Services

Change requests:
- [TESB-2133] - Include Karaf-Webconsole as preinstalled feature
- [TESB-3328] - Update tHttp request and response types to Document
- [TESB-3413] - Talend Runtime: Remove explicit start / stop commands & update list commands
- [TESB-3545] - Configurable CXF work queues in Karaf

In M5:

Tooling:

- [TESB-2498] [Studio] Export Service and Route as OSGI bundle(s)
- [TESB-3174] [RouteBuilder] Generated code for Routes now support stop and shutdown

Improvements:
- [TESB-3028] [Studio] Migrated to CXF 2.5.0-SNAPSHOT and Camel 2.8.2-SNAPSHOT
- [TESB-3186] [Studio] Renamed tHTTPRequest and tHTTPResponse to tRESTRequest and tRESTResponse
- [TESB-3229] [RouteBuilder] Renamed cCamelContext component to cConfig
- [TESB-2744] [Components] Jobs that uses tESB* components should rely on OSGi require-bundle instead of embedded bundles.

Runtime:

- TESB-1446 - Enable Service Locator for Rest based services.
- TESB-2692 - Implement the Rest interface for the Service Locator proxy
- TESB-2972 - Include CXF-STS in delivery
- TESB-3060 - Expose all Talend Runtime specific commands via JMX MBeans
- TESB-3068 - Ready to deploy Archiva package for Tomcat

Improvements:
- TESB-2938 - Same native OS metrics in Karaf embedded JobServer as in stand-alone JobServer

In M3:

- New tHTTPRequest and tHTTPResponse component for implementing REST-full service providers.
- Importing Camel Routes from Spring XML file.
- New specialized components for CXF, File, JMS, FTP and ActiveMQ.
- Displaying custom properties in TAC SAM UI.
- Paging and sorting in TAC SL UI.

In M1/M2:

ESB Studio Standard Edition

Data Services Builder:
- Automatic import of XML Schemas from WSDL to Metadata. It is simpler now to re-use WSDL types in tXMLMap.
- Custom properties support for SAM Events in tESB Components: now it's possible to attach custom attributes to SAM events sent from Data Services.
- Add WSDL-driven configuration for tESBFault component
- Missing namespace in job-first generated WSDL

Route Builder:
- Added first specific implementations of generic cMessagingEndpoint for Web Service (CXF), ActiveMQ, File and FTP.

ESB Runtime Standard Edition

- Starting of SAM Server as mvn jetty:run-war fails for examples/talend/tesb/sam/WSDLException while running WS-Security (STS) enabled Rent A Car tutorial.
- Specification of endpoint selection strategy per client
- Change of Default TESB Container HTTP Port  (it's now 8040 !)
- Fixed issue with Zookeeper start script on Windows machines

For more information on fixed bugs and new features, go to the TalendForge Bugtracker.

Thanks for being a part of our community,
The Talend Team.

[2011-11-28] Talend Blog: The Truth is That They Don’t Care as Long as you Pay

After Orlando, Barcelona, and before India, Gartner held their Symposium in Australia last week. A much expected session of Symposium, and many other Gartner events, discusses how to deal with mega-vendors. Brett Winterford from IT News was in the room when Dennis Gaughan presented this session, and came back with some interesting – but not at all surprising – insight into the strategy of IBM, Microsoft, Oracle and SAP, the four mega vendors that rule IT today.

There are so many sound bites in Brett’s article that I wouldn’t know where to start – and my goal is not to plagiarize his article anyway.  Still, I couldn’t resist picking a few, see at the end of this post.

At the end of the day, it comes down to a simple fact: customers are cash cows for the mega-vendors. They don’t care, as long as you pay. They don’t view customers as partners with whom to build strong technology, they view them as account balances in their ledgers.

We have always known that mega-vendors rarely innovate. Sure, they maintain research facilities, but how many great technologies have come out of these? In fact, the only recent exception that comes to mind is SAP’s HANA – but then, in typical SAP fashion, they have out-priced the market – says Brett: “Pilot customers of HANA had reported that the in-memory technology enabled ‘dramatic performance gains’, but once SAP told the customer how much the technology would cost in production, they struggled to make a business case for deployment.”  Compare HANA with Hadoop. Same general purpose – extreme scalability. Closed (and secretive) product vs. open source collaboration. Outrageous price tag vs. free download. Runs on super duper expensive hardware vs. myriad of commodity nodes.

Beyond (lack of) innovation, there are two areas where mega-vendors create special challenges.

The first one is integration. Their technologies “suites” or “platforms” have only one point of integration: the price list.  When you are dealing with integration technologies, as Talend is, this is obviously an important issue. Integrating data/applications/processes with disintegrated technology does not make a lot of sense, you simply end up with mode silos!. The reason?  M&A-driven growth, and a lack of interest to invest into really building a platform. M&A is a respectable strategy and is not in itself a justification for non-integration: look at Talend ESB, the fruit of the acquisition of Sopera, which is truly part of the Talend Unified Platform. Lack of willingness to invest for your clients, this is the problem.

The other challenge is pricing. I am always baffled when I hear that analysts such as Gartner’s or Forrester’s (and attorneys too), are needed to review a vendor proposal.  Not only are these prices outrageous but they are so hard to comprehend that you have no idea what you will be paying down the road. That’s unpredictability at best.

At the end of the day, why do they keep behaving this way?  Because they can.  Real alternatives are few.  But they do exist, and are gaining ground.  Of course, open source will (does) play a role here, and is already pushing some large vendors to change their behavior. Mega-vendors are next.

The best sound bites

Brett’s article is worth reading in its entirety, but I couldn’t resist a bit of copy/paste:

  • Most inquiries to Gartner from CIOs interested in working with IBM focused on contract negotiation. (…)  The number one question is: how do you avoid being managed by IBM?
  • Oracle is not looking for where integration is possible but where it is profitable. (…)  It’s a bundling on paper, not in the architecture.
  • SAP haven’t sorted out the pricing. (…)  Gartner was most often approached by SAP customers to get a better understanding of software licensing.
  • SAP has interesting licensing terms for getting data in and out of the system for use in other applications. (Licensing terms to access YOUR data??)

Yves

[2011-11-16] Talend Forum Announcement: Talend Unified Platform 5.0.0RC3 is Available For Testing

We are pleased to announce that Talend's 5.0.0RC3 Unified Platform release is available, for testing only.  This release candidate contains many bug fixes and new features, and is recommended for experienced users who need an early preview of upcoming 5.0 release.

Download Talend's third Unified Platform release candidate here.

Talend Open Studio (Data Integration part of the Unified Platform):

Fixed bugs in the Studio:
TDI-14509: tELTxxxMap does't work if column name is different from DB column name
TDI-18347: tXMLMap generates wrong xpath
TDI-17509: there are some missing issues in the generated documentation (HTML)
TDI-18161: implict context load on JDBC generate errors
TDI-18289: Can't retrive table schema with specific ODBC driver
TDI-18254: check the property fields does not work
TDI-18224: The old version of the job cannot be opened after import a job which has multi-version
TDI-18419: Demo project error if demo project name is different than TALENDDEMOSJAVA
TDI-18169: tXMLMap: An Element with an Atribute can not be used in an Input Map
TDI-18185: txmlMap: Expression can not be saved
TDI-18186: tXMLMap: Formula in Expression gets lost on save
TDI-18365: tXMLMap - Wrong path in XML to XMl mapping
TDI-18029: tXMLMap: renaming the root element is counterintuitive, provide in-place edition instead
TDI-18285: Can not Deploy a simple Web Service by "Axis WS(.war)" talend export method
TDI-18061: there is no error when change job's name same with routines' name
TDI-18211: Troublesome to create any table link from ELT input component...
TDI-18231: NPE error when try to export job and select "Override parameter's values"
TDI-18273: Show error after delete the copy of routine and sql template firstly
TDI-18418: NPE when opening the TOS demo project
TDI-17905: Opening Dialog: Create new project and cancel -> Nullpointer Exception
TDI-18423: Unable to use stats and logs with sybase
TDI-17986: some schemas were missing in a AS400 connection when multiple libraries were given in the additional parameters

Components Fixed bugs:
TDI-18490: tSalesforceConnection: jar required
TDI-18462: When "normal" is selected in "Error Output" field in tSystem component , there is a compile error.
TDI-18435: message from tMYsqlSP are not displayed
TDI-18432: Generation failed in the studio for one version of the job
TDI-18422: tSSH compile error
TDI-18401: TALEND DI creates wrong script entries for Teradata FASTLOADUTILITY component
TDI-18400: TALEND DI creates wrong script entries for Teradata FASTLOAD and MULTILOAD
TDI-18376: Wrong palette displayed for a route
TDI-18356: Exception in component tSalesforceBulkExec_1
TDI-18326: The generated code of tXMLMap is wrong when there's no namespace for some elements.
TDI-18318: For the tFileInputDelimited, when the input source is file stream and zip file, please igore the parameter of footer and radom number
TDI-18271: The extra column must be null after configuring "aggregate" and "reject" in the outgoing document
TDI-18268: For Stats and Logs, the Talend job just keeps building cursors on the Oracle server until maximum cursors are reached
TDI-18191: Suppose there is not backslash in the filename, the components tFileOutputJSON and tFileOutputLDIF throws NPE
TDI-18138: In the component tPivotToColumnsDelimited, Pivot does not work for "Date" type
TDI-18002: File Processing Components like tFileInputDelimited and tXsdValidator does not work as expected in CronTab enviroment
TDI-17977: if Data Service Job contains tSalesforceInput component - job start failed.
TDI-17971: problem with tBonitaDploy component whenever we run a job
TDI-17968: Problem with tCreateTable using Sybase
TDI-17948: tLDAPOutput is unable to import a BASE64 content correctly
TDI-17865: Invalid Reject Population at tApacheLogInput
TDI-17717: tLDAPInput, can't get binary values except objectGUID
TDI-17519: tWriteXMLFiled component is unable to process the XML document correctly mentioned in TDI-17347 when generation mode is "slow with no memory consumption"
TDI-17491: When choosing to run tRunJob as a separate process, you can not define the run paramenters that are passed to its sons.
TDI-17468: Logic error in the tXMLMap when matching mode is "All Match"
TDI-17388: When the subjob contains the compoment tParallelize, the problem TDI 8619 still exists

Talend Open Profiler (Data Profiling and Quality part of the Unified Platform):

New feature:
TDQ-1837     Add a preference option to limit the number of analyzed columns

Fixed bugs:
TDQ-3689     analysis run failed with db2 z/os connection
TDQ-3877     ColumnSet Comparison Analysis can not run correctly with nvarchar2 column type
TDQ-3866     TOP - Could not open the editor: 6
TDQ-3846     the node "ParserRule" should not show under Exchange for product lower than 5.0.0.
TDQ-3876     remove the updateDependency method
TDQ-3844     no judgement when add an indicator on a column
TDQ-3345     drag a UDI into analysis, seems no db restrain
TDQ-1554     run oracle column analysis with java engine "default value count" will not show.

Talend MDM Community Edition (MDM part of the Unified Platform):

New Features:
[TMDM-2290] - Open /save Process in the repositoryView
[TMDM-2416] - create/open picture resource in the repositoryView
[TMDM-2491] - Share server connection information between Server View and Repository View
[TMDM-2608] - Update must allow deletion on the server-side too
[TMDM-2651] - The update server dialog must indicate what it is going to do
[TMDM-2714] - Auto-deploy to last server on save
[TMDM-2744] - Real classpath isolation in job ran by the "jobox" (zip) container
[TMDM-2777] - logical delete+recyclebin

Fixed bugs:
[TMDM-1326] - error while input non-numeric characters in text box of lines per page
[TMDM-1703] - Items browser executes non optimized code for FK picker
[TMDM-1730] - Bug with "Edit item in grid" option and custom dates
[TMDM-1767] - When editing an entity directly on the search page of browse_items2, beforeSaving is triggered before xpath valdiation rules
[TMDM-1904] - Process through view does not seem to work
[TMDM-2206] - Grouping Hierarchy not sorted
[TMDM-2273] - Import from Exchange feature only fetches extensions compatible with 4.0.1
[TMDM-2369] - The search filter does not work well to select a foreign key.
[TMDM-2472] - tMDMOutput does not perform the update in MDM when the before process is called
[TMDM-2591] - When violate a validation rule, display the defined message directly
[TMDM-2664] - Search result can't be sorted by ascending or descending
[TMDM-2697] - Entity description is lost in the new design. Add the blue info badge and tooltip.
[TMDM-2729] - Can't save record When beforeSaving process return message type="info"
[TMDM-2731] - Failed to save datamodel in Repository after generate browse_items view for an entity
[TMDM-2732] - Status bar in search results, information not visible unless panel is set really wide
[TMDM-2733] - The collapsible state of the left (menus) and right (model/container) sliding panels is no longer persistent
[TMDM-2745] - English errors in message when jobs successfully deployed from MDM Repository to MDM Server
[TMDM-2746] - The toolbar should be switch to the same as when a record is edited
[TMDM-2747] - Add a xpath bring up the MDM server view
[TMDM-2749] - Fk picker: need to sort infos
[TMDM-2754] - Title for main tab of an entity is not displayed correctly
[TMDM-2755] - Duplicate action does not work correctly
[TMDM-2757] - Broken FK links prevents a record to be opened
[TMDM-2761] - When creating items in BrowseRecord, after item is saved, it does not appear in the list view on the left
[TMDM-2763] - The action to generate caller process from workflow is missing
[TMDM-2766] - Cannot create an object in the repository view with a '#' suffix in the names
[TMDM-2767] - Inconsistent state after a resource failed to save
[TMDM-2768] - Min occurs is inconsistent with XSD in the model prop sheet
[TMDM-2770] - Import from server causes an exception
[TMDM-2771] - Wrong model set by generate workflow caller wizard
[TMDM-2779] - Foreign key is wrong saved after edit it in grid
[TMDM-2786] - TMDMInput does not work when filter is EQUALS
[TMDM-2791] - Can't open FK record
[TMDM-2792] - Breadcrumb is not well displayed
[TMDM-2793] - Change view doesn't work on update in journal
[TMDM-2795] - Creating a record after the entity in search has been changed causes Web GUI to mess up.
[TMDM-2797] - Panel containing search toolbar in BrowseRecords should not be resizable
[TMDM-2798] - New L&F limitations
[TMDM-2800] - using MDM-5180, the bookmark for AdvancedSearch does not work
[TMDM-2802] - Use of correct naming 'Browse Data'
[TMDM-2804] - Save&Close button does not close tab
[TMDM-2805] - Bad display of journal search panel
[TMDM-2806] - WebUI does not retain paging sizes
[TMDM-2807] - The icon for success to deploy job to server view is not correct.
[TMDM-2809] - Cannot import a 4.2 export archive in 5.0
[TMDM-2813] - Entity must be displayed for every segment in breadcrumb, not just root segment
[TMDM-2814] - Hotkey not suppport copy/paste components
[TMDM-2816] - Change checkbox label in record editor
[TMDM-2817] - Displayed tab order is inconsistent in item detail view
[TMDM-2821] - Saving a new record requires successful message box to be explicitely closed
[TMDM-2822] - Can't view changes of a record
[TMDM-2823] - Saving changes to a record in Item Detail panel now causes the view to reset to displaying the first record in the list
[TMDM-2827] - FK integrity causes NPE, prevents deleting a record when entity was deleted or renamed in model
[TMDM-2829] - Some info is missing from FK picker dialog box
[TMDM-2830] - Be able to do an export/import of a data container from repository
[TMDM-2836] - Toolbar display issue
[TMDM-2837] - Stray editors in studio cause NullPointerException
[TMDM-2845] - Wrong behavior/workflow of the tab when creating a new record
[TMDM-2849] - Server selection empty when running a process in studio
[TMDM-2851] - A scroll bar appear when browsing records by firefox
[TMDM-2852] - What should be the first tab is not being displayed as the first tab when creating in ItemsDetailPanel in WebUI
[TMDM-2856] - Process field disappear in toolbar
[TMDM-2857] - Duplicate Browse item
[TMDM-2859] - If there are no modification on the item, the tab couldn't be closed
[TMDM-2860] - Duplicate doesn't work on AUTO key fields
[TMDM-2861] - Update Server wrong for new entity.
[TMDM-2868] - Save or save&close does nothing
[TMDM-2873] - Creating record by duplication doesn't refresh left list panel and closes tab when only Save (and not Save and Close) is pressed
[TMDM-2890] - Can't start studio
[TMDM-2892] - EE menus created in CE version
[TMDM-2903] - Creating subtypes using Model Editor in Studio causes XSD error

Talend ESB / Talend ESB Studio (Application Integration part of the Unified Platform):

Fixed bugs
•    [TESB-2871] – SAM: the value of field MESSAGE_CONTENT in events table of sam server database is wrong when log.messageContent=false in agent.properties
•    [TESB-2873] – Runtime: cCXF doesn't work in Runtime when start from JavaBean
•    [TESB-3503] – Runtime/Rest Services: tRESTRequest component needs to replace jsr311-api-1.1.1.jar with a ServiceMix bundle
•    [TESB-3628] - Talend Runtime scripts need to be executable on Linux
•    [TESB-3645] – Runtime: Make path to ZooKeeper dataDir relative for ZooKeeper in Talend Runtime
•    [TESB-3651] – SL: Incorrect response for lookup request through Service Locator SOAP Service
•    [TESB-3688] – Route:Builder: Content Based Router (cMessageRouter) Is not working in Studio
•    [TESB-3710] – Runtime/Routes: after uninstall route in runtime - endpoint still 'active' in TAC Service Locator
•    [TESB-3712] – Studio: tXMLMap component located only under "Processing" group in Studio palette and not under "XML" group as it can be expected
•    [TESB-3720] - Content is empty when importing remote WSDL schema through tESBConsumer
•    [TESB-3741] – Runtime/Data Services: can't call data service with selected both SAML and UsernamePassword security options
•    [TESB-3746] – SL: WSDL and WADL Files for the Locator Services are missing in the delivery
•    [TESB-3757] – Runtime: 'com.ctc.wstx.exc.WstxEOFException: Unexpected EOF in prolog' in case consumer with up security calls provider without security
•    [TESB-3765] – Runtime: PermGen space error when deploying STS+rentacar client in the same Karaf container when using 64-bit Java
•    [TESB-3768] - STS: can't create a RST with default users of the container user.properties file
•    [TESB-3769] - STS: Config - SAML Policy contains an absolute path
•    [TESB-3804] – Route Builder: Code generation in cDelayer is broken Change request
•    [TESB-2672] – ServiceLocator: Add more meaning log message for "NodeExists"
•    [TESB-2841] – Studio: Improve tRESTRequest/Response components
•    [TESB-3675] – Data Service: Improve usability of the 'New Service' wizard
•    [TESB-3745] – SL: Change Service Locator SOAP|REST Proxy to Locator SOAP|REST Service
•    [TESB-3761] – Runtime: Configuration enhancements in Karaf
•    [TESB-3858] – Runtime: Data Service support for multiple policies

New Features
•    [TESB-3705] – ESB Studio move to released camel (2.8.2) and cxf (2.5.0)
•    [TESB-3760] – Route Builder: Add new cRecipientList component into Router

Note: all product versions
Version Upgrade: With the RC3 we are now in all ESB products on the release version of Apache CXF 2.5.0, Apache Camel 2.8.2, Apache Karaf 2.2.4 and Apache ActiveMQ 5.5.0.
Please also note that the Route Builder cActiveMQ component has been replaced by the already existing cJMS component in the Route Builder please update your route models accordingly.
Please see also related new features, changes and bug fixes in the Components (related to the tESB…, tREST… and tXMLMap components) in the Talend Open Studio for DI notes within this forum post.
With the ‘TOS for ESB’ download we now also ship the entire Talend ESB SE also with the TOS for ESB download (sub-direcory Runtime_ESBSE) but keep in mind that ‘TOS for ESB’ is GPL v2 licensed.
The Talend ESB SE download is by this only required for people to likes to get the Runtime and the JAVA development tooling (non Studio based tooling) under a plain Apache License.

For more information on fixed bugs and new features, go to the TalendForge Bugtracker.

Thanks for being a part of our community,
The Talend Team

[2011-11-12] Talend Blog: Talend Community Coders

This week at ApacheCon in Vancouver, BC, we announced the Talend Community Coders program. This program showcases members of our engineering team who contribute to open source projects.

IMG 8870

If you’re a regular reader of this blog, the names of our Talend Community Coders will already be familiar to you. Dan Kulp, Hadrian Zbarcea, Glen Mazza, Jean-Baptiste Onofre, Sergey Beryozkin, Colm O hEigeartaigh, Christian Schneider, and Olivier Lamy have recently been featured as Highlighted Community Members on TalendForge.

Our decision to announce this program here at ApacheCon was not a coincidence. Most of our Coders’ open source work is in Apache Software Foundation projects like Apache CXF, Apache Camel, and Apache Karaf. There’s another reason, though: every day, The Apache Software Foundation shows us that companies and communities can work together towards a common goal, and nowhere is that more apparent than here at ApacheCon.

Some of this week’s highlights: Dan Kulp covered about 12 hours worth of web service security technology in just under 40 minutes; Hadrian Zbarcea showed how to build a modern social web application using enterprise SOA technology; and Jean-Baptiste Onofre gave a demonstration of how Apache Ace makes deployment easier, quicker, and more manageable.

Take a look at our new Talend Community Coders site to keep up with our open source community work. I gathered some interview footage of our Coders this week and you will see it there soon!

Ross

[2011-11-10] Talend Blog: Supporting French Software Vendors Expansion in the US

Last week we were proud to welcome the AFDEL team in our Los Altos office. AFDEL is the French Association of Software Vendors, and for two years now Talend has been chairing its open source subcommittee.

Among its many missions, AFDEL is committed to supporting the international development of its members. This is what they have decided to open a Silicon Valley branch office, and Talend is proud to provide logistical support to this move. The US branch of AFDEL is hosted in Talend’s Los Altos office and the official inauguration took place last week.

afdel.pngAmong the participants, we were happy to welcome Patrick Bertrand, the president of AFDEL and a very successful French entrepreneur, and Loïc Riviere, the AFDEL’s executive director.  A number of members had made the trip and were joined by local  entrepreneurs.

Talend is proud to have been selected to host the US branch of AFDEL, and of course we are committed to supporting and provide advice to any member who is interested.

Bertrand

[2011-11-09] Talend Blog: Live from Hadoop World in New York City

hadoop_world.pngThis week is the third edition of Hadoop World, and the event has been growing quite a bit from its first edition. From a couple hundred enthusiasts – to a 1400 strong crowd, that only got limited by the fire code restrictions of the meeting rooms. We don’t see a lot of tech conferences that are actually marked as “Sold Out” and refusing walk-in registrations!

I have been sitting in the keynotes and in several breakout sessions, and also got the opportunity to chat with speakers and attendees. The overall impression is overwhelmingly positive: energy is great, content is valuable, Hadoop is the technology.

The last time I saw this kind of energy was in the Web technology conferences of the late 90’s. People then were going crazy over HTML, J2EE, Web application servers… Of course, one could argue that this energy was more hype than anything and led us straight into a bubble – true, but after the bubble had burst, the market consolidated, these technologies were deeply entrenched.  How much of Big Data is a hype that will burst remains to be determined, but one thing is certain: it will have a long lasting impact on IT and the way the business utilizes data.

The twittosphere is abuzz with sound bites from the sessions so I won’t be getting here into the amazing stats we are hearing all day long – just follow the #hw2011 hashtag to get your “Big Data fix”. There are a couple other points I’d like to bring up.

  • One third of attendees are non-technical folks. That’s a lot, for a tech conference on Hadoop. And a proof that a lot of Big Data initiatives are business-led. Since IT is badly lacking resources to build all these complicated Map Reduce jobs (cf. the “we are all hiring” stuff below), they need tools to help them. This is why the ecosystem around Hadoop is growing so quickly, and one key capability is the ability to generate optimized Big Data Integration and Big Data Quality jobs, as Talend is delivering with its Big Data solution.
  • “We are hiring” is really extreme.  When was the last time you heard a keynote speaker starting his presentation by saying “we are hiring, and we are paying more than the previous speaker”?  It happened in ’99, when everyone who understood 3GL development was crash trained as a Java developer – and everyone else as a Web QA engineer.
  • We are witnessing the power – and openness – of open source at work here. Hadoop World is organized by Cloudera, arguably the first to take Hadoop to the enterprise. But Cloudera’s competitors are here, and involved in the program. Including non-Hadoop-based Big Data solutions. I agree with Cloudera that this is the right way to take Hadoop to the next level, but it’s still a gutsy move on their side. Now, you don’t get into the commercial open source business if you are not willing to make a few bets along the way.

Another day is starting in New York, with great content on the agenda. Can’t wait!

Yves

[2011-11-03] Talend Forum Announcement: Talend Unified Platform 5.0.0RC2 is Available For Testing

We are pleased to announce that Talend's 5.0.0RC2 Unified Platform release is available, for testing only.  This release candidate contains many bug fixes and new features, and is recommended for experienced users who need an early preview of upcoming 5.0 release.

Download Talend's second Unified Platform release candidate here.

Talend Open Studio (Data Integration part of the Unified Platform):

Studio :
Bugs:
•    TDI-17796: Entity is empty if DND schema to a new mdm input component
•    TDI-17346: Exported Job as Unix script can't work when you export as Unix script only
•    TDI-18129: Changing the Name of a Job belonging to a Service operation breaks the link
•    TDI-18275: Go to File --> Edit project properties,NPE error
•    TDI-18027: Import from XML Schema does not allow me to chose what element to import
•    TDI-18150: Linkage between TalendForge login and usage data capture
•    TDI-17773: Could not export job as war when export job first time
•    TDI-17824: There is nothing in recycle bin after delete SQL Templates
•    TDI-18140: Three issues related to studio login
•    TDI-18084: An error icon always show on txmlinput component
•    TDI-17764: "Show traditional tabs" option resets after restart Open Studio
•    TDI-17868: (Exchange) Need to fix non-working webservices

Features:
•    TDI-17358: new flow created via "*New Output* (Main)" does not appears in SCHEMAS parameter table

Components :
Bugs :
•    TDI-18262: tEdifactToXml : the component does not unlock the file
•    TDI-18255: tEdifactToXml does not accept Iterate link as Input
•    TDI-18235: TFileOutputDelimited not closed when a exception occurs
•    TDI-18232: tWebServiceInput problems after upgrading from 4.0.2 to 4.2.3.[...] in user defined routines "Error"
•    TDI-18192: tjasperOutputexec report strange casting error
•    TDI-18164: tWriteJSonField missing element
•    TDI-18155: tHash input/output failure with multiple instances in child jobs. 4.2.3
•    TDI-18136: tLDAPInput -> errorMessage on "on componentError"
•    TDI-18095: When the outgoing result contains both document and flat element, the reject cotent always be empty
•    TDI-18079: For tXMLMap, unable to filter reject content when the namespace is contained in the filter condition
•    TDI-18075: Job fails to import from 4.0.3 to 4.2.3
•    TDI-18065: Unable to output inner-reject content when enable lookup flow in the job and matching mode is all match
•    TDI-18057: tSchemaComplianceCheck length check problem.
•    TDI-18046: tSAlesforceGetDeleted - rows/sec extremely slow
•    TDI-18044: Add Conversion between Document & String in tConvertType
•    TDI-18034: When "set aggregate column" in the tXMLMap, null always be returned in the generated document no matter if "Create Empty Element" is checked or not
•    TDI-18033: Suppose an element is not existed in the both main flow and lookup flow, the element will not be generated even "create empty element" is checked
•    TDI-18032: Unable to process element "attribute" correctly
•    TDI-18031: Problems related the tXMLMap after implementing "Create empty element"
•    TDI-18020: The method talendTrim in the system routine TalendString had been changed and it do not support some char anymore.
•    TDI-17993: Getting java.lang.IllegalArgumentException when I try to read date column using tJDBCInput on SQLSERVER
•    TDI-17950: Optimize The code of method tFileInputDelimited_1Process(Map<String,O[...] to push back 65535 bytes limit
•    TDI-17811: When there is an extra column in the tXMLMap, the outgoing value of the extrac column always be empty
•    TDI-17810: When there is no document type in the both "main" and "lookup" parts, and match mode is "all match", wrong result will be generated
•    TDI-17809: Problems related with tXMLMap after implementing the Group by options
•    TDI-17678: tHttpRequest does not fetch full response
•    TDI-17648: tParAccelOutputBulkExec should use toPlainString() for BigDecimal
•    TDI-17595: Problem with multiply Rejects and tFileOutputDelimited Option "Create no empty file"
•    TDI-17284: [tFileOutputMSXML] File Name requires backslash
•    TDI-16992: [tPivotToColumnsDelimited] Pivot doesn't work for int (or other types than String?)
•    TDI-8542: Sometimes the generated order of siblings of the loop element is wrong
•    TDI-8508: tFileOutputDelimited produces empty file upon error in input component
•    TDI-8017: tFileInputDelimited fails if uncompressing, and footers are specified.

Talend Open Profiler (Data Profiling and Quality part of the Unified Platform):

Bugs:
•    TDQ-3795     run column set analysis with flat file, got a NPE
•    TDQ-3459     move dbconnection/analysis to a folder may lead to the file which depend on it lose items
•    TDQ-3812     change the dbconnection name then the analysis will be unavailable
•    TDQ-3797     edit file connection, seems noting happened
•    TDQ-3776     Cannot find the "getting started" cheat sheet
•    TDQ-3759     Date Frequency table indicators do not work as expected on a file delimited
•    TDQ-3758     Export of schema from FileDelimited connection wizard results in blank xml file
•    TDQ-3745     overview analysis can not show the view number correctly
•    TDQ-3621     can't run a column analysis with db2/ZOS connection
•    TDQ-3579     missing menu "generate pattern" on date pattern indicator
•    TDQ-3710     The TDQ feature ParserRule is visible in the Talend exchange node of DQ repository of TOP
•    TDQ-2269     Empty strings in cheat sheets display
•    TDQ-2066     error msg when regEx not correct

Talend MDM Community Edition (MDM part of the Unified Platform):

1) New Features Progress

Studio

[TMDM-2610] - Restore the "browse" action on views, rename as "Test"
[TMDM-2633] - MDM Component Rework (tMDMTriggerInput / tMDMTriggerOutput)
[TMDM-2653] - The confirmation dialog upon deploying objects should reflect what was actually done
[TMDM-2707] - Deprecate MDM Server view
[TMDM-2718] - Server selection basics

Web UI

[TMDM-2489] - Item browser gxt refactor -- item FK relations
[TMDM-2496] - To add default value rule for tree panel in new item browser refractor
[TMDM-2612] - BrowseRecords---ForeignKey component display in ForeignKey Tab
[TMDM-2627] - Polishing the current new web server -- Actions selector and truncated words for all webapp
[TMDM-2628] - Polishing the current new web server -- Import and Export
[TMDM-2655] - Composite View navigation
[TMDM-2679] - FT search in web ui, displaying the entity name in the 2nd drop-down is misleading
[TMDM-2728] - Open the detailed panel in the outermost tabpanel
[TMDM-2734] - Increase the default # of lines per page in the search result
[TMDM-2662] - CSS Tuning on the new item browser to make it comply with the expected Look&Feel (attachment).
[TMDM-2678] - Look and feel rework

2) Bug/QA Fixes (All Editions)

[TMDM-1733] - Classloading issue with Apache POI
[TMDM-1765] - When an entity is saved, auto incrementing should be disabled
[TMDM-2364] - Add a right_click menu for the opened tab(s)
[TMDM-2412] - Unable to launch a zip deployed job with tWebService
[TMDM-2435] - Moving items in the derived hierarchy leaves FKs with an incorrect format, causes exceptions when browsing
[TMDM-2481] - move code from tom to tem
[TMDM-2517] - Cannot make the packaged MDM groovy scripts work
[TMDM-2572] - Date type has default value
[TMDM-2593] - Expand the elment automatically when select the checkbox named 'Auto Expand' in property tab
[TMDM-2644] - Dynamic labels dosen't work on the web
[TMDM-2645] - Incorrect perspectives initial setup
[TMDM-2646] - Can' open a record from journal or workflow tasks
[TMDM-2647] - Set Role with no access to element doesn't work
[TMDM-2650] - Repository view does not update the visual cues upon deploy / update
[TMDM-2656] - Deploy to and update server must prompt to save first
[TMDM-2657] - When there is no layout defined the web ui must assume a default layout
[TMDM-2658] - Gradient header is all white, empty when creating a new record, it looks weird
[TMDM-2660] - Every time you open a custom form it opens a new editor instance
[TMDM-2663] - Can't open advanced search
[TMDM-2666] - The data container editor lets me save records that are not compliant with the model
[TMDM-2668] - Some minor issues on r70208-V5.0.0NB
[TMDM-2673] - In-place row edit in the search result resets all the other elements
[TMDM-2680] - Save & Close closes when it should not, it should only appear when you can actually close
[TMDM-2682] - WebUI issue by IE
[TMDM-2683] - Hierarchy explorer delete issue
[TMDM-2685] - Clone can not work when you configure the two column layout for entity.
[TMDM-2686] - Can't open foreign key picker when using recursive foreign key
[TMDM-2688] - Second custom form for the same entity overwrites the previous one
[TMDM-2694] - Add call to refresh() on Qizx library before using it
[TMDM-2698] - When the session times out the authentication error screens says "null"
[TMDM-2700] - Foreign key selector in Studio does not show up
[TMDM-2701] - Menus in the role editor are missing, can't assign a menu to a role/user
[TMDM-2702] - As administrator I can't see whom a task is assigned to
[TMDM-2703] - Create new view causes an RCP exception, but even though it looks like it fails, the resource was in fact created
[TMDM-2705] - Error in log during Qizx server start
[TMDM-2706] - Foreignkey validate issue
[TMDM-2708] - Can't delete entity from model
[TMDM-2709] - Removing from repository does not ask for confirmation
[TMDM-2710] - Custom Layout : No way to view/edit the associated role nor the entity it refers to
[TMDM-2713] - Generate browse_items view wizard should add the individual views in the role
[TMDM-2717] - Import actions bring up the MDM server view
[TMDM-2723] - Throw Exception When Duplicate or Copy action
[TMDM-2736] - Can't edit a record
[TMDM-2738] - Error while importing objects in server
[TMDM-2742] - Searching issue and date picker issue of Advanced Search
[TMDM-2743] - Could not save or delete (randomly) on browserecords
[TMDM-2752] - Calljob process plug-in tries to call a WS when the protocol is "ltj:", should use jobox instead
[TMDM-2753] - Boolean fields aren't editable
[TMDM-2758] - Could not import after deletion
[TMDM-2780] - Information of current record should be gone after deletion of all the records

Talend ESB / Talend ESB Studio (Application Integration part of the Unified Platform):

New Features
•    [TESB-3352] – Routes: ESB Features support for cCXF components for SL & SAM
•    [TESB-3486] – Rent-A-Car sample runs together with new Talend STS
•    [TESB-3531] – Data Service: Custom Locator properties for Data Services Consumer
•    [TESB-3590] – Data Service: Simplify selection of the services from the workspace in tESBConsumer component
•    [TESB-3690] – Data Service: Improved security configuration in tESBConsumer
•    [TESB-3583] - Data Service: Improve service metadata editing dialog
Change request
•    [TESB-3527] – Locator Service: Enhancements / Interface updates and additional improvements
•    [TESB-3546] – Routes: Camel components required by Route builder components preinstalled in Karaf container
•    [TESB-3621] – Data Services: studio web service export: change way for generate runtime configuration to template mechanism
•    [TESB-3637] – Data Services: Change the metadata editing action title
•    [TESB-3655] – Data Services: Remove namespace schema folder when import WSDL schema
Bugs
•    [TESB-3213] – ESB Studio: Duplicate Service does not work           
•    [TESB-3535] – Data Service: Service In case port name in wsdl is not the same as port type name the endpoint can not be opened
•    [TESB-3553] – Runtime: Stopping OSGI job containing tRESTRequest produces unexpected RuntimeException
•    [TESB-3584] – tESBConsumer Username token authentication does not work from Studio/Standalone
•    [TESB-3596] – Data Services: Simplify relative URL support for services.
•    [TESB-3653] – Data Services: Numbers are not supported in the namespace
Version Upgrade: With the RC2 we are now in the Talend ESB Standard Edition (Runtime) on Apache CXF 2.5.0 and Apache Camel 2.8.2
(and Apache Karaf 2.2.4 and Apache ActiveMQ 5.5.0 as already in RC1 )

Please see also related new features, changes and bug fixes in the Components (related to the tESB…, tREST… and tXMLMap components) in the Talend Open Studio for DI notes within this forum post.

For more information on fixed bugs and new features, go to the TalendForge Bugtracker.

Thanks for being a part of our community,
The Talend Team

[2011-11-03] Talend Blog: Another Magic Quadrant, Another Move for Talend

Last week, for the third year in a row, Gartner rated Talend a Visionary in their Magic Quadrant for Data Integration Tools. You can get a free copy here, courtesy of Talend (registration required).

Time for a little reflection on these past 3 quadrants, and the one before…

  • 2008: Disappointment. Readers of this blog and observers of the analyst relations world will remember my lashing at Gartner for ignoring not acknowledging the disruptive force of open source. Lots of interesting reactions (Wait, a vendor criticizing the Gartner process? — Whoa, these guys need to be taught proper AR. — Boy, are they toast forever!)
    A disappointment it was. We were set for changing forever the data integration landscape and Gartner did not see that. In retrospect, they probably saw it, but needed proof points. And truth be told, in 2008, we did not have many proof points.
  • 2009: We Made It! Not only did we make it to the MQ, but we made it as a visionary. The long and patient work of gathering the proper references, explaining our strategy to the analysts, and more importantly bringing them the proof points they needed, was finally paying off.  Talend was the first open source vendor to enter the MQ (still is), the only new entrant, and came in a very strong position.
  • 2010: Consolidating. The 2010 edition of the MQ came out right at the time where we were finalizing our acquisition of Sopera – too late to influence our position. Still, 2010 marked a significant jump in Ability to Execute, a further proof of Talend’s explosive growth.
  • 2011: Flirting with the Leaders… The 2011 vintage of the MQ is an excellent one. Or maybe I should say MQs – plural. Talend is now also a Visionary in the Data Quality Magic Quadrant, and a Cool Vendor. And with another jump in Ability to Execute, Talend is now flirting with the leaders…

Get your copy of the MQ to view the positions, and to read the verbatim on Talend and on the other players. Lots of interesting stuff in there, as usual.

Yves

[2011-10-26] Talend Document: Embedding Talend Technology to Deliver Superior Solutions and Boost Business

The Talend Unified Platform technology is the first and only unified platform that leverages data integration, data quality, master data management, and application integration in a single development and runtime environment, promoting reusability and consistency of the approach.

Easily embed and bundle one or several technology components:

  • Big Data - Hadoop and MapReduce architecture
  • Data Integration - Award-winning integration solution
  • Data Quality - First fully embeddable data quality solution
  • Master Data Management - based on the flexible and multi-domain Active Data Model
  • Enterprise Service Bus — a lightweight, standards-based web services stack, that includes messaging, mediation, and routing

More than 25 companies are partnering with Talend — including JasperSoft, WDCi Group and Bi3 Solutions. Learn how they and others are using Talend technology to augment their solutions and solve real-world business problems.

[2011-10-25] Talend Blog: Join us at ApacheCon!

Next month, Talend’s engineers are hitting the road. Their destination: ApacheCon North America 2011 in beautiful Vancouver, BC.

Since its inception in 1998, ApacheCon has been the premier gathering for developers building open source software The Apache Way. Talend is proud to support the Apache community, and has been given the honor of contributing content to this year’s conference.

Apachecon na 2011

On Thursday, November 10 at 2:30pm, Dan Kulp and Hadrian Zbarcea will share a practical approach to developing distributed applications in their talk: An Interactive Example of Enterprise SOA, Apache Style.

Later that day, Dan Kulp will show how Apache projects such as CXF, WSS4J, Camel, and Karaf can work together to provide a complete security solution for services in Security Problems (and Solutions) for Service Oriented Applications.

On Friday, November 11 at 2:00pm, Jean-Baptiste Onofre will share how to deploy reusable components in a large network environments during his Deployment With Apache Karaf and ACE talk.

Finally, I, Ross Turk, will talk about how companies and communities interact in the Business of Open Source panel discussion at 1:30pm on Wednesday, November 9.

We hope to see you there!

Ross

[2011-10-21] Talend Forum Announcement: Talend Unified Platform 5.0.0RC1 is Available For Testing

We are pleased to announce that Talend's 5.0.0RC1 Unified Platform release is available, for testing only.  This release candidate contains many bug fixes and new features, and is recommended for experienced users who need an early preview of upcoming 5.0 release.

Download Talend's first Unified Platform release candidate here.

Talend Open Studio (Data Integration part of the Unified Platform):

•    Studio :
o    Features :
TDI-17374:Exchange view: create a new plugin for exchange system.(almost finished )
TDI-17999:Import from repository should also import the MDM metadata
o    Bugs:
TDI-18007:the field type:"COMPONENT_LIST" in component.ecore doesn't work in virtual component
TDI-18027:Import from XML Schema does not allow me to chose what element to import
TDI-17853:Studio/Metadata/XML File: incomplete schema in Etape 4/5, when created from xsd file and XML is output
TDI-17770: Need to fix problem of refresh of repository view when use Talend product as plugin.
TDI-17778: Fix a problem for components who call javajet from another plugin. (adds new functionality for component's developpers)
TDI-17985: NPE when start studio. (Related to exchange)

•    Components :
o    Features:
TDI-17994: Add the option transfert mode in component "tFTPFileProperties component"
TDI-17944: Create the component tMDMTriggerOutput
TDI-17943: Create the component tMDMTriggerInput
TDI-17941: XMLMap: Add an option to write all the input data in only one document
TDI-17908: Manage the tXMLMap rejects like in the tMap
TDI-17890: tXMLMap: Add the possibility to create (OR NOT) an empty element
TDI-17463: Create the component tWriteJSONField
o    Bugs:
TDQ-3775: Cannot start the studio
TDI-17977: if Data Service Job contains tSalesforceInput component - job start failed.
TDI-18041: Unable to output reject content when enable lookup flow in the job
TDI-18022: Unable to import pervious job correctly after adding "All in One" option
TDI-18019: When there is a flat element in the tXMLMap mapping and "All In One" is true, NPE is thrown
TDI-17909: If an output has no document element, the output is wrong
TDI-17895: tXMLMap: GUI issue for the output tables reject options
TDI-17892: tMysqlLastInsertId causes fatal exception when a AUTO_NUMBER is BIGINT
TDI-17860: Utf8 encoding problem for the input fields from tFileInputJson
TDI-17859: Subjob's tHashInput component tries to read data from principal Job's tHashOutput_1 component
TDI-17756: getting encoding not supported exception
TDI-17753: MSELTOutput using invalid Connection String
TDI-17738: ignore DTD + Document type doesn't work in the tFileInputXML.
TDI-17707: Base 64 content read from LDIF with tFileInputLDIF is corrupted.
TDI-17661: tSQLtemplate component can not choose Netezza as db
TDI-17453: When there are two tXMLMap components in the job, and the second matching mode is "all match", the result of "all match" always be empty
TDI-17447: For tXMLMap component, when "all match" is selected, mutliple XML document will be generated.
TDI-17442: tAddCRCRow implementation with BigDecimal fails due to differing scales
TDI-17400: Option "Use cursor" in tNetezzaInput not working
TDI-7622 : tWriteXMLField: accumulates all the XML records in memory before sending them all downstream; causes Java heap space exception
TDI-7586 : tAddCRCRow is generating same CRC checksum for rows with different field values

Talend Open Profiler (Data Profiling and Quality part of the Unified Platform):

Bugs:
•    [TDQ-1324] - create jdbc mysql can not successfully by top
•    [TDQ-3307] - Open Profiler runs Oracle analyse compute statistics on every table when anakysing metadata
•    [TDQ-3425] - should Remove the "Text length" option panel in the minimal length indicator
•    [TDQ-3435] - Unable to view table data in oracle RDB schema
•    [TDQ-3510] - the analysis editor always dirty when open it
•    [TDQ-3554] - Incorrect results from overview analysis
•    [TDQ-3631] - No column available in indicator selection dialog

Talend MDM Community Edition (MDM part of the Unified Platform):

New features:
•    [TMDM-1711] - Fr translations handling
•    [TMDM-1826] - Item browser gxt refactor -- rules
•    [TMDM-1843] - Item browser gxt refactor -- Inheritance & polymorphism
•    [TMDM-1865] - Filter by object category in the repository view
•    [TMDM-1868] - Item browser gxt refactor -- form
•    [TMDM-2390] - Item browser gxt refactor -- Clone
•    [TMDM-2391] - Item browser gxt refactor -- Composite view
•    [TMDM-2392] - Item browser gxt refactor -- TreeDetail refactor and FK list display
•    [TMDM-2395] - Item browser gxt refactor -- Entity Banner
•    [TMDM-2397] - Item browser gxt refactor -- Custom UI Form Designer
•    [TMDM-2434] - Item browser gxt refactor -- Banner
•    [TMDM-2502] - Revisit 'Deploy All' action
•    [TMDM-2530] - Import/Export custom form objects from MDM Repository View to the server
•    [TMDM-2555] - implement context menu for data model entity/element
•    [TMDM-2564] - add remove function for fk node list
•    [TMDM-2566] - Repository studio Integration: Item property view
•    [TMDM-2567] - MDM Repository cosmetic changes
•    [TMDM-2607] - Primary Key info presentation in the header panel
•    [TMDM-2609] - Browse container on the data-container repository object, like in the MDM server view
•    [TMDM-2613] - BrowseRecords---Display ForeignKey Tab separately.link and create ForeignKey
•    [TMDM-2696] - Omit the root element when displaying the default form layout


Talend ESB / Talend ESB Studio (Application Integration part of the Unified Platform):

New Features:
•             [TESB-2366] – STS: Support KerberosToken in WS-Security header of RST
•             [TESB-2367] – STS: Issued SAML Token must be extensible for custom AttributeStatements
•             [TESB-2368] – STS: Custom Claims providers
•             [TESB-2369] – STS: TokenProvider and TokenValidator must be able to give more context of the error condition
•             [TESB-2370] – STS: Username/password authenticator
•             [TESB-2371] – STS: An STS JVM must support several authentication systems (IDM)
•             [TESB-2911] – STS: Token caching in CXF intermediary service
•             [TESB-2971] – Authentication for Data Services
•             [TESB-3056] - Enable security for Data Services
•             [TESB-3081] - Samples in the software package instead of the documentation package
•             [TESB-3104] - Plain ActiveMQ Distribution in Talend ESB Runtime package
•             [TESB-3122] – STS: Signature and encryption username and passphrase configurable
•             [TESB-3156] - Automated start / stop of Routes and Data Services
•             [TESB-3279] - Karaf Cellar in TESB (Technical Prev.)
•             [TESB-3351] - tESBProviderRequest Component should use new 'Service' item
•             [TESB-3438] - Rename ESB runtime to reflect broadened scope
•             [TESB-3525] - Add context names to the OSGi feature configuration
•             [TESB-3530] - ESB CXF Features as OSGi Services

Change request:
•             [TESB-2133] - Include Karaf-Webconsole as preinstalled feature
•             [TESB-3328] - Update tHttp request and response types to Document
•             [TESB-3413] - Talend Runtime: Remove explicit start / stop commands & update list commands
•             [TESB-3545] - Configurable CXF work queues in Karaf
Bug fixes:
•             [TESB-1991] - Confusing error message when trying to call wrong service method
•             [TESB-2080] - TSF STS example generates 404 with STS service call
•             [TESB-2385] - Karaf Console history - job:start and features:listurl - doesn't appear in console history
•             [TESB-2491] - STS: RequestContext Principal is null for 2-way TLS
•             [TESB-2492] - Problem validating signed SAML Assertions
•             [TESB-3007] - SAM: Message IDs for same message are different on sender and receiver side
•             [TESB-3098] - Routes and Data Service bundles have different restart behavior that other OSGi Features
•             [TESB-3238] - tESBConsumer: Error get service description from running provider in tESB Container
•             [TESB-3278] - WSDL Improvement of the SOAP based Service Locator proxy.
•             [TESB-3316] - Karaf execution environment does not include JavaSE-6
•             [TESB-3340] - Exporting and Importing a Route as Spring XML: can't import the exported route
•             [TESB-3383] - Can't start some bundles due to monitoring packages not found from the TalendRuntime under operating system Ubuntu 10
•             [TESB-3416] - export service: broken kar is created
•             [TESB-3448] - Remove custom SAM parameters from tESBResponse and tESBFault
•             [TESB-3529] - Running A Route with cFile in ESBContainer fails because routines.system.api.TalendESBJob
•             [TESB-3532] - Code Generation for service operation jobs /exporting Service fails
•             [TESB-3533] - tXMLMap: Wrong Error that Loop element is missing although it is there per default now

Known Issues:
•             In the Data Service Builder you can only use WSDL’s with RC1 where the porttype name attribute and the port name attribute in the WSDL have the same ‘name’ please make sure that your WSDL fulfills this (temporary) requirement [TESB-3535]

Please see also related new features, changes and bug fixes in the Components (related to the tESB…, tREST… and tXMLMap components) in the Talend Open Studio notes within this forum post.

For more information on fixed bugs and new features, go to the TalendForge Bugtracker.

Thanks for being a part of our community,
The Talend Team

[2011-10-20] Talend Blog: Talend’s OEM Program: Powered by Talend

We announced today an expanded OEM Partner Program, designed to help software vendors and SaaS providers embed Talend’s enterprise-grade open source integration technologies in their offerings.

With this new “Powered by Talend” program, OEM partners will be able to leverage all components of the Talend Unified Platform – not just data integration and data quality. This means, especially, that Big Data Management, Master Data Management (MDM) and ESB are added to the OEMizable portfolio.

What does OEM mean?
OEM stands for Original Equipment Manufacturer. It essentially means the manufacturer is embedding a third party product in their own product. For example, the tires in your BMW are not made by BMW but by Michelin. The hard disk in your laptop is not made by Dell or Apple but by Western Digital, and the LCD screen comes from some obscure LCD manufacturer in China. The whole idea is for each vendor to focus on their core competency and get “the rest” from best of breed vendors.

The same story is true in the software world. If you are a BI vendor, why try to build your own ETL when you can get the best one as an OEM? If you are an accounting software vendor that needs to sync data with Salesforce, why build this yourself?

Don’t try this at home, you might get hurt…
In other words: leave it to the pros. Talend knows data integration, data quality, Big Data, MDM and ESB better than anyone else. Frankly, the cost and hassle of redeveloping these technologies for a software vendor or a SaaS provider are not worth it, especially with solutions like Talend’s in the mix!

Want more use cases and happy partner references? Read the press release. Interested in learning more? The Powered By Talend – OEM & SaaS Partner Program page on our site is a good place to start, or you can email oem@talend.com.

Yves

[2011-10-20] Talend Forum Announcement: UPDATED: Maintenance on Talend Bug Tracker and Support Ticket System

During last Saturday's scheduled outage of these systems, we did not complete all of our planned maintenance tasks within the time allotted.  Unfortunately, additional maintenance is necessary to ensure their ongoing operational stability.

This Saturday evening, October 29, at 4:00 PM PDT / 1:00 AM CEST, our operations team will continue routine maintenance on the systems that power our Bug Tracker (http://jira.talendforge.org) and Support Ticketing System (http://support.talend.com).  Again, there will be a small amount of downtime during this service window, but we anticipate services to be fully accessible within 24 hours.


Original message:

This Saturday evening, October 22, at 4:00 PM PDT / 1:00 AM CEST, our operations team will be performing routine maintenance on the systems that power the Talend Bug Tracker (http://jira.talendforge.org) and the Support Ticketing System (http://support.talend.com).  There will be a small amount of downtime during this service window, but we anticipate services to be fully accessible within 24 hours.

Questions or concerns regarding this maintenance window may be directed to the comment section on this thread.

Sincerely,
The Talend Team

[2011-10-11] Talend Document: Cloud Integration: 4 Key Recommendations

As business solutions evolve to embrace cloud computing, it is only natural that integration solutions must change as well. This white paper presents the key characteristics of cloud computing, the integration challenges this deployment model introduces, four recommendations for what to look for in a solution and five key questions you should ask when considering cloud integration solutions.

[2011-10-05] Talend Blog: Happy Birthday to Talend Open Studio!

Exactly five years ago, on October 5, 2006, a new product was born in the R&D labs of Talend.  Talend Open Studio was born under a good star that would soon see it rise to the pinnacle of the most widely used data integration technology in the world.

Fast track to today. With over 15 million total downloads in the world, Talend Open Studio is now being downloaded twice per minute. Over 750,000 users are leveraging this groundbreaking technology for their data integration tasks. And, very importantly also, the business that Talend has built on top of this cornerstone technology sustains 400 employees in 8 countries and counts today over 2,500 paying enterprise customers.

Along the way, Talend Open Studio has won multiple awards and recognitions (for more, please visit our Wall of Fame):

  • Visionary in Gartner’s Magic Quadrant
  • Three BOSSIE awards, three years in a row
  • Intelligent Enterprise’s Editors’ Choice ward
  • Infoworld’s Technology of the Year
  • BeyeNETWORK’s Vision Impact Award
  • SourceForge’s Community Choice Awards
  • Computerworld’s Horizon Award

talend-cloud-studio1.pngToday, Talend Open Studio is still the cornerstone of our technology.  Multiple products have been built either on top of it or to complement it – but the first thing users see when they launch any of these products is our user friendly, Eclipse-based studio.

On this birthday, I would like to thank especially the entire Talend engineering team who are working hard to continue to build our technology, our community who is really helping us keep up with the requirements of our users, and of course the rest of the Talend teams (sales, marketing, consulting, admin, etc.) who are making our business a success and hence allow us to continue to fund the development of the products.

Fabrice

[2011-10-05] Talend Blog: Red Hat Brings more Innovation to Mainstream

Many were on the lookout for a database acquisition from Red Hat, most likely a NoSQL one.  In an interview with InternetNews.com last August, CEO Jim Whitehurst had made it clear they were not interested in a traditional, “me too” RDBMS technology.

Of course, Red Hat could have launched their own Hadoop distribution. But with Cloudera and Hortonworks already out there, the Hadoop market is getting a bit crowded.

Therefore, the acquisition of Gluster does not come as a surprise.  With Gluster, Red Hat invests in a lesser known technology but one that clearly complements their middleware stack, previously focused primarily on application development and deployment.

This acquisition brings a couple comments:

  • Red Hat reinforces an already prevalent open source enterprise IT stack. Open source has taken a lion’s share in enterprise IT, Red Hat is a key player in this field and their having a broader product portfolio will make open source deployments stronger.
  • Open source is clearly taking the lead when it comes to Big Data management. Hadoop, the leading Big Data solution, is before all an open source project.  Red Hat getting into Big Data will make this even more true. Especially in light of recent announcements by another “red” company, who is thinking primarily of Big Data as a way to bring more data into their RDBMS appliances.
  • Open source is also the foundation on which Cloud infrastructure is being deployed, and Gluster is bringing another brick to this foundation. Red Hat is making is clear that they view Gluster’s scaling out capabilities as key to seamlessly support hybrid environments.  This is aligned with Talend’s view that Cloud is before all about hybrid architectures: combination of on-premises, private clouds and public clouds.

One of Red Hat’s claims to fame has been to bring innovation to the mainstream, thanks to a well-executed go to market model.  Gluster clearly fits this bill. We will be looking forward to seeing this technology take off under Red Hat’s wing, and of course to leveraging it as we already leverage a large number of data management technologies – conventional or not.

Yves

[2011-10-04] Talend Document: Talend Solution Brief - Master Data Management for Product Data

There are many reasons to implement MDM for product data. These business cases have been well documented and many projects have benefited from these implementations.

Download this Solution Brief and find out a series of reasons why others have used MDM. For each item, the business value as well as the key technical values of MDM are presented.

[2011-10-04] Talend Document: Talend Solution Brief - Master Data Management for Employee Data

There are many reasons to implement MDM for employee data. These business cases have been well documented and many projects have benefited from these implementations.

Download this Solution Brief and find out a series of reasons why others have used MDM. For each item, the business value as well as the key technical values of MDM are presented.

[2011-10-03] Talend Document: Data Quality Solution Checklist

The following checklist provides key functional requirements for implementing and deploying data quality in an enterprise environment.

[2011-09-30] Talend Forum Announcement: Talend Unified Platform 5.0.0M5 is Available For Testing

We are pleased to announce that Talend's 5.0.0M5 Unified Platform release is available, for testing only.  This milestone contains many bugs fixes and new features, and is recommended for experienced users who need an early preview of upcoming 5.0 release.

Download Talend's fifth Unified Platform milestone here.

Talend Open Studio:

Features:

Studio:
TDI-17374: New Exchange system (not fully working)
TDI-17372: Getting Started modified, use html page. (DI only)
TDI-17373: New TalendForge account linking.
TDI-17229: Usage Data Collector.
TDI-17592: Review Talend Open Studio Login dialog.
TDI-17258: Add a new attribute like: SHOW_IF with type:String for the ELEMENT:RETURN.

Components:
TDI-17815: the path in the tOldDbOutput_begin.javajet&tOleDbOutput_main.javajet are incorrect
TDI-17614: Create a generic component for Teradata TPT
TDI-17492: Aggregating multiple 'rows' in tXMLMap component
TDI-17498: Can't write binary data with tFileOutputLdif
TDI-17470: Vertica components: Support of Vertica 5.0
TDI-17463: Create the component tWriteJSONField
TDI-17303: tfileinputmsxml uses wrong dtd file location in the input xml file
TDI-13094: GlobalMap coming from tLoop doesn't work
TDI-12941: tMysqlOutputBulk - no error when disk is full

Bugs:

Studio:
TDI-17302: functions do not appear on tRowGenerator
TDI-17714: Remove store on disk option from tXmlMap (not available for 5.0.0)
TDI-17288: In the tXMLMap component editor, it is unable to edit the element when the element has an attribute but no any subnodes
TDI-8690: OSGI Export bug fixes.
TDI-17413: bug fixes related to the new system of libraries.

Components:
TDI-17701: tMDMInput uses non-optimized code to retrieve result
TDI-17574: tExtractDelimitedFields ignores trailing empty fields and then throws error if "Check against schema" is checked
TDI-17558: NullPointerException at TMap storing temp data
TDI-17518: tFileOutputProperties : file handle not closed
TDI-17445: tAggregateRow doesn't work with deactivated rLogRow
TDI-17406: tSybaseOutputBulk doesn't use the default decimal parameter of Syabse IQ
TDI-8197:  tFileOutputExcel 'Write excel2007 file format(xlsx)' memory error

Talend Open Profiler:

Phone number validation indicators

Talend MDM CE:

Studio:
MDM Repository view integrated with the platform repository fully functional
Graphical custom form editor

Web UI:
Most web modules ported to GXT
Redevelopment of web forms with composite view

Server:
Foreign key integrity constraint

Talend ESB / Talend ESB Studio:

Tooling:

New features:
[TESB-2498] [Studio] Export Service and Route as OSGI bundle(s)
[TESB-3174] [RouteBuilder] Generated code for Routes now support stop and shutdown

Improvements:
[TESB-3028] [Studio] Migrated to CXF 2.5.0-SNAPSHOT and Camel 2.8.2-SNAPSHOT
[TESB-3186] [Studio] Renamed tHTTPRequest and tHTTPResponse to tRESTRequest and tRESTResponse
[TESB-3229] [RouteBuilder] Renamed cCamelContext component to cConfig
[TESB-2744] [Components] Jobs that uses tESB* components should rely on OSGi require-bundle instead of embedded bundles.

Bug fixes:
[TESB-2591] [RouteBuilder] New Route Wizard input name validation fails to notify an invalid name
[TESB-2879] [Components] Wrong name in the title of tESBProviderRequest & tESBConsumer components' editor
[TESB-3197] [RouteBuilder] Can’t Create Bean on RouteBuilder
[TESB-3199] [RouteBuilder] Remove the "Services" displays on RouteBuilder perspective
[TESB-3203] [RouteBuilder] Fix Export as Spring XML bugs for cFtp Component
[TESB-3205] [RouteBuilder] Additional Dependency libraries are not exported

Runtime:

New features:
TESB-1446 - Enable Service Locator for Rest based services.
TESB-2692 - Implement the Rest interface for the Service Locator proxy
TESB-2972 - Include CXF-STS in delivery
TESB-3060 - Expose all Talend Runtime specific commands via JMX MBeans
TESB-3068 - Ready to deploy Archiva package for Tomcat

Improvements:
TESB-2938 - Same native OS metrics in Karaf embedded JobServer as in stand-alone JobServer

Bug fixes:
TESB-2589 - Exception during container launch if no SAM server running
TESB-2936 - Passwords to secure commands in JobServer visible
TESB-2960 - Jar file container/lib/bin/karaf-client.jar corrupted in TESB EE


For more information on fixed bugs and new features, go to the TalendForge Bugtracker.

Thanks for being a part of our community,
The Talend Team

[2011-09-27] Talend Forum Announcement: For test only, Talend's 5.0.0M4 Unified Platform release is available

We are pleased to announce that Talend's 5.0.0M4 Unified Platform release is available, for testing only.  This milestone contains many bugs fixes and new features, and is recommended for experienced users who need an early preview of upcoming 5.0 release.

Download Talend's fourth Unified Platform milestone here.

Talend Open Studio:

Features:
TDI-17222: Wizard Db connection for Oracle RAC.
TDI-17310: Remove Perl.
TDI-17413: Review the system of libraries (keep jars in original location / avoid to use .JavaLibs in commandline)
TDI-8697: Change display of repository for Services. (ESB)

Components:

Features:
TDI-17448: Add the possibility to use the tXMLMap as a virtual component
TDI-17383: Salesforce component: Add an advanced option to set the Client ID
TDI-17285: Please update JSch to latest release
TDI-17283: tFileOutputPositional should use toPlainString() for BigDecimal
TDI-17276: tSybaseBulkExec component should support the interface file when calling bcp.exe
TDI-17256: Option to mask data entry to Question tMsgBox
TDI-17250: Refactor the tFileInputPositional
TDI-17195: tOracleOutput : truncate table reuse storage
TDI-17162: Support of Oracle RAC in the existing Oracle components
TDI-17022: Amazon RDS components
TDI-6890: "java Code of a method longer than 65535 bytes" in tFileOutputPositional
TDI-16974: tXMLMap : Match model : support "Unique match" and "First match"

Bugs:
TDI-17502: replace old mdm_webservice.jar
TDI-17331: tLDAPOutput incomplete
TDI-17273: Variable Local pstmt_tMSSqlOutput_2 double
TDI-17234: about byte[] type on tWebservice component
TDI-17208: Issue when we contextualized the file input delimited row separator
TDI-17183: When there is two tXMLMap in the job and the outgoing connector of the first tXMLMap is connected to the second tXMLMap as "lookup" connector, a compile error occurs
TDI-17164: tSybaseBulkExec does not use connection information
TDI-17161: XPathException in tExtractXMLField component
TDI-13293: Option "Use cursor" in tParAccelInput not working
TDI-8713: Compile error occurs by just placing tSAPConnection and tSAPCommit (or tSAPRollback)
TDI-8694: Trace debug shows value null for parameter name and parameter value
TDI-8680: Some components don't die on error (tFileInputXML)
TDI-8291: Optimize tFileInputPositional to push back 65535 bytes limit
TDI-7631: tXXDBOutput component is unable to aggregate the NB_LINE_XX when trigger the "BatchSize" option
TDI-17339: The same problem also exists in the tInformixOutput component
TDI-17291: When there is not document type in the "main" part of tXMLMap component, compile errors occur
TDI-17263: When the document in type of "Lookup" does not have any outgoing connector, the "inner join" is unable to work
TDI-17189: When there is "lookup" connector in the job of tXMLMap and a document type without any connector, a lot of compile error occur

Talend Open Profiler:

Features:
TDQ-1845 Add Filters in DQ Repository view
TDQ-1760 Java UDI: not convinient to delete udi jar files

Bug fixes:
TDQ-2140 Charts are not synchronized with the selection when the analysis contains several pages
TDQ-3209 indicator selection and execution errors
TDQ-2640 Cannot move a folder onto another folder

TDQ-3209 indicator selection and execution errors
TDQ-2594 the indicators can't be displayed right
TDQ-2468 TOP does not correctly close Netezza connections
TDQ-3376 get the table wrong for a table exist in two catalogs
TDQ-3281 Truncated filter icons in DQ repository view
TDQ-3518 Could not retireve db in dq repository view


For more information on fixed bugs and new features, go to the TalendForge Bugtracker.

Thanks for being a part of our community,
The Talend Team.

[2011-09-27] Talend Document: Using WS-Trust in Apache CXF

In this technical note, Apache CXF contributor Colm O hEigeartaigh provides a walkthrough of how to secure a Web service using WS-Trust in Apache CXF. This technical note walks readers through the WS-Trust sample that ships with Talend ESB. Also, watch the recorded webinar here.

[2011-09-27] Talend Document: Using WS-SecurityPolicy/SAML with Apache CXF

In this technical note, Apache CXF contributor Colm O hEigeartaigh provides a walkthrough of how to secure a Web service using WS-SecurityPolicy/SAML in Apache CXF. Using the WS-SecurityPolicy sample that ships with Talend ESB, this sample shows how to secure a Web service provider using both a UsernameToken and a SAML Assertion. Also, watch the recorded webinar here

[2011-09-27] Talend Document: Talend Solution Brief - Master Data Management for Customer Data

There are many reasons to implement MDM for customer data. These business cases have been well documented and many projects have benefited from these implementations.

Download this Solution Brief and find out a series of reasons why others have used MDM. For each item, the business value as well as the key technical values of MDM are presented.

[2011-09-27] Talend Document: Talend Solution Brief - Master Data Management for ERP Systems

There are many reasons to implement MDM for product and ERP data. These business cases have been well documented and many projects have benefited from these implementations.

Download this Solution Brief and find out a series of reasons why others have used MDM. For each item, the business value as well as the key technical values of MDM are presented.

[2011-09-26] Talend Document: Tools for Big Data: Talend Integration Suite MPx

Big Data is still a new and rapidly developing space, with solutions like high-end data warehouse appliances, Hadoop clusters, NoSQL databases and other technologies emerging to address varied use cases that are still being discovered.

Going forward, Talend is committed to data integration for Big Data and to helping incorporate Big Data technologies into the mainstream of analytic and operational data integration thanks to Talend Integration Suite MPx. Download this white paper to learn how.

[2011-09-14] Talend Press Release: Talend Named Winner in InfoWorld Bossie Awards for Third Consecutive Year

Talend Named Winner in InfoWorld Bossie Awards for Third Consecutive Year

Open source vendor recognized as “the undisputed leader in open source data management and integration”

Los Altos, CA - September 15, 2011 - Talend, a global open source software leader, today announced it has been honored for the third consecutive year by IDG's InfoWorld as a recipient of the Best of Open Source Software awards (Bossies). Talend Open Studio, the Company's leading-edge open source solution for data integration, was recognized as a winner in the “Best open source data center and cloud software” category for its ability to dramatically improve the efficiency of data integration job design, enable rapid deployment, reduce maintenance costs and offer support for all types of data integration, data migration and data synchronization operations.

Chosen by InfoWorld Test Center editors and reviewers, the annual Bossies recognize the best and most innovative open source software products for end users, businesses and IT professionals. The 2011 awards reflect the dominance of open source software on the desktop, in mobile technology, business applications, application development, the data center and the private cloud.

According to InfoWorld, “Talend Open Studio remains the undisputed leader in open source data management and integration. Although most of the improvements this year pertained to growing and assimilating Talend's own ESB, the addition of an XML data mapping application is a boon to anyone working XML docs into their data flows, and the Bonita plugin builds a bridge to open source BPM. Talend's many onboard wizards simplify setup, while new tools for data quality analysis are must-haves for master data management. With strong provisions for business modeling and ETL, and even native support for Apache Hadoop, Talend scales to enterprise heights with ease.”

“Through open source, we are continuing to lead innovation in integration technology and to disrupt the market,” said Bertrand Diard, co-founder and CEO, Talend. “The recent release of Talend Cloud, our Big Data initiatives, and of course our combining of multiple integration paradigms within a single platform all bring further proof points of the enterprise-readiness of open source. We are excited to see InfoWorld recognize these trends and reward us with a Bossie Award for the third year in a row.”

About InfoWorld's Best of Open Source Software Awards

InfoWorld's annual Bossies (Best of Open Source Software awards) recognize the best open source software for end users, business, and IT professionals. The Bossies are selected by InfoWorld Test Center editors and reviewers. To nominate your favorite open source project, please contact Test Center Executive Editor Doug Dineley.

About InfoWorld

InfoWorld is the leading resource for content and tools on “modernizing enterprise IT.” InfoWorld's Web site, focused seminars and custom programs provide a deep dive into specific technologies to help IT managers excel in their roles and provide opportunities for IT vendors to reach this audience. InfoWorld is published by IDG Enterprise, a subsidiary of International Data Group (IDG), the world's leading media, events, and research company. Company information is available at www.idgenterprise.com.

[2011-09-13] Talend Document: Agility in Data Governance

Bad data management doesn’t happen overnight. Therefore, you can’t expect to establish a complete data governance program to solve all of your data management woes at once. One approach to data governance is to be pragmatic, carving off small important aspects of the data management problem and solving them one by one.
This white paper discusses the techniques that have been successful for data champions as they embark to establish an agile form of data governance for their company. It examines guidelines for project selection as well as other techniques for introducing agility into a data governance program.

[2011-09-09] Talend Forum Announcement: Talend Open Studio 4.2.3 is available

Download Talend Open Studio 4.2.3.

This release is a bugfixing release: 98 bugs were fixed.

For more detailed information, please check the data integration page on the JIRA bugtracker: http://jira.talendforge.org/browse/TDI

All bugs fixed in this release:

For the Studio :

TDI-17361: Wrong export when have export dependencies checked
TDI-17213: Memory Leak results in studio crash after re-initializing the generation engine a few times.
TDI-17214: Issue with migration of Metadata DB Connection (type AS400) from 4.0.2 to 4.2.2
TDI-17046: the UI information is not right in tXMLMap plugins when there is two or more tXMLMap components.
TDI-13193: After import tWebService job,missing [*] when added new Element
TDI-12871: Avoid delete the default schema for salesforce
TDI-8662:  Dragging & dropping a schema from Salesforce metadata to a job always picks the next in sequence custom object
TDI-8661:  tSalesforceOutput option Retrieve Inserted ID produces several salesforce_id fields
TDI-13449: create xml,if the coloum name are same,after modify it,the priview can't use yet
TDI-12594: if one item is not saved ,then export project will fail.
TDI-16900: MEMO_SQL font under linux is unreadably small and hard coded
TDI-13289: Retrieve schema On IBM DB2 ZOS will throw error.
TDI-13036: Unable to load tables using IBM DB2 ZOS connection
TDI-13302: the db (oracle,sqlserver)with context ,copy schema into generic schema,paste the schema will throw error.
TDI-13262: tMap : output link , lose mapping when switch build in to repository
TDI-13231: connect oracle,sqlserver with jdbc ,export to context ,then guess schema will throw error.
TDI-13156: Cannot add note to Job
TDI-12998: Error when generate a job from a component without schema
TDI-12916: Job generation very slow with latest patch
TDI-12783: Libraries dependancies JAR are not loaded from community components
TDI-12775: Job does not display 'Built-in' when a change in the repository metadata schema is not wanted to impact a job
TDI-9885:  tRunJob hangs
TDI-8707:  "Import from repository" command of tXMLMap ignores child mapping of XML-File metadata
TDI-8687:  "Import from repository" command of tXMLMap uses last declared element in xsd ignoring <schema/> tag
TDI-8668:  "Import from repository" command of tXMLMap uses filesystem instead of embedded xsd
TDI-8704:  Click "retrieve schema" button will show nothing when the db connection login on MSSQL via "Windows Authentication"
TDI-8671:  DB Wizard open / close the connection too often
TDI-8553:  TOS crashes while attempt to edit database schema (fixed in TDI-8671)
TDI-8647:  Export of job causes duplicate .java routines files.
TDI-8646:  Export doesn't produce all jar requirments.
TDI-8638:  Clicking SQL Template on tCombinedSQLAggregate_1 component shows java.lang.Nullpointer Exception
TDI-8626:  Teradata retrieve schemas doesn't see any tables/views
TDI-12663: CREATE A DERBY DB CONNECTION.WHEN CLICK THE FINISH BUTTON ,THEN IT WILL THROW ERRO
TDI-12656: Impossible to create a new XML file
TDI-8567:  Greenplum's metadata wizard fails with "ClassNotFoundException"
TDI-8561:  [Metadata XML Files] Can't retrieve schema from xsd files with multiple import
TDI-8552:  Improve performance for import items
TDI-8503:  <Detect and update job>:Not support modify update if job closed
TDI-8465:  Font setting problem in appearance
TDI-8433:  User routines
TDI-8424:  Can't Save Db Version for Oracle metadata
TDI-8343:  Error if undo delete of component after tMap
TDI-8328:  Bad generated query in the tJDBCInput component
TDI-8310:  Database(DB2) schema not map to retrieve schema page
TDI-7962:  Don’t reflect to source code context change by Job change in tRunJob
TDI-7524:  tJDBC components not support H2 database
TDI-8615:  Concurrent access possible when retrieve schemas from db connection
TDI-14139: IMPOSSIBLE TO OBTAIN SAS METADATA using the Talend connector

Components :

•    TMDM-2204 : Webservice Error
•    TMDM-1775 : Problem with the twebservice component
•    TDQ-1705 : Talend v4.2.2 job execution error
•    TDI-17273 : Variable Local pstmt_tMSSqlOutput_2 double
•    TDI-17244 : tMDMBulkLoad client JAR needs to be updated
•    TDI-17234 : about byte[] type on tWebservice component
•    TDI-17208 : Issue when we contextualized the file input delimited row separator
•    TDI-17164 : tSybaseBulkExec does not use the connection information
•    TDI-17161 : XPathException in tExtractXMLField component
•    TDI-16955 : Run a job with a desactivated tLogRow failed
•    TDI-16944 : tPivotToColumnsDelimited_1_NB_LINE created with invalid type
•    TDI-16901 : tSalesforce timeout broken in 4.2.1
•    TDI-16981 : tFTPGet fails if parent of local directory does not exist
•    TDI-13425 : tHasjOut/tHashInput null pointer exception
•    TDI-13410 : Execution error with tMSSqlInput
•    TDI-13328 : the stats & logs information is missing when enable parallel execution
•    TDI-13293 : Option “Use cursor” in tParAccelInput not working
•    TDI-13081 : tMicrosoftCrmOutput issue
•    TDI-12957 : Problem with tAdvancedFileOutputXML
•    TDI-12938 : The parameter “timeout” in tWebServiceInput doesn’t work
•    TDI-12924 : Concurrency Error on Iterative link
•    TDI-12918 : tFileFetch doesn’t release connection
•    TDI-12823 : rRunJob doesn’t populate run status
•    TDI-12693 : tWaitForFile component:check box “Wait for file to be released” is invalid
•    TDI-12681 : tAggregateSortedRow component doesn’t understand “Input Rows count” parameter
•    TDI-12641 : Vertica Component tVerticaBulkExec locks bulk file
•    TDI-8713 : Compile error occurs by just placing tSAPConnection and tSAPCommit (or tSAPRollback)
•    TDI-8694 : Trace debug shows calue null for parameter name and parameter value
•    TDI-8636 : tMicrosoftCRMOutput component can’t update on Task module
•    TDI-8682 : tNormalize before tUnite produces compilation errors
•    TDI-8680 : Some components don’t die on error (tFileInputXML)
•    TDI-8663 : Duplicate local variable when using tUnite
•    TDI-8656 : error message “TreeNode_API_tXML_1 cannot be resolved to a type” when consumer job starts
•    TDI-8654 : Talend Sage X3 adapter (tSageX3Input) – Keys obtained incorrectly
•    TDI-8653 : tHashOutput re-initialised for each iteration
•    TDI-8652 : NB_LINE of tFixedflowInput doesn’t return a value
•    TDI-8645 : tXMLMap issues with lookups
•    TDI-8641 : tSybaseIQOutputBulkExec, DATA_SOURCE parameter isn’t sent to tSybaseIQBulkExec
•    TDI-8619 : Father job in multithread mode finised with statut “success” instead of ‘failure” when whild job finished with status “failure”
•    TDI-8618 : confusing typo proprtties in various MySQL components
•    TDI-8550 : NPE in tSalesforceGetDeleted
•    TDI-8536 : The problem described in the [Bugtracker] bug 22677 also exists for the database Informix
•    TDI-8528 : tFileCopy globalMap variables not populated
•    TDI-8500 : Enhance tWaitForFile to support multi-thread when “Wait for file to be released” is checked
•    TDI-8495 : Bug on tXMLMap component
•    TDI-8473 : Bug in tJasperOutput
•    TDI-8291 : Optimize tFileInputPositional to push back 65535 bytes limit
•    TDI-7652 : Monitor Connection (volumetrics) doesn’t work when used after an iterate link that has parallel execution enabled
•    TDI-7631 : tXXDBOutput component is unable to aggregate the NB_LINE_XX when trigger the “BatchSize” option.
•    TDI-6890 : “java Code of a method longer than 65535 bytes” in tFileOutputPositional”

[2011-09-08] Talend Forum Announcement: Talend ESB Standard Edition 4.2.1 is now available

Hello Community,

Download and try Talend ESB Standard Edition 4.2.1. This is a bugfixing release.

For more detailed information, please check the ESB page on the JIRA bugtracker: http://jira.talendforge.org/browse/TESB

All fixed bugs for this release include:

- For Talend ESB 4.2.1 Standard Edition:

• [TESB-1577] WSDLException while running WS-Security (STS) enabled Rent A Car tutorial
• [TESB-2049] during start zookeeper got error
• [TESB-2081] sam-agent jar should not contain generated code for customerservice
• [TESB-2091] Starting of SAM Server as mvn jetty:run-war fails for examples/talend/tesb/sam/
• [TESB-2109] SAM adds flowID twice for faults
• [TESB-2117] Documentation, SAMUserGuide : Hint where to find the database sql script files
• [TESB-2126] Error Schema update TAC ESB
• [TESB-2127] Documentation inconsistencies "Talend ESB Development Guide"
• [TESB-2136] Location of STS server sample in GettingStartedGuide is outdated
• [TESB-2156] Specification of endpoint selection strategy per client
• [TESB-2300] SAM agent send incorrect event time
• [TESB-2359] Make jmx samples work and align it with the other samples

- For Talend ESB Studio 4.2.1 Standard Edition:

• [TESB-2125] Data Service Builder: Missing namespace in job-first generated WSDL
• [TESB-2513] Data Service Builder: Component generated code should not include obvious variable names (to
avoid clashes with user defined e.g. flow names)
• [TESB-2693] Data Service Builder: ESB Provider will always require jobs with dom4j dependencies
• [TESB-2224] Route Builder: CXF is unavailable in Camel Router Builder (fix now supports cxf:// URI's in
cMessageEndpoint)

[2011-09-08] Talend Forum Announcement: Talend MDM Community Edition 4.2.3 is available

Download Talend MDM Community Edition 4.2.3.

This release is a bugfixing release: 84 bugs have been fixed.

For more detailed information, please check MDM's JIRA page: http://jira.talendforge.org/browse/TMDM

All bugs fixed in this release:

    * [TMDM-1271] - JMS error log on Mac OS X
    * [TMDM-1337] - unknown messages in datamodel detail view
    * [TMDM-1363] - tMDMBulkLoad ...
    * [TMDM-1488] - Default value rules are not usable on Primary key
    * [TMDM-1510] - Erasing a date doesn’t work
    * [TMDM-1518] - « Relation » functionality: the entities associated under an inheritance are not shown
    * [TMDM-1554] - A job involving tStewardShipTaskInput won't run properly when deployed in MDM server
    * [TMDM-1639] - The entity search doesn’t work
    * [TMDM-1640] - error while overwriting records
    * [TMDM-1680] - NullPointerException with Enter key
    * [TMDM-1682] - In fk picker, add a combobox to select the inheritance type(s)
    * [TMDM-1697] - deployment problem
    * [TMDM-1700] - Items browser table displays incorrect value in case FK values are the same
    * [TMDM-1704] - Exception on validation beforeSaving Job in the case of Validation Error found, if we launch the saving two times consecutively
    * [TMDM-1705] - Multi-occurrence of Complex type – problem when adding a second occurrence
    * [TMDM-1706] - Jobox fails to guess the proper main class of a job when using tRunJob
    * [TMDM-1710] - An exception is thrown with Delete key and CRTL +V
    * [TMDM-1712] - An error message is quickly triggered at save time, even though the record is valid
    * [TMDM-1717] - User isn't kicked out of browse_items2 when session is expired
    * [TMDM-1719] - Unable to execute runnable process
    * [TMDM-1723] - A new created user has its <properties> element empty in datacontainer 'PROVISIONING'
    * [TMDM-1725] - Navigation bar can't be shown if user don't  click search button
    * [TMDM-1726] - Modifying an record in the « CREATION » window generate a « CREATE » event in the UPDATE REPORT, instead of an UPDATE event
    * [TMDM-1731] - Custom display format for the date doesnt accept simple format...
    * [TMDM-1733] - Classloading issue with Apache POI
    * [TMDM-1739] - tMDMOutput writes data in repository before the validation process (beforeSaving) happens.
    * [TMDM-1741] - Facet attribute rule doesn't block the saving if the rule is not valid, and strange error message displayed in a popup
    * [TMDM-1742] - Language selection causes 'TypeError: node is null'
    * [TMDM-1753] - Exception when launching a job deployed on Talend MDM server
    * [TMDM-1755] - Complex type opening arrow added for a simple type occurrence from multi-occurrence
    * [TMDM-1766] - When switching between entities with browse_items2, the former one stays open in the bottom of the screen
    * [TMDM-1767] - When editing an entity directly on the search page of browse_items2, beforeSaving is triggered before xpath valdiation rules
    * [TMDM-1775] - Problem with the twebservice component
    * [TMDM-1777] - Set default value rule option is missing in right-click menu for primary key
    * [TMDM-1795] - Bad display (with scrollbars and the screen height too small ) for the smartview pages for small resolutions (big screens)
    * [TMDM-1798] - Can not search the tasks which have been sent to others with administrator
    * [TMDM-1924] - Visibility & default rules don't work with xpath functions
    * [TMDM-1938] - FK  infos displayed in grid are without '-'
    * [TMDM-1940] - Missing WebService method "partialPutItem" - mixed up code base
    * [TMDM-1946] - Run (test) a process in the Studio brings no result window anymore, which makes it useless
    * [TMDM-1975] - Textbox border in browse_items2 can disappear when using IE8
    * [TMDM-2041] - Info messages bumping onto the console
    * [TMDM-2045] - Impossibility to reinitialize Auto Increment values
    * [TMDM-2047] - The Validation Rule does not work in the 'Edit Item with Row Editor'
    * [TMDM-2075] - Make some checks before accepting a validation rule name
    * [TMDM-2079] - Search button must be clicked - Enter is not enough
    * [TMDM-2122] - Edit a MDM Server location
    * [TMDM-2194] - The FK Picker doesn's work correctly
    * [TMDM-2203] - After creating a new record, the create button keep sending you back to the same tab
    * [TMDM-2204] - Webservice Error
    * [TMDM-2224] - All the elements are displayed in the first column.
    * [TMDM-2248] - Report(by using store procedure) is not showing any data but in studio the data is viewble correctly
    * [TMDM-2279] - It isn't possible anymore to define an anonymous simple type in 4.2.2
    * [TMDM-2292] - Error [Object error] when displaying the occurrences
    * [TMDM-2302] - Throw exception when browsing records
    * [TMDM-2308] - Talend job components do not read well the Update Report item's modification containing accents
    * [TMDM-2309] - Could not load an item detail
    * [TMDM-2312] - Not able to delete a link with a FK
    * [TMDM-2319] - Error while clicking + button for a multi occurence polymorphism element
    * [TMDM-2324] - Error while save a record which primary key is auto increment
    * [TMDM-2325] - Search engine issue
    * [TMDM-2326] - Unexcepted popup when you click on refrech
    * [TMDM-2333] - Hidden elements must not be resetted to empty values
    * [TMDM-2335] - save a data from the browser view, the save button always stay available and this button should lock when saving
    * [TMDM-2336] - When date format is invalid, warning icon cover the search button 
    * [TMDM-2338] - In the data record, the date format are displayed in french, but the date picker still have some english words.
    * [TMDM-2341] - the arrows disappear
    * [TMDM-2346] - The Polymorphizm cannot be saved correctlly.
    * [TMDM-2349] - Synchronization Plan failed with an error
    * [TMDM-2351] - Cannot search or create an item in  ItemsRecords
    * [TMDM-2352] - The search using EQUALS does not work 
    * [TMDM-2367] - Query generation issue using joins and conditions
    * [TMDM-2368] - Cannot resolve foreign key when contained in a sequence
    * [TMDM-2378] - Throw exception while creating records for entities with inheritance
    * [TMDM-2384] - [Reports]Display nothing when there's only one element in Display fields
    * [TMDM-2387] - Problem with the multiple occurrence polymorphic element
    * [TMDM-2389] - After launch a runnable proccess, the current record detail would be repeated two times
    * [TMDM-2399] - The result of FK filter search is incorrect
    * [TMDM-2400] - Records can be saved while missing mandatory values
    * [TMDM-2403] - Change to a simple type in the Reusable Types doesn't work as expected
    * [TMDM-2404] - Journal does not work at all
    * [TMDM-2410] - Arjuna Error when you try to access the journal
    * [TMDM-2415] - Unable to display records in crossreference tables
    * [TMDM-2423] - Error when opening imported user in  PROVISIONING

[2011-09-08] Talend Forum Announcement: Talend Open Profiler 4.2.3 is available

Download Talend Open Profiler 4.2.3.

This release is a bugfixing release: 39 bugs were fixed.

For more detailed information, please check the data quality page on the JIRA bugtracker: http://jira.talendforge.org/browse/TDQ

All bugs fixed in this release:

•    [TDQ-1432] - SYSBASE Connection Analysis doesn't work
•    [TDQ-1494] - create some db connections ,then change to DQ view , the db view didnot show.
•    [TDQ-1531] - dill down of the generated pattern is not correct
•    [TDQ-1547] - Approximation in the result is wrong in analysis
•    [TDQ-1569] - the indicators can't be displayed right
•    [TDQ-1575] - the structure under the "Rules" folder is not correctly. SQL folder show as two levels
•    [TDQ-1645] - after importing a indicator, the message showed is not correctly
•    [TDQ-1662] - the rules file user created named "Rules" can be unfold and under it will show all the rule files without end.
•    [TDQ-1679] - Duplicate count indicator result is wrong for column analsis with OracleDB
•    [TDQ-1683] - analysis about mdm columns which being used for the first time
•    [TDQ-1711] - Pattern test view no longer shows result
•    [TDQ-1976] - error message not displayed that non-pattern-supporting-database has been detected
•    [TDQ-1998] - confirmation message for "Reload database list"
•    [TDQ-2095] - click "finish" in the edit wizard there is error.
•    [TDQ-2169] - SQL Pattern not shown in Pattern selector
•    [TDQ-2249] - Java.lang.NullPointerException message in log file
•    [TDQ-2268] - run the analysis with many columns and indicators have errors.
•    [TDQ-2302] - Multiple user connection to same Oracle database result in issues
•    [TDQ-2303] - cannot set thresholds on minimal length indicators with null or with blank
•    [TDQ-2333] - TDQFileItem lost in dataquality.ecore
•    [TDQ-2468] - TOP does not correctly close Netezza connections
•    [TDQ-2499] - cannot display the value valid in the Analysis Results
•    [TDQ-2503] - can not drag sqlPattern to columnSet Analysis
•    [TDQ-2522] - Headers are wrong for Java option in indicator editor.
•    [TDQ-2528] - Unable to drill down on Databases
•    [TDQ-2546] - java.sql.SQLException: SQLite only supports TYPE_FORWARD_ONLY cursors
•    [TDQ-2640] - Cannot move a folder onto another folder
•    [TDQ-2730] - analyses contain a pattern and a indicator and these pattern, indicator have a same name
•    [TDQ-3197] - [check regex pattern] Doesn't work
•    [TDQ-3206] - Pattern tester result is hidden
•    [TDQ-3209] - indicator selection and execution errors
•    [TDQ-3212] - Connection information can not changing correctly according to the data changed in the analysis for db created later than analysis opening
•    [TDQ-3240] - Sum indicator doesn't appear
•    [TDQ-3266] - check jdbc connection to netezza in ConnectionInfoPage will get an error
•    [TDQ-3267] - DQ view can't display
•    [TDQ-3307] - Open Profiler runs Oracle analyse compute statistics on every table when anakysing metadata
•    [TDQ-3336] - delete DB connections into recycle,not show the connection name 'mysql' but showing "Db Connections" .this action is the same as the analysis
•    [TDQ-3419] - Reopen column analysis,get an empty editor, and NPE
•    [TDQ-3436] - Could not import items from 4.0.x-4.2.3

[2011-09-08] Talend Blog: Three Bossie Awards, Three Years in a Row

It came yesterday, when the editorial team at Infoworld notified us: for the third year in a row, Talend won a Bossie Award! I like the way they said it: “Talend Open Studio remains the undisputed leader in open source data management and integration.”  Infoworld’s Bossie Awards recognize the best open source software of the year.

And indeed, with 15 million downloads, 750,000 users – Talend has taken open source integration to a level no one suspected it could go. The challenge of democratizing integration was an interesting one to tackle, and I am proud of what we have achieved.

bossie09.jpg   bossie10.jpg   bossie11.jpg

Now, as I am looking back at these three Bossie awards, there is an interesting pattern that is very reflective of Talend’s history.

  • Our first Bossie came in 2009. Talend Open Studio had been reviewed a few months ago by Infoworld’s Test Center and has passed with flying colors and an overall score of 9/10. At this time, all we were talking about at Talend was data integration, and indeed the Bossie highlighted this: “Talend Open Studio makes an excellent choice for data integration projects large or small.”
  • In 2010, our second Bossie highlighted an expanding suite of products for all data management needs. At that point, we were gaining recognition in the data quality field and had introduced the first open source MDM solution. And here we went: “This year Talend raised the bar with the addition of master data management tools and native support for Hadoop, laying the groundwork for large-scale, big data analysis.”  Also note the first reference to Big Data.
  • And today, in 2011, as our third Bossie comes along, two major shifts have happened. The first one is a category shift: while Talend was recognized in the “Best of open source platforms and middleware” categories for the past two years, the category is now “Best open source data center and cloud software”.  Any link to our strategic Cloud initiative earlier this year, with the release of Talend Cloud?  The second shift, very important also, is the extension of Talend’s focus from pure play data management into an overall integration solution. As Infoworld highlighted, “most of the improvements this year pertained to growing and assimilating Talend’s own ESB.”  And indeed, beyond the assimilation of Talend ESB, the Talend story has shifted significantly this year – from pure play data management into a much broader integration solution.

This Bossie will soon join the other ones on our Wall of Fame, as I like to (modestly) call our Awards and Industry Recognition web page.

Yves

[2011-09-05] Talend Press Release: East Sussex Fire & Rescue Service uses Talend Data Quality to fight the hazards of data management

East Sussex Fire & Rescue Service uses Talend Data Quality to fight the hazards of data management

East Sussex Fire & Rescue Service selects open source data management solution, Talend Data Quality

MAIDENHEAD, UK - September 6, 2011 - East Sussex Fire and Rescue Service has announced that it has selected Talend Data Quality, an open source data management solution, to integrate and manage over 100 applications within its IT infrastructure. The implementation of Talend Data Quality will enable the Service to dramatically reduce the time taken to compile reports, whilst ensuring the consistency and accuracy of information contained within them.

East Sussex Fire and Rescue Service operates from 24 fire stations handling a wide variety of calls – from major fires and complicated technical rescues to the routine and inevitable false alarms. Under-pinning this activity lies an IT infrastructure that includes over 100 applications, everything from a comprehensive human resources system, which keeps track of fire and rescue personnel, to the transactional systems that log every emergency call, fire station alarm, appliance deployment and actions taken in the field.

Steve Sims, project manager, East Sussex Fire and Rescue Service said: “East Sussex Fire and Rescue has to handle huge volumes of events each day and this information needs to be logged for reporting purposes and for business intelligence. With over 100 applications, it was no longer efficient to integrate and manage our data manually as this was a time-consuming process, on occasions prone to error”

Talend Data Quality will be implemented to cleanse inaccurate and inconsistent data, identify and resolve duplicate records and augment and enhance data.

Steve commented: “We selected Talend Data Quality as it has the capability to aggregate and reconcile all the data we need to pull together in-depth reports. As a public service, East Sussex Fire and Rescue Service has to provide regular reports to evidence that it is discharging its statutory obligations and meeting its own very challenging standards of service delivery. We also rely heavily on this data for business intelligence. It is vital that we have an accurate account of the volume and nature of the incidents we handle and reflecting the resources we use, so we can constantly review the efficiency and effectiveness of our response. It is of paramount importance to us to ensure that we have one consistently dependable source of integrated data, to ensure that all reports are completely accurate. We needed a cost effective data quality tool which could achieve this and Talend Data Quality met all of these requirements”

François Méro, VP of Global Sales, Talend said: “Starting a data management project of this scale can be very time consuming and complex due to license and contractual discussions; not to mention very costly. East Sussex Fire and Rescue Service has put efficiency and flexibility at the top of its agenda, and Talend Data Quality enables it to achieve on both fronts. Using Talend Data Quality, East Sussex Fire and Rescue Service can continue to push performance levels and provide the local community with an ever improving first class service.”