Spatial module for Talend Open Studio (aka. Spatial Data Integrator or SDI) is an ETL with geospatial capabilities.
Likewise regular TOS components, there are basically three sorts of Geo components: input, output and transform components. Input and output components read features from and write feature to datastores, respectively. Transform components read features from their input flows, possibly transform those features, and write features to their output flows. The term transform is to be taken loosely here as it represents any sort of operation.
Spatial components are using GeoTools and Java Topology Suite for read/write GIS formats and feature manipulation. Experimental works have been done to embed uDig to display GIS datasets and Sextante library to add RASTER processing.
SDI was first prototyped by Camptocamp team in summer 2007. It is based on Talend Open Studio (TOS), the general-purpose ETL developed by Talend.
The geospatial capabilities are provided in the components belonging to the Geo component family. We refer to the components within the Geo family either as the Geo or spatial components.
Note: the names of all Geo components start with an “s” (for SDI).
More recently in 2010, Neogeo Technologies worked with Talend in order to be able to plug spatial modules to an existing TOS installation. This has been achieve since 4.1.0 release.
| GIS Format | Read | Write | Component name | Comments |
|---|---|---|---|---|
| PostGIS | | | sPostgisInput/sPostgisOutput and tPostgresInput/tPostgresOutput | |
| ESRI Shapefile | | | sShapefileInput/sShapefileOutput | |
| MapInfo MIF/MID | | | sMapinfoInput/sMapinfoOutput | |
| GeoRss | | | sGeoRssOutput | Read using tFileInputXml component |
| Edigeo | | sEdigeoInput | ||
| WFS | | sWfsInput | ||
| GPX | | sGpxInput | ||
| OSM | | sOSMNodesInput/sOSMWaysInput | ||
| NetCDF | | sNetCDFInput | http://www.neogeo-online.net/blog/archives/1384/ | |
| OGR | | sOgrInput | http://www.gdal.org/ogr/ogr_formats.html |
svn co http://talendforge.org/svn/sdi/trunk/