Talend Open Studio for DI/DQ/BD Installation Guide

This wiki page is deprecated: although the information provided may still be applicable, it will not be maintained. Please visit the Talend Help Center website to read the latest and most accurate installation information, and update your bookmarks.

Talend Open Studio for DI (Data Integration), Talend Open Studio for DQ (Data Quality) and Talend Open Studio for BD (Big Data) are based on Eclipse RCP. Therefore their installation procedure is identical.

Prerequisites

Prior to installing Talend Open Studio for DI/DQ/BD, please read the following page: Software and Hardware Prerequisites
It is important to note that the disk and memory recommendations for Talend Runtime only apply to the ESB users.

To use Talend Open Studio for DI/DQ/BD properly, you need to install the following programs.

On Windows

:!: On Windows XP and Windows Server 2003, the GDI is already installed. However, on Windows 2000, this installation is required. The GDI can be downloaded from Microsoft website. For further information, visit Eclipse’s FAQ.

On Linux

If you are on Linux, you are required to install the xulrunner package for all versions of Talend Open Studio. For further information, visit Installation on Ubuntu

Downloading & installing

Download

Get the archive file from the download section of the Talend website.
:!: Note that the .zip file contains binaries for ALL platforms (Linux/Unix, Windows and MacOS).

Once the download is complete, extract the archive files on your hard drive.

Configure the memory settings

If you want to tune the memory allocation for your JVM, you only need to edit the .ini file corresponding to your executable file. For example:

  • For Talend Open Studio on Windows, edit the file: TOS_DI/DQ/BD-win32-x86.ini,
  • For Talend Open Studio on Linux, edit the file: TOS_DI/DQ/BD-linux-gtk-x86.ini.

The default values are:

 -vmargs -Xms40m -Xmx500m -XX:MaxPermSize=128m 

If you only have 512Mo of memory on your computer, you can specify the memory allocation as following:

-vmargs -Xms40m -Xmx256m -XX:MaxPermSize=64m, for example.

Learn more on http://www.oracle.com/technetwork/java/hotspotfaq-138619.html

Launch

On Windows, double-click the executable file to launch Talend Open Studio for DI/DQ/BD.

On Unix-like systems, add execution rights on the desired “TOS_DI/DQ/BD-*” binary before launching it.
On a standard Linux box, the command is:

$ chmod +x TOS_DI/DQ/BD-linux-gtk-x86
$ ./TOS_DI/DQ-linux-gtk-x86

On Mac OS X, unpack the TOS_DI/DQ/BD-macosx-cocoa.app folder and launch Talend Open Studio for DI/DQ/BD.

Public license, login and first project

Public license

First screen is a license screen.
In the [License] window that appears, read and accept the terms of the license agreement to proceed to the next step.

Login window: Setting up a project

You need to create a connection before logging to Talend Open Studio for DI/DQ/BD.

:doc:tos_install-project_creation.png

As first time user, you need to set up a new project or you can also import the Demo project which gathers numerous job samples:
- To select the demo project, select TALENDDEMOSJAVA and click Import.
- To create a new project, enter the name of your project in the corresponding field and click Create to complete the description of your project.
:doc:tos_install-project_desc.png
In the Project name field, type in the name of the project.
In the Project description field, type in a description for this project.
Click Finish when complete, and the newly created project is displayed in the Login window.

:doc:tos_install-project_selection.png

In the Login window, open the project you just created.
A registration window opens.

If required, follow the instructions provided to join Talend community or click Skip to open a welcome window and launch the Studio.

Configuring the Studio

Basic configuration

Talend Open Studio for DI/DQ/BD opens up on a four-panel window. The Repository is the left panel storing all items used to model your business and design your processes.

You can define various preferences according to your needs. To do so, open menu Window > Preferences… > Talend.
:doc:tos_install-preferences.png

Click Browse… to specify the path to the Temporary file directory, and the Java interpreter.

Install Jar dependencies

Talend Open Studio requires specific third-party Java libraries or database drivers (Jar files) to be installed to connect to sources and targets. Those Jar files, known as external modules, can be required by some Talend components. However, due to license restrictions, Talend may not be able to integrate certain external modules within Talend Open Studio.

Identify Jar dependencies

On your design workspace, if a component requires the installation of external modules before it can work properly, a red error indicator appears on the component. With your mouse pointer over the error indicator, you can see a tooltip message showing which external modules are required for that component to work.

See below an example when you use the tFTPGet component in Talend Open Studio for Big Data.

:doc:tos_install-missing_modules-tooltip.png

In this example, as the required Jar files are provided under the LGPL license while Talend Open Studio for Big Data is provided under the Apache license, these Jar files are not included in this distribution.

The Modules view lists all the modules required to use the components embedded in the Studio, including those missing Java libraries and drivers that you must install to get the relevant components working.

:doc:tos_install-missing_modules-modules_view.png

INFO If the Module is not shown under your design workspace, go to Window > Show View… > Talend and then select Modules from the list.

In addition to the Modules view, the Studio provides a mechanism that enables you to easily identify, download and install most of the required third-party modules from Talend website and directs you to valid websites for the rest.

A Jar installation wizard appears when you:

  • drop a component from the Palette if one or more external modules required for that component to work are missing in the Studio, or
  • click the Check button in a Metadata connection setup wizard in Talend Open Studio for Data Integration if one or more external modules required for the connection are missing in the Studio, or
  • click the Guess schema button in the Component view of a component if one or more external modules required for that component to work are missing in the Studio, or
  • click the button in the Modules view.

INFO When you click this button, the wizard that appears will list all the required external modules that are not integrated in the Studio.

tos_install-missing_modules-wizard

This wizard lists the external modules to be installed, the licenses under which they are provided, and the URLs of the valid websites where they are downloadable, and allows you to download and install automatically all the modules available on Talend website and download those not available on Talend website by following the links provided in the Action column and then install them into your Studio manually.

When you use a component that requires an external module for which neither the Jar file nor its download URL information is available on Talend website, the Jar installation wizard does not appear, but the Error Log view will present an error message informing you that the download URL for that module is not available. You can try to find and download it by yourself, and then install it manually into the Studio.

INFO To show the Error Log view on the tab system, go to Window > Show views, then expand the General node and select Error Log.

Install dependencies

To install missing modules automatically, do the following:

  1. In the Jar installation wizard, click the Download and Install button to install a particular module, or click the Download and install all modules available button to install all the required modules available on Talend website.
  2. Click Accept in the [License] dialog box that appears to continue with the installation.

INFO The [License] dialog box appears for each license under which the relevant modules are provided until that license is accepted.

Upon installation of the chosen external module or modules, a dialog box appears to notify you about the number of modules successfully installed and/or about the modules failed to install, if any.

To install manually an external module you already have in your local file system, do the following:

:!: Talend Open Studio for Big Data does not come with the JDBC drivers for Oracle databases due to Apache license restrictions. For Oracle9i, the required JDBC driver downloadable from Oracle website is named ojdbc14.jar, the same as that for Oracle 10g. To enable the JDBC driver for Oracle9i you have downloaded to work in Talend Open Studio for Big Data, you have to change the file name to ojdbc14-9i.jar before installing it into the Studio.

  1. In the Modules view, click the button at the upper right corner to browse your local file system.
  2. In the [Open] dialog box of your file system, browse to the Jar file you want to install.
  3. Double-click Jar file, or select it and then click Open to install it.
  4. Click Refresh in the Modules view. The component is ready for use.

You're now ready to create your first job.

 
doc/installation_guide.txt · Last modified: 2013/03/20 11:04 by lgaudens
 
 
Recent changes RSS feed Driven by DokuWiki