• Index
  •  » Talend Open Studio for Data Integration » Usage, Operation
  •  » Generic schema definition for files

#1 2008-09-17 13:43:03

tcimoli
Member
Company: Liberologico
Registered: 2008-09-02
Posts: 57
Website

Generic schema definition for files

Tags: [file, metadata, schema]

I have some Excel files to migrate, which have same structure (i.e column name, column order)
In TOS i would like to create a metadata for that schema, but without linking it to a specific file: the file will change, but the schema will be the same.

What i have to do so far is to create a new Metadata-> File Excel, but in this case i can only use that schema on that file.

i have two questions:
1)  Is it possible to define generic schemas for excel files?

2) Creating a metadata file takes a long time before ending (7/ 10 minutes).  The Excel file is about 15Mb.

Why does it takes so much time? i only need it to load the schema of the file. It is not necessary that it reads all the file.

Maybe i do not know the meaning of Metadata for files: does it keep in RAM all the file to be quicker when running job?

Last edited by tcimoli (2008-09-17 14:09:54)

Offline

#2 2008-09-17 14:22:23

jcoutin
Member
Registered: 2008-05-30
Posts: 170

Re: Generic schema definition for files

Hello,

1) You can change the file name in the job. (File Name in Component properties).

2) I guess creating a  metadata file takes a long time because talend read the file to determine field type (String, Integer etc...)

JC

Last edited by jcoutin (2008-09-17 14:29:08)

Offline

#3 2008-09-17 14:48:16

tcimoli
Member
Company: Liberologico
Registered: 2008-09-02
Posts: 57
Website

Re: Generic schema definition for files

>You can change the file name in the job.
i have to do it, dinamically. I will not runn all jobs by hands. Is there a way to change   job properties at runtime?

Anyway: i have to reuse schema definition a lot in different components. i do not want to edit manually each time, i would like to define it once. Is it possible?

Offline

#4 2008-09-17 15:05:34

jcoutin
Member
Registered: 2008-05-30
Posts: 170

Re: Generic schema definition for files

You can dynamically change the name : put a context variable in "file name" field.

You can also use "export as context" in the Metadata repository.

Offline

  • Index
  •  » Talend Open Studio for Data Integration » Usage, Operation
  •  » Generic schema definition for files

Board footer

Powered by FluxBB