Talend Exchange is the place where Talend community can share items related to Talend opensource products, such as Data Integration, Data Quality and Data Master Management. Contribution is open to any user, no specific validation is needed. As soon as you have your forum account, you automatically get a Talend Exchange account.


tFileOutputOCR


  • Author: bennatigiuliano
  • Categories: Java, Component
  • First revision date: 2012-07-16
  • Latest revision date: 2012-07-16
  • Compatible with: Data Integration releases 5.0.0
  • Downloads: 29

About: This component based on Tesjeract allow you to convert any pictures containing text to .txt file

Revision list

expand/collapse all

Revision 0.1 29 Downloads, Released on 2012-07-16
Download revision 0.1

Compatible with: 5.0.0

This component based on Tesjeract ( http://code.google.com/p/tesjeract/ ) allow you to convert any pictures containing text to .txt file.

Please read Tesjeract FAQ :
`tessdll.dll` , `tesjeract.dll` and tessdata directory need to be in `C:/Windows/System32`
You also need the [http://www.microsoft.com/downloads/details.aspx?familyid=a5c84275-3b97-4ab7-a40d-3802b2af5fc2 Microsoft Visual C++ 2008 SP1 Redistributable Package].
See also JVM stacks settings : http://www.talendforge.org/forum/viewtopic.php?id=24838 .

or simply use tSystem(tesseract.exe) ;-)

Reviews (0)

Be the first to review this extension!

 

Submit review
Name:*
Email:*
Title:*
Please select your rating*
Review:*


Version Author Released on Rating Downloads
Indicator

EMail validation via mail server

5.3.0 mzhao 2013-06-03
190

This Java UDI check emails by sending a SMTP request to mail server. the code sample can be found at: http://www.rgagnon.com/javadetails/java-0452.html

Indicator

Frequency table of hours

2.0 scorreia 2013-04-25
277

This indicator helps to analyze the most frequent day hours that appear in date time columns.

Indicator

Sample Standard Deviation

1.1 scorreia 2013-04-25
195

This indicator computes the sample standard deviation of any numerical column

Indicator

Variance

1.1 scorreia 2013-04-25
182

This indicator computes the variance of numeric columns

Indicator

Trimmed

1.0 scorreia 2013-04-25
17

evaluate the number of data which are correctly trimmed

Indicator

Week Frequency

2.0 scorreia 2013-04-25
173

aggregates Date fields into weeks

Indicator

Duplicate Rows

2.0 scorreia 2013-04-25
598

this indicator counts the number of duplicate rows.
It's different from the system indicator called "duplicate count" because it counts the number of duplicate rows, not the number of duplicate values.

Indicator

Length Range Frequency

1.1 scorreia 2013-04-25
48

get length ranges of data.

group data according to their length range.
Ranges are the following:
data of length < 10
data of length < 20
data of length < 30
data of length >= 30
null data

Indicator

Order of Magnitude

1.1 scorreia 2013-04-25
58

measure the order of magnitude of numerical data

Indicator

phone_area_code_freq

1.0 scorreia 2013-04-24
11

Area codes of American phone numbers

Version Author Released on Rating Downloads
Export

Product Demo

3.0 ctoum 2012-05-31
433

Product & families, with Cafepress pictures.

Data-Model

Clinical Trials: Janus Model Basics

1.0 jaymce 2010-11-22
288

This is a model of the basic of the Janus Clinical Data Repository.
http://www.fda.gov/ForIndustry/DataStandards/StudyDataStandards/ucm155327.htm

Data-Model

D* Demo Model

1.0 ctoum 2010-08-13
516

Model used in the D* Demo.

Export

Talendshop Demo

1.0 ctoum 2010-08-04
1066

Talendshop Demo (Demo Project)


53 ms