• Index
  •  » Talend Open Studio for Data Integration » Usage, Operation
  •  » [resolved] parse html to extract some tags

#1 2011-09-07 18:41:33

fmarin156
Member
Registered: 2010-11-23
Posts: 48

[resolved] parse html to extract some tags

Tags: [html, java, jdom, parse]

Hi,

i have to parse some HTML pages to extract certains tags, and i choose to use JDOM (see scrrenshots for the job ans it's parameters)

I have downloaded jdom1.1 from http://www.jdom.org/downloads/index.html, i build it, and i copy jdom.jar in "../jre/lib"

i update the java path (see screenshot

when i execute the job, i have an error "java.lang.NoClassDefFoundError: org/jdom/Document" (and so for SAXBuilder) ...

where is the mistake ?

thanks for an idea

fred

Last edited by fmarin156 (2011-09-07 18:47:11)

Offline

#2 2011-09-07 19:04:57

fmarin156
Member
Registered: 2010-11-23
Posts: 48

Re: [resolved] parse html to extract some tags

i have some problems with my jre to join screenshots ... and i search to solve the problem

fred

Last edited by fmarin156 (2011-09-08 12:19:02)

Offline

#3 2011-09-08 12:21:31

fmarin156
Member
Registered: 2010-11-23
Posts: 48

Re: [resolved] parse html to extract some tags

hi,

it's all right for my jre

there are my screenshots

fred


Uploaded Images

Offline

#4 2011-09-09 12:03:27

shong
Talend team
Registered: 2007-08-29
Posts: 10297
Website

Re: [resolved] parse html to extract some tags

Hi
It miss the jar file, you need to load the jar using tLibraryLoad first, see my screenshot.
You can download the jar from here:
http://www.java2s.com/Code/Jar/JKL/Down … m10jar.htm

Best regards
Shong


Uploaded Images


Email:shong@talend.com
Choose Talend, Enjoy Talend!
New & Event: Talend Help Center
Talend-->the leader of open source data management and application integration solutions!

Offline

#5 2011-09-12 15:14:06

fmarin156
Member
Registered: 2010-11-23
Posts: 48

Re: [resolved] parse html to extract some tags

thanks, Shong

it's all right

fred

Offline

  • Index
  •  » Talend Open Studio for Data Integration » Usage, Operation
  •  » [resolved] parse html to extract some tags

Board footer

Powered by FluxBB