Post a reply

Write your message and submit

Options

Click in the dark area of the image to send your post.

Go back

Topic review (newest first)

Guillaume_
2009-11-17 17:33:08

PS:
On a side note, using sax parsing, and having talend produce dom4j node when the "Get Nodes" mapping option is used should probably be relatively trivial to implement thanks to the org.dom4j.io.SAXContentHandler class.

Guillaume_
2009-11-17 17:15:48

Hello,

I need to parse a large (4.3 GB) and complex (several related hierarchical levels) XML file.

Due to the complexity of the file, the parsing is handled by a  tFileInputXML component, several levels of tExtractXMLField and the "Get Nodes" mapping option.

That works correctly for a small sample file, but when using the real file, it quickly runs into OutOfMemoryErrors.

First I thought no problem, simply switch from dom4j to sax for the parsing, but "Get Nodes" only work with dom2j parsing. So is any practical solution existing?

Cheers,
G

Board footer

Powered by FluxBB