You are not logged in.
On a side note, using sax parsing, and having talend produce dom4j node when the "Get Nodes" mapping option is used should probably be relatively trivial to implement thanks to the org.dom4j.io.SAXContentHandler class.
I need to parse a large (4.3 GB) and complex (several related hierarchical levels) XML file.
Due to the complexity of the file, the parsing is handled by a tFileInputXML component, several levels of tExtractXMLField and the "Get Nodes" mapping option.
That works correctly for a small sample file, but when using the real file, it quickly runs into OutOfMemoryErrors.
First I thought no problem, simply switch from dom4j to sax for the parsing, but "Get Nodes" only work with dom2j parsing. So is any practical solution existing?