You are not logged in.
I am facing similar issue. Both leading and tailing white spaces & are truncated when converted to &.
I am trying to use the SAXParser and Talend is having issues when it comes across & and '. Does anyone know of a workaround? These instances involving the & character look proper in the xml file. I have three XML files, each larger than 1GB. So, using SAX is required.
I am using TOS 2.4.
When I configure a TFileInputXML to use the SAXParser, it doesn't get the element text right as soon as there are (encoded) quotes or apostrophes.
<?xml version="1.0" encoding="utf-8"?>
It returns just a single decoded quote ( " ) for column "title", whereas the DOM and xerces parsers work fine.
I definately need to use the SAXParser because of memory issues.
Anyone got an idea?