Unanswered posts

  • Index
  •  » Talend Open Studio for Data Integration » Usage, Operation
  •  » TFileInputXML using SAX parser : problem with quotes

#1 2008-06-27 18:07:55

msu-price
New member
Registered: 2008-06-27
Posts: 1

TFileInputXML using SAX parser : problem with quotes

I am using TOS 2.4.
When I configure a TFileInputXML to use the SAXParser, it doesn't get the element text right as soon as there are (encoded) quotes or apostrophes.

e.g.:

<?xml version="1.0" encoding="utf-8"?>
<root>
<title>&quot;my title&quot;</title>
</root>

It returns just a single decoded quote ( " ) for column "title", whereas the DOM and xerces parsers work fine.
I definately need to use the SAXParser because of memory issues.

Anyone got an idea?

Offline

#2 2009-04-17 16:05:26

fszeborg
Member
Registered: 2009-01-23
Posts: 17

Re: TFileInputXML using SAX parser : problem with quotes

I am trying to use the SAXParser and Talend is having issues when it comes across &amp; and &apos;.  Does anyone know of a workaround?  These instances involving the & character look proper in the xml file.  I have three XML files, each larger than 1GB.  So, using SAX is required.


Thanks

Offline

#3 2009-08-25 07:26:07

abharal
New member
Registered: 2008-04-02
Posts: 5

Re: TFileInputXML using SAX parser : problem with quotes

I am facing similar issue. Both leading and tailing white spaces &amp; are truncated when converted to &.

Thanks

Offline

  • Index
  •  » Talend Open Studio for Data Integration » Usage, Operation
  •  » TFileInputXML using SAX parser : problem with quotes

Board footer

Powered by FluxBB