You are not logged in.
Announcement
Unanswered posts
|
Pages: 1

Hi
i have xml file which contains HTML tags in in one of the column. but while creating schema of XML it removes all HTML from that column. please suggest how to solve this issue.
Offline

Here i have modified some text and pasted here but all the XML tags is same as we have in original XML except HTML Tags. see the <GoodsBODY> for HTML tags.
<GOODS>
<Good>
<GoodsTITLE>
<![CDATA[på Force Measurement]]>
</GoodsTITLE>
<NUMBER>
<![CDATA[51728536]]>
</NUMBER>
<GoodsCOMPANY><![CDATA[Tvärleden 2]]></GoodsCOMPANY>
<GoodsLOCATION><![CDATA[Västerås/018/Sverige]]></GoodsLOCATION>
<GoodsACTION><![CDATA[add]]></GoodsACTION>
<GoodsBODY><![CDATA[<p>ABB <table border="0" cellspacing="0" cellpadding="0" bgcolor="#FFFFFF">
<tr>
<td valign="bottom" rowspan="4" align="right"><img src="images/homepage/top24.gif" width="121" height="125" border="0"></td>
<td valign="bottom" rowspan="4" align="right" height="125"><img src="images/homepage/top23new1.gif" width="286" height="125"></td>
</tr>
<tr>
<td valign="bottom" height="41" align="right" bgcolor="#FFFFFF"> </td>
<td valign="bottom" height="41" align="right" bgcolor="#FFFFFF"></td>
</tr>
<tr>
<td valign="bottom" align="left" height="65" colspan="2" bgcolor="#000066"><img src="images/homepage/top23new2.gif" width="406" height="65" usemap="#MapMap" border="0"></td>
</tr>
<tr>
<td valign="bottom" align="left" bgcolor="#000066" height="19" colspan="2">
</td>
</tr>
</table>
<map name="MapMap">
<area shape="rect" coords="277,9,349,25" href="Contactus.htm" alt="Contact Us" title="Contact Us">
<area shape="rect" coords="9,37,49,51" href="index.asp">
</map> </p><br>]]></GoodsBODY>
<AREA><![CDATA[Projektledning]]></AREA>
<CONTRACT><![CDATA[Regelbundet/Permanent]]>
</CONTRACT>
<EMP_FRACTION><![CDATA[Skiftarbete]]></EMP_FRACTION>
<PUB_START_DATE><![CDATA[20120420]]></PUB_START_DATE>
<DE_FOR_APP><![CDATA[20120513]]></DE_FOR_APP>
<LAN><![CDATA[SV]]></LAN>
<COUNTRY><![CDATA[Sverige]]></COUNTRY>
<REGION_STATE><![CDATA[018]]></REGION_STATE>
<CITY><![CDATA[Västerås]]></CITY>
Last edited by umeshrakhe (2012-05-15 08:57:43)
Offline

Hi
Got you. Because this xml file is not complete. I can't reproduce it.
But in my opinion, Talend can handle XML file which is embedded with HTML well.
You need to add <![CDATA[]]> correctly.
So I think you'd better recheck your XML file. Or can you open them with browser?
Regards,
Pedro
Offline

Hi Pedro,
i pasted wrong XML this is the correct one, here HTML Tags from <BODY> encoded, so we need to decode it in its original form (HTML)
<WORKS>
<WORK>
<WORKTITLE>
<![CDATA[Ledande El-konstruktör för]]>
</WORKTITLE>
<WORKNUMBER>
<![CDATA[51853370]]>
</WORKNUMBER>
<COMPANY><![CDATA[Tvärleden 2]]></WORKCOMPANY>
<LOCATION><![CDATA[Västerås/018/Sverige]]></WORKLOCATION>
<ACTION><![CDATA[add]]></ACTION>
<BODY><![CDATA[<p>Some Text.</p><br><p>Some Text!</p><br>]]></BODY>
<F_AREA><![CDATA[Design och teknik]]></F_AREA>
<EMP_CONTRACT_TYPE><![CDATA[Regelbundet/Permanent]]></EMP_CONTRACT_TYPE>
<EMP_FRACTION><![CDATA[Skiftarbete]]></EMP_FRACTION>
<PUB_START_DATE><![CDATA[20120514]]></PUB_START_DATE>
<DE_FOR_APPLICATIONS><![CDATA[20120605]]></DE_FOR_APPLICATIONS>
<LAN><![CDATA[SV]]></LAN>
<COUNTRY><![CDATA[Sverige]]></COUNTRY>
<RE_STATE><![CDATA[018]]></RE_STATE>
<CITY><![CDATA[Västerås]]></CITY>
<NAME><![CDATA[Ingela Olsson]]></NAME>
<PHONE><![CDATA[+46 21 32 50 00]]></PHONE>
<TIMESTAMP><![CDATA[2012-05-14 14:38:52]]></TIMESTAMP>
</WORK>
</WORKS>
please suggest how to decode below text into its original HTML Form.
<BODY><![CDATA[<p>Some Text.</p><br><p>Some Text!</p><br>]]></BODY>
Last edited by umeshrakhe (2012-05-17 06:43:50)
Offline
Pages: 1