[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: DOM wrappers and UTF-8 encoding



This depends on where the decode takes place. Are you doing a
System.out.print to check the values or are you looking at the result in
a browser? How is the browser or system set up? Can the browser display
the character? Can the system? If this is on a terminal, the terminal
software might limit what can be shown. Is your browser set up for
UTF-8, Latin-1, iso-8990, whatever?

Remember that UTF-8 is an encoding standard, similar to base64. It needs
to be decoded somewhere to make sense. You need to discover where the
decoding happens or do it yourself.
Ed.

Poorav Chaudhari wrote:
> 
> thank you edward for replying to my query. i will take a look at the
> link you sent me. but just to answer your question about the language
> i am writing this in. well actually it is a jsp. what i am trying to
> do is read data from an xml file, the data is correctly represented as
> &# characters, but the problem is that when i read the data after
> parsing through the xml document using a DOM object. normal ASCII text
> (english characters basically) are displayed just fine, and there is
> no change, if the getnodevalue comes across this type of string
> combination (& followed by # followed by some numbers followed by ;)
> it some how seems to do its own interpretation. and changes the output
> to a resulting special character.
> 
> i hope that explanation made some sense. please let me know if you
> have any idea what i am talking about.
> 
> thanks
> 
> poorav
> 
> ----------------------------------------------------------------------
> Do You Yahoo!?
> Listen to your Yahoo! Mail messages from any phone with Yahoo! by
> Phone.