Lakshmi
2005-02-21 22:47:53 UTC
A japanese customer sent me a file, ids.txt, containing a list of Employeed
ids. These ids are a
mix of "Japanese Characters" and English characters. I can open this file
in notepad on a
Japanese enabled workstatation and when I do a File->Save, it shows the
encoding as unicode.
When I save this file as "ids_ansi.txt" with the encoding changed to ANSI
and issue a "type ids_ansi.txt"
command at a cmd prompt, I can see that the Japanese & English characters
are rendered
correctly.
I am under the impression that japanese characters can be rendered
correctly only if the
underlying charset is unicode. If so, how did the cmd window render the
Japanese & English
characters in the ANSI encoded file correctly ?
Here is the reason why I am asking the question.
We have a .net windows forms application which can save data to any rdbms
using the ADO.net
architecture. Some of the databases have been round for a while and have
only the cp850
character installed on them and customers resist changing to unicode. I
wanted to know if there is
an easy enough way to specify that string traffic between the windows
forms->ADO.Net->database is only in cp850 such that japanese character
rendering occurs on the forms
without any "loss of translation." In short, I would like to have the same
behaviour as the cmd
prompt that I described above. I understand that with this approach the
sort order, the size of local
language characters that can be stored, etc will be an issue.
Any tips, hints appreciated.
Thanks
ids. These ids are a
mix of "Japanese Characters" and English characters. I can open this file
in notepad on a
Japanese enabled workstatation and when I do a File->Save, it shows the
encoding as unicode.
When I save this file as "ids_ansi.txt" with the encoding changed to ANSI
and issue a "type ids_ansi.txt"
command at a cmd prompt, I can see that the Japanese & English characters
are rendered
correctly.
I am under the impression that japanese characters can be rendered
correctly only if the
underlying charset is unicode. If so, how did the cmd window render the
Japanese & English
characters in the ANSI encoded file correctly ?
Here is the reason why I am asking the question.
We have a .net windows forms application which can save data to any rdbms
using the ADO.net
architecture. Some of the databases have been round for a while and have
only the cp850
character installed on them and customers resist changing to unicode. I
wanted to know if there is
an easy enough way to specify that string traffic between the windows
forms->ADO.Net->database is only in cp850 such that japanese character
rendering occurs on the forms
without any "loss of translation." In short, I would like to have the same
behaviour as the cmd
prompt that I described above. I understand that with this approach the
sort order, the size of local
language characters that can be stored, etc will be an issue.
Any tips, hints appreciated.
Thanks