I know this is an old post, but I experienced the same thing an thought I'd add my two sense and how it might be possible to fix it.
First of all, MS (anything) handles CJK fonts and encoding pretty bad. One thing I noticed is that now most Japanese test reverts to Simsum with an encoding of CHINESE_GB2312. It should actually be a Japanese font, such as MS Gothic, and have a western encoding. Surely it should be UTF-8, but we can't expect MS to have a clue about widely used international standards.
Some documents I'm trying to rescue are only rescueable by directly editing the file in a plain text editor. If your file is RTF, this is not too difficult. If it's in MS Word, you may be out of luck. Of course you could always try opening it in another editor such as OpenOffice (Gasp!), then save it as another sane format such as HTML or something that you can copy and paste the rendered text from into a plain text document to remove all formating. Then copy again the plain text into your favorite editor and format to your liking - hopefuly with styles, etc.
God speed!
|