AnsweredAssumed Answered

CJK characters garbled in MSG files (MailContentTransformer)

Question asked by irvingpop on Mar 29, 2011
Latest reply on Mar 30, 2011 by mrogers
Server:  Alfresco 3.4.d on Ubuntu 10.04,  MySQL (UTF-8) DB

I'm getting reports from users that text extraction is broken for double-byte characters (Chinese in my case) in Outlook MSG files.

The rest of the email text is extracted correctly, but the Chinese characters look like this:  @z/NHa

I enabled previews for MSG files using these instructions:  http://issues.alfresco.com/jira/browse/ALF-6200

Also, you cannot search on Chinese terms in that are contained in the emails.

Anyone else tried this?

Outcomes