AnsweredAssumed Answered

Character Set

Question asked by helpertoo on Dec 29, 2010
We have an interesting issue where some of our articles were created from webforms that were forcing ISO-8859-1 while others were using UTF-8.  In an effort to standardize, we have changed everything to UTF-8 (which is what our Oracle 10i database has been since installation).  Our issue is basically around the double space that everybody wants to put in after a punctuation and various other areas - the pages that were originally ISO-8859-1 now have an odd "U+00C2: Latin Capital Letter A with Circumflex — Keystroke: Alt+0194."  We have been able to grep our content store for the character and globally remove the critter, but have been unable to get the fricken A out of the Alfresco side of the house.

Every time we manage to fix our external content store for our portal, somebody goes back in and edits and/or creates something new and injects a buch more of them fricken A's.

We have checked and rechecked our tomcat servers, alfresco configs, and FSR… all say UTF-8.  Unfortunately, we are still on Alfresco 3.1.1 with no resources to upgrade.

Any help would be extremely WELCOME!!!

from our tomcat server.xml:

<Connector port="8080"
       maxHttpHeaderSize="8192"
       maxThreads="150"
       minSpareThreads="25"
       maxSpareThreads="75"
       enableLookups="false"
       redirectPort="8443"
       acceptCount="100"
       connectionTimeout="500000"
       disableUploadTimeout="true"
       URIEncoding="UTF-8"
       protocol="HTTP/1.1" />

Outcomes