AnsweredAssumed Answered

Full text indexing and transforming

Question asked by itbeb on Apr 13, 2010
We are experiening a problem when uploading new documents and full text indexing. It does not happen instantaneously and also not even after a few days  :cry:

If I now look for words in documents uploaded in February I do find it.

I picked up the following in the catalina this morning (and it looks as if it is only now transforming something that was already uploaded in February):

INFO: Server startup in 38799 ms
11:33:02,113 User:ITBEB DEBUG [content.transform.ContentTransformerRegistry] Searched for transformer:
   source mimetype: application/msword
   target mimetype: text/plain
   transformers: [TextMiningContentTransformer[ average=0ms]]
11:33:02,217 User:ITBEB DEBUG [content.transform.AbstractContentTransformer2] Completed transformation:
   reader: ContentAccessor[ contentUrl=store://2010/2/16/15/18/d9f68990-d29d-422a-ad32-78f5e2dd410a.bin, mimetype=application/msword, size=388608, encoding=UTF-8, locale=en_US]
   writer: ContentAccessor[ contentUrl=store://2010/4/13/11/33/6b6b1774-596f-40d5-bbda-3ae57d021ff4.bin, mimetype=text/plain, size=1474, encoding=UTF-8, locale=en_US]

   options: org.alfresco.service.cmr.repository.TransformationOptions@4d33a040
   transformer: TextMiningContentTransformer[ average=102ms]
11:33:04,680 User:System DEBUG [content.transform.ContentTransformerRegistry] Searched for transformer:
   source mimetype: application/vnd.excel
   target mimetype: text/plain
   transformers: [PoiHssfContentTransformer[ average=0ms], ComplexContentTransformer[ average=0ms]]
11:33:04,858 User:System DEBUG [content.transform.AbstractContentTransformer2] Completed transformation:
   reader: ContentAccessor[ contentUrl=store://2010/2/15/16/35/fea594cb-ffdc-4427-b858-38f15b3de647.bin, mimetype=application/vnd.excel, size=25088, encoding=UTF-8, locale=en_US]
   writer: ContentAccessor[ contentUrl=store://2010/4/13/11/33/a58fc511-ccdc-4151-96b8-b338b8670554.bin, mimetype=text/plain, size=1440, encoding=UTF-8, locale=en_US]
   options: org.alfresco.service.cmr.repository.TransformationOptions@3f11cedd
   transformer: PoiHssfContentTransformer[ average=178ms]
11:33:04,883 User:System DEBUG [content.transform.ContentTransformerRegistry] Searched for transformer:
   source mimetype: application/msword
   target mimetype: text/plain
   transformers: [TextMiningContentTransformer[ average=102ms]]
11:33:04,887 User:System DEBUG [content.transform.AbstractContentTransformer2] Completed transformation:
   reader: ContentAccessor[ contentUrl=store://2010/2/15/16/35/d0089520-2423-4bd5-9eef-3eb7c739044c.bin, mimetype=application/msword, size=87552, encoding=UTF-8, locale=en_US]
   writer: ContentAccessor[ contentUrl=store://2010/4/13/11/33/8d17eb52-e37c-40f2-b6ec-36a2728343f3.bin, mimetype=text/plain, size=3532, encoding=UTF-8, locale=en_US]
   options: org.alfresco.service.cmr.repository.TransformationOptions@5935c72f
   transformer: TextMiningContentTransformer[ average=52ms]

Does anyone have any idea what on erath is happening here??

Outcomes