AnsweredAssumed Answered

Some docx documents crash the indexer

Question asked by wschwierz on Dec 1, 2011
We have problems with some MS Word (.docx) files. If such a file is checked into the system we get the following example error message in the logfile:


13:48:55,423 DEBUG [org.alfresco.repo.content.metadata.AbstractMappingMetadataExtracter] Starting metadata extraction:
   reader: ContentAccessor[ contentUrl=store:///opt/alfresco/tomcat/temp/Alfresco/alfresco2797023594067075767.upload, mimetype=application/vnd.openxmlformats-officedocument.wordprocessingml.document, size=1761185, encoding=UTF-8, locale=en_US]
   extracter: org.alfresco.repo.content.metadata.OpenOfficeMetadataExtracter@69c39cb2
13:48:55,423 DEBUG [org.alfresco.repo.content.filestore.FileContentReader] Opened write channel to file:
   file: /opt/alfresco/tomcat/temp/Alfresco/alfresco2797023594067075767.upload
   random-access: true
13:48:55,423 DEBUG [org.alfresco.repo.content.AbstractContentReader] Created callback byte channel:
   original: sun.nio.ch.FileChannelImpl@88b5bc1
   new: org.alfresco.repo.content.AbstractContentAccessor$CallbackFileChannel@7534e048
13:48:55,424 DEBUG [org.alfresco.repo.content.AbstractContentReader] Opened channel onto content: ContentAccessor[ contentUrl=store:///opt/alfresco/tomcat/temp/Alfresco/alfresco2797023594067075767.upload, mimetype=application/vnd.openxmlformats-officedocument.wordprocessingml.document, size=1761185, encoding=UTF-8, locale=en_US]
13:49:08,354 ERROR [net.sf.jooreports.openoffice.connection.SocketOpenOfficeConnection] disconnected unexpectedly
13:49:08,355 DEBUG [org.alfresco.repo.content.metadata.AbstractMappingMetadataExtracter] Metadata extraction failed:
   Extracter: org.alfresco.repo.content.metadata.OpenOfficeMetadataExtracter@69c39cb2
   Content:   ContentAccessor[ contentUrl=store:///opt/alfresco/tomcat/temp/Alfresco/alfresco2797023594067075767.upload, mimetype=application/vnd.openxmlformats-officedocument.wordprocessingml.document, size=1761185, encoding=UTF-8, locale=en_US]
com.sun.star.lang.DisposedException: java.io.IOException: com.sun.star.io.IOException: EOF reached - socket,host=localhost,port=8100,tcpNoDelay=1,localHost=Alfresco04,localPort=52819,peerHost=localhost,peerPort=8100
        at com.sun.star.lib.uno.bridges.java_remote.java_remote_bridge$MessageDispatcher.invoke(java_remote_bridge.java:237)
        at com.sun.star.lib.uno.bridges.java_remote.java_remote_bridge$MessageDispatcher.run(java_remote_bridge.java:144)
13:49:08,357 DEBUG [org.alfresco.repo.content.metadata.AbstractMappingMetadataExtracter] Completed metadata extraction:
   reader:    ContentAccessor[ contentUrl=store:///opt/alfresco/tomcat/temp/Alfresco/alfresco2797023594067075767.upload, mimetype=application/vnd.openxmlformats-officedocument.wordprocessingml.document, size=1761185, encoding=UTF-8, locale=en_US]
   extracter: org.alfresco.repo.content.metadata.OpenOfficeMetadataExtracter@69c39cb2
   changed:   {}
13:49:09,116 DEBUG [de.exchange.insight.alfresco.behaviours.PropagateProductTags] onCreateChildAssociation() parentRef=workspace://SpacesStore/f52e7f32-76a4-46eb-a10b-351cab9808f7 childRef=workspace://SpacesStore/58fc0eba-e4d9-47ca-8b04-3e5c53788ef8 new=true
13:49:09,133 DEBUG [org.alfresco.repo.search.impl.lucene.ADMLuceneIndexerImpl] Creating indexer
13:49:09,133 DEBUG [org.alfresco.repo.search.impl.lucene.ADMLuceneIndexerImpl] Create node workspace://SpacesStore/58fc0eba-e4d9-47ca-8b04-3e5c53788ef8
13:49:09,139 DEBUG [org.alfresco.repo.search.impl.lucene.ADMLuceneIndexerImpl] Update node workspace://SpacesStore/58fc0eba-e4d9-47ca-8b04-3e5c53788ef8
13:49:09,144 DEBUG [org.alfresco.repo.search.impl.lucene.ADMLuceneIndexerImpl] Update node workspace://SpacesStore/58fc0eba-e4d9-47ca-8b04-3e5c53788ef8
13:49:09,146 DEBUG [org.alfresco.repo.content.filestore.FileContentStore] Created content writer:
   writer: ContentAccessor[ contentUrl=store://2011/12/1/13/49/9c299763-c2f4-499c-b558-36f2ec341ed6.bin, mimetype=null, size=0, encoding=UTF-8, locale=en_US]
13:49:09,146 DEBUG [org.alfresco.repo.content.AbstractContentStore] Fetched new writer:
   Store:   FileContentStore[ root=/data/alfresco/contentstore, allowRandomAccess=true, readOnly=false]
   Context: NodeContentContext[ contentUrl=null, existing=false, nodeRef=workspace://SpacesStore/58fc0eba-e4d9-47ca-8b04-3e5c53788ef8, propertyQName={http://www.alfresco.org/model/content/1.0}content]
   Writer:  ContentAccessor[ contentUrl=store://2011/12/1/13/49/9c299763-c2f4-499c-b558-36f2ec341ed6.bin, mimetype=null, size=0, encoding=UTF-8, locale=en_US]
13:49:09,147 DEBUG [org.alfresco.repo.content.filestore.FileContentWriter] Opened write channel to file:
   file: /data/alfresco/contentstore/2011/12/1/13/49/9c299763-c2f4-499c-b558-36f2ec341ed6.bin
   random-access: true
13:49:09,147 DEBUG [org.alfresco.repo.content.AbstractContentWriter] Created callback byte channel:
   original: sun.nio.ch.FileChannelImpl@1fcd5198
   new: org.alfresco.repo.content.AbstractContentAccessor$CallbackFileChannel@60a7e277
13:49:09,147 DEBUG [org.alfresco.repo.content.AbstractContentWriter] Opened channel onto content:
   content: ContentAccessor[ contentUrl=store://2011/12/1/13/49/9c299763-c2f4-499c-b558-36f2ec341ed6.bin, mimetype=application/vnd.openxmlformats-officedocument.wordprocessingml.document, size=0, encoding=UTF-8, locale=en_US]
   channel: org.alfresco.repo.content.AbstractContentAccessor$CallbackFileChannel@60a7e277
13:49:09,620 DEBUG [org.alfresco.repo.content.ContentServiceImpl] Content property updated:
   Node Name:   qqDTI_Programming_Manual_V1.2.7-DRAFT.docx
   Property:    {http://www.alfresco.org/model/content/1.0}content
   Is new:      true
   Before:      contentUrl=|mimetype=application/vnd.openxmlformats-officedocument.wordprocessingml.document|size=0|encoding=UTF-8|locale=en_US_
   After:       contentUrl=store://2011/12/1/13/49/9c299763-c2f4-499c-b558-36f2ec341ed6.bin|mimetype=application/vnd.openxmlformats-officedocument.wordprocessingml.document|size=1761185|encoding=UTF-8|locale=en_US_
13:49:09,622 DEBUG [org.alfresco.repo.content.filestore.FileContentStore] Created content reader:
   url: store://2011/12/1/13/49/9c299763-c2f4-499c-b558-36f2ec341ed6.bin
   file: /data/alfresco/contentstore/2011/12/1/13/49/9c299763-c2f4-499c-b558-36f2ec341ed6.bin
   reader: ContentAccessor[ contentUrl=store://2011/12/1/13/49/9c299763-c2f4-499c-b558-36f2ec341ed6.bin, mimetype=null, size=1761185, encoding=UTF-8, locale=en_US]
13:49:09,623 DEBUG [org.alfresco.repo.content.filestore.FileContentStore] Created content reader:
   url: store://2011/12/1/13/49/9c299763-c2f4-499c-b558-36f2ec341ed6.bin
   file: /data/alfresco/contentstore/2011/12/1/13/49/9c299763-c2f4-499c-b558-36f2ec341ed6.bin
   reader: ContentAccessor[ contentUrl=store://2011/12/1/13/49/9c299763-c2f4-499c-b558-36f2ec341ed6.bin, mimetype=null, size=1761185, encoding=UTF-8, locale=en_US]
13:49:10,271 DEBUG [org.alfresco.repo.search.impl.lucene.ADMLuceneIndexerImpl] Update node workspace://SpacesStore/58fc0eba-e4d9-47ca-8b04-3e5c53788ef8
13:49:10,271 DEBUG [org.alfresco.repo.content.ContentServiceImpl] Stream listener updated node:
   node: workspace://SpacesStore/58fc0eba-e4d9-47ca-8b04-3e5c53788ef8
   property: {http://www.alfresco.org/model/content/1.0}content
   value: contentUrl=store://2011/12/1/13/49/9c299763-c2f4-499c-b558-36f2ec341ed6.bin|mimetype=application/vnd.openxmlformats-officedocument.wordprocessingml.document|size=1761185|encoding=UTF-8|locale=en_US_
13:49:10,271 DEBUG [org.alfresco.repo.content.AbstractContentAccessor] 1 content listeners called: close
13:49:10,276 DEBUG [org.alfresco.repo.search.impl.lucene.ADMLuceneIndexerImpl] Update node workspace://SpacesStore/58fc0eba-e4d9-47ca-8b04-3e5c53788ef8
13:49:10,279 DEBUG [org.alfresco.repo.search.impl.lucene.ADMLuceneIndexerImpl] Update node workspace://SpacesStore/58fc0eba-e4d9-47ca-8b04-3e5c53788ef8
13:49:11,780 DEBUG [org.alfresco.repo.search.impl.lucene.ADMLuceneIndexerImpl] Creating indexer
13:49:11,964 DEBUG [org.alfresco.repo.content.ContentServiceImpl] Content property updated:
   Node Name:   qqDTI_Programming_Manual_V1.2.7-DRAFT.docx
   Property:    {http://www.alfresco.org/model/content/1.0}content
   Is new:      true
   Before:      null
   After:       contentUrl=store://2011/12/1/13/49/9c299763-c2f4-499c-b558-36f2ec341ed6.bin|mimetype=application/vnd.openxmlformats-officedocument.wordprocessingml.document|size=1761185|encoding=UTF-8|locale=en_US_
13:49:11,966 DEBUG [org.alfresco.repo.content.filestore.FileContentStore] Created content reader:
   url: store://2011/12/1/13/49/9c299763-c2f4-499c-b558-36f2ec341ed6.bin
   file: /data/alfresco/contentstore/2011/12/1/13/49/9c299763-c2f4-499c-b558-36f2ec341ed6.bin
   reader: ContentAccessor[ contentUrl=store://2011/12/1/13/49/9c299763-c2f4-499c-b558-36f2ec341ed6.bin, mimetype=null, size=1761185, encoding=UTF-8, locale=en_US]
13:49:11,967 DEBUG [org.alfresco.repo.content.filestore.FileContentStore] Created content reader:
   url: store://2011/12/1/13/49/9c299763-c2f4-499c-b558-36f2ec341ed6.bin
   file: /data/alfresco/contentstore/2011/12/1/13/49/9c299763-c2f4-499c-b558-36f2ec341ed6.bin
   reader: ContentAccessor[ contentUrl=store://2011/12/1/13/49/9c299763-c2f4-499c-b558-36f2ec341ed6.bin, mimetype=null, size=1761185, encoding=UTF-8, locale=en_US]
13:49:12,351 DEBUG [org.alfresco.repo.search.impl.lucene.ADMLuceneIndexerImpl] Update node workspace://SpacesStore/58fc0eba-e4d9-47ca-8b04-3e5c53788ef8
13:49:12,360 DEBUG [org.alfresco.repo.content.filestore.FileContentStore] Created content reader:
   url: store://2011/12/1/13/49/9c299763-c2f4-499c-b558-36f2ec341ed6.bin
   file: /data/alfresco/contentstore/2011/12/1/13/49/9c299763-c2f4-499c-b558-36f2ec341ed6.bin
   reader: ContentAccessor[ contentUrl=store://2011/12/1/13/49/9c299763-c2f4-499c-b558-36f2ec341ed6.bin, mimetype=null, size=1761185, encoding=UTF-8, locale=en_US]
13:49:12,360 WARN  [org.alfresco.repo.content.transform.OpenOfficeContentTransformerWorker] [1:141] OpenOffice connection is down - will try for 100 times with 5 seconds delay between for OpenOffice to come back
13:50:00,016 DEBUG [net.sf.jooreports.openoffice.connection.SocketOpenOfficeConnection] connecting
13:50:00,028 WARN  [org.alfresco.util.OpenOfficeConnectionTester] [1:27] OpenOffice connection is down - will try for 100 times with 15 seconds delay between for OpenOffice to come back
13:50:15,043 DEBUG [net.sf.jooreports.openoffice.connection.SocketOpenOfficeConnection] connecting

The indexer does not start again, it always says: connecting.
If I then try to manually start the indexer again I get the following example error message in the logfile:


11:04:16,215 WARN  [org.alfresco.util.OpenOfficeConnectionTester] [1:27] OpenOffice now back online
11:04:16,411 WARN  [org.alfresco.repo.content.transform.OpenOfficeContentTransformerWorker] [1:1927] OpenOffice now back online
11:04:17,125 ERROR [net.sf.jooreports.openoffice.connection.SocketOpenOfficeConnection] disconnected unexpectedly
11:04:17,227 INFO  [org.alfresco.repo.search.impl.lucene.ADMLuceneIndexerImpl] Not indexed: Transformation failed at /{http://www.alfresco.org/model/application/1.0}company_home/{http://www.alfresco.org/model/application/1.0}user_homes/{http://www.alfresco.org/model/content/1.0}sokomyk/{http://www.alfresco.org/model/content/1.0}PrismaInstallationDraft-copy.docx
org.alfresco.service.cmr.repository.ContentIOException: 10181588 Content conversion failed:
   reader: ContentAccessor[ contentUrl=store://2011/11/18/10/44/6604e620-faaf-4b6a-bc2e-33e4705e0186.bin, mimetype=application/vnd.openxmlformats-officedocument.wordprocessingml.document, size=59588, encoding=UTF-8, locale=en_US]
   writer: ContentAccessor[ contentUrl=store://2011/11/18/11/4/1f5df9b1-c83d-4671-8188-b4b23453f108.bin, mimetype=text/plain, size=0, encoding=UTF-8, locale=en_US]
   options: org.alfresco.service.cmr.repository.TransformationOptions@5b25f2c9
        at org.alfresco.repo.content.transform.AbstractContentTransformer2.transform(AbstractContentTransformer2.java:197)
        at org.alfresco.repo.content.transform.AbstractContentTransformer2.transform(AbstractContentTransformer2.java:143)
        at org.alfresco.repo.search.impl.lucene.ADMLuceneIndexerImpl.indexProperty(ADMLuceneIndexerImpl.java:948)
        at org.alfresco.repo.search.impl.lucene.ADMLuceneIndexerImpl.createDocumentsImpl(ADMLuceneIndexerImpl.java:629)
        at org.alfresco.repo.search.impl.lucene.ADMLuceneIndexerImpl.createDocuments(ADMLuceneIndexerImpl.java:590)
        at org.alfresco.repo.search.impl.lucene.AbstractLuceneIndexerImpl.indexImpl(AbstractLuceneIndexerImpl.java:632)
        at org.alfresco.repo.search.impl.lucene.AbstractLuceneIndexerImpl.indexImpl(AbstractLuceneIndexerImpl.java:657)
        at org.alfresco.repo.search.impl.lucene.AbstractLuceneIndexerImpl.flushPending(AbstractLuceneIndexerImpl.java:799)
        at org.alfresco.repo.search.impl.lucene.ADMLuceneIndexerImpl.doPrepare(ADMLuceneIndexerImpl.java:1658)
        at org.alfresco.repo.search.impl.lucene.AbstractLuceneIndexerImpl.prepare(AbstractLuceneIndexerImpl.java:472)
        at org.alfresco.repo.search.impl.lucene.AbstractLuceneIndexerAndSearcherFactory.prepare(AbstractLuceneIndexerAndSearcherFactory.java:802)
        at org.alfresco.repo.transaction.AlfrescoTransactionSupport$TransactionSynchronizationImpl.beforeCommit(AlfrescoTransactionSupport.java:695)
        at org.springframework.transaction.support.TransactionSynchronizationUtils.triggerBeforeCommit(TransactionSynchronizationUtils.java:48)
        at org.springframework.transaction.support.AbstractPlatformTransactionManager.triggerBeforeCommit(AbstractPlatformTransactionManager.java:835)
        at org.springframework.transaction.support.AbstractPlatformTransactionManager.processCommit(AbstractPlatformTransactionManager.java:645)
        at org.springframework.transaction.support.AbstractPlatformTransactionManager.commit(AbstractPlatformTransactionManager.java:632)
        at org.springframework.transaction.interceptor.TransactionAspectSupport.commitTransactionAfterReturning(TransactionAspectSupport.java:314)
        at org.alfresco.util.transaction.SpringAwareUserTransaction.commit(SpringAwareUserTransaction.java:467)
        at org.alfresco.repo.transaction.RetryingTransactionHelper.doInTransaction(RetryingTransactionHelper.java:349)
        at org.alfresco.web.bean.dialog.BaseDialogBean.finish(BaseDialogBean.java:130)
        at org.alfresco.web.bean.dialog.DialogManager.finish(DialogManager.java:534)
        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
        at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
        at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
        at java.lang.reflect.Method.invoke(Method.java:597)
        at org.apache.myfaces.el.MethodBindingImpl.invoke(MethodBindingImpl.java:132)
        at org.apache.myfaces.application.ActionListenerImpl.processAction(ActionListenerImpl.java:61)
        at javax.faces.component.UICommand.broadcast(UICommand.java:109)
        at javax.faces.component.UIViewRoot._broadcastForPhase(UIViewRoot.java:97)
        at javax.faces.component.UIViewRoot.processApplication(UIViewRoot.java:171)
        at org.apache.myfaces.lifecycle.InvokeApplicationExecutor.execute(InvokeApplicationExecutor.java:32)
        at org.apache.myfaces.lifecycle.LifecycleImpl.executePhase(LifecycleImpl.java:95)
        at org.apache.myfaces.lifecycle.LifecycleImpl.execute(LifecycleImpl.java:70)
        at javax.faces.webapp.FacesServlet.service(FacesServlet.java:139)
        at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:290)
        at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:206)
        at org.alfresco.web.app.servlet.AuthenticationFilter.doFilter(AuthenticationFilter.java:110)
        at sun.reflect.GeneratedMethodAccessor396.invoke(Unknown Source)
        at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
        at java.lang.reflect.Method.invoke(Method.java:597)
        at org.alfresco.repo.management.subsystems.ChainingSubsystemProxyFactory$1.invoke(ChainingSubsystemProxyFactory.java:122)
        at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:171)
        at org.springframework.aop.framework.JdkDynamicAopProxy.invoke(JdkDynamicAopProxy.java:204)
        at $Proxy195.doFilter(Unknown Source)
        at org.alfresco.repo.web.filter.beans.BeanProxyFilter.doFilter(BeanProxyFilter.java:88)
        at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:235)
        at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:206)
        at org.alfresco.repo.web.filter.beans.NullFilter.doFilter(NullFilter.java:74)
        at sun.reflect.GeneratedMethodAccessor396.invoke(Unknown Source)
        at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
        at java.lang.reflect.Method.invoke(Method.java:597)
        at org.alfresco.repo.management.subsystems.ChainingSubsystemProxyFactory$1.invoke(ChainingSubsystemProxyFactory.java:122)
        at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:171)
        at org.springframework.aop.framework.JdkDynamicAopProxy.invoke(JdkDynamicAopProxy.java:204)
        at $Proxy195.doFilter(Unknown Source)
        at org.alfresco.repo.web.filter.beans.BeanProxyFilter.doFilter(BeanProxyFilter.java:88)
        at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:235)
        at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:206)
        at org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:233)
        at org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:191)
        at org.apache.catalina.authenticator.AuthenticatorBase.invoke(AuthenticatorBase.java:525)
        at org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:128)
        at org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:102)
        at org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:109)
        at org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:286)
        at org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:845)
        at org.apache.coyote.http11.Http11Protocol$Http11ConnectionHandler.process(Http11Protocol.java:583)
        at org.apache.tomcat.util.net.JIoEndpoint$Worker.run(JIoEndpoint.java:447)
        at java.lang.Thread.run(Thread.java:619)
Caused by: org.alfresco.service.cmr.repository.ContentIOException: 10181587 Content conversion failed:
   reader: ContentAccessor[ contentUrl=store://2011/11/18/10/44/6604e620-faaf-4b6a-bc2e-33e4705e0186.bin, mimetype=application/vnd.openxmlformats-officedocument.wordprocessingml.document, size=59588, encoding=UTF-8, locale=en_US]
   writer: ContentAccessor[ contentUrl=store:///opt/alfresco/tomcat/temp/Alfresco/ComplextTransformer_intermediate_docx_7441327923234372065.pdf, mimetype=application/pdf, size=0, encoding=UTF-8, locale=en_US]
   options: org.alfresco.service.cmr.repository.TransformationOptions@5b25f2c9
        at org.alfresco.repo.content.transform.AbstractContentTransformer2.transform(AbstractContentTransformer2.java:197)
        at org.alfresco.repo.content.transform.ComplexContentTransformer.transformInternal(ComplexContentTransformer.java:154)
        at org.alfresco.repo.content.transform.AbstractContentTransformer2.transform(AbstractContentTransformer2.java:185)
        … 68 more
Caused by: org.alfresco.service.cmr.repository.ContentIOException: 10181586 OpenOffice server conversion failed:
   reader: ContentAccessor[ contentUrl=store://2011/11/18/10/44/6604e620-faaf-4b6a-bc2e-33e4705e0186.bin, mimetype=application/vnd.openxmlformats-officedocument.wordprocessingml.document, size=59588, encoding=UTF-8, locale=en_US]
   writer: ContentAccessor[ contentUrl=store:///opt/alfresco/tomcat/temp/Alfresco/ComplextTransformer_intermediate_docx_7441327923234372065.pdf, mimetype=application/pdf, size=0, encoding=UTF-8, locale=en_US]
   from file: /opt/alfresco/tomcat/temp/Alfresco/OpenOfficeContentTransformer-source-8149674538285292681.docx
   to file: /opt/alfresco/tomcat/temp/Alfresco/OpenOfficeContentTransformer-target-7870562607794714779.pdf
        at org.alfresco.repo.content.transform.OpenOfficeContentTransformerWorker.transform(OpenOfficeContentTransformerWorker.java:343)
        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
        at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
        at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
        at java.lang.reflect.Method.invoke(Method.java:597)
        at org.alfresco.repo.management.subsystems.SubsystemProxyFactory$1.invoke(SubsystemProxyFactory.java:77)
        at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:171)
        at org.springframework.aop.framework.JdkDynamicAopProxy.invoke(JdkDynamicAopProxy.java:204)
        at $Proxy34.transform(Unknown Source)
        at org.alfresco.repo.content.transform.ProxyContentTransformer.transformInternal(ProxyContentTransformer.java:66)
        at org.alfresco.repo.content.transform.AbstractContentTransformer2.transform(AbstractContentTransformer2.java:185)
        … 70 more
Caused by: net.sf.jooreports.openoffice.connection.OpenOfficeException: conversion failed; com.sun.star.lang.DisposedException: java.io.IOException: com.sun.star.io.IOException: EOF reached - socket,host=localhost,port=8100,tcpNoDelay=1,localHost=Alfresco03,localPort=49153,peerHost=localhost,peerPort=8100
        at net.sf.jooreports.openoffice.converter.OpenOfficeDocumentConverter.convertInternal(OpenOfficeDocumentConverter.java:114)
        at net.sf.jooreports.openoffice.converter.AbstractOpenOfficeDocumentConverter.convert(AbstractOpenOfficeDocumentConverter.java:75)
        at org.alfresco.repo.content.transform.OpenOfficeContentTransformerWorker.transform(OpenOfficeContentTransformerWorker.java:338)
        … 80 more

Afterwards Alfresco tries to connect to OpenOffice again but it doesn't restart. Again it always says connecting.
So we have no chance to restart the Indexer when we get such a document into our system.

This error does not happen for each docx document. But we get more and more documents that cause this error.
As a workaround we now have deactivated indexing for all docx documents to keep our system running but this is no real solution for us because we need the indexer to work also for docx.

We are running Alfresco Community Edition Version 3.2.0, DB postgresql and OpenOffice 3.2  on Centos in a VM.

Any help would be appreciated!

Thanks, Wiebke

Outcomes