The AutoOCR Server is integrated via REST as a dynamic configurable Alfresco document transformer. AutoOCR creates searchable PDF´s or other document formats like TXT, DOC(X), XLS(X), PPT(X), XML, RTF and HTML from image of PDF files. The OCR functions can be used via Java, JavaScript or as a document transformer. Config is done from the Share UI which also has a new document action "Transform" and gives access to all Alfresco transformers. Highlights / features:
- Direct AutoOCR integration as Alfresco transformer with REST web service interface.
- Separate AutoOCR service / server which does not strain the Alfresco server
- Based on ABBYY – the leading OCR engine
- Easy configuration by selecting OCR profiles – all available ABBYY OCR engine settings are combined.
- In addition to PDF other output formats can be generated (TXT, RTF, DOC, etc.)
- Dynamic transformer configuration at runtime using the Alfresco Share Admin interface.
- JavaScript client for the AutoOCR service, available in Alfresco repository scripts (WebScripts, actions, etc.)
- Java client for the AutoOCR service, for use in Java code.
- The Java client itself has no dependencies for Alfresco.
- New Share document action “Transform†enhances Share not only with OCR but with all supported transformers.
Available in
German
English