I all! I'm evaluating Alfresco Community Edition to use in our organization, currently I have installed the 5.2 version on a Ubuntu Server 16.04 working with alfresco-simple-ocr using pdfsandwich 0.1.4.esearch I
I like to know if there is some way to get the content from a image or pdf file to use it as content for a new document. After some research I found a few reference to Content Transformers (and Renditions) but before continue I like to know if no one already do that and if not its it the correct path to follow? I'm new here so any clue is appreciated.
Cheers.
nueces...
There can be two reasons to integrate OCR with alfresco, either to make the image with text searchable or to capture specific information from the document in order to do the further operations based on that.
If you just simply want to make images containing text searchable then follow Configuring OCR in Alfresco | ContCentric
Please mention, for what reason you need OCR with Alfresco?
Thanks
The second one. Capture the content from the image/pdf and generate a new text based document.
thanks.
Ask for and offer help to other Alfresco Content Services Users and members of the Alfresco team.
Related links:
By using this site, you are agreeing to allow us to collect and use cookies as outlined in Alfresco’s Cookie Statement and Terms of Use (and you have a legitimate interest in Alfresco and our products, authorizing us to contact you in such methods). If you are not ok with these terms, please do not use this website.