AnsweredAssumed Answered

How to search words inside documents: alternatives?

Question asked by euleon on Feb 11, 2009
Latest reply on Apr 8, 2009 by mrogers
Hello everybody!

I've developed a layer for my company using RMI that connects to ALFRESCO and recover nodes and hence files from ALFRESCO.
Now, I'd like to search for content in the files (for example in a file.pdf search a word like "me").

I've been reading and reading for hours in the forum but I'm not satisfied with the information found, so I have to do several questions:

First of all, I'd like to present you the options I think I have (if anyone has more ideas or can correct some mistakes in my reasoning chain, is begged to do that).

1) I want to use the RMI connection in order to continue the with the proyect and the JCR layer, so Is it possible to search for content in files using the API of JCR??? I've found No examples of that.

2) Perhaps what I have to do is to use the ALFRESCO API? If so, Can I use it with my RMI connection? I've seen the example in the SDK called FirstFoundationClient, and I see that connects to an embebbed repository using RMI with this parameters:


If I put there my own RMI configuration would I be able to work with RMI and the ALFRESCO API? The fact is that I think that Alfresco supports native calls to LUCENE, but in my opinion, I don't need to create an index for LUCENE because Alfresco does it by itself(I don't need to reinvet the wheel)… so maybe what I have to do is to connect to Alfresco LUCENE INDEX  and do there the queries?? the API for that??

3)Use LUCENE. But I think this is (as we say in Spain) kill flies with cannons because I'd have to build the index. in order to do that, I'd have to visit all DOCUMENTS in Alfresco Repository, create the Lucene Documents, create an Index and do the queries there…. This does not seem to be efficient….

Are these all possible solutions??

Any help will be appreciated!