I have a little issue I don't know how to tackle properly and concerns performance.
I need to process ~200k documents that are missing an aspect. First, I identified these documents with a mysql query to evaluate how many there are. These nodes have aspect X but are missing aspect Y, so I need to add aspect Y to all these nodes.
I'm exploring the BatchProcessor approach right now and I am basing my code on the class org.alfresco.repo.node.db.NodeStringLengthWorker. Is this the correct way to do this ?
Few questions arise from writing my code :
- what's the best way to get the nodes in the WorkProvider : searchService solr/lucene ? nodeService ? nodeDAO.getNodesWithAspects (here I would have to check each node for the missing aspect) ?
- how do I stagger this search ? I see in NodeStringLengthWorker we use minNodeId and maxNodeId, is this a way to reduce the search load ?
I saw there was a talk https://community.alfresco.com/thread/214163-deleting-over-1000-nodes#comment-716898 about this subject but this link is not working anymore.
Our production environment is very frail so this process should really produce as less load as possible.
Thanks in advance, and if you need any more info I will provide gladly.