AnsweredAssumed Answered

contentstore:  unnecessary file duplication

Question asked by qasimh on Jun 12, 2007
Latest reply on Dec 7, 2007 by ero
Hi,

I'm using Alfresco 2.0 CE, on Windows Server 2003, Tomcat, and MS SQL Server.

As I was debugging something, I noticed that the files in the contentstore folder do not get cleaned up after a modification.

Using the NodeBrowser to find the contentURL and examining alf_data/contentstore, I see that files get duplicated in the backend after every modification.

For example, FileA.htm without versioning:

FileA.htm's content URL: 
contentUrl=store://2007/5/4/13/22/7b78124f-fa6c-11db-9240-eba5cf10755f.bin|mimetype=text/html|size=14261|encoding=UTF-8|locale=en_US_

FileA.htm's content URL, after a modification
contentUrl=store://2007/6/12/11/13/e3154088-18ff-11dc-8092-3142445812fe.bin|mimetype=text/html|size=14267|encoding=UTF-8|locale=en_US_

FileA.htm's content URL, after another modification:
contentUrl=store://2007/6/12/11/17/5f24bdd7-1900-11dc-8092-3142445812fe.bin|mimetype=text/html|size=14276|encoding=UTF-8|locale=en_US_

Each binary file above in the contentstore remains there after the modification.  This seems like a serious disk space consumer for no apparent reason.

Is this a bug, or is there more going on in the back that I'm not aware of?

Outcomes