AnsweredAssumed Answered

Alfresco 4.2.2 + AAAR Extraction problem

Question asked by pawelb on Sep 2, 2014
Latest reply on Nov 26, 2014 by fcorti
Hello everyone!

I have tried to parse Alfresco audit logs using AAAR. When I use default, freshly generated AAAR config I get authorization error for /alfresco/cmisatom service (which is good password). Full log attached in extract1.log file.

After changing cmisatom url:


USE AAAR_DataMart;
UPDATE dm_dim_alfresco SET url_cmis_suffix='/alfresco/api/-default-/public/cmis/versions/1.1/atom' WHERE id=1;


Extraction script continues to process logs, but shows some errors at the beginning: Attached in extract2.log file.

Example:


2014/09/02 09:48:24 - stg_cmis_folders_partial 2.0 - ERROR (version 5.1.0.0, build 1 from 2014-06-19_19-02-57 by buildguy) : Because of an error, this step can't continue:
2014/09/02 09:48:24 - stg_cmis_folders_partial 2.0 - ERROR (version 5.1.0.0, build 1 from 2014-06-19_19-02-57 by buildguy) : org.pentaho.di.core.exception.KettleException:
2014/09/02 09:48:24 - stg_cmis_folders_partial 2.0 - Error batch inserting rows into table [stg_cmis_folders_partial].
2014/09/02 09:48:24 - stg_cmis_folders_partial 2.0 - Errors encountered (first 10):
2014/09/02 09:48:24 - stg_cmis_folders_partial 2.0 -
2014/09/02 09:48:24 - stg_cmis_folders_partial 2.0 -
2014/09/02 09:48:24 - stg_cmis_folders_partial 2.0 - Error updating batch
2014/09/02 09:48:24 - stg_cmis_folders_partial 2.0 - Duplicate entry '1-cd8a02c3-7770-4fb3-bf15-53981e5ce4e2' for key 'PRIMARY'
2014/09/02 09:48:24 - stg_cmis_folders_partial 2.0 -
2014/09/02 09:48:24 - stg_cmis_folders_partial 2.0 -
2014/09/02 09:48:24 - stg_cmis_folders_partial 2.0 -    at org.pentaho.di.trans.steps.tableoutput.TableOutput.writeToTable(TableOutput.java:342)
2014/09/02 09:48:24 - stg_cmis_folders_partial 2.0 -    at org.pentaho.di.trans.steps.tableoutput.TableOutput.processRow(TableOutput.java:118)
2014/09/02 09:48:24 - stg_cmis_folders_partial 2.0 -    at org.pentaho.di.trans.step.RunThread.run(RunThread.java:62)
2014/09/02 09:48:24 - stg_cmis_folders_partial 2.0 -    at java.lang.Thread.run(Unknown Source)
2014/09/02 09:48:24 - stg_cmis_folders_partial 2.0 - Caused by: org.pentaho.di.core.exception.KettleDatabaseBatchException:
2014/09/02 09:48:24 - stg_cmis_folders_partial 2.0 - Error updating batch
2014/09/02 09:48:24 - stg_cmis_folders_partial 2.0 - Duplicate entry '1-cd8a02c3-7770-4fb3-bf15-53981e5ce4e2' for key 'PRIMARY'
2014/09/02 09:48:24 - stg_cmis_folders_partial 2.0 -
2014/09/02 09:48:24 - stg_cmis_folders_partial 2.0 -    at org.pentaho.di.core.database.Database.createKettleDatabaseBatchException(Database.java:1365)
2014/09/02 09:48:24 - stg_cmis_folders_partial 2.0 -    at org.pentaho.di.trans.steps.tableoutput.TableOutput.writeToTable(TableOutput.java:289)
2014/09/02 09:48:24 - stg_cmis_folders_partial 2.0 -    … 3 more
2014/09/02 09:48:24 - stg_cmis_folders_partial 2.0 - Caused by: java.sql.BatchUpdateException: Duplicate entry '1-cd8a02c3-7770-4fb3-bf15-53981e5ce4e2' for key 'PRIMARY'
2014/09/02 09:48:24 - stg_cmis_folders_partial 2.0 -    at com.mysql.jdbc.PreparedStatement.executeBatchSerially(PreparedStatement.java:1981)
2014/09/02 09:48:24 - stg_cmis_folders_partial 2.0 -    at com.mysql.jdbc.PreparedStatement.executeBatch(PreparedStatement.java:1388)
2014/09/02 09:48:24 - stg_cmis_folders_partial 2.0 -    at org.pentaho.di.trans.steps.tableoutput.TableOutput.writeToTable(TableOutput.java:285)
2014/09/02 09:48:24 - stg_cmis_folders_partial 2.0 -    … 3 more


When process finish, most of graps in Analyse are empty (Repository size, Documents per type etc.). Can this be related to errors in extract2.log file?

Why do I need to change cmis url? Is that because Alfresco 4.2.2 is too recent for AAAR 2.1? Is it not supported?

Please help me. I would be grateful for any answers, suggestions :)

Regards,
Paul

Attachments

Outcomes