Incremental cube update - Pros & Cons
Environment :
IBM Cognos 8.3
Data Flow for cube update process:
DWH(Data Warehouse) ->Data Mart->Package->Transformer(Create Cube and Package)->Create report
As shown above, we are building a cube using package that pulls data from the data mart.
The initial population of fact table will be approximately 230,000 rows.
I have built a cube. The model (.mdl) size is around 14 MB and cube (.mdc) size is around 21 MB.
Data Mart fact table will be populated on a daily basis. This requires us to refresh the cube on a daily basis too.
Consider this scenario :
Day 0 : Data Mart fact table has 230,000 rows. Cube is created by processing all these records. Total cube update time is 2 hours.
Day 1 : 15 additional records got inserted in data mart fact table. so the current count is 230,015. Refresh the cube. It processes all the records even though only 15 new records have been added.
and so on...
By using non-incremental process, existing records are unncessarily re-processed and hence increases the cube build time.
The only way to bring over the new data is to incrementally update/refresh the cube.
I have following questions ;
1. As per cognos transformer documentation, for incremental update to work, data source should have only the new data. This requires lot of work on ETL work. Is there any way we can handle this at Cognos model level (Framework Manager model or Transformer model).
2.What are the main disadvantages or difficulties of having incremental cube update.
3. Can any one share the best practices or their strategy to do incremental cube update?
4. Will I be able to batch file to update this cube using incremental approach ?
Thank you in advance!!!