|
Abstract:
|
At the early stages of a data warehouse design project, the main objective is to collect the business requirements and needs, and translate them into an appropriate conceptual, multidimensional design. Typically, this task is performed manually, through a series of interviews involving two different parties: the business analysts and technical designers. Producing an appropriate conceptual design is an errorprone task that undergoes several rounds of reconciliation and redesigning, until the business needs are satisfied. It isof great importance for the business of an enterprise to facilitate and automate such a process. The goal of our research is to provide designers with a semi-automatic means for producing conceptual multidimensional designs and also, conceptualrepresentation of the extract-transform-load (ETL)processes that orchestrate the data flow from the operational sources to the data warehouse constructs. In particular, wedescribe a method that combines information about the data sources along with the business requirements, for validatingand completing –if necessary– these requirements, producing a multidimensional design, and identifying the ETL operationsneeded. We present our method in terms of theTPC-DS benchmark and show its applicability and usefulness. |