Search Helium

Home > Computers & Technology > Computers & Technology (Other)

A guide to data warehousing

by Barry Marcus

Created on: October 11, 2007

In the good old days of "data processing", database experts and programmers focused their attention on storing and retrieving data in the best way to do real-time transactions. Being able to store and retrieve data to record a sale, to record a bank deposit and so on. Relational databases became the best thing since sliced-bread. These are built through a complex process of entity relationship modelling and are unintelligible to all but the technically inspired.

As relational databases became established and systems became more stable and reliable, management of organisations running these huge and complex systems began to realise that the databases contained more than data - they contained information. "Data Processing" became "Information Technology". The demand for information grew. IT departments became inundated with requests for management information reports of various types. They couldn't cope. Delivery of these reports was always a week too late. The extraction process was typically very complex and slow requiring data from a wide variety of tables. To make matters worse, the extraction of information for urgent reports often impacted negatively on the performance of the entire system.

A few bright sparks in the industry began to realise that accessing information for management reports required something other than a relational database. Something much simpler and more accessible was needed. Something that reflected the business rather than the technology. The operational database was not the ideal place from which to compile executive reports.

The data warehouse was born. It would contain the kind of information sought by management, organised in a way that was simple to understand and to easy to access.

One of the simplest and most amenable data warehouse model is the "Star Schema". "Facts" are kept in a Fact Table and "Dimensions" - the characteristics by which the facts are accessed - in a Dimension Table. For instance, Customer may be a fact file, while region, age, gender would be dimensions. The popular multi-dimensional cube works in a similar way.

Data for the warehouse is extracted from the operational database and undergoes a number of processes - extraction, cleaning, aggregating and loading. The extraction takes place at specific times - daily, weekly or monthly - and provides a "snapshot" of the organisation at a particular time and date.

Data for each Fact Table may be aggregated (or summarised)at a number of different levels. For example,

Helium Debate

Cast your vote!

Is Microsoft a dying computing brand?

Click for your side.

193696

Featured Partner

International Human Rights Group

IHRG Mission Statement: Standing for Religious Liberties for All We believe that religious liberties are the foundation of human rights for any civilized society. Governments, however, have not always respected this most foundation...more


CONNECT WITH US

Read
our blog
Helum for writers

Write and get published
Share with other writers
Polish your freelancing skills

Join our active writing community
Helium Content Source for Publishers

Quality articles from proven freelancers
Exclusive rights, fast turnaround
Brand engagement, business blogging -- our writers do it all

Get custom content today!

INFORMATION


Helium, Inc.
200 Brickstone Square Andover, MA 01810 USA
#