cloud_binary_data_thinkstock_451638127-100412456-carousel.idge

Don’t use the cloud like a data warehouse

Don’t use the cloud like a data warehouse

 

Weve all heard the cry of data consolidation. After all, data within enterprises is strewn all over the place. So combining that sprawl of data would allow us to mine it more easily from a single location.

We’ve taken this path before: Early data warehouses migrated operational data to a single data store that was structured and combined to enable business intelligence. Huge batch jobs took place every night or every week, rolling up and aggregating the data for this "single source of truth" instance.

But let’s not live in 1996, where that was really the only technically viable option.

These days, we can access distributed data, rather than relocate it to a common repository with a common, sometimes destructive data structure. Today, if you have dozens of operational data stores you can access those data stores as if they were one consolidated database, even if they use inconsistent models (such as NoSQL versus SQL) or if the data is unstructured.

Read Also:
15 Chief Data Officer Job Requirements

In fact, that 1996 approach is a bad one today, causing problems we don't need to have. The more you copy the data, the more likely you are to have data inconsistency. And copying data means using more storage, thus spending more money. You may also need to buy more database licenses. Also, as you try to scale the consolidated database, you’ll find that the complexity of that central database spins out of control.

Distributed data, whether in the cloud or locally hosted, is a scary concept to many in enterprise IT. Working with distributed data does require a great deal of planning as well as a strong understanding of database access and abstraction approaches. Moreover, working with distributed data in the cloud makes things a bit more complex because of the different database technologies used in the cloud versus in traditional data centers.

However, that price is worth paying, especially as the new database technologies are becoming more pervasive and so should be learned anyhow. Ultimately, the distributed approach is cheaper than going back to that 1996 approach.

Read Also:
How smart storage will rescue big data


Enterprise Data World 2017

2
Apr
2017
Enterprise Data World 2017

$200 off with code 7WDATA

Read Also:
8 Business Process Analytics Every Manager Should Know

Data Visualisation Summit San Francisco

19
Apr
2017
Data Visualisation Summit San Francisco

$200 off with code DATA200

Read Also:
Top 28 Cheat Sheets for Machine Learning, Data Science, Probability, SQL & Big Data

Chief Analytics Officer Europe

25
Apr
2017
Chief Analytics Officer Europe

15% off with code 7WDCAO17

Read Also:
Cloud computing: more about agile development than cost

Chief Analytics Officer Spring 2017

2
May
2017
Chief Analytics Officer Spring 2017

15% off with code MP15

Read Also:
When Big Data Means Bad Analytics

Big Data and Analytics for Healthcare Philadelphia

17
May
2017
Big Data and Analytics for Healthcare Philadelphia

$200 off with code DATA200

Read Also:
How smart storage will rescue big data

Leave a Reply

Your email address will not be published. Required fields are marked *