big-data-in-brazil-the-year-ahead

Data lakes, don’t confuse them with data warehouses, warns Gartner

Data lakes, don’t confuse them with data warehouses, warns Gartner

In mid-2014, a pair of Gartner analysts levied some trenchant criticisms at the increasingly hyped concept of data lakes.

"The fundamental issue with the data lake is that it makes certain assumptions about the users of information," said Gartner research director Nick Heudecker.

"It assumes that users recognize or understand the contextual bias of how data is captured, that they know how to merge and reconcile different data sources without 'a priori knowledge' and that they understand the incomplete nature of datasets, regardless of structure." A year and a half later, Gartner's concerns do not appear to have eased. While there are successful projects, there are also failures -- and the key success factor appears to be a strong understanding of the different roles of a data lake and a data warehouse.

Heudecker said a data lake, often marketed as a means of tackling big data challenges, is a great place to figure out new questions to ask of your data, "provided you have the skills".

Read Also:
Big Data and Information Management Predictions for 2016

"If that's what you want to do, I'm less concerned about a data lake implementation. However, a higher risk scenario is if your intent is to reimplement your data warehousing service level agreements (SLAs) on the data lake."

Heudecker said a data lake is typically optimised for different uses cases, levels of concurrency and multi-tenancy.

"In other words, don't use a data lake for data warehousing in anger."

It's perfectly reasonable to need both, he said, because each is optimised for different SLAs, users and skills.;

 



Data Science Congress 2017

5
Jun
2017
Data Science Congress 2017

20% off with code 7wdata_DSC2017

Read Also:
Google Launches Cloud Dataproc, A Managed Spark And Hadoop Big Data Service

AI Paris

6
Jun
2017
AI Paris

20% off with code AIP17-7WDATA-20

Read Also:
Creating a Data-Driven Organization Depends on a Data-Driven Culture

Chief Data Officer Summit San Francisco

7
Jun
2017
Chief Data Officer Summit San Francisco

$200 off with code DATA200

Read Also:
How to Tell When You Need a Better Analytics Platform
Read Also:
Google Launches Cloud Dataproc, A Managed Spark And Hadoop Big Data Service

Customer Analytics Innovation Summit Chicago

7
Jun
2017
Customer Analytics Innovation Summit Chicago

$200 off with code DATA200

Read Also:
Google Offers Free Cloud Access to Colleges, Plays Catch Up to Amazon, Microsoft (EdSurge News)

HR & Workforce Analytics Innovation Summit 2017 London

12
Jun
2017
HR & Workforce Analytics Innovation Summit 2017 London

$200 off with code DATA200

Read Also:
How to Create a Business Case for Data Quality Improvement

Leave a Reply

Your email address will not be published. Required fields are marked *