Data governance process gets fine-tuned

Data governance process gets fine-tuned, as big data makes its mark

Data governance process gets fine-tuned, as big data makes its mark

As data-driven business models, digital transformations, big data analytics and the like continue to rise, they challenge the conventional data governance process.

They also provide opportunity to place data governance at the center of important business changes, according to participants in last week's Enterprise Data Governance Online 2017 webinar.

Among the most challenging new developments is the data lake, which, in its most basic form, eschews upfront curation and categorization of data. Curation, which includes cleansing data and assuring its consistency, is among the hallmarks of the data governance process.

Effective data governance can be applied to a Hadoop data lake, according to Shannon Fuller, director of data governance at Carolinas HealthCare System, based in Charlotte, N.C. The data-lake path was chosen for an innovative big data project, he said, because it could encourage more rapid application development and create a common repository, while protecting patients' information and protecting intellectual property.

"We decided this would not be another data warehouse," Fuller said. "It would be stand-alone assets available to the whole organization."

Read Also:
What does artificial intelligence mean for the creative mind?

One road to reports, another to sandbox Fuller said his organization is using a twofold path that prepares sets of curated data carefully for both business users and data scientists. Driving the project is Carolinas HealthCare's push to look at a patient's overall treatment plan, taking disparate data into account and making decisions on compensation models. Fuller described his operation as an IBM InfoSphere shop, but said the pilot data lake was accomplished using Microsoft's HDInsight and Azure Data Lake Store. Tresata software was used to catalog some of the source data, according to Fuller. Once treated, data is then pushed back into the Azure Data Lake Store to be further analyzed, or to feed reports and executive dashboards.



Big Data Innovation Summit London

30
Mar
2017
Big Data Innovation Summit London

$200 off with code DATA200

Read Also:
Why Big Data is in Trouble

Data Innovation Summit 2017

30
Mar
2017
Data Innovation Summit 2017

30% off with code 7wData

Read Also:
How to Intelligently Apply Data Integration and Visual Analytics Tools
Read Also:
The misanthrope’s vain struggle with big data

Enterprise Data World 2017

2
Apr
2017
Enterprise Data World 2017

$200 off with code 7WDATA

Read Also:
The misanthrope’s vain struggle with big data

Data Visualisation Summit San Francisco

19
Apr
2017
Data Visualisation Summit San Francisco

$200 off with code DATA200

Read Also:
Machine Learning: The New ‘Gold Rush’

Chief Analytics Officer Europe

25
Apr
2017
Chief Analytics Officer Europe

15% off with code 7WDCAO17

Read Also:
How big data will transform the ways people travel

Leave a Reply

Your email address will not be published. Required fields are marked *