Exploring open data quality

Exploring open data quality

Exploring open data quality

To contribute to discussions around open data quality, the ODI has been working with Experian to investigate quality in several UK Government open datasets. Here, ODI Associate Leigh Dodds introduces the project and its initial findings

Open data communities are focusing on improving the quality of their data and sharing guidance. CC BY 2.0, uploaded by Neil Rickards.

There are a number of initiatives at the moment exploring the idea of data quality, with particular reference to describing, measuring and improving the quality of open data.

For example, the W3C Data on the Web Best Practices Working Group are producing a vocabulary for publishing and describing data quality metrics. There is also related work capturing best practices for sharing public sector data.

Various open data projects and communities are working to improve the quality of their open data and have started to share guidance. For example data.gov.sg have recently shared their data quality guide for tabular data. And Mark Frank and Johanna Walker at Southampton University have recently published a paper exploring a user-centred view of data quality.

Read Also:
Harnessing the power of the ‘Four Opens’

To contribute to this ongoing discussion, we recently undertook a small project with Experian to explore data quality in some open datasets.

The project had several goals:

For the initial exploratory project we’ve used the Land Registry Price Paid data, the Companies House register and the NHS Choices GP Practices and Surgeries.

 



Data Science Congress 2017

5
Jun
2017
Data Science Congress 2017

20% off with code 7wdata_DSC2017

Read Also:
4 Reasons Your Machine Learning Model is Wrong (and How to Fix It)

AI Paris

6
Jun
2017
AI Paris

20% off with code AIP17-7WDATA-20

Read Also:
Big data has not revolutionised medicine – we need big theory alongside it

Chief Data Officer Summit San Francisco

7
Jun
2017
Chief Data Officer Summit San Francisco

$200 off with code DATA200

Read Also:
Biden says big data is key to fight against cancer

Customer Analytics Innovation Summit Chicago

7
Jun
2017
Customer Analytics Innovation Summit Chicago

$200 off with code DATA200

Read Also:
5 Business Impacts of Advanced Analytics and Visualization
Read Also:
Planet analytics: big data, sustainability, and environmental impact

HR & Workforce Analytics Innovation Summit 2017 London

12
Jun
2017
HR & Workforce Analytics Innovation Summit 2017 London

$200 off with code DATA200

Read Also:
4 Reasons Your Machine Learning Model is Wrong (and How to Fix It)

Leave a Reply

Your email address will not be published. Required fields are marked *