light-bulb-549090_1280

Data Outliers: 10 Ways To Prevent Big Data Damage

Data Outliers: 10 Ways To Prevent Big Data Damage

Most business decision-makers aren't trained to understand data outliers, but they can learn the basics. Executives, managers, and employees without math degrees can ask smarter questions about analyses they're basing crucial judgments on. Here are some things to know.

Data analytics has its own vocabulary that business decision-makers are under pressure to learn. Beware, though, because technical terms are often used loosely, sometimes to the detriment of individuals and their companies. An outlier is a good example. A lot of people are talking about outliers, but not a lot of people understand why they exist, what causes them, and what should be done with them, if anything.

"An outlier is a member of a defined dataset which has a dramatically different value than the other members of the set. It can be the result of measurement or recording errors, or the unintended and truthful outcome resulting from the set's definition," said Tom Bodenberg, chief economist and data consultant at market research firm Unity Marketing in an interview.

Read Also:
4 Ways Data-First Competitors Are Killing You

Outliers make their way into reported statistics every day. Sometimes their inclusion or exclusion is obvious, and sometimes it isn't. For example, in 1984 the University of Virginia reported that the average starting salary of Rhetoric and Communications graduates was $55,000. However, an outlier was skewing the analysis. The dataset included one hundred graduates with $25,000 salaries and NBA first draft pick Ralph Sampson, another graduate. His starting salary exceeded $1 million.

Outliers can pop up for different reasons. Some are caused by mistakes made by humans or machines. Others represent actual data.

 



Data Innovation Summit 2017

30
Mar
2017
Data Innovation Summit 2017

30% off with code 7wData

Read Also:
Is your data warehouse still living in the 90s?

Big Data Innovation Summit London

30
Mar
2017
Big Data Innovation Summit London

$200 off with code DATA200

Read Also:
How big data can drive employee engagement

Enterprise Data World 2017

2
Apr
2017
Enterprise Data World 2017

$200 off with code 7WDATA

Read Also:
Don’t let big data make your head spin: Building a user-friendly marketing ecosystem
Read Also:
5 ethics principles big data analysts must follow

Data Visualisation Summit San Francisco

19
Apr
2017
Data Visualisation Summit San Francisco

$200 off with code DATA200

Read Also:
Data science is easy; making it work is hard

Chief Analytics Officer Europe

25
Apr
2017
Chief Analytics Officer Europe

15% off with code 7WDCAO17

Read Also:
3 Best Practices for Data Lake Deployment

Leave a Reply

Your email address will not be published. Required fields are marked *