Blog 4: Statistics Denial, Best Statistical Practice [Blog Series]

For more than a century, applied statisticians/quants have refined their accumulated wisdom in extracting information from numbers with uncertainty and leveraging it to make smarter decisions.  We are calling this Best Statistical Practice.

Best Statistical Practice

Best Statistical Practice, as characterized by Deming, et al., requires mastering business knowledge and solving the Data Analysis within the broader considerations of the Business Analytics problem: Timeliness, Client Expectation, Accuracy, Reliability, and Cost.

[pullquote cite="Louise Wehrle, Certification Manager, INFORMS" type="right"]I keep saying that analytics today is like the wild, wild west: anyone can say they are an analyst and there’s no reason to disbelieve them. When they say they’re an analytics professional with a CAP, then you know they’ve achieved the industry standard for practice. [/pullquote]

There are three natural pillars we can leverage to improve statistical practice within an organization: Statistical Qualifications, Statistical Diagnostics, and Statistical Review (see Chapters 7-9, in my book 'A Practitioner's Guide To Business Analytics').  Aggressively applying statistics means leveraging the best Qualifications; using effective Diagnostics to measure results; and Reviewing everything that could go wrong.

When we can express our business need as a mathematics problem, then we can deduce a unique answer.  Statistics problems, however, have an additional solution layer derived from uncertainty with the numbers.  Hence, we need Statistics assumptions and to take a corresponding level of precautions. (We provide a more thorough problem-based clarification of statistics in the May/June 2015 issue of Analytics Magazine, http://goo.gl/Wod3gk.)

Acquiring this accumulated wisdom comes from working with other quants.  By themselves, statistics books and the internet can only prepare the hobbyist; or only augment the learning process for the professional in the field.  Just like we would strongly prefer a highly specialized team of professionals to perform our heart transplant, we need the same level of professionalism to perform important Data Analysis.

Circumventing Statistical Qualifications

Here, I will discuss the harm that comes from circumventing just one pillar of best practice, Statistical Qualifications.  The absence of the proper training and experience leads to sloppy data analysis, a much narrower breadth of practice, and what we will call, 'data hogging.'

[pullquote cite="Hal Varian" type="right"]I keep saying that the sexy job in the next 10 years will be statisticians [/pullquote]

A number of forces are pushing statistics expertise out of data analysis.  First, promotional hype is advising employers to look for generalists or 'unicorns,' who are great at everything.  This is a fool's errand.  It tries to find someone expert in both statistics and IT.  This objective might make sense for very small companies, but if you need advanced data analysis, it does not make sense to replace The Beatles with four one-man bands.  Such an approach leads to schizophrenic job descriptions; to a sacrifice in statistical prowess; and, unfortunately, to a healthy environment for hucksterism.

[block_grid type="three-up"][block_grid_item]blog4_Beatles[/block_grid_item][block_grid_item]blog4_One_Man_Band[/block_grid_item][block_grid_item]blog4_One_Man_Band2[/block_grid_item][/block_grid]

Second, the straddling terms “machine learning,” “data mining,” and “data science” are being used to repackage statistics/data analysis with IT/data management as if there is some technical synergy between the two and next up, as if data analysis is merely a subfield of data management.  Such repackagings provide a way to repurpose qualifications in IT as qualifications in statistics.  It is a leap to regard all data scientists as competent in data analysis, too.  We need accreditation specific to data analysis, rather than encompassing several distinct skills.  Also, splitting these straddling terms would improve communication, Statistical ML, Statistical DM, and Statistical DS.

Third, it is relatively easy to set up shop as a data scientist.  That is part of the term's popularity.  Again, we need accreditation specific for data analysis to help consumers of advanced data analysis discern legitimate qualifications.

Fourth, claims that software is ready to make everyone a statistician can, if taken on its face, push statistical expertise out of data analysis.  Statistical software is very good at automating tasks for statisticians, yet it is not ready to replace judgment or creative thinking.  For many technical advances, such as replacing human judgment, the achievements are proclaimed before they are achieved.

Fifth, if these factors discourage specialization in data analysis, then we might be headed for another round of 'data hogging' (not seen since the days before Y2K).  If just anyone can perform data analysis, then there is no need to share the data, or review the analysis, for that matter ... and we are heading for an Orwellian order form.  Just fill this it out; one size fits all.

Close:

We want new ideas from the 'information rush.'  However, there is a great risk from promotional hype that is extreme enough to adulterate statistics and circumvent Best Statistical Practice, which is our accumulated experience for extracting signal from noise, i.e., information from data.  In particular, we need to raise our level of practice by embracing Statistical Qualifications, Statistical Diagnostics, and Statistical Review.

We sure could use Deming, right now.  Many of us, who consume or produce data analysis, hang out in the new LinkedIn group: About Data Analysis.  Come see us.

Share it:
Share it:

[Social9_Share class=”s9-widget-wrapper”]

Randy Bartlett

Randy Bartlett

Statistician/Statistical Data Scientist at Blue Sigma Analytics

Randy Bartlett, Ph.D. CAP® PSTAT® is a statistician/statistical data scientist with 20+ years of practice experience analyzing and reviewing data analysis; and leading business analytics teams. He designed 'A Practitioner’s Guide to Business Analytics' to be the foremost reference on how corporations can better implement business analytics and in this era of Big Data.

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.

You Might Be Interested In

What I Always Wanted To Know About Big Data* (*but was afraid to ask)

14 Jan, 2017

  When I first heard the term Big Data few years ago, I didn’t think much of it. Soon after, …

Read more

The Modern Analytic Platform

3 Dec, 2016

BI tools have evolved from purpose-built tools to suites to modern analytic platforms (MAPs)–integrated environments with a comprehensive set of …

Read more

How to Discover Hidden Value in Your Customer Journey

20 Dec, 2016

The world of business and customer service has changed immensely over the past few years. Whereas once business was largely …

Read more

Do You Want to Share Your Story?

Bring your insights on Data, Visualization, Innovation or Business Agility to our community. Let them learn from your experience.

Get the 3 STEPS

To Drive Analytics Adoption
And manage change

3-steps-to-drive-analytics-adoption

Get Access to Event Discounts

Switch your 7wData account from Subscriber to Event Discount Member by clicking the button below and get access to event discounts. Learn & Grow together with us in a more profitable way!

Get Access to Event Discounts

Create a 7wData account and get access to event discounts. Learn & Grow together with us in a more profitable way!

Don't miss Out!

Stay in touch and receive in depth articles, guides, news & commentary of all things data.