data-scientist2

From Science to Data Science, a Comprehensive Guide for Transition

From Science to Data Science, a Comprehensive Guide for Transition

An in-depth, multifaceted, and all-around very helpful roadmap for making the switch from 'science' to 'data science,' yet generally useful for data science beginners or anyone looking to get into data science.

After posting What I do or: science to data science I got a lot of emails on how to make this transition.

In this post I try to summarize my advice. I don’t intend to write a complete walkthrough, but to provide a starting point, with links to further materials. I target it at people with academic, quantitative background (e.g. physics, mathematics, statistics), regardless if they are undergraduate students, PhDs or after a few postdocs. Some points may be valid for other backgrounds (but then - use it at your own risk).

Here and everywhere else: please don’t take approach of learn book[s] then play - start with playing!

All projects required me to learn something new - be it a library, a machine learning model or a software tool. Analyzing real, and often - dirty, data using a mixture of programming and statistics. Or, as Josh Wills put it:

Read Also:
Internet of Things Has Potential to Transform the Consumer Products Industry

From my perspective the whole process looks that way:

And everything needs to be done in a reproducible way - so others can interact with your code, or even run it on a server. Depending on the job, there may be more emphasis on one part or the other. Or even look at this tweet - while humorous, it shows a balanced list of typical skills and activities of a data scientist:

If you want to learn more about what is data science, look at the following links:

When you have some academic title, no-one will question your intelligence. But they are justified to question your practical skills. From my experience, you need to fulfill two requirements:

Most data science things are simple and at the point that you are able to use R or Python you can start working, gradually increasing your knowledge and experience. That is, after a few months you should be ready to start an entry-level job.

Read Also:
Big Data Misconceptions

Initially, I was afraid that it is a problem that I lack 10+ years of experience with C++ and Java.;

 



Data Innovation Summit 2017

30
Mar
2017
Data Innovation Summit 2017

30% off with code 7wData

Read Also:
Tom Reilly Talks About Helping Companies Deal With Big Data

Big Data Innovation Summit London

30
Mar
2017
Big Data Innovation Summit London

$200 off with code DATA200

Read Also:
Business intelligence helps Maryland facility thrive with value-based care

Enterprise Data World 2017

2
Apr
2017
Enterprise Data World 2017

$200 off with code 7WDATA

Read Also:
Data Lakes, Explained

Data Visualisation Summit San Francisco

19
Apr
2017
Data Visualisation Summit San Francisco

$200 off with code DATA200

Read Also:
Are You Ready for These Big Data Trends?

Chief Analytics Officer Europe

25
Apr
2017
Chief Analytics Officer Europe

15% off with code 7WDCAO17

Read Also:
3 Best Practices for Data Lake Deployment

Leave a Reply

Your email address will not be published. Required fields are marked *