Professional Documents
Culture Documents
Download 10+ Bonus Resources for Data Science & Machine Learning
CAREER (HTTPS://ELITEDATASCIENCE.COM/CATEGORY/CAREER)
RESOURCES (HTTPS://ELITEDATASCIENCE.COM/CATEGORY/RESOURCES)
Google (https://plus.google.com/share?
text=65+Free+Data+Science+Resources+for+Beginners&url=https%3A%2F%2Felitedatascie
nce.com%2Fdata-science-resources)
Linkedin (https://www.linkedin.com/shareArticle?
trk=65+Free+Data+Science+Resources+for+Beginners&url=https%3A%2F%2Felitedatascien
ce.com%2Fdata-science-resources)
(h
Tweet (https://twitter.com/intent/tweet?
text=65%20Free%20Data%20Science%20Resources%20for%20Beginners&url=https://eliteda
tascience.com/data-science-resources)
In this guide, we’ll share 65 free data science resources that we’ve hand-picked and
annotated for beginners.
If that sounds like a career you’d enjoy, then bookmark this page and read on because
we compiled this list just for you.
First Name
Send My Download
1. Foundational Skills
Foundational skills form the basis of true understanding, which will in turn allow you
to discover novel solutions, build more accurate models, and make better decisions.
First, you'll need to know at least one scripting language well enough to wrangle
datasets, prototype models, and perform analyses.
Python is more common in software startups, large tech firms, and adTech.
Python tends to be more flexible because it's a general purpose programming
language. It's also better for deep learning and processing data.
R / RStudio is popular in research, finance, and analytics. R is a statistical
programming language that has mature libraries for econometrics, statistics, and
machine learning.
We've also written a more detailed comparison of Python vs. R for data science
(http://elitedatascience.com/r-vs-python-for-data-science).
If you're still on the fence, we'd recommend starting with Python due to its breadth
and flexibility (and it's a bit more beginner-friendly).
Tip: Each resource link below opens in a new tab, so you won't lose your place.
Python Resources:
R / RStudio Resources:
R for Data Science (Online Book) (http://r4ds.had.co.nz/introduction.html) -
Recommended for beginners who want a complete course in data science with R.
Swirl (Interactive R Package) (http://swirlstats.com/) - Very cool R package that
you can install and learn the language directly from inside RStudio (the most
common interface used to run R).
Introduction to Data Science with R (Video Series)
(https://www.youtube.com/watch?v=32o0DnuRjfg&list=PLTJTBoU5HOCRrTs3cJK-
PbHM39cwCU0PF&index=1) - For those who learn better by watching someone
else walk through the steps.
2. Technical Skills
Data science is all about converting raw data into insights, predictions, software, and
so on. Therefore, you'll need to be comfortable working with data.
Core technical skills include collecting, cleaning, managing, and visualizing data, plus
the big umbrella of applied machine learning.
API Resources:
2.2. SQL
SQL is the lingua franca for database management and querying, and you should be
able to write complex queries.
Learning SQL also gives a better understanding of relational data in general (i.e. data
in "table" format), which will improve your data analysis skills in any language.
2.3. Data Visualization
3. Business Skills
3.1. Communication
If a tree falls in a forest but no one is around to hear it, does it make a sound? If data
is analyzed but no one can explain the results, does it really matter?
Effective communication skills are universal, but data scientists have the
added challenge of discussing highly technical or mathematical topics.
During data scientist interviews, you'll often be asked to "explain a technical concept
to a layperson" or "describe a previous project you've worked on." Employers will
specifically look for clarity, conciseness, and organization.
Data scientists are hired to build new products, perform complex analyses, and
invent valuable ways to use data.
In fact, they rarely solve the same problem twice. Even if you can apply the same
methods to an adjacent dataset, you'll need to be creative about feature engineering,
supplemental data, and business implications.
You'll naturally become a better creative thinker as you gain more experience, but the
following resources can help jumpstart your problem-solving and innovation skills.
Here's a question you should ask yourself every day: "What are some ways I can
improve this business?"
At the end of the day, companies don't hire you to analyze data... they hire you to help
them grow or become more profitable. This means that you should have an
understand how data can help make better decisions and build better products.
Data Driven Decisions (Video) (https://www.youtube.com/watch?
v=trbOW1TDOao) - How to take business objectives, extract testable hypotheses
from them, and then design experiments to evaluate.
How to be data driven and build great products by DJ Patil (Video)
(https://www.youtube.com/watch?v=54t7bSXniAs) - Lecture by DJ Patil before he
become Chief Data Scientist of the USA.
Big Data: New Tricks for Econometrics by Hal Varian (PDF)
(http://people.ischool.berkeley.edu/~hal/Papers/2013/ml.pdf) - Hal Varian, Chief
Economist at Google, gives an excellent overview of the technology and
methodology landscape for data analysis.
How data will transform business (TED Talk)
(https://www.ted.com/talks/philip_evans_how_data_will_transform_business) -
Thought-provoking discussion of the relationship between business strategy and
technology. Explains why the two long-standing theories of business strategy
have become invalidated by the rise of big data.
Victor Cheng's Case Interview Workshop (Video Series)
(https://www.youtube.com/watch?
v=fBwUxnTpTBo&index=1&list=PL8b_fmdDHHyCznYmSeWJrdrJN4UJhUrsh) -
Some employers like to ask consulting-style "case" questions during the interview.
This is more common for Data Scientists in business operations, strategy, or
analytics roles. This is an excellent crash course on tackling case interviews.
Business analytic skills are critical for data scientists in operational roles. Python and
R will allow you to perform more complex analyses than Excel can, thanks to the
flexibility of programming languages.
After you master the technical tools, building strong domain knowledge will lead to
greater business impact.
4. Supplementary Skills
They power many amazing websites and apps, including Amazon, Yelp, Netflix, and
Spotify. In a nutshell, recommendation systems find other users who have similar
tastes to you to make better recommendations for you. This produces a huge win-win
by improving user experience while driving up revenue.
Time Series Analysis deals with data series that are indexed by time. For example,
stock prices, precipitation amounts, and Twitter hashtags by hour would all be
considered time series. Time series analysis is commonly used in Finance,
Forecasting, and Econometrics.
1. They help you solidify concepts and practice pulling together all the moving
pieces of data science.
2. They arm you with something tangible to show employers. If a picture is worth
1000 words, a project is worth a million...
By nature, projects are personal undertakings, and you should pick topics you're
interested in. Here are a few places to find project ideas:
Google (https://plus.google.com/share?
text=65+Free+Data+Science+Resources+for+Beginners&url=https%3A%2F%2Felitedatascie
nce.com%2Fdata-science-resources)
Linkedin (https://www.linkedin.com/shareArticle?
trk=65+Free+Data+Science+Resources+for+Beginners&url=https%3A%2F%2Felitedatascien
ce.com%2Fdata-science-resources)
Tweet (https://twitter.com/intent/tweet?
text=65%20Free%20Data%20Science%20Resources%20for%20Beginners&url=https://eliteda
tascience.com/data-science-resources)
« Previous Post
Dimensionality Reduction Algorithms: Strengths and Weaknesses
(https://elitedatascience.com/dimensionality-reduction-algorithms)
Next Post »
WTF is the Bias-Variance Tradeoff? (Infographic)
(https://elitedatascience.com/bias-variance-tradeoff)
LINKS
Homepage (/)
Login (https://elitedatascience.com/login)
ARTICLES
Guides (https://elitedatascience.com/category/guides)
PRODUCTS
SHARE
Share (https://www.facebook.com/sharer.php?u=https%3A%2F%2Felitedatascience.com)
Google (https://plus.google.com/share?
text=65+Free+Data+Science+Resources+for+Beginners&url=https%3A%2F%2Felitedatascience.com)
Linkedin (https://www.linkedin.com/shareArticle?
trk=65+Free+Data+Science+Resources+for+Beginners&url=https%3A%2F%2Felitedatascience.com)
Tweet (https://twitter.com/intent/tweet?
text=65%20Free%20Data%20Science%20Resources%20for%20Beginners&url=https://elitedatascience.com)