Data Science Programming Resources

These days, being a data scientist means being a programmer too. While there are numerous languages out there, and every person has their opinion on which are worth learning, there are two that are most common for data science purposes: Python and R. Also popular are SQL, MATLAB, and Stata. In this section, you’ll find resources for learning these languages, with a mix of beginner’s guides to advanced graduate-level courses. If you are aware of any free resources not listed here, please share them in a comment below or in an email to hellomenti@gmail.com!


Python

General-purpose programming language, useful for data science, web development, and software development.

R/RStudio

Specialized language which is highly useful for statistical analysis and data visualization. Usually run through the RStudio interface.


SQL

A language for manipulating structured data, effective with large datasets. One of the most common languages for actually gathering the data needed to perform data science.

MATLAB

A mathematical programming meant for statistical computing, also has tools for data visualization.

Stata

A highly specialized tool best suited for running econometric and regression analysis.

Other Data Science Topics

Previous
Previous

A Guide to Learning Data Science

Next
Next

Case Study: Example of a Brief Data Science Project