Learn key data science essentials, including R, and machine learning, through real-world case studies to jumpstart your career as a data scientist.
The demand for skilled data science practitioners in industry, academia, and the government is rapidly growing. The HarvardX Data Science program prepares you with the necessary knowledge base and useful skills to tackle real-world data analysis challenges. The program covers concepts such as probability, inference, regression, and machine learning and helps you develop an essential skill set that includes R programming, data wrangling with dplyr, data visualization with ggplot2, file organization with Unix/Linux, version control with git and GitHub, and reproducible document preparation with RStudio.
In each course, we use motivating case studies, ask specific questions, and learn by answering these through data analysis. Case studies include Trends in World Health and Economics, US Crime Rates, The Financial Crisis of 2007-2008, Election Forecasting, Building a Baseball Team (inspired by Moneyball), and Movie Recommendation Systems.
Throughout the program, we will be using the R software environment. You will learn R, statistical concepts, and data analysis techniques simultaneously. We believe that you can better retain R knowledge when you learn how to solve a specific problem. Furthermore, HarvardX has partnered with DataCamp for all assignments, which use code checking technology that will permit you to get hands-on practice during the courses.
- R is listed as a required skill in 64% of data science job postings and was Glassdoor’s Best Job in America in 2016 and 2017. (source: Glassdoor)
- Companies are leveraging the power of data analysis to drive innovation. Google data analysts use R to track trends in ad pricing and illuminate patterns in search data. Pfizer created customized packages for R so scientists can manipulate their own data.
- 32% of full-time data scientists started learning machine learning or data science through a MOOC, while 27% were self-taught. (source: Kaggle, 2017)
- Data Scientists are few in number and high in demand. (source: TechRepublic)
What You'll Learn
- Fundamental R programming skills
- Statistical concepts such as probability, inference, and modeling and how to apply them in practice
- Gain experience with the tidyverse, including data visualization with ggplot2 and data wrangling with dplyr
- Become familiar with essential tools for practicing data scientists such as Unix/Linux, git and GitHub, and RStudio
- Implement machine learning algorithms
- In-depth knowledge of fundamental data science concepts through motivating real-world case studies
Learn more about Professional Certificate Programs
Created by leading companies and top universities, Professional Certificate programs are a series of in-demand courses designed to develop the critical skills needed for today’s top jobs.
Courses in this Program
- Data Science: R Basics
- Data Science: Visualization
- Data Science: Probability
- Data Science: Inference and Modeling
- Data Science: Productivity Tools
- Data Science: Wrangling
- Data Science: Linear Regression
- Data Science: Machine Learning
- Data Science: Capstone
- Average Length: 7 weeks per course
- Effort: 1-4 hours per week, per course (one course: 15-20 hours per week, for 2 weeks)
- Number Of Courses: 9 Courses in Program
- Subject: Data Science
- Institution: Harvard University (HarvardX)
- Languages: English
- Video Transcripts: English
- Price (USD): Originally $491 USD, now $441.90 USD for the entire program. You save $49.10 USD.