Chapter 1 Summary resources for DS learning

1.1 Statistics, Probability, and A/B Testing

1.1.1 Stanford Courses (NON-EDUC):

  1. STATS 160-Introduction to Statistical Methods
    • COVERS estimation | confidence intervals | test of hypotheses | t-tests | correlation && regression | analysis of variance and chi-square tests
  2. STATS 116-Theory of Probability or CS109-Introduction to Probability for Computer Scientists
    • COVERS probability spaces | discrete spaces (binomial, hypergeometric, Possion) | continuous spaces (normal, exponential) and densities | random variables | expectation | independence | conditional probability | the laws of large numbers and central limit theorem (CLT)

1.1.2 Online Resources

Additional Tips: A/B Testing is NOT required by all DS/DA jobs, but if you are interested in applying for a Product Data Scientist then it is REQUIRED. Be sure to browse DS/DA interns JD so that you know what skills would be needed.

1.2 SQL / R / Python / Visualization Tools

1.3 Machine Learning resources

1.3.1 Stanford Classes:

1.3.2 Online Resources

  1. Machine Learning Specialization by Andrew Ng on Coursera
  • Topics covered:

    • Supervised Learning: multiple linear regression, logistic regression, neural networks, & decision trees)

    • Unsupervised Learning: clustering, dimensionality reduction, recommender systems

    • Some AI & ML innovation: evaluating and tuning models, taking a data-centric approach to improving performance

  • Applied Learning Project:

    • Build ML models using Numpy & Scikit

    • Build & train a neural network with Tensorflow to perform multi-class classificaton

    • Build & use decision trees and tree ensemble methods, including forest and boosted trees

    • Build recommender systems with a collaborative filtering approach and a content-based deep learning method

  1. 15 hours of expert ML videos.

  2. 《An Introduction to Statistical Learning》(This book is also used for Stanford’s course STATS 202: Data Mining and Analysis)

  3. Machine Learning 101 on Towards Data Science and many other articles

Other useful resources (notes):

Additional Tips: All resources listed as other useful resources are mainly for your references when you need to actually implement certain methods / conduct a project / or to review certain syntax or concept. I personally DO NOT recommend beginners to start their learning journey with these resources, because it is much more important that you have already built a SOLID foundation in all fields mentioned above through SYSTEMATIC learning processes.

1.4 Interview Questions & Resume