Data Science

This page is for data science related items.


  • Modules and libraries
  • Numpy, Pandas, MySQL
  • Matplotlib, Seaborn
  • SciPy, SymPy
  • Online documentation, books, tutorials

  • Official Online Documentation

  • Python documentation
  • Pandas documentation
  • Matplotlib documentation
  • NumPy documentation
  • SciPy documentation
  • SymPy documentation
  • Books

  • Core Python Programming, 2nd Ed.
  • Python for Data Analysis, 2nd Ed.
  • Introduction to Computation and Programming Using Python, 2nd Ed.
  • Grokking Algorithms
  • Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow, 2nd Ed.
  • Math for Programmers:3D graphics, machine learning, and simulations with Python
  • Tutorials

  • Taming math and physics using SymPy
  • How to Learn Pandas
  • Minimally Sufficient Pandas
  • How NOT to write pandas code
  • Python Plotting with Matplotlib
  • Look Ma, No For-Loops: Array Programming with NumPy
  • DataCamp: downloadable cheatsheets
  • Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations
  • Notes on python, python data analysis and visualization
  • Experimental Design and Analysis (Howard Seltman's online book)