Section 8.1
- Exercises
- 8.1.21. Luhn algorithm
Section 8.2
- Box 8.1. NumPy
Section 8.4
- USGS earthquake data
- USGS earthquake data — 1 month, all (local copy, downloaded 1/16/2015)
- USGS earthquake data — 1 month, 2.5+ (local copy, downloaded 1/16/2015)
- Exercises
- 8.4.2. SCRABBLE dictionary
- 8.4.3. Password file
- 8.4.4. World population, 1950-2050 (based on data from here)
- 8.4.5. Meteorite data (based on data from here)
- 8.4.6. Madison, WI temperature data
- 8.4.7. zebra.csv (Here is the story behind the data.)
- 8.4.8. Educational attainment data (source)
Section 8.7
- NYC cyclist collision data
- Exercises
- 8.7.3. NYC cyclist collision data
- 8.7.4. Wisconsin breast cancer data | description of data (source)
- 8.7.5. List of t-shirt measurements
Project 8.1
Note: Do not open these csv files in Excel. Instead, save them directly to your hard drive.
- Deep sea isotope data: 2008CompilationData.csv (source)
- Vostok ice core data: co2nat.txt (source)
- Keeling CO2 data: weekly_mlo.csv (source)
Project 8.2
Project 8.4
Project 8.5
- Snake river peak discharge data: snake_peak.txt (source)
Project 8.6
- Voting results 1 (13 votes)
- Voting results 2 (100 votes)
- Voting results 3 (100 votes)
Project 8.7
- africa.tsp: coordinates of 96 African cities (a modified version of gr96.tsp)
- usa13509.tsp: coordinates of 13,509 U.S. cities with populations at least 500
- tsp-ga-start.py: baseline genetic algorithm
- Genetic algorithms for the traveling salesman problem
- In Pursuit of the Traveling Salesman by William Cook
Further discovery
Recommended books and articles
- Doing Data Science by Rachel Schutt and Cathy O’Neil
- Computer Ethics by Deborah Johnson and Keith Miller