Prof. James Bagrow
|Email:||james.bagrow [at] uvm.edu|
|Lectures:||M/W 15:30–16:45 in Perkins 107|
|Office Hours:||Tu 9:30–10:30, Th 9:00–10:00, or by appointment|
|Office:||Farrell Hall 206 ( Map to my office)|
Extracting meaning from data remains one of the biggest tasks of science and industry. The Internet and modern computers have given us vast amounts of data, so it is more important than ever to understand how to collect, process, and analyze these data. A picture is worth a thousand words, so visualizations, from scientific plots and infographics to interactive data explorers, are crucial to summarize and communicate new discoveries.
Course overview, motivation, logistics and computer setup. Introduction to python.Slides
The "central dogma" of statistics, inference and prediction, brief review of probabilityNotes
Random variables and their statistics, finish reviewNotes
A data science workflow, the typology of data and levels of measurement, storing dataNotes
Data cleaning: rejecting bad data, combining data, filtering and processing data; Start on HISTOGRAMS!!!Notes
More on density estimation: histograms, box plots, kernel densities, violin plots, cumulative distributionsNotes
Review Q02, HW02. Building the XY-toolbox with scatterplots and trendlines.Notes
Spreadsheets considered harmful!
Gene name errors are widespread in the scientific literature. Genome Biology, 2016.
Error bars considered harmful!
Researchers Misunderstand Confidence Intervals and Standard Error Bars. Psychological Methods, 2005. (Journal link.)
Homework and projects are posted on Blackboard.