Data Modeling for the Environmental Sciences
PBIO 295 (3 credits)

Fall 2008 Tuesday / Thursday 2:00-3:15 PM

Instructor: Brian Beckage (Brian.Beckage@uvm.edu)

Location: Discussions & Lectures in Lafayette L100; Labs TBA

Office Hours: By appointment; Marsh Life Sciences 125


Course Description

This seminar course will provide an introduction to data visualization and analysis. This course will emphasize likelihood, information theorteic, and Bayesian approaches to data modeling in an interactive class environment that is based on discussions rather than lectures. This class will also stress applications to real world problems in the environmental sciences. Class time will be divided between discussions (with some lectures) and computer lab. Students will learn the open source R statistical computing language (R project website) and will analyze a data set for a final project. This course is designed to provide students with the analytic tools required for independent research in the environmental sciences.

Course Prerequisites

Intoductory calculus or permission of the instructor.  The course is intended for advanced undergraduates and graduate students.


Course Texts


Computing

Please install R on the laptop or desktop computer that you use. It would be helpful if you could bring a laptop, with R installed, to our labs. R can be freely downloaded and installed on a wide variety of computing platforms. To obtain R, go to the R project website, follow the link beneath 'Download' on the left, choose a site near your geographic location, download the pre-compiled binary distribution for your operating system, and then follow the instructions for installation. We will also use winBugs software for fitting Bayesian models.


Preliminary Outline of Grading.

Student grades will be based on four components:

  1. A final paper that analyzes a data set of the student's choosing. (40% of course grade).
  2. A corresponding 20 minute class presentation on this same research topic earlier in the semester (30% of course grade). The presentation should be 15 minutes with 5 minutes for questions. Students should provide an electronic presentation (using powerpoint, keynote, or similar software).
  3. Completion of homework assignments will encompass 20% of couse grade.
  4. Student participation in class. (10% of course grade) . This will include asking questions and contributing to discussions.

Research paper
Research presentation
Class participation

Syllabus (tentative):

(C1~Clark Chapter 1; L1~Lab Manual Chapter 1)

Week Date Topic Reading Activity Notes Code & Assignments
1 Sep 2 Introduction   Lecture/Discussion    
    4   Lab (L1)   Lab 1 Code (Game Show Problem)
2 Sep 9 Models in Context C1 Lecture/Discussion    
    11   Lab (L1)  
3 Sep 16 Model Elements C2 Lecture/Discussion    
    18   Lab (L1)    
4 Sep 23 Point Estimation (1) C3 Lecture/Discussion    
    25   Lab (L2)    
5 Sep 30 Point Estimation (2) C3 Lecture/Discussion    
    2   Lab (L2) Project Proposal due  
6 Oct 7 The Bayesian Approach (1) C4 Lecture/Discussion    
    9   Lab (L3)    
7 Oct 14 The Bayesian Approach (2) C4 Lecture/Discussion    
    16   Lab (L3)    
8 Oct 21 Confidence Envelopes (1) C5 Lecture/Discussion   Assignment; Data
    23   Lab (L4)   Example code
9 Oct 28 Confidence Envelopes (2) C5 Lecture/Discussion    
    30   Lab (L4)   Logistic Regression; Assignment; Data; Paper
10 Nov 4 Model Selection (1) C6 Lecture/Discussion    
  6   Lab (L5)   Solution to 30 Oct Assignment
11 Nov 11 Model Selection (2) C6 Lecture/Discussion    
    13   Lab (L5)   Assignment
12 Nov 18 Computational Bayes (1) C7 Lecture/Discussion    
    20   Lab (L6) Paper Draft due Data to winBugs
25   Thanksgiving Break
27 Thanksgiving Break
13 Dec 2 Computational Bayes (2) C7 Lecture/Discussion Paper Reviews due Assignment; Data
    4   Lab (L6)    
14 Dec 9 Hierarchical Structures C8 Lecture/Discussion    
    11   Presentations    
               
  Dec 16       Final Papers Due by 5pm