Data Science 6400

School of Data Science
University of Virginia


Course Overview
Course Calendar
Course Policies

View the Project on GitHub thomasgstewart/machine-learning-1-fall-2022

Course Calendar Fall 2022

Final Project Presentation

The final project presentations will be Thursday, December 8, 2022 from 2:00PM-5:00PM.

Deliverables

Title First Submission Due Date Resubmission Due Date
0. Student Profile 2021-08-29 Not available
1. Facial metrics 2022-09-16 2022-10-31
2A. Framingham Heart Study 2022-09-26 2022-10-31
2B. Framingham Heart Study Diagnostics 2022-10-12 2022-10-31
3. PCA 2022-11-04  
4. Case study 2022-11-16  
5. Stroke 2022-11-28  
6.    
7.    
8.    
9.    

Calendar

Material in monospaced font are in the course Teams site. The letter H denotes sections with the course text book.

Class date Topic Material
2022-08-26 Data types 01-data-types
2022-08-29 Course overview 02-course-outline
  Linear Regression  
  Model formulation lion-age-nose, H 2.2
  Bayesian vs MLE lion-age-nose-bayesian
Maximum Likelihood and Bayesian Analysis
salary-bayes.R
salary-mle.R
  Interactions 2.3
  Nonlinearity H 2.4 - 2.5
  Measures of model performance H 5
  → Discrimination  
  → Calibration  
  → Optimism  
  Measures of model performance H 5
  Carrying capacity of data  
  Model complexity  
  Strategies for right-sizing the model complexity  
  → regularization (LASSO, ridge, Bayesian)  
  → constraints (principle components, monotonicity)  
  Logistic Regression  
  Model formulation  
  Bayesian vs MLE  
  Interactions  
  Nonlinearity  
  Measures of model performance  
  → Discrimination  
  → Calibration  
  → Optimism  
  Measures of model performance  
  Carrying capacity of data  
  Model complexity  
  Strategies for right-sizing the model complexity  
  → regularization (LASSO, ridge, Bayesian)  
  → constraints (principle components, monotonicity)  
  Ordinal Regression  
  Model formulation  
  Bayesian vs MLE  
  Interactions  
  Nonlinearity  
  Measures of model performance  
  → Discrimination  
  → Calibration  
  → Optimism  
  Measures of model performance  
  Carrying capacity of data  
  Model complexity  
  Strategies for right-sizing the model complexity  
  → regularization (LASSO, ridge, Bayesian)  
  → constraints (principle components, monotonicity)  
  Hazard Regression  
  Model formulation  
  Bayesian vs MLE  
  Interactions  
  Nonlinearity  
  Measures of model performance  
  → Discrimination  
  → Calibration  
  → Optimism  
  Measures of model performance  
  Carrying capacity of data  
  Model complexity  
  Strategies for right-sizing the model complexity  
  → regularization (LASSO, ridge, Bayesian)  
  → constraints (principle components, monotonicity)  
  Random Forest  
  Model formulation  
  Bayesian vs MLE  
  Interactions  
  Nonlinearity  
  Measures of model performance  
  → Discrimination  
  → Calibration  
  → Optimism  
  Measures of model performance  
  Carrying capacity of data  
  Model complexity  
  Strategies for right-sizing the model complexity  
  → regularization (LASSO, ridge, Bayesian)  
  → constraints (principle components, monotonicity)