From 2b4efdae5583b18ce9177fe4fea3a91ec3b368c4 Mon Sep 17 00:00:00 2001 From: Alex Knudson Date: Wed, 28 Oct 2020 16:21:53 -0700 Subject: [PATCH] about to reflow this doc --- 080-discussion.Rmd | 6 ++++++ index.Rmd | 4 +++- scratch/outline.Rmd | 47 +++++++++++++++++++++++++++++++++++++++++++++ 3 files changed, 56 insertions(+), 1 deletion(-) create mode 100644 scratch/outline.Rmd diff --git a/080-discussion.Rmd b/080-discussion.Rmd index e9c89bc..2bbeaff 100644 --- a/080-discussion.Rmd +++ b/080-discussion.Rmd @@ -64,4 +64,10 @@ - He was interested in fitting a 2-AFC model in Julia, which did not provide a link function for such psychometric experiments - In the end of our discussion, I wrote my own N-AFC link function that can be dropped in to the `GLM.jl` package +- Model selection is not always the goal + - Building a model motivated by a set of principals and domain expertise should be the prefereed way + - Model comparison is important, especially in terms of predictive inference + - One model doesn't fit all + - Different models help to answer different questions + - Mean effect, individual effect, predicitve density, conditional densities, etc diff --git a/index.Rmd b/index.Rmd index bdf2120..09a9998 100644 --- a/index.Rmd +++ b/index.Rmd @@ -21,7 +21,7 @@ With the advances in computational power and high-level programming languages li I want to talk about how the advances in computational power have lead to a sort of mini revolution - resurrection - in statistics where Bayesian modeling has gained an incredible following thanks to projects like Stan. The steady adoption of computer aided statistical workflows also brings the need for multidisciplinary techniques from numerical analysis, probability theory, statistics, computer science, visualizations, and more. And with the age of computers, there is a strong push towards reproducibility. Concepts of modular design, workflows, project history and versioning, virtual environments, and human readable code all contribute to reproducible analyses. And somehow I also want to tie in how data is immutable - raw data should (must) be treated as a constant and unchangeable entity, and merely touching it will cause data mitosis. - +## Classical Approaches to Modeling - Overview of classical modeling methods - classical approaches to data analysis usually adhere to the flexibility-interpretability trade-off @@ -53,6 +53,8 @@ Computational Power - The denominator acts as a scaling value that forces $P(\theta|X)$ to be a probability distribution (i.e. area under PDF is equal to 1) - There are simulation-based techniques that let one approximate the posterior distribution without needing to know the analytic solution to the denominator +## Proposal of New Methods +## Organization I have organized this thesis as follows. In [Chapter 2](#motivating-data) I introduce the data set that drives the narrative and that motivates the adoption of Bayesian multilevel modeling. In [Chapter 3](#background) there is a review of common approaches approaches to modeling with psychometric data, and the benefits and drawbacks of such techniques. [Chapter 4](#bayesian-modeling) introduces Bayesian hierarchical modeling and programming frameworks for Bayesian inference. In [Chapter 5](#workflow) I describe and work through a principled Bayesian workflow for multilevel modeling. [Chapter 6](#model-checking) goes into more depth on checking the model goodness of fit and model diagnostics in a Bayesian setting. Finally in [Chapter 7](#predictive-inference) I demonstrate how to use the Bayesian model from the principled workflow for predictive inference, and use posterior predictive distributions to plot and compare models. diff --git a/scratch/outline.Rmd b/scratch/outline.Rmd new file mode 100644 index 0000000..e72c0d7 --- /dev/null +++ b/scratch/outline.Rmd @@ -0,0 +1,47 @@ +--- +title: "Outline" +author: "Alex Knudson" +date: "10/28/2020" +output: html_document +--- + +```{r setup, include=FALSE} +knitr::opts_chunk$set(echo = TRUE) +``` + +- Introduction + - Advances in CS as a lead-in to discussing newer modeling workflows + - Classical Approaches to Modeling + - Proposal of new methods (Thesis) + - Organization +- Psychometrics and Motivating Data + - Psychometric Experiments + - TOJ tasks + - Data Viz and Quirks +- Principaled Bayesian Workflow + - Discussion + - Introduce concepts but save details for later sections + - Building a model in steps + - Iter1 + - Iter2 + - ... + - Segue +- Model Checking + - Diagnostics + - Simulation study (bigsimr?) + - Consequence of using "wrong" model +- Predictive Inference + - Simulating/Predicting at different levels + - Average vs. Marginal effects +- Application Results + - Age Groups + - Parameter Scale (relative) + - Prediction Scale (absolute) + - Predictive distribution of new subject + - Consequence of removing outlier(s) +- Discussion + - Project history + - Data cleaning + - Model development + - Model comparison, but not model selection +- Conclusion