Review: Drawing the Bayesian Owl 🦉
- Establish Estimand
- Build Scientific model(s) (i.e. Causal model), depending on 1.
- Use 1 & 2 to build a Statistical Model
- Simulate data from from 2 and Validate you can recover from 3
- Analyze Real Data using 3.
In real life, it's never a linear path; you are jumping back and forth, iterating on 2-5, much like a branching path/choose your own adventure book.
Multi-level Adventures
Similarly there is no one-size-fits all approach to applying the methods in this course. In order to optimize for success when applying these methods, McElreath suggests a few strategies ("paths") moving forward:
- Return to the start -- McElreath suggests to return to the beginning of the course, reviewing the material now that you've observed a lion's share of the building blocks.
- It turns out that the notes presented in this repo are after the Third Pass of the course. I can't recommend strongly enough to take McElreath's advice and review the material from the beginning. It's a lot of material, but I was flabbergasted with how much I had forgotten in the short time between this lecture and the earlier ones. Similarly, I was surprised by how much easier it was to soak up the material the 2nd time around--a real testament to McElreath's outstanding teaching style.
- Skim & Index -- don't sweat the deatils, just aquaint yourself with the possiblities.
- One thing that I've found useful is to compile each of the model classes discussed in the course into a "recipe book" or "toolbox" of plug-and-play models that can be reused for different applications
Clusters vs Features
Clusterstanksstoriesindividualsdepartments​​⟶⟶⟶⟶​​Featuressurvivaltreatmenteffectaverageresponseadmissionrate​
- Clusters: subgroups in the data (e.g. tanks, participants, stories, departments)
- Adding clusters is fairly simple
- requires more index variables; more population priors
- Features: aspects of the model (i.e. parameters) that vary by cluster (e.g. survival, average response rate, admission rate, etc.)
- Adding features requires more complexity
- more parametrers, particularly dimensions in each population prior
Varying effects as confounds
- Varying effect strategy: using repeat observations and partial pooling to estimate unmeasured features of clusters that have left an imprent on the data
- Predictive perspective: regularization
- Causal Perspective: unobserved confounds are terrifying, but leveraging repeat observations give us some hope at more accurate inference
Previous Examples:
Grandparents & Education