Train, Test, And Validation Sets
Learn why it is best practice to split your data into training, testing, and validation sets, and explore the utility of each with a live machine learning model.
The Good, the Rad, and the Gnarly
'An exploration into the music of skateboarding'. Visual essay I created for The Pudding.
ROC & AUC
A visual, interactive explanation of Receiver Operating Characteristic (ROC) Curves and Area Under The Curve (AUC).
An explanation of the Decision Tree algorithm in machine learning: how the tree makes its splits, the concepts of Entropy and Information Gain, and why going too deep is problematic.
Precision & Recall
When it comes to evaluating classification models, accuracy is often a poor metric. This article covers two common alternatives, Precision and Recall, as well as the F1-score and Confusion Matrices.
Learn how the majority vote and well-placed randomness can extend the decision tree model to one of machine learning's most widely-used algorithms, the Random Forest.
Double Descent: A Visual Introduction
An introduction to the double descent phenomenon in modern machine learning: what it is, how it relates to the bias-variance tradeoff, and a theory of what lies behind.
Double Descent: A Mathematical Explanation
A mathematical explanation of the double descent phenomenon, building on the cubic spline example introduced in the prior article.
The Bias Variance Tradeoff
Understand the tradeoff between under- and over-fitting models, how it relates to bias and variance, and explore interactive examples related to LOESS and KNN.
A curated list of some cool datasets that I created.
The Permutation Test
A 'scrolly-telling' visual tutorial of statistical testing with hand-drawn SVG aesthetics.
A game where you control a sperm cell with your head and dodge various forms of birth control.
The Networks of Skateboarding Filmmakers
A look into the networks (graphs) of skateboarding filmmakers and the skateboarders they film.
4PLYMAG: All The Gall
An analysis of Fred Gall's prolific skateboarding career.
Nested Machine Learning Workflows
An introduction to nested machine learning workflows in R with tidyr & purrr.
4PLYMAG: Eric Koston
A look and analysis the professional skateboarding career of Eric Koston.
Sometimes I do data-driven stuff for Jenkem Mag, a skateboard magazine.