How Predictive Are Statcast Metrics?
Intro Baseball is a hard sport to predict because of all the random variation in it, and that alone probably accounts for like half of sabermetrics. I was curious about how predictive Statcast metrics are and it got me inspired. For this assignment, I looked at the 2017-2021 player-seasons that had at least 200 plate appearances. A lot has been written about how long it takes for Statcast metrics to stabilize, and it turns out they take much, much shorter than other stats like batting average. I’ve read that Statcast metrics stabilize within 70 batted balls (some by 40 even), which a player will easily achieve by the time he’s reached 200 plate appearances. This gives me a sample size of 1548 player-seasons. Statcast’s main metrics are the batted ball ones, such as average and max exit velocity, considering those are the ones that actually need Statcast in order to exist, but any player’s Baseball Savant dashboard will also show plate discipline metrics like out-of-strike-zone swi...