| Temporal Context in Deep Nets for MLB Pitch Modeling: A Study of Lag Length and Architecture | |||
| Henry Wang | |||
| Final project for 6.7960, MIT | |||
| Temporal Context in Deep Nets for MLB Pitch Modeling: A Study of Lag Length and Architecture | |||
| Henry Wang | |||
| Final project for 6.7960, MIT | |||
Run expectancy can be defined more granularly using base–out–count states: this is the same idea as base–out states, except now we also condition on the ball–strike count. In full, there are 12 possible ball–strike counts, yielding \(24 \times 12 = 288\) base–out–count states.
Every pitch moves the game from one base–out–count state to another, each with its own run expectancy. The pitcher's goal is to reduce the overall run expectancy. The target we use is delta run expectancy (\(\Delta RE\)), meaning the change in run expectancy from before to after the pitch. This provides a numerical representation of the pitch's effectiveness at preventing runs.
As Figure 1 shows, the expected number of runs varies dramatically depending on the game situation. The formula for delta run expectancy is:
As a concrete example, consider the following scenario: A relief pitcher enters the game with the bases loaded and one out. They allow a sacrifice fly to score the runner from third, with the other two runners not advancing.
| Feature | Variable Type | Plain-English Definition |
|---|---|---|
| Pitch Velocity (mph) | Kinematic | Speed of the pitch as it leaves the pitcher's hand. Generally, higher fastball velocity and the ability to "kill" velocity on offspeed pitches is better for the pitcher. |
| Spin Axis (degrees) | Kinematic | Direction the ball is rotating around as it travels toward the plate, directly determining the direction of the spin-induced movement (Magnus force) |
| Spin Rate (rpm) | Kinematic | How fast the ball is spinning around its axis, directly determining the magnitude of the spin-induced movement (Magnus force). |
| Plate X Position (ft) | Kinematic | Horizontal location where the pitch crosses the front of home plate (inside/outside). |
| Plate Z Position (ft) | Kinematic | Height of the pitch above the ground as it crosses the front of home plate. |
| Horizontal Break (ft) | Kinematic | Side-to-side movement of the pitch caused by the Magnus force and seam-shifted wake. |
| Induced Vertical Break (ft) | Kinematic | Extra rise or drop on the pitch due to spin, beyond what gravity alone would do, caused by the Magnus force and seam-shifted wake. |
| Arm Angle (degrees) | Kinematic | The angle of the pitcher's arm at release (over-the-top vs. sidearm, etc.). |
| Release Extension (ft) | Kinematic | How far in front of the rubber the pitcher releases the ball. Higher extension means the ball is released closer to home plate, typically better for the pitcher. |
| Balls (count) | Contextual | Number of balls in the count before the pitch. |
| Strikes (count) | Contextual | Number of strikes in the count before the pitch. |
| Runner on First Base | Contextual | Indicator of whether there is a runner on first base. |
| Runner on Second Base | Contextual | Indicator of whether there is a runner on second base. |
| Runner on Third Base | Contextual | Indicator of whether there is a runner on third base. |
| Batter Handedness (R/L) | Contextual | Whether the batter is hitting left-handed or right-handed. Typically, being the opposite handedness as the pitcher favors the batter. |
| Pitcher Handedness (R/L) | Contextual | Whether the pitcher throws left-handed or right-handed. Typically, being the same handedness as the batter favors the pitcher. |
| Game Year | Contextual | Season year in which the pitch was thrown. Accounts for the yearly variation in run scoring environment. |
| Delta Run Expectancy | Target variable | Change in expected runs for the inning from before to after the pitch (the pitch's run value). |