
I have had the opportunity to work with many MLB franchises and discuss what their priorities and challenges are related to data analytics. Coaching staff task their data teams with assisting them in making critical decisions, for example, should a pitcher throw another inning or make a substitution to avoid a potential injury? Does a player have a greater probability of success stealing from first to second base, or from second to third? Hindsight is always 20-20, but it goes to show how impactful data has become to the game. The Rays went on to lose the game and world series.

The Tampa Bay Rays were leading the Los Angeles Dodgers 1-0 in the sixth inning when Rays Pitcher Blake Snell was pulled from the mound while pitching arguably one of the best games of his career, a decision head coach Kevin Cash said was made with the insights from their data analytics. One of the more memorable examples of this was in Game six of the 2020 world series. The decisions made from real-time analytics can dramatically change the outcome of a game and a team’s season.
True game data pellington professional#
Here’s an inside look at how professional baseball teams use technologies like Databricks to create the modern-day Moneyball and gain competitive advantages that data teams provide to coaches and players on the field.įigure 1: Position and scope of Hawkeye cameras at a baseball stadiumįigure 2: Numbers represent events during a play captured by Statcastįigure 3: Sample of data collected by Statcastĭata teams need to be faster than ever to provide analytics to coaches and players so they can make decisions as the game unfolds. It’s been 20 seasons since the A's first introduced the use of data modeling to baseball. This explosion of data has created opportunities to analyze the game in real-time, and with the application of machine learning, teams are now able to make decisions that influence the outcome of the game, pitch by pitch. Statcast generates up to seven terabytes of data during a game, capturing every imaginable data point and metric related to pitching, hitting, running and fielding, which the system collects and organizes for consumption. In 2015, the Major League Baseball (MLB) introduced Statcast, a set of cameras and radar systems installed in all 30 MLB stadiums. Fast forward 20 years – the use of data science and quantitative modeling is now a common practice among all sports franchises and plays a critical role in scouting, roster construction, game-day operations and season planning.

The book Moneyball, written by Michael Lewis, highlighted the A’s ‘02 season and gave an inside glimpse into how unique the team’s strategic data modeling was, for its time. The Oakland Athletics baseball team in 2002 used data analysis and quantitative modeling to identify undervalued players and create a competitive lineup on a limited budget.
