Proposed Stats and Definitions
The following stats and definitions are based on my interpretation of Ryan Stimson's methods and definitions for the Passing Project. I'm also collecting additional pieces of data that have the potential for deeper insights into player and team performance.
Collected Game-State Data
This data accounts for the score, strength, and time at which each play takes place, and the team generating the play. The pieces collected are:
- Period - Marks the period in which the play takes place.
- Score - The score of the game at the time of the play. When a goal is scored, the score is recorded as it was before the goal took place because this is the score state at the time of the play. This data allows us to account for score effects.
- Strength - The first number is the number of away skaters on the ice, the second number is the number of home players on the ice. E.g. 5v4 means the home team is shorthanded. 5v6 means the home team has pulled their goalie.
- Time - The time listed on the clock when the play takes place. This allows us to sync our data with other sets.
- Team - The team making the play.
This data is collected for players who take a shot attempt in the offensive zone only. Shots that are taken from outside the offensive zone are rarely dangerous (aka this data is noise) and are therefore not recorded.
- SOG - Shot on Goal.
- MS - Missed Shot.
- BS - Blocked Shot. A shot that is blocked or tipped by a defending player, or a shot blocked by the body of a shooter's teammate.
- Posts - I'm tracking posts separately from MS (that's what they're normally tracked as, right?) because I think posts, as being almost goals, could indicate the play is a higher quality scoring chance.
- NS - Non-shot. A play in which a player has possession of the puck in the scoring chance area, has the opportunity to shoot, but for whatever reason does not get the shot off. This is a way of accounting for scoring chances that aren't recorded as shot attempts.
- SCS - Scoring Chance Shot. Credited in addition to any of Shooter Stats 1 to 5 if the shot attempt is taken from the scoring chance area.
- Goal - credited to any shooter whose SOG is also a goal.
Collected Passer Statistics
- A1 (similarly, A2) - Credited to a player who passes the puck to a player who then takes a shot attempt (A2 is credited to the passing player who is one pass removed from the shooter).
- A1 D/N/O (similarly A2 D/N/O) - The location from which the A1 (or A2) passer passes from; Defensive zone, Neutral zone, Offensive zone.
- SCA (similarly, SCA2) - Scoring Chance Assist. Credited to a player whose pass directly leads to a shot in the scoring chance area (SCA2 is credited to the A2 player who's pass directly leads to the A1 player's pass coming from the scoring chance area, which leads to a shot attempt. See example play below.)
Failed Passes
I'm also tracking FP - Failed Passes - and FPL - Failed Pass Location. These are passes that do not reach their intended target and result in a change of possession. The pass's starting location and intended location are recorded. Generally failed passes are the fault of the passer, but the pass receiver can instead be credited with the FP if the tracker judges that the receiver is at fault for not receiving the pass. Failed Passes are a potentially reliable way to capture player errors.
Recording the Data
The data is recorded in Excel using an event-based model (i.e. each play is recorded as an event). Each row represents a play, and each column represents a stat (listed in the header row) credited to a player involved in the play. Player numbers are inputted into the fields depending on what the player contributes to the play. Refer to the sample image below (game-state data not shown). Included is the header row, and two rows representing two separate plays.
The first row contains the data for this goal by Curtis Glencross, setup beautifully by Hudler and Monahan.
Glencross (20) is awarded with:
- SOG because he gets a shot on goal
- SCS because his shot is taken from the scoring chance area
- Goal because he gets a goal
- A1 because his pass leads to the shooter's shot
- O because his pass comes from the offenzive zone
- SCA1 because his pass leads directly to the shooter's shot which takes place in the scoring chance area
- A2 because he is one passer removed from the shot (and because this play results in a goal, he's awarded the second assist).
- N because he passed from the neutral zone.
- SCA2 because his pass leads to Hudler skating into the scoring chance area unimpeded (Muzzin is angling Hudler away from the net, but Hudler is able to skate into the scoring chance area as a direct result of Monahan's pass).
Data Outcomes
The event-based data model allows for a large amount of raw data to be collected relatively simply, and offers many advantages:
- Time-stamps allow this dataset to be synced with others.
- Each play is associated with the score of the game, making score effects easy to account for.
- Each play is associated with the manpower situation (strength), allowing us to collect data for both even strength and man advantage play.
- The raw nature of the data allows for an endless possibility of analyses, including the ability to calculate metrics such as Corsi and Stimson's SAGE.
- The location data captures where on the ice the different elements of a play are taking place.
- The data captures the interactions between players involved in the same plays.
***
Readers, methods can always be improved. If you have suggestions for better definitions or methods for the data collection, please let me know.
If you'd like to join the Passing Project and collect data for an NHL team, you can reach out to Ryan Stimson on Twitter @RK_Stimp / or by email hockeypassingstats@gmail.com
***
References
Stars GM Jim Nill wants to see the NHL implement SportsVU league-wide. Thomas Drance. http://www.thescore.com/nhl/news/542630
2013-2014 Devils Passing Review: A Passing Stats Primer. Ryan Stimson. http://www.inlouwetrust.com/2014/7/21/5899095/a-passing-stats-primer
Goal by Curtis Glencross. http://www.nhl.com/gamecenter/en/boxscore?id=2014020539
If you'd like to join the Passing Project and collect data for an NHL team, you can reach out to Ryan Stimson on Twitter @RK_Stimp / or by email hockeypassingstats@gmail.com
***
References
Stars GM Jim Nill wants to see the NHL implement SportsVU league-wide. Thomas Drance. http://www.thescore.com/nhl/news/542630
2013-2014 Devils Passing Review: A Passing Stats Primer. Ryan Stimson. http://www.inlouwetrust.com/2014/7/21/5899095/a-passing-stats-primer
Goal by Curtis Glencross. http://www.nhl.com/gamecenter/en/boxscore?id=2014020539