Expected points

Introduction

The first article on the concept of the Win Expectancy Index based on Statistical Exploration, the introduction of W.E.I.S.E., explained the process and concept of win expectation. Using historic race results by athletes with variables identical to those of the athlete being analyzed, it calculated the percentage of wins. This gave us an idea of how likely it was for our athlete to win as well.

The second article on the topic gave you some examples of the possible use cases of the WEI.

After further developing the WEISE, this third article explains and demonstrates a new calculation using the same data and logic as the WEI. In this case, however, it does not look at the results binarily (winning or not winning), but rather at every result that awarded athletes with points*. The results of this calculation are summarized in the Expected Points Index (EXPI).

^* see yellow box further down in the article

As a quick reminder, this is what the Win Expectancy Index looks like for all third laps of women’s sprint races (see the interactive version of the full WEI on Tableau Public):

Win Expectancy Index example
A female (W) athlete entering lap three of a SPrint race with 1 miss and ranked #3 in ski time, has a **20%** chance of winning the race.

Version 2 of W.E.I.S.E.

One of the limitations of the Win Expectancy is that it only looks at the race results binarily: you either win a race or you do not win the race. But since World Cup races are rewarded with points we can use the same approach to calculate the EXpected Points (EXP).

This is a good time to remind you that W.E.I.S.E. uses calculated points based on calculated ranks, which can significantly differ from the ranks (and points) you find on the IBU website. All pursuit race results are (re)ranked based on the isolated times, thus ignoring the time differences at the start of the races which are based on the sprint race results. Also, in the season point totals, in addition to including points for Olympic Races, I do not deduct the points of the worst two races of the season, as is common in the IBU total standings. Lastly, all points are recalculated based on the current rules on awarding points as defined by the IBU.

For all race participants in the last 22 seasons that were in the same race situation for cumulative misses and ski rank, we calculate the average of the points those athletes were awarded (link to interactive dashboard):

Again, like with the Win Expectancy, we can compare expected results with actual results. But now we can do so with far greater detail. Rather than saying “Eder had a 32% chance of winning after the second lap, but he didn’t win” we can now say “Eder could expecting 15 points after lap two, but got 18 points in the end”. And since we have this more detailed information at the points level, it allows us to calculate overperformance.

Overperformance

We simply subtract the EXpected Points on any lap during a race from the Actual Points (AP) at the end of that race. This measure tells us if the performance was above or below the expectation for every athlete. Overperformance is therefore defined as the actual points above the expected points. A negative value for overperformance simply indicates that the actual points were lower than the expected points. This can also be referred to as underperformance.

Let’s look at an example. In the image below, we have all races of the 2021-22 season for Julia Simon. Based on the overperformance calculation, we can see that she generally performed quite a lot better than expected:

Julia Simon has performed better than expected based on her last laps in all races in the 2021-22 season

When we turn this chart by 90 degrees and add lines for actual points (AP, green) and expected points (EXP, blue), it shows us Julia’s seasonal trend:

Again Julia Simon, comparing Actual Points (AP) to Expected Points (EXP) per final lap of the race
for the 2021-22 season (lines), and the overperformance (bars)

Running total of overperformance points

Now, rather than looking at the data race by race, let’s look at the overperformance cumulatively. Showing the running total of these values, we get a sense of how much Julia has overperformed this whole season. An amazing 120 points more than she could have expected based on historic results:

Same as above but with a running total of Over-and Under Performance based on final laps

Let’s move forward with that idea of the running total of overperformance points for the season. The following chart plots all female athletes of the 2021-22 season based on the total actual points (vertical axis) and the running total of overperformance points (horizontal axis):

Total actual points and running total of over-and under performance points for women’s final laps of races in the 2021-22 season

Based on this chart, Julia Simon, Dorothea Wierer and Lisa Hauser had the best total of overperformance points this season. They scored far more points than expected from historic results, based on their misses and ski rank going into the final lap of the races. Alina Stremous, Mari Eder and Vanessa Voigt were the worst overperforming athletes (aka underperformers) scoring much fewer points than expected.

Analyzing the past season, the athletes who overperformed did very well. Alternatively, when looking at the season ahead, athletes that underperformed last season could have expected to do better. So with some specific improvements over the summer, and perhaps some better luck, they may be able to make huge jumps next season if they can perform up to or above their expected points.

Season(s) averages and spread of overperformance

Per athlete, we can take the average of all the overperformances to see how they performed per season or over a specific timeframe. The other thing to look at is how large the difference was between their best overperformance and worst overperformance during that timeframe, the overperformance spread, or variance.

The last charts of this article shown below, also available interactively in the W.E.I.S.E. version 2 dashboard, show all athletes of the 2021-22 season plotted based on their average points overperformed per race (final laps only), and their overperformance spread:

Dorothea Wierer had an average of 4.56 point overperformance and an overperformance spread of 31.1

With overperformance it is fairly straightforward to determine good and bad: a positive overperformance is good (and the larger the better) and a negative one is not good, or at least leaves room for improvement. With the spread, it is a little less clear. As it is strictly based on the best and the worst overperformances during the selected season, it could be used, cautiously, as an indicator of consistency.

Another example. When we dig into Wierer’s data a little deeper, we can see that with an already strong average of +4.56 points of overperformance. Her average would be even better if the two negative outliers could be excluded. Did she perhaps brake a pole in the last lap of that individual race or had a fall in the last lap?

It will be up to Wierer and her team to figure out what happened in those two races to learn and improve. And for other athletes to find out how Wierer is so good at outperforming the expected points.

Dorothea Wierer had her best overperformance at 14.9 points above expected, and a worst of –16.2 overperformance, for a spread of 31.1

Conclusion and further development

In the first article on the Win Expectancy Index based on Statistical Information, I mostly focused on explaining the underlying data, processes and models, and the concept for the index itself. The second article gave some practical examples of the application of the index. This third article introduced an alternate and more detailed measurement based on the same concept: EXpected Points.

Tools like the W.E.I.S.E. are not suggested to eventually replace current and conventional wisdom, skill, expertise and knowledge in the field of biathlon. But it provides a different view based on actual data. And this is an approach not commonly and seriously used by many athletes and nations in the world of biathlon. Yet? Hopefully, articles like these will open some eyes to new possibilities that eventually, combined with, rather than instead of, current knowledge and expertise, will push biathlon athletes even further.

Thoughts or comments? You can find me on Twitter, or email me on the podcast Gmail account: PenaltyLoopPodcast

Introduction

In my last article, the introduction of W.E.I.S.E., I introduced the process for creating the calculation of the expectation of winning, based on historic results with identical combinations of discipline, gender, race lap, the cumulative number of misses and cumulative ski time rank. After further developing the WEISE, this article demonstrates some practical uses of this Win Expectancy Index (WEI).

As a quick reminder, the interactive WEI dashboard can be found here and looks like this (with lap 3 selected):

Examples of practical uses of the Win Expectancy Index

A tool like the Win Expectancy Index only makes sense if there is actual value in using it. Personally, I think it is very interesting to just look at the index, but I can imagine there are not too many people who share that passion for biathlon, data and data visualization. Below are some practical examples of how the Win Expectancy Index can be used to analyze athletes and races.

Athletes

Final lap Win Expectancy versus actual result

The conversion rate of opportunity to wins. For example, Elvira Oeberg’s conversion rate for races in which her Win Expectancy on the last lap was 33% or higher, was 60% (she won 3 of the 5 races).

Elvira race results in the 2021-22 season and Win Expectancy in the last lap of the race

Seasonal trend of average Win Expectancy per race for the final lap

This allows us to look at an athlete’s performance without the influence of the actual race results

The trend of JT Boe’s Win Expectancy in the final lap of every race

Average Win Expectancy per lap per discipline

Provides insight into a racer’s ability to balance their races well, and still perform well right to the final lap of a race

Looking at average Win Expectancy development per lap per athlete for sprint races

In this example, we are looking at all sprint races for women in the 2021-22 season. We then average each athlete’s Win Expectancy per lap. From this, we can see that for athletes like Roeiseland, Elvira Oeberg and Hauser the Win Expectancy increases towards the end of the race. On average, Sola, Alimbekava and Nilsson’s Win Expectancy declines in the last lap.

This gives athletes and coaches a tool to further analyse data at an individual race level to see if there is an issue or potential for improvement.

Races

Win Expectancy per race

How does the Win Expectancy change over the course of the race, and how did the eventual winner develop during the race?

Win Expectancy for athletes in the Women’s Sprint during World Cup 8 of the 2021-22 season

The trend of Win Expectancy per miss as ski rank increases

Here we can see that for athletes in the top-10 for ski rank, the difference in WE between 0 (dark green) or 1 (light green) miss is fairly consistent

Win Expectancy for all athletes in lap five of all five-lap-races with either
one (light green) or zero (dark green) misses as their race rank increases

The trend of Win Expectancy per ski rank (group) as misses increase

We can see here that the WE when skiing top-5 with one miss (~43%) is lower than when skiing top-10 with 0 misses (~49%)

Win Expectancy for all athletes in lap five of all five-lap-races with a ski rank as indicated by the labels as their race misses increase

Using Win Expectancy data to indicate some of the more exciting races

For biathlon viewers and fans, we can look at races where the highest value for Win Expectation (all athletes) during the race was 20% or less (a close pack). And within that group, the number of athletes that had that highest value of WE at some point during the race. If those numbers are 2 or more in sprint races or 4 or more in all other races, we then look at the final rank of the athlete(s) with the highest WE. If that was 10 or better we highlight the races with a pink star:

Based on this logic, the Women’s Sprint of World Cup 7 in the 2002-03 season would be interesting to watch (the highest WE of the race was only 14%, three athletes had that WE during the race and the highest rank of those three athletes was 9th). Or more recently, rewatch the 2019-20 World Cup 8 Women’s Pursuit (15%, 5 and #4).

Conclusion and further development

In the first article on the Win Expectancy Index based on Statistical Information, I mostly focused on explaining the underlying data, processes and models, and the concept for the index itself. This article dove into the application of the index by showing some examples of how it can provide value in analyzing the performances of athletes related to historic results with the same variables. I hope that this gives a better understanding of the concept of the WEI, and highlights its potential for performance analysis and athlete improvement based on a different, data-driven, view of biathlon performance.

Thoughts or comments? You can find me on Twitter, or email me on the podcast Gmail account: PenaltyLoopPodcast

Historic biathlon results create expectations. But what about points?

Introduction

Version 2 of W.E.I.S.E.

EXpected Points

Overperformance

Running total of overperformance points

Season(s) averages and spread of overperformance

Conclusion and further development

What do you expect? Practical applications of the W.E.I.S.E.

Introduction

Examples of practical uses of the Win Expectancy Index

Athletes

Final lap Win Expectancy versus actual result

Seasonal trend of average Win Expectancy per race for the final lap

Average Win Expectancy per lap per discipline

Races

Win Expectancy per race

The trend of Win Expectancy per miss as ski rank increases

The trend of Win Expectancy per ski rank (group) as misses increase

Using Win Expectancy data to indicate some of the more exciting races

Conclusion and further development

Recent Articles

Categories

Archives by Month

Search Articles