Analyzing Consistency of Athletes

Update (April 16th): Since originally posting I have added an age-weighted component to the numbers, so that newer results have a larger influence than older ones. As you can see from the changes in the numbers, this adds another interesting dimension to the numbers in this post.

I love getting feedback on my analysis and predictions – very often, they trigger some new, interesting way of looking at the data. For example, Linsey Corbin made the following remark to me:

I wish there was a way that your predictions could show consistency. One thing I pride myself on is being fairly consistent across the board.

Thanks for the suggestion, Linsey (and great to see you back to racing)! I have been looking at different ways of attacking this question, here is what I was able to come up with. I will continue to monitor these numbers for upcoming races, ~~maybe~~ and I’ll include them in future predictions.

Deviation

In statistics, there are a number of way to measure how “consistent” a set of data is. The most common way to express variability in data sets is the “Standard Deviation“. StdDev basically measures the distance of data points from the average value – the more “outliers” there are and the further off they are, the higher the standard deviation.

This was my first try of analyzing consistency. The data analysis part is pretty simple, as the function is built into all kinds of programs. However, the results were not very helpful: In essence it helped identify athletes that had one or more sub-standard results, e.g because of walking large parts of the marathon in a race. For example, Lucy Gossage showed up as an inconsistent athlete with a large deviation, but that was almost exclusively a result of her marathon walk resulting in an 11:32 finish in Kona 2014. It also didn’t value “good” results: The difference of a good result to an average – maybe 30 minutes or so – is much smaller than that of a bad result – walking easily adds an hour to the overall time.

Identifying Non-standard Results and Quantifying Consistency

Even when looking at the deviation of results of each athlete did not lead to a good measure, it formed the basis for another way of looking at the data. In the familiar “bell shape” curve of the normal distribution, 68% of results fall within one standard deviation around the average. When looking at the difference between an athlete’s “expected time” and their actual finishing time, roughly 68% of the results are within 20 minutes of the expected time. Based on this I classify results within 20 minutes of the expected finishing time as “normal”, and any result quicker as “better” results and anything slower and DNFs as “sub-par” results.

I can then aggregate all the results of an athlete into a figure like this:

Linsey Corbin: 83% +17% -0% (18)

Older results have less of a meaning than newer, so adding in an aging component gives the following numbers:

Linsey Corbin: 79% +21% -0% (18)

Each part has the following meaning:

Linsey Corbin: Name of the athlete
79%: Fraction of normal race results
+21%: Fraction of “better than expected” race results
-0%: Fraction of “sub-par” race results (including DNFs)
(Note: Technically, Linsey has at least one DNF in her Ironman races – she didn’t finish IM Texas in 2011. This is a limitation in my data – I have only been including DNF’s since 2014.)
(18): Total number of Ironman-distance results (including DNFs)

Average numbers are about 68% of normal results and roughly 15-20% each of better and sub-par results, but these numbers vary wildly between athletes.

Examples

Here are some more numbers from well known athletes – put into different groups. (As I have updated my algorithm a bit since posting for the first time, I am also including the originally posted numbers in [square brackets].)

Stable Athletes

Andy Potts: 100% +0% -0% (13) [originally posted: 100% +0% -0% (13)]
Yvonne Van Vlerken: 84% +0% -16% (23) [originally posted: 91% +0% -9% (23)]
Lucy Gossage: 92% +0% -8% (12) [originally posted: 91% +0% -9% (11)]
Sebastian Kienle: 85% +12% -3% (11) [originally posted: 82% +9% -9% (11)]

These are athletes where predictions are a very good indicator of how they’ll perform on race day – they usually perform on a very similar level from race to race.

Normal Stability

Jodie Swallow: 55% +0% -45% (10) [originally posted: 78% +0% -22% (9) – she has since DNF’d in South Africa]
Caroline Steffen: 92% +8% -0% (20) [originally posted: 75% +25% -0% (20)]
Meredith Kessler: 65% +14% -20% (23) [originally posted: 70% +17% -13% (23)]
Andreas Raelert: 48% +0% -52% (19) [originally posted: 63% +0% -37% (19)]
Luke McKenzie: 51% +30% -19% (26) [originally posted: 62% +23% -15% (26)]

For these athletes predictions give a good indication, but it is also interesting whether there is a higher potential for an “up-side”, better-than-expected result (larger percentage of faster results, e.g. Carolin Steffen) or for a “down-side” result (larger percentage of sub-par results, e.g. Jodie Swallow or Andreas Raelert). For other athletes, the day could go either way (e.g. Meredith Kessler or Luke McKenzie).

Lower Stability

Sarah Piampiano: 41% +47% -12% (14) [originally posted: 50% +43% -7% (14)]
Luke Bell: 23% +5% -72% (26) [originally posted: 38% +12% -50% (26)]
Dede Griesbauer: 41% +18% -40% (26) [originally posted: 32% +32% -36% (25)]
Tim O’Donnell: 14% +63% -23% (11) [originally posted: 27% +45% -27% (11)]
Pete Jacobs: 5% +16% -79% (26) [originally posted: 15% +42% -42% (26)]

Then there are athletes that have a lower fraction of “normal” results. Here it’s also interesting to look at the upside (e.g. Sarah Piampiano, Tim O’Donnell) or downside potential (e.g. Luke Bell). Some athletes’ results are very hard to predict from previous numbers – for example Dede Griesbauer and Pete Jacobs have had a good fraction of great results but also slower, disappointing results.

2 thoughts on “Analyzing Consistency of Athletes”

Martin May 12, 2016 at 8:30 pm

Hi Thorsten,

tbh I consider your approach quite off in general. You miss the whole dynamic of tapered/untapered athletes, rising season form,… Apart from that bike courses are more different than your numbers/estimations. Rolling hill, bike courses, one flat bike course with one long and steep hill, totally flat course,… Apart from that are weather conditions as well as increasing race experience.

So from my analysis approach it is quite different – less on pure numbers, more on course oddities (weather course, humidity) as well as athlete experience and current fitness for a specific race.

To give you an idea here my predictions for IM Texas, which completely deviate from your data. Lets see who is closer? The final result is in the “total row” where higher is better on a scale from 1-10.

Bib Surname Name total swim bike run
1 Matt Hanson 9,1 6,5 9,0 9,5
3 Eneko Llanos 9,1 9,0 9,5 8,0
40 Patrick Lange 9,0 9,5 9,0 9,0
25 Antony Costes 9,0 9,0 9,5 8,5
13 Jeff Symonds 8,9 9,0 8,5 10,0
6 Nils Frommhold 8,7 9,5 9,5 8,0
36 Jeremy Jurkiewicz 8,7 9,8 8,0 9,0
17 Balazs Csoke 8,7 9,3 8,5 8,5
18 Clemente Alonso McKernan 8,7 9,5 8,5 9,0
2 Jordan Rapp 8,6 7,5 8,5 9,5
7 Matthew Russell 8,5 6,0 8,5 9,0
14 Callum Millward 8,5 8,5 9,0 8,5
5 Justin Daerr 8,5 7,5 8,0 9,0
4 Terenzo Bozzone 8,5 9,0 9,5 7,0
8 Andrew Starykowicz 8,4 9,0 9,5 6,0
10 Michael Weiss 8,4 6,0 9,0 8,5
48 Francisco Serrano 8,4 8,5 9,0 8,0
19 Pedro Gomes 8,3 7,0 8,5 8,5
16 Richie Cunningham 8,3 8,5 8,5 8,5
9 Chris McDonald 8,2 9,0 8,0 7,0
53 Harry Wiltshire 7,7 9,8 7,0 7,5
1. Thorsten May 13, 2016 at 11:33 am
  
  Hello Martin,
  
  all of my analysis and predictions are based on previous results of an athlete. This tells an important part of the story – but of course not the complete story. Other aspects – such as those you mention – have to be factored into a complete picture. I try to highlight some of those in my comments in the section on odds, and these “softer”, more subjective factors also make following the races so interesting.
  Cheers
  Thorsten

Comments are closed.