When all is said and done, this rollout should stand as a case study of how not to use statistics to design a product & marketing campaign.
I wonder if you go outside of the top 20 percent of middle age men in Kona if the rest of us are closer to the 20 percent, than the rest of 20-39 year olds outside of the top 20 percent of young men in Kona. If thatās the case, it helps the local fast and fit guy a bit more at the local IM.
What if they took 100 percent of the Kona field, what do the coefficients look like? Because then you have the Kona pack fodder in every age group, and these pack fodder guys in Kona are the local heros who are largely the ones getting the tailwind of the coefficient and getting the KQ slots.
If you generate the coefficients off the most competitive at Kona but then you apply it to the next tier, of humans, maybe its not as applicable. Itās just a hypothesis based on what is playing out (having not gone thru the entire year cycle yetā¦maybe we get more young people from the European IM qual cycle next spring/summer).
One thing I donāt quite get here (or at least Iām not sure there is a consensus), is what weāre actually using finish times and rankings as a proxy for?
Would the ideal theoretical final ranking for slots be by:
-
Actual vs theoretical max for your age and gender? (So luck + genetics + how hard you trained)
-
Actual vs theoretical max relative to age and gender with adjustments for the fact that life gets in the way as you get older and women face greater societal barriers? (So luck + genetics + how hard you trained + societal adjustment)
-
something else?
like whatās the actual goal here? we seem to have already decided, whether we know it or not, what a fair outcome looks like, and now weāre all trying to work backwards to justify it. It might make more sense to do it the other way around.
i thought the goal was simple, create a metric to distribute slots beyond one per age group. but with all the talk of who is getting those distributed slots it seems my understanding is stupid.
Makes sense, so we could just add a cap and floor to each age groups slots? Either absolute or relative to entries in that age group. So keep the āperformanceā aspect, but say no group gets less than 3 slots, and no group gets more than 10 slots. Just a little tweak but along the same lines as what theyāve already got so should be doable without too much of a fuss (very naive statement I realise)
Floor doesnāt really make sense as it cant be over 1.
A floor of 2 would mean every race would have to have at least 52 slots - which is more slots than 2/3 of the qualifying races. so even if you increase the slots in each race this would be effectively just give slots to the top 2 in in each AG
@devashish_paul yes right now there isnt a huge gap which is why i dont think drastic action is needed. lets see how it balances out after the kona racers return. also this will be 100% different next year when the top men and women are all in kona during this period
it seems like course is making a difference here
in Cali (flat) the top AG time was 20 min faster than the top under 40 AG time
Maryland (also Flat) it was 23 min fast
in Wisconsin it was only 3 min faster
Chatt top qualifier was M30-34
You are mistaking one metric for another. What matters is who shows up. All those times you posted were done by racers, not the course. If the 8 or so guys in the 50 to 59 AG that swamped the spots didnt race that day, completely different qualifying outcome. Has nothing to do with he course.
Yes some courses will be tougher than others, some swims cancelled or shortened, but at the end of the day it just matters who shows up to race on the day. Thus far it has been men top heavy, while the top women had other fish to fry in the actual Kona race everyone is so amped up about..
that is the biggest thing (and to me always has been the biggest thing); my question is for the folks who recently finished second and are quite upset, where would have they placed had Kona not been the week or so before hand⦠maybe they would not have been so upset⦠we need to let this system breath.
I appreciate the efforts of the folks thus far trying to bring some data to this.
I went ahead and scraped all of the data for all of the participants in the 2026 Kona qualifying races that have happened so far (did not include 70.3 Hawaii).
HERE IS THE COMPLETE AGGREGATE DATA FOR EACH RACE AND AG:
NOTE ā This is just the people who were OFFERED slots, not who took them. Please let me know if you see any errors and I will do my best to fix them.
Please also send me ideas of new KQ systems to investigate. So far in this and the other thread I have seen:
- Re-weight performance pool slots on a for each race (rather than comparing to Kona times ā working on this one now)
- Separate womenās and menās performance pool + podium slots like with 70.3 worlds (AKA equal representation)
- Top 3 in each age group get offered a slot
- Time cut-off of some kind (would need a clearer way to do this)
- Distributing number of slots equally across all races
- Distributing number of slots proportionally based on number of entrants for that race
Idk who the @SlowtwitchSupport mods are here (@Bryancd ??) but Iād be happy to write up my findings from these in a full post.
(My bias ā *In my day job I work as a data journalist for one of the bigger news outlets here in the U.S. Iām a M25-29 Ironman and Iād love to go to Kona someday, but Iām currently about 4 hours too slow and my chances of qualifying in 30-39 about about 0.)
(Note: I noticed an error with the gender percentages that I went ahead and fixed)*
Go for it! This thread is sort of a in perpetuity rolling analyses as races drop. You can have a separate thread with just your data.
Thanks for sharing. In case you didnāt see, someone else on this thread posted a similar spreadsheet. The main comparison theyāre doing is versus the old allocation system. Maybe you can work together.
Mods, is there any way to pin these posts in the thread so people can easily reference the spreadsheet links?
Anyway, I think other system tweaks people have mentioned are:
- Using some other Kona results metric to derive the coefficients (average of top 3, top 3-20, top 5%, top 50% versus the current top 20%)
- Using data from all Ironman races to derive the coefficients (either using the top 20% or some other metric)
- Excluding legacy entrants from the results used to calculate the coefficients
Another thing I think would be interesting is a larger analysis of how this current system would have played out in past years (probably even pre 2020). Someone has done some partial analysis of the past few years for the fall races earlier in the thread.
A long wish list of some not so easy analysis, but you asked⦠![]()
Data journalists are the best!!
Cross posting this here as well. Using IMCA, I show that the issue with the older men getting slots has more to do with the fact that the depth of field in M55 is considerably deeper than elsewhere and that this is driving the slot allocations. I do think that the coefficients need tweaking (XC, Legacy, Kona basis), but in this case M55 was considerably deeper than M30, and the slots bear that out.
If anything the person on the outs (at least as I saw) was 2nd place in F30.
There were 163 M55-59 at IM CA or ~7% of total participants who finished, 8.7% of the men who participated.
(Note: I noticed an error with the gender percentages that I went ahead and fixed)
At quick glance something does not seem right here - I just know that i was the final member of the M30-34 AG to get a offered a slot at IM WI and I was 7th in that AG (you have 9)
Also the total number of slots for that race was 40, but you have 47.
For cali you have 64 slots but there was only 55
feel free to cross reference my sheet posted earlier in the thread
I have 55 slots for CA and and 40 slots for Wisconsin.
I fixed another error earlier today but that shouldnāt have affected the slot number.
9 represents the number of people in that age group that would have been offered slots based on their time. If you were 7th in your AG, then your AG ranking wouldnāt matter at all ā it would go to the performance pool. 2 others slower than you in the AG may have been eligible to claim one of those slots but did not show up/already had one from a previous race.
Iāll double check on this again though.
Maybe Iām reading it wrong but when you sum column c itās 47
I can confirm at WI no one lower than 7 in my age group was eligible in the performance piok
finally look at whats happening in more detail
It looks like you are grabbing the top X (55, 40, etc) in the performance pool and assuming they would all be offered the slot. This is not the case - If one of the AG winners is outside the top X then it reduces how deep the pool goes.
In the case of Wisconsin and California there were AG winner that didnāt crack the top 40 or the top 55 respectively in their performance pool ranks. So the pools only offered slots to the people in the top 32 and 46 in the performance pool respectively.
Wait Iām confused ā my understanding of the system was that:
-
Every AG winner gets offered a slot not matter their time
-
After every AG winner gets offered a slot they go to the performance pool for the remaining slots
This appears to be what Ironman says on their website:
So I did this:
df['adjusted_time'] = df['Finish Time'] * df['Factor']
df['auto_kq'] = False
#if you win your age group you get a slot
df.loc[df['Age Group Ranking'] == 1, 'auto_kq'] = True
...
#rank people on their adjusted times for each event
df['rank'] = df.groupby('EVENT')['adjusted_time'].rank(method='first').astype(int)
#if you don't already have a slot (auto_kq == False) and your adjusted time rank is below the total number of slots, you get offered a slot
df.loc[((df['auto_kq'] == False) & (df['rank'] <= df['SLOTS ALLOCATED'])), 'auto_kq'] = True
Is this incorrect?




