A Tale of Two Tires (long)

NOTE: I am leaving this original post intact, but look on page 2 of the thread for an update based on something somebody pointed out. I did 10 trials for each tire, but only pasted 9 of them into some of the original analysis. I’ve updated figures for the missing set of trials.

As promised, I am posting the results of some field testing I did comparing two different front wheel and tire setups. The results were both as expected and not as expected.

The two wheel setups were:

Zipp 404 (2001 model) shod with tubular Tufo S3, size 21. Glued with Continental glue about one year ago. The gluing job at that time involved a full cleaning of the rim, multiple coats, etc. That wheel+tire has been trained and raced on 4-5 times in the past year for a total of about 150 miles. It has been stored inflated. It does not have sealant in it. To the casual observer (me) the tire looks virtually new.

Zipp 808 (2005 model) sporting a clincher Michelin Pro Race, size 23, with standard weight inner tube (Specialized). That tire has about 1,500 miles on it.

Bike setup was my Yaqui DE, size 55 TT bike. Rear wheel was PowerTap hub on OpenPro rim and Michelin Pro Race size 23 clincher. No water bottles; two frame cages. Both tires at 115 psi. No adjustments made to any bike fit parameters during testing window.

I wore the exact same clothing for each trial – snug sleeveless tri-top, basic bike shorts, Bell X-Ray helmet, TriVent shoes. No arm warmers or leg warmers.

Trials took place in three batches in a six-day window. Course was Fiesta Island “short loop,” which is 2.5 miles, give or take 50 meters depending on exactly where I hit the interval button on the PT computer. I measured single laps, stringing together 1-3 in a row on each wheel before changing wheels. Each lap began with a flying start at about 20 mph, holding a given power output target for that lap. Each lap was completed at a constant power. For example, I would do a whole lap at ~175 watts, then start over for the next one at, say, 215 watts. A total of ten laps were completed for each wheel+tire.

Conditions ranged from a barely perceptible breeze to dead calm. I did each set at 5:30 to 6:30 am, and quit before the sun got up in the sky and kicked up the wind. Road surface there is fairly good – about what you’d expect on most N. American Ironman courses. Slight rise and fall of ~20 feet per lap.

I recorded the average watts and average speed off of the PT computer display. I was not able to complete downloads into CyclingPeaks on all of the runs, so I used the PT computer data. More on that point later.

The data:
404+Tufo

Power Speed (mph)

162 19.5
205 21.6
173 20.6
165 19.6
183 20.3
210 21.9
164 19.7
214 22
235 23

808+Michelin
Power Speed (mph)

173 21
211 22.7
222 23
215 22.9
190 21.8
188 21.6
160 20.5
242 23.9
245 23.9

A casual examination of these figures should convince anyone that the 808 setup is much faster (at the 99+% confidence level). It’s not even close.

The big question is, “Why?” Is it the tires or the wheels? Let’s first talk about the wheels. Zipp has published figures that indicate that the 2001 404 is about a half second per kilometer slower than the 2005 808. In terms of my data accuracy (tenths of a mile per hour) that is barely measurable. While I had intended to make an exacting adjustment to the data for the drag differences between the wheels, I think it is a waste of time to do so. The differences overwhelm an adjustment that small.

So, I start by assuming that the entire system of bike and rider had the same CdA for all trials. (If Josh at Zipp thinks this is a bad assumption, please feel free to say so…).

With full credit to technical assistance from my mate from the Great White North, rmur, assuming a CdA of 0.27 and some other constants, we would estimate the Crr’s to be:

404+Tufo: 0.0073

808+Michelin: 0.0050

This is equivalent to a 15-20 watt differential to travel the same speed. Note that these Crr estimates are for *both tires being the same. *In the case of the 404+Tufo setup, all of the difference is attributed to only one tire. In other words, these figures understate the Crr for the Tufo tire. Under our assumptions here about CdA, the Tufo is actually somewhat worse than 0.0073, and the wattage penalty for the Tufos is somewhat worse than 15-20 watts. Throw in a small speed adjustment for the 404, and maybe we get some of those watts back. But just a little. The difference here is 10-12 minutes in an Ironman, my friends. And that’s just one of two tires.

I could leave it at that and claim to have proved a point, but there is a puzzling anomaly that bears reporting.

In the method above, we made a (perfectly valid, IMO) assumption that the CdA was near-constant for both setups. It turns out that we can test that assumption. Via a method published by Allen Lim, one can analyze these types of trials at varying speeds and power levels and extract CdA and Crr separately. Kraig Willet was kind enough to crunch the numbers for me and show me an odd result.

According to the Lim method, the 404+Tufo trials exhibited a materially lower CdA than did the 808 setup trials. While we assumed above that they were the same, or that, at most, the 404 was a little slower, the data tells us the “slower” wheel was faster in terms of CdA. How much faster? A CdA of 0.24 versus 0.28. If you didn’t realize it, this is a shocking gap.

This has two serious implications. First, if it is true (and I don’t think it is), Zipp better go back to the wind tunnel! If it is true, it means that the Tufo tires performed far, far worse than rmur estimated. The Crr had to have been so bad as to overcome a profound aerodynamic advantage held by the 404 wheel. While I am convinced the Tufos performed badly versus the Michelins, they cannot be that much worse! Knobby mountain bike tires maybe, but not the Tufos.

The second implication is that the data has some flaw in either the way it was gathered or the way it was computed and recorded. The only “gathering” flaw would be if I had held the aero position while on the 404s and sat up while on the 808s. I assure you that was not the case, so we must look elsewhere.

Two possibilities: First, I discovered that the PT head unit reports average power and speed figures that are slightly different from the averages one gets if one downloads the data and analyzes the raw file. Average power and average speed are sometimes under-reported on the head unit, versus the data actually recorded on-the-fly. The differences are in the 0.4% to 1.1% range, and I cannot discern a pattern. So, it is possible that, by reading the results off the PT head unit instead of from a download, I have flawed data. It is unlikely to affect the basic conclusion from the first half of my discussion because the possibility of mis-calculated average figures would seem to apply equally to both wheel setups. While these changes might be enough to throw off the regression results from the Lim technique, they are quite unlikely to throw off the basic conclusions.

Second, Lim’s method argues for lots of data points. In particular, it begs for additional data points from low-power, low-speed trials. My slowest trial was 19.5 mph at 162 watts. Lim did trials starting at 100 watts in his paper. Kraig has suggested I go do a few more low-speed trials. I think I’ll do that and see if the Lim formula spits out results that are a bit closer to what we might expect. It is an open issue that needs resolution. It is, quite frankly, impossible that the 404 setup reduces the CdA of the entire bike + rider system by a whopping 0.04.

So, what do I conclude?

This and only this: A 2005-era Zipp 808 clincher front wheel mounted with a Michelin Pro Race 23 mm tire requires 15-20 less watts to push along at a given race-level speed than a 2001-era Zipp 404 tubular front wheel mounted with a Tufo S3 21 mm tire, on a bike set up as described, and piloted by me.

Of that much, I am quite certain. The data is overwhelming, and there is no room for the null hypothesis.

What I am less certain about is how much of that difference is strictly due to tires. I have an overwhelmingly strong hunch that it is all in the tires. But until I run some more trials, the Lim calculation leaves me scratching my head.

Questions?

Nice work; I like it. I might’ve been underestimating the difference among wheel/tire combos. Yikes.

Now with my critical cap on, your power is all over the place. Well, not really, but it’s certainly not as steady as you’d like it to be. When/if you get it into Cyclingpeaks, I’d be curious to see the diff between mean power and normalized power, and how that relates to speed.

Thanks Rick. The deck is becoming heavily stacked.

I know what I’m going to do…

Ash, your avg wattage (across all trials) for the 808 setup seems higher. So that might be a consideration.

I have two others that I will throw out there for you:

Clinchers – Josh has discussed, and really anyone can see from looking, that there is a major disconnect between the tire->rim joint (josh recommends filling with silicon as a possible fix) on ANY clincher. The 808 being 0.5 seconds faster is, I am fairly certain, a clincher-clincher, tubular-tubular wheel comparison. A clincher rim WILL produce more aero drag than a comparable tubular rim because of the joining of the tire at the rim. So this could be one point. Josh would have to be the one to say if 0.04 was reasonable for this.

The 21mm tire is narrower and may merge with the 404 rim better, further enhancing the difference.

I wouldn’t overlook the aerodynamics of the tire, tire+wheel, and clincher vs. tubular aerodynamics. That is my own perception. And any variance in your CdA may have been magnified by the higher wattage output (average) in the 808 tests over the 404 tests…

Nice work; I like it. I might’ve been underestimating the difference among wheel/tire combos. Yikes.

Now with my critical cap on, your power is all over the place. Well, not really, but it’s certainly not as steady as you’d like it to be. When/if you get it into Cyclingpeaks, I’d be curious to see the diff between mean power and normalized power, and how that relates to speed.

No – the power for each trial was spot-on constant. Avg power = normalized. If a trial above says “214 watts” that means I rode right on 214 watts for the entire trial. I then stopped, turned around for another rolling start, and did the next one at another power. I used different power levels to generate a variety of data points. It improves the confidence in the results, and allows us to use the Lim regression method.

Again – each trial was at a chosen power level, held constant for the entire 2.5 mile loop.

And any variance in your CdA may have been magnified by the higher wattage output (average) in the 808 tests over the 404 tests…

The math (forces) doesn’t work that way. CdA is a dimensionless number, and does not vary with speed or power.

Surely the two wheels+tires are not identical. We can go on for a long time about why the 404 with tubular “might” be faster than the 808 with clincher. Ok, fine, maybe it is (but I doubt it). But – the magnitude of the measured difference in the Lim regression is just not possible. You don’t get 0.04 CdA variation between any two wheels! Before we get too far into angels-on-heads-of-pins over rim-tire discontinuities, we need to back up and keep the size of the effects in mind.

The real conundrum is that the 808 was overwhelmingly faster, yet the 404 somehow comes up with better CdA numbers. That’s why I think the Lim regression result is wrong, and I need to do more low-speed trials, with downloaded PT data. Any result from that effort will not effect the main conclusion – 808+clincher is faster than 404+Tufo tubular.

It (CdA) does if your body position changes unknown to you as you start put the pedal down harder or faster. It could cause f the whole system to have changed in some way – such as if there was a change in cadence that accompanied the change in power. Cadence can play a HUGE role in determing drag. I have a BSE in MAE, I know my aero.

It is at least something to think about.

10 points also seems kind of lean for any kind of regression, I agree.

It’d be interesting to know how the hubs bearings of the two setups differ as well…

It’d be interesting to know how the hubs bearings of the two setups differ as well…

Didn’t test for that, of course. If somebody wanted to, I guess they could swap a 2001 Zipp hub with a 2005 Zipp hub and find out. Kinda pointless, since it’s the tires.

10 points also seems kind of lean for any kind of regression, I agree.

Not for *any *kind of regression – just ones with poor fit. We have R^2 of .9912 (808 plot) and .9198 (404 plot). Single-variable regressions don’t get much tighter than that!

The 808 setup has lower Crr with 90% confidence under the Lim method (allowing CdA to vary between setups), and at 99% confidence if assuming the same CdA.

All the same, something seems amiss in the CdA result. We can put the Crr question to bed – that is a done deal. I want to revisit the CdA issue with some more trials.

BTW – these power levels are nice comfortable riding for these 6-7 minute loops. No mashing or rocking was required. Cadence was my usual 78-82 throughout. This was “metronome riding.” Not perfect, but pretty close.

Glad to see you convinced yourself. Also, you state that a total of 10 laps per tire & wheel combo were done. However, you only show data for 9 data points each.

I remember reading that the newer 404’s were significantly faster than the older 404s. I believe that it was at the point in time when the sidewall bulge was introduced. Wasn’t that after 2001? That could be a significant contributing factor. I believe that the comparison is somewhere on the Zipp site.

Don’t know the exact date of bulged rim introduction, but my 2002 404 has the bulge.

Ashburn

I have no data whatsoever to back it up, but I can tell you that I only ever rode TUFO tubulars once. It was in my pre-powermeter days, but I didn’t need a PM to tell me that they were slow. I still have them hanging up in my garage and they are free to a good home - or any home for that matter.

Bill

Also, you state that a total of 10 laps per tire & wheel combo were done. However, you only show data for 9 data points each.

D’oh! I copied and pasted nine into the post I made on biketechreview, and then copied that paste in here. There is another data pair on my office computer. The calculations above don’t include the 10th set.

I’ll forward those to the math wizzards and update this post.

Thanks for doing this. People will come up with flaws in your approach, but whatever…

Maybe Tufo should consider selling them as a training aide… Tufo PowerTires!

All of your measured data is in 3 significant figures yet your rolling resistance numbers are in 4 significant figures.

How did you do that?

A better approach, but I would still like to see someone use the same wheels and mount different tires on them and compare them to a control set. That way you wouldn’t be comparing apples to oranges.

jaretj

Very nice work! Why not swap out the Tufo for something else before the next trial.

i’m giving up on all reasoning, it’s obvious that the mob mentality has taken over this issue. the extremly bright and motivated people doing these tests are making just small errors that can throw a test either way.

please everyone send me your tufo tires if you actually believe that a tire can make a 15-20 watt difference (1.0 to 1.2 mph)

jaretj

I’m wondering about the bulge theory. Rick: do your 404s have the ‘bulge’?

Here’s my wildly speculative hypothesis: deeper rims generally have minimal advantage at 0 yaw to begin with (there are tests, including Zipp’s I believe that show this); This combined with the fact that either the 23 tire and/or the ‘bulge’ increase Cda, gives the 808 worse numbers.

The more I think about it, the more I think it’s that 23 tire. I’ve always run my 808 with a 20 (it actually measures 21). One expert we know has expressed some reservation about Zipps claims that the 808 works fine with any tire width. Fortunately, it won’t be that hard to test since it’s just a clincher.

– Jens