• Contact
Posted by Huckleberry on January 8th, 2009 under Football
Many ratings systems are based on the idea that a sports team’s (or athlete’s) performance is normally distributed about their true strength. This true strength is estimated, then, to be the mean of the normal distribution and is what the system will spit out as the team’s final rating. This idea that a team has a true strength around which their performances are normally distributed is both logical and convenient for the computers.
The question is raised, though, about the standard deviation of performances. A consistent athlete or team should show a low standard deviation in their performances while an inconsistent team, of course, should have a high standard deviation. As usual, I will begin by issuing a few warnings regarding the information I’m going to present. In order to calculate a team’s standard deviation (or consistency), you have to assume that their opponent in each game was playing at its mean. With the limited number of data points in a college football season, it seems entirely possible that a team may be fortunate enough to catch more teams on bad days than good days or vice versa, and to do so in a high enough percentage to affect their consistency rating. And that warning includes what should always be considered when dealing with statistical analysis of college football – there aren’t enough data points to be completely sure about most things we care about. Sure there’s enough data to tell me that Florida should practically never lose to North Texas, but delineations between teams of similar strength don’t come with a great amount of confidence in this sport. (Playoff, anyone?)
I calculated each team’s consistency rating based on their power score for each game played. This power score can essentially be viewed to be the margin of victory in each game compared to the margin of victory an average opponent playing an average game would have against that opponent when the opponent played an average game. That’s a lot of average; I almost feel like I’m discussing Texas A&M recruiting. For this analysis I focused only on Division 1A teams and matchups.
| Most Consistent Teams | ||
|---|---|---|
| Rank | Team | SDEV |
| 1 | Texas A&M | 8.35 |
| 2 | Air Force | 8.50 |
| 3 | Georgia | 8.71 |
| 4 | Miami (FL) | 9.11 |
| 5 | TCU | 9.29 |
| 6 | Akron | 9.38 |
| 7 | Virginia Tech | 9.39 |
| 8 | Boston College | 9.57 |
| 9 | Nebraska | 9.71 |
| 10 | Syracuse | 9.72 |
| Least Consistent Teams | ||
|---|---|---|
| Rank | Team | SDEV |
| 1 | Tulsa | 23.39 |
| 2 | San Diego St. | 20.49 |
| 3 | New Mexico | 19.86 |
| 4 | Rutgers | 19.07 |
| 5 | UTEP | 18.68 |
| 6 | Maryland | 18.42 |
| 7 | North Carolina St. | 18.10 |
| 8 | Virginia | 18.08 |
| 9 | Texas Tech | 18.06 |
| 10 | Missouri | 17.82 |
| Top 10 Power Teams w/ Consistency | ||||
|---|---|---|---|---|
| Rank | Team | Power | SDEV | Cons Rk |
| 1 | Florida | 63.67 | 12.25 | 46 |
| 2 | Oklahoma | 62.63 | 14.37 | 86 |
| 3 | Southern Cal | 60.19 | 12.99 | 62 |
| 4 | Texas | 57.20 | 10.33 | 20 |
| 5 | Penn St. | 53.47 | 11.14 | 31 |
| 6 | Texas Tech | 47.40 | 18.06 | 112 |
| 7 | TCU | 46.23 | 9.26 | 5 |
| 8 | Missouri | 45.82 | 17.82 | 111 |
| 9 | Ohio St. | 45.60 | 13.44 | 70 |
| 10 | Utah | 45.11 | 11.92 | 40 |
A few items about the tables:
Pros and Cons
Normally when people discuss college football teams, everyone talks about wanting to be a consistent football team. What they mean, or should mean, is that they want to be a consistently good football team. When you consider the problem of maximizing your team’s wins during the season, if you are a good football team whose opponents are all or nearly all worse than your team (the mean of the distribution) then you will want to be extremely consistent. This is because upsets are more probable when an inconsistent team is involved. By the same token, if you are a bad football team that will be playing mostly opponents that are better than your team, then consistency isn’t really something you should strive for if you want to win the most games possible. A team that is bad and plays at a consistent level will not get up for an upset. A team that is bad and plays inconsistently might put out a real stinker, but they are also more likely to upset a better team.
Take Texas A&M for example. They were a bad football team this year, but their performances were the most consistent in the nation. If we know that the Aggies are team whose performances averaged out to their power rating of 19.48, doesn’t it make sense that they would rather play half their games at a 34.48 level and half at a 4.48 level instead of all at their average strength? When playing games against a schedule that averaged 35.45 in terms of power ratings, a team with a 19.48 rating will win more games by being inconsistent than by being a consistent squad. The bottom line is that if you’re going to suck, you want to do it erratically.
On the other hand, there’s Texas Tech. A team with a relatively high power rating will want to play consistently against lower-rated teams. I won’t elaborate here as the reasons should be obvious. However, it should be noted that a team’s power rating is less important to whether or not they should be consistent than their schedule. Tech, while #6 in the power ratings, actually played two teams with higher ratings. Being inconsistent helps against those two teams as far as increasing the expected value in terms of wins. And in reality, that’s what happened with the Red Raiders this year. With no standard deviation, they would have been expected to lose to both Texas and Oklahoma by comfortable margins. Instead they upset Texas in a close game and got destroyed by Oklahoma due to inconsistency (at least partially known as the Lubbock factor in Tech’s case).
Expected Wins in a True Round Robin
My power ratings only display the mean of a team’s distribution. Taking the distribution of all the team’s power ratings, then, and normalizing each team’s specific rating enabled us to give a percentage chance that one team would beat another. This method, though, did not account for the consistency of each team and we’ve shown that this will have an effect. Now that we have identified each team’s rating as a normal distribution of its own we can recalculate that percentage in each case thanks to the property of the sum of normally distributed variables.
Furthermore, thanks to my favorite toy, Excel, we can simulate a complete round robin of Division 1A football using this calculation. There are many things we can look at after doing this, including seeing how changing a specific team’s consistency changes their expected win total out of the 119 matchups. We can also compare a team’s rank in expected wins to their rank in power rating. We would expect that highly rated teams with poor consistency would fall in the rankings and that lowly rated teams with poor consistency would rise.
So let’s begin with the Aggies and Red Raiders. In a full round robin, based on their current rating and standard deviation, Texas A&M’s expected value in wins would be about 45.5 according to this calculation while the Red Raiders would have an EV of 94.6 wins. If we simply switch their standard deviation values, the Aggies EV goes up to 48.2 wins while the Red Raiders’ goes up to 101.0 wins. This makes sense based on the theory that consistency helps a good team’s expected wins and hurts a bad team’s expected wins.
Below is a table of the Top 25 in the power ratings along with their power rating, power rating rank, expected winning percentage in a true round robin, and their expected winning percentage rank. Thankfully it just so happened that nobody went into or out of the Top 25.
| Rank | Team | Power | SDEV | Rk | EWP | EWPRk |
|---|---|---|---|---|---|---|
| 1 | Florida | 63.67 | 12.25 | 46 | 0.952 | 1 |
| 2 | Oklahoma | 62.63 | 14.37 | 86 | 0.938 | 2 |
| 3 | Southern Cal | 60.19 | 12.99 | 62 | 0.931 | 3 |
| 4 | Texas | 57.20 | 10.33 | 20 | 0.925 | 4 |
| 5 | Penn State | 53.47 | 11.14 | 31 | 0.894 | 5 |
| 6 | Texas Tech | 47.40 | 18.06 | 112 | 0.795 | 10 |
| 7 | TCU | 46.23 | 9.29 | 5 | 0.832 | 6 |
| 8 | Missouri | 45.82 | 17.82 | 111 | 0.779 | 13 |
| 9 | Ohio State | 45.60 | 13.44 | 70 | 0.802 | 8 |
| 10 | Utah | 45.11 | 11.92 | 40 | 0.805 | 7 |
| 11 | Alabama | 44.81 | 12.45 | 48 | 0.798 | 9 |
| 12 | Oregon | 44.34 | 12.63 | 52 | 0.792 | 11 |
| 13 | Mississippi | 43.96 | 15.26 | 93 | 0.772 | 15 |
| 14 | Iowa | 43.33 | 15.68 | 97 | 0.762 | 17 |
| 15 | Boise St | 43.17 | 11.03 | 29 | 0.785 | 12 |
| 16 | Oklahoma St | 43.00 | 13.98 | 79 | 0.767 | 16 |
| 17 | California | 42.58 | 11.81 | 37 | 0.773 | 14 |
| 18 | Arizona | 41.14 | 12.56 | 51 | 0.750 | 18 |
| 19 | Florida St | 40.28 | 12.00 | 41 | 0.741 | 19 |
| 20 | Oregon St | 40.12 | 14.08 | 83 | 0.728 | 21 |
| 21 | Georgia | 39.21 | 8.71 | 3 | 0.740 | 20 |
| 22 | North Carolina | 37.34 | 17.73 | 109 | 0.672 | 24 |
| 23 | Nebraska | 36.80 | 9.71 | 9 | 0.698 | 22 |
| 24 | Kansas | 35.74 | 14.01 | 81 | 0.664 | 25 |
| 25 | Clemson | 35.26 | 9.77 | 11 | 0.673 | 23 |
According to this model, Florida is a little over a one-point favorite tonight, has a 52.2% chance of winning, and has a 39.6% chance of covering the current 6-point line.
2009 BCS Championship Game, College Football, Computer Ratings
D W commented on the blog post Texas Hoops vs. Wake Forest Open thread 4 minutes ago
We lead the NCAA in ‘dunks against’
SizzleChest commented on the blog post Texas Hoops vs. Wake Forest Open thread 10 minutes ago
These guys just flat out don’t care. Except for Damo, but he’s not exactly lighting it up.
Trips Right commented on the blog post Texas Hoops vs. Wake Forest Open thread 12 minutes ago
We’re so bad on offense that it looks like Wake is playing zone. They’re not. In the 36th fucking game of the year. We still stand around waiting for someone to do something against one of the worst defenses in the tournament.
D W commented on the blog post Texas Hoops vs. Wake Forest Open thread 13 minutes ago
Yikes, we’re getting blown out by a 9 seed
Trips Right commented on the blog post Texas Hoops vs. Wake Forest Open thread 16 minutes ago
Everything we do is individual. We have nothing to fall back on in the way of sets or motion. For that, Barnes should be ashamed of his fucking self.
Trips Right commented on the blog post Texas Hoops vs. Wake Forest Open thread 18 minutes ago
Words can’t describe how bad Wake is. Miami dropped 85 on these fools and didn’t break a sweat.
D W commented on the blog post Texas Hoops vs. Wake Forest Open thread 18 minutes ago
5 points in 5 minutes?
RRR commented on the blog post March Madness Open thread 18 minutes ago
Big East 1-3. I’m satisfied with Day 1.
SHARETHIS.addEntry({ title: ””, url: ”” });
Trips Right commented on the blog post Texas Hoops vs. Wake Forest Open thread 26 minutes ago
quette blew a fifteen point second half lead
Trips Right commented on the blog post Texas Hoops vs. Wake Forest Open thread 28 minutes ago
Bradley doing a good job on Smith so far. Make that fucker shoot it.
Trips Right wrote a new blog post: Texas Hoops vs. Wake Forest Open thread 32 minutes ago
If we can stop Randolph Childress we have a chance.
RRR commented on the blog post March Madness Open thread 48 minutes ago
Good job, Ohio. Huskies making a great comeback!
SHARETHIS.addEntry({ title: ””, url: ”” });
Trips Right commented on the blog post March Madness open thread 56 minutes ago
Ha ha ha ha. MAC 9th seed knocks off GTown.
Trips Right commented on the blog post March Madness open thread 57 minutes ago
My Ohio state pick looks better.
RRR commented on the blog post March Madness Open thread 1 hour, 23 minutes ago
I love rooting against the Big East in the tournament. Georgetown in trouble and Marqette in a battle with the Huskies (Grr!). Need the ‘Horns to hold serve for the Big XII tonight.
Btw, my Red Raiders beat down Seton Hall, so this is theme with me, Big XII vs Big East.
SHARETHIS.addEntry({ title: ””, url: ”” });
Luke wrote a new blog post: Never a doubt 1 hour, 39 minutes ago
If I had a quarter for every time I heard the word “guard” used in a positive way today, I would be a rich man.
It’s one of the favorite clichés in March that good guards win games in the postseason, and there’s no secret that’s good news for Kansas St. Against North Texas, it quickly
SHARETHIS.addEntry({ title: ””, url: ”” });
Seeing Red wrote a new blog post: Farmer Ted Receives 2 1/2 Years 1 hour, 46 minutes ago
ESPN reports that he was sentenced as a result of the illicit videotapes taken of Dancing With The Stars Hottie and ESPN correspondent who wants to be taken seriously as a journalist dammit, Erin Andrews.
A quote from Ted not credited in the article: “This is total bullshit!! Have you SEEN her??”
Erin later pointed out that
SHARETHIS.addEntry({ title: ””, url: ”” });
Boo Radley wrote a new blog post: First Round: Clemson Tigers 1 hour, 55 minutes ago
I have put the Nebraska debacle behind me as I was so pissed off by the effort, that the game did not even deserve a post mortem. Again, where have you gone Justin Safford? Who knew that losing Safford would seem to take so much out of this team. I guess when you only have
Farmer Ted commented on the blog post Chase Daniel makes The Economist 1 hour, 56 minutes ago
I find it impossible to hate Chase Daniel. I used to try really hard to hate the guy, mostly because he was shredding the Husker secondary. But he seems like a good dude (who just happened to have arguably the greatest job in the world for the past 9 months). There used to be talk
Seeing Red commented on the blog post Lucky 2 hours, 3 minutes ago
Fantastic write-up. I always thought Marlon ran harder than people gave him credit for. His freshman year, it seemed like time & again he was grabbed as soon as he got the pitch or the handoff but managed to turn 3-5 yard losses into one yard gains.
He never had the breakaway speed or top-notch elusiveness
SHARETHIS.addEntry({ title: ””, url: ”” });
skymonkeyhorn commented on the blog post March Madness Open thread 2 hours, 20 minutes ago
Barbee on the grill is like the sun gleaming on the Rio Bravo in the morning next to the taco stand by Riverside. ” I cry like a baby” when I heard about UTEP.
SHARETHIS.addEntry({ title: ””, url: ”” });
Kevin Berger commented on the blog post March Madness Open thread 2 hours, 30 minutes ago
Marquette is so fun to watch.
SHARETHIS.addEntry({ title: ””, url: ”” });
Kevin Berger commented on the blog post March Madness Open thread 2 hours, 34 minutes ago
I picked murray state so utep could beat them in the second round. Looked good at half.
Fuck tony barbee. Mainly because skymonkey loves him.
SHARETHIS.addEntry({ title: ””, url: ”” });
skymonkeyhorn commented on the blog post Final Four Prediction 2 hours, 34 minutes ago
“I Cry like baby” When I think of Texas.
“The Letter” from fuck chalk says FF.
RIP“The country roads take me home” WVU points to FF.
“The blues” of Dukies. FF
Plus the other #1 that Patron forgot. FFSHARETHIS.addEntry({ title: ””, url: ”” });
J Rog wrote a new blog post: A Magic Show in Orlando 2 hours, 39 minutes ago
As suspected a tough game in the second half of a back to back results in a loss for the Spurs. Might as well make it a law of physics.
The Spurs lost this one in spectacular fashion 110-84 in a game that honestly they barely showed up for. They looked tired and sluggish from the
SHARETHIS.addEntry({ title: ””, url: ”” });
skymonkeyhorn commented on the blog post March Madness Open thread 2 hours, 52 minutes ago
Fuck the Dance, and Kevin Berger is a Bitch !
Wait, who is who and there is also a guitar player right ?SHARETHIS.addEntry({ title: ””, url: ”” });
Patrick Bateman commented on the blog post March Madness Open thread 3 hours, 28 minutes ago
UTEP going down in flames. If Texas loses tonight, then someone will need to make sure Trips isn’t face down in a ditch somewhere….
SHARETHIS.addEntry({ title: ””, url: ”” });
Seeing Red wrote a new blog post: How To Make Your Woman a Football Fan 4 hours, 2 minutes ago
Many thanks to Larry Burton over at Bleacher Report for providing advice on how to turn your sports-hating significant other into a college football nut.
Now, full disclosure – I’m philosophically opposed to such a thing (further disclosure & incredible shocker: I’m divorced). I do not want to bring a wife/girlfriend who doesn’t want to be
SHARETHIS.addEntry({ title: ””, url: ”” });
I.M. Hipp commented on the blog post Lucky 4 hours, 3 minutes ago
Pete is right about Marlon’s running style, Lucky needed to have the ball in space to be effective at all. I think he had a nice career overall though, keep in mind he did have those injury issues his senior year.
Good player who should have looked out for himself and left early.
SHARETHIS.addEntry({ title: ””, url: ”” });
D W commented on the blog post March Madness open thread 5 hours, 35 minutes ago
Now Baylor talking trash to the #14 seed players. Seriously?
© 2009 Fantake. All rights reserved unless otherwise indicated.
Thornton Melon said:
January 8th, 2009 at 1:55 pm
I feel like I just gave birth… to an accountant!
dasmithjones said:
January 8th, 2009 at 2:03 pm
Okay dude!
t1climb1 said:
January 8th, 2009 at 2:12 pm
I’m too stupid to talk to you Huck.
Brian Combs said:
January 8th, 2009 at 2:19 pm
Just for fun, what happens if you extend it out multiple years?
Huckleberry said:
January 8th, 2009 at 2:20 pm
I’m too stupid to talk to you Huck.
I doubt that. I’ve talked to HenryJames before.
beowulf said:
January 8th, 2009 at 2:21 pm
Dude.
Huckleberry said:
January 8th, 2009 at 2:22 pm
I’m interested in performing the analysis on other seasons, but scoring environment changes might make combining seasons difficult. I haven’t looked at it that closely yet.
One of the reasons I want to run last year is to run a sanity check using Oregon. They should have a higher than average standard deviation because of Dixon’s injury.
Brian Combs said:
January 8th, 2009 at 2:29 pm
Using data from the previous season or two (addressing the reduced confidence in some way) might improve the data set. Yes, teams do sometimes have major differences from one season to the next, but in most cases I expect it would look more like a trend line.
Disclaimer: We’re well beyond my statistical knowledge here. Stats 301 was an awful long time ago.
HenryJames said:
January 8th, 2009 at 4:37 pm
What a fascinating….zzzzzz.
absolut said:
January 8th, 2009 at 4:51 pm
As long as your analysis confirms atm consistently sucks, I couldn’t give a rat’s ass for the rest of the information. I do, however, really appreciate all the hard work and effort. What do you forsee for the USA’s IRR on TARP fund investments?
LonghornScott said:
January 8th, 2009 at 6:12 pm
Huck,
I’ve been toying with the idea of a similar analysis for a long time. The std dev of a good team is hugely important, imo. I think the final ranking you came up with there might just be the best ranking system I’ve seen.
wickedceltics said:
January 8th, 2009 at 6:30 pm
Seems like the idea of playoffs is the best way to go the more I hear about it. Down with the BCS!
Here’s a very relevant video about how the BCS robbed us Longhorns of our rightful shot at the title this year:
“BCS without the “C” – Texas should be #1
wickedceltics said:
January 8th, 2009 at 6:35 pm
maybe not as relevant as I originally thought…but I have to say that this had me laughing:
“I feel like I just gave birth… to an accountant!”
DrkBgrk said:
January 8th, 2009 at 7:19 pm
Now, who here likes a good story about a bridge?
pleaseplaykindle said:
January 9th, 2009 at 3:10 am
Teams with higher power ratings will naturally have higher standard deviation. Basically, I’m saying its easier to be consistently bad than consistently good. This seems trivial, but it has the effect that teams that have higher power ratings have more of an opportunity to deviate from them, whereas teams with low ratings do not.
Huckleberry said:
January 9th, 2009 at 8:56 am
Actually, the top 25 in the power ratings have an average consistency number of 12.9, while the bottom 25 have a number of 13.4 – this means that the top 25 were more consistent on average than the bottom 25.