NCAA tournament upsets no surprise for these Furman math professors
This story was written prior to the start of the NCAA tournament.
For 12 years running, ESPN writer Peter Keating has been giving readers of his “Giant Killers” blog a “metrics-based forecast of the big upsets” in the early rounds of the NCAA men’s basketball tournament. Since 2013, those metrics have come from increasingly complex calculations by Furman math professors Kevin Hutson, Liz Bouzarth and John Harris ’91, and there’s a reason they keep getting asked back: They’re pretty darn good at it.
For instance, after crunching the numbers in 2016, the Furman trio zeroed in on four first-round games most likely to result in an upset (defined as a team defeating an opponent seeded at least five spots higher). Three of them—No. 14 Stephen F. Austin over No. 3 West Virginia (36 percent probability), No. 13 Hawaii over No. 4 California (30 percent) and No. 11 Wichita State over No. 6 Arizona (40 percent)—ended up being just that.
With that in mind, if you’re a Virginia, Cincinnati, Creighton, Minnesota, or, especially, a Maryland fan, you may want to stop reading now if you don’t want your tummy to be even more fluttery when your team takes the court this week.
“Our most likely upset is the Maryland-Xavier game,” Harris said. “It’s got a 51 percent upset probability, so our model would say that Xavier (seeded 11th) has a slightly better chance of winning that game than Maryland (seeded sixth) does even though it’s a 6 vs. 11.”
The models also had Kansas State knocking off Wake Forest in First Four battle of 11 seeds on Tuesday night, which it did, before projecting a 46 percent chance of the Wildcats of upsetting Cincinnati, also seeded sixth, on Friday. UNC-Wilmington, meanwhile, is given a 42 percent of defeating No. 5 Virginia, similar to the chances of No. 12 Middle Tennessee knocking off No. 5 Minnesota and No. 11 Rhode Island sending No. 6 Creighton home.
That would continue a trend that’s seen No. 12s win 50 percent of their 20 games against No. 5s since 2012.
“Virginia is classified for us as a first-cluster giant, and UNC-Wilmington is classified as a third-cluster killer. Historically speaking, at least over the past 11 years that we’ve been looking at data, there’s a 40 percent chance in this grouping that killers will upset the giants,” Hutson said.
Why? Well, that’s where it complicated. The exact methods Hutson and crew use are protected proprietary information by ESPN, and what information is made available isn’t exactly easily digestible to the average person.
The short version is “giants” – the higher seeds – are lumped into four broad families based on their style of play, as are the “killers.” Past results show trends in certain matchups between families, and as the data builds on itself the ability to predict future outcomes based on past performances theoretically becomes more accurate.
Virginia, for instance, falls in the category of being a good team that isn’t great at anything, and history shows those squads are vulnerable to free-wheeling, 3-point gunning opponents like UNCW.
“(These kinds of giants) are kind of generic. There’s nothing special about these giants. They don’t overly excel in any areas, as opposed to cluster-three killers,” Hutson said of Virginia. “These are kind of below average in certain things, but they are way above average in terms of creating steals and way above average in their tendency to shoot a lot of 3-pointers in terms of other killers.”
In general, the least vulnerable giants rebound well, especially offensively, and don’t turn the ball over, while the most successful killers play a high-risk, high-reward game. They shoot lots of 3s, they crash the offensive glass, and get steals.
What makes the work by the Furman professors unique is its ability to use information from past tournaments to increase the personalization of individual game predications in the present.
“There’s really no one-size-fits-all idea of what’s going to create an upset,” Hutson said.
The equation is always changing, too, thanks to new information. In 2017, the professors implemented a new wrinkle called “decision tree analysis” in an attempt to reflect “more intuitively how humans make tough choices.”
“In creating the decision tree, we rely on the power of statistics to help identify the things we wish we could identify by eye-natural groupings of complicated data,” Bouzarth told ESPN.
It might be natural to conclude that these are people with whom you should never compete in an NCAA tournament bracket pool, but the reality is Giant Killers isn’t meant to predict the entire tournament, and even if it were advanced analysis of data ultimately ends up in the same place you do: With a guess.
“Only friendly ones, and I wish that at we could say that we do better than we do,” Harris said with a laugh when asked if he and his colleagues take part in tournament bracket games.
“One of the hard things about this is the analysis we do gives us a lot of insights into these giant-killer matchups, but there’s a whole lot of the bracket that isn’t covered by this sort of analysis,” Bouzarth added. “We’re not looking at what happens when a 1 plays a 2 in the Final Four. Our data doesn’t really give us that insight, so this is certainly not a tool that is going to give you a perfect bracket the whole way.”
It’s also important to realize that much more often than not the Furman picks for most likely upsets are relative to the likelihood of other upsets—not to the likelihood of it actually happening. The underdogs are still the underdogs, just not by as much as you might think.
“It’s not all that common to have too many games that have an upset probability of greater than 50 percent, because if that were the case then an upset becomes more likely than not,” Harris said. “Some of the games that we get excited about are ones where the upset probability is in the 30-to-40 percent range. That’s a strange way to look at it, because three out of 10 times we’ll be right, and seven out of 10 times we’ll be wrong. But as far as an upset goes, a 30 percent chance of a killer beating a giant is pretty significant compared to a five percent chance.”
The predictions are built on their initial rankings of the 68 tournament teams, though, and Furman agrees with the NCAA selection committee about who the top teams are – almost.
“We had Villanova, Gonzaga and North Carolina as our 1-2-3, and those three all made it as top seeds,” Hutson said. Instead of Kansas, though, West Virginia is the professors’ No. 4 team despite being seeded third.
The professors’ rankings also aren’t as bullish on Duke as Las Vegas, which has made the Blue Devils the favorite to win it all at 5/1, instead ranking Duke 10th.
In fact, the Furman three give South Carolina, seeded seventh, a better-than-decent 28 percent chance to slay second-seeded Duke in the second round of the regional Furman and the Southern Conference are hosting at Bon Secours Wellness Arena in Greenville, and as volunteers Hutson and Bouzarth will get a chance to see in person how their predictions match what actually happens.
Because two of the games are 7-10 and 8-9 matchups and Texas Southern and Troy each have less than a five percent chance of defeating North Carolina and Duke, respectively, in the first round, the second round offers the only chance for Giant Killing. Arkansas has a 10 percent shot at beating UNC in the second round and Seton Hall half that, while Marquette has a 19 percent chance of knocking off Duke if it gets past South Carolina.
No matter what happens, though, the Furman math department can’t lose. “That’s the beautiful thing about talking in terms of probability,” Hutson said. “You’re pretty much right either way!”