← All posts
March 24, 2026

A Mea Culpa

Identifying some issues with my models that affected past results.

So, I screwed up.

This is not unexpected, as this was year 1 of rolling out the new models, so there were bound to be some mistakes, but it’s important that I’m transparent about what went wrong and what it affected so that you all know you can trust me.

The issues were with how Minnow calculated its efficiency metrics. Since Branchy also used those metrics, the issue affected Branchy as well. For those of you who don’t care about the math, feel free to skip to the next section where I break down how the fix affected the data. Essentially, I was failing to properly anchor my parameters, and I wasn’t stabilizing my alpha hyperparameter. This is why the issue became apparent when I went to update my rankings for the Sweet 16, the values all shifted.

Updated Evaluations

First, let’s look at how the changes affect the results of the 2025 MLBR (Machine Learning Battle Royale). Since Minnow and Branchy were both affected, the composite was naturally affected as well. Every other value remains the same. Next to the rank, I include in parentheses the change in rank from the old evaluation, where (+1) means a move up in the rank, and (-1) means a move down in the rank (+ means a better rank, whereas - is a worse rank).

Accuracy

RankModelOld ScoreNew ScoreDiff
1BranchyBrackets88.89%90.48%+1.59%
2 (+1)Composite82.54%85.71%+3.17%
3 (-1)Torvik84.13%NANA
4 (-1)Resumetric82.54%NANA
T5 (+1)Minnow79.37%80.95%+1.58%
T5MNPI80.95%NANA
7Chalk77.78%NANA

Log Loss

RankModelOld ScoreNew ScoreDiff
1Branchy Brackets0.34940.3509+0.0015
2Resumetric0.3628NANA
3Composite0.37530.3821+0.0068
4 (+1)Torvik0.4047NANA
5 (-1)Minnow0.38460.4072+0.0226
6MNPI0.4383NANA
7Chalk2.0468NANA

(For context, 0.693 is the log loss of a random classifier)

Tourney Points

RankModelOld ScoreNew ScoreDiff
1Branchy Brackets17601790+30
2 (+1)Composite16301760+130
3 (-1)Resumetric1740NANA
4Torvik1270NANA
5Minnow11401150+10
6Chalk1090NANA
7MNPI1060NANA

Expected Tourney Points

RankModelOld ScoreNew ScoreDiff
1Resumetric1114.85NANA
2Branchy Brackets1079.351079.66+0.31
3Composite955.24944.28-10.96
4Minnow904.86865.30-39.56
5MNPI764.66NANA

Most of the differences were subtle, but a couple stood out. In particular, the composite saw a moderate leap in accuracy, and a huge leap in tourney points. I’m going to chalk that up to random chance more than anything significant analytically, since the difference is just the shifting of percentage points in a few places just barely over 50%. We can confirm our suspicions by seeing that the composite’s expected tourney points actually decreased (if only slightly). However, if the composite performs similarly well after this year’s tournament upon review, then we can start to sing its praises.

At first glance, the updated scores might seem a bit concerning, as my “fix” to Minnow resulted in Minnow scoring worse across all four evaluation metrics, and all three affected models scoring worse in log loss, which I identified in my original post as likely the most important metric. We have to remember that we’re looking at a relatively small sample size, and that the 2025 tournament was exceptionally chalky. The issues that I fixed in Minnow had a few different practical effects, but one of them was that it made Minnow more confident in favorites. That served it well in a tourney as chalky as 2025, but I remain confident that a more balanced approach will benefit it in the long-run, especially since our fixes are well-founded and principled. I’ll keep an eye on it moving forward, and if the old versions are routinely outperforming the new ones once we’ve built up a dataset of 3-5 years, then we can contemplate switching back, and perhaps do some research into why having untethered parameters benefits these models.

Updated 2026 Odds

There are changes up and down the model, with Minnow in particular (and, to a lesser extent, Branchy) being slightly more favorable to underdogs, but the most obvious changes are at the top, so let’s focus on that:

Championship Odds by Model

MinnowBranchy
Duke30% (-1%)42% (-10%)
Arizona12% (+0%)36% (+13%)
Michigan19% (-3%)10% (-2%)
The Field39% (+4%)12% (-1%)

As expected, the model flattened out somewhat, and Minnow reduced some confidence in the favorites to return odds to the field.

As for Branchy, honestly, who knows what that model is thinking. This might not inspire a lot of faith in me, but your guess is as good as mine when it comes to why that model decided to start liking Arizona more. The only inputs that changed for Branchy were the adjusted efficiency metrics provided by Minnow. My best guess is that those subtle differences taught Branchy new interactions that snowballed into a big change at the top.

Here’s a breakdown of the changes to Minnow rankings further down the list:

Minnow Net Rating — Ranking Changes (Old → New)

Top 10 Non-Tournament Teams by Ranking Change

#TeamOld RankNew RankChange
1Maryland134182🔻 48
2Utah122167🔻 45
3Penn St.142181🔻 39
4Navy163127🔺 36
5Rutgers144179🔻 35
6Georgia Tech158186🔻 28
7East Tennessee St.150123🔺 27
8Oregon103130🔻 27
9Boston College149176🔻 27
10Austin Peay183158🔺 25

All 68 Tournament Teams by Ranking Change

#TeamOld RankNew RankChange
1Howard225192🔺 33
2Siena186154🔺 32
3UMBC202172🔺 30
4North Dakota St.130103🔺 27
5Tennessee St.168147🔺 21
6High Point7656🔺 20
7LIU233215🔺 18
8Hawaii121104🔺 17
9Cal Baptist133116🔺 17
10Idaho177160🔺 17
11Wright St.151135🔺 16
12McNeese St.6652🔺 14
13Troy140126🔺 14
14Akron5645🔺 11
15UCF5162🔻 11
16Miami OH8170🔺 11
17Northern Iowa6858🔺 10
18Furman174164🔺 10
19Prairie View A&M323313🔺 10
20Queens203194🔺 9
21Hofstra7971🔺 8
22Lehigh281273🔺 8
23Penn129136🔻 7
24Saint Louis3125🔺 6
25Utah St.3328🔺 5
26Santa Clara4035🔺 5
27South Florida4540🔺 5
28Missouri5459🔻 5
29Connecticut1713🔺 4
30Virginia1814🔺 4
31VCU4238🔺 4
32Texas3842🔻 4
33Gonzaga118🔺 3
34Arkansas1316🔻 3
35Saint Mary’s2522🔺 3
36Wisconsin2730🔻 3
37Ohio St.2831🔻 3
38SMU3639🔻 3
39Michigan St.911🔻 2
40Texas Tech1517🔻 2
41Alabama1618🔻 2
42Kansas1921🔻 2
43Kentucky2224🔻 2
44North Carolina2426🔻 2
45N.C. State3032🔻 2
46Villanova3941🔻 2
47Purdue89🔻 1
48Tennessee1415🔻 1
49St. John’s2019🔺 1
50Nebraska2120🔺 1
51Iowa2627🔻 1
52Miami FL3433🔺 1
53Clemson3534🔺 1
54UCLA3736🔺 1
55Texas A&M4344🔻 1
56TCU4746🔺 1
57Kennesaw St.164165🔻 1
58Duke11— 0
59Michigan22— 0
60Arizona33— 0
61Florida44— 0
62Houston55— 0
63Illinois66— 0
64Iowa St.77— 0
65Vanderbilt1010— 0
66Louisville1212— 0
67BYU2323— 0
68Georgia2929— 0