How Does Team Age Profile Affect Winning?
Our initial simple analysis here identified that the older the team, the more likely they would win. My underlying theory for why this is the case isn't that older players are necessarily better (though they may be), but rather that teams that tend to be older are more likely to be in a “win now” situation. Once teams no longer feel like they can realistically challenge for a premiership they will start to play kids and look to the draft.
In some situations, though, there could be "losing" teams that have a high number of older players—i.e. they are on the way down, haven’t brought the kids in yet, and their average age is high. This theory leaves open the idea that it might not be just the average age, but potentially an age profile, such as having more players in the 24–28 year old bucket, that could be a better indicator of what a winning team looks like.
Before getting into the analysis, I wanted to understand the trend of player ages over time. I had a general theory that players were getting older and that 30 years old was no longer considered the cliff in which footy players were put out to pasture (a trend that has played out in other professional sports over the same period).
Looking at the numbers, it seems that teams have been getting older:
- Average age has risen from ~24.6 to ~25.75 years old
- Covid does make a couple years look a bit funny, but overall the trend is clear
Average Age by Season
Looking at the make up of players over that time, it is almost completely driven by an increase in players over 30, with the average per team going from ~1.7 to ~4 per team per game.
Team Age Profile Proportions by Season
The biggest drop off over this time was in players aged 20–23. So, footy has been experiencing a consistent aging of players over the last ~15 years or so: the future is not yet now old man.[1]
What does this mean for winning and losing? Well, nothing directly, but what it does tell me is that teams view age less and less as a cut off for whether or not a player can contribute to the team. Teams are more likely to play older 30+ players if they think those players can help them win, and this potentially adds credence to age, or age profiles, being important variables to look at.
To begin with, does the older team win more often? Yes! Not only does the older team win more often, but they win 63% of the time: this puts "just pick the older team" up there with all the different models I have looked at so far:
Model | Performance (Accuracy %) |
---|---|
Home team only | 56.6 |
Older team | 63.0 |
Top-down (Model v1) | 61.7 |
Player-based sum (Model v2a) | 63.7 |
Player-based sum (Model v2b) | 65.8 |
To dive into the next layer down: is it just the older team, or can the age profile of a team make a difference? And to what extent does being older make a difference, a goal a game, 3 goals a game? To understand these questions I will conduct both logistic and linear regression. One looks at the binary outcome of win/loss (using logistic regression like we have been using previously) and the other looks at the size of victory or defeat as a continuous outcome (linear regression).
Starting with logistic regression:
Feature Name | Coefficient | p-value |
---|---|---|
Age difference | 0.6556 | <0.001 |
Under 20 (difference) | -0.3153 | 0.004 |
Between 20 & 23 (difference) | -0.2278 | 0.034 |
Between 23 & 26 (difference) | -0.2309 | 0.030 |
Between 26 & 30 (difference) | -0.2617 | 0.014 |
Over 30 (difference) | -0.4023 | <0.001 |
Average age (raw) | -0.0050 | 0.880 |
What does above tell us? Well:
- Age difference matters: Being older than your opponent increases your chance of winning (the only strong positive coefficient).
- Bucket differences add nothing positive: Once you control for overall age difference, having more players in any specific age group—even the “prime” years—does not improve your odds. In fact, all these coefficients are negative.
- Absolute age is irrelevant: The average age of a team by itself (without considering the opponent) has no predictive value (coefficient ≈ 0, p = 0.88).
Linear regression:
Feature Name | Coefficient | p-value |
---|---|---|
Age difference | 10.91 | <0.001 |
Under 20 (difference) | -8.41 | <0.001 |
Between 20 & 23 (difference) | -4.67 | <0.001 |
Between 23 & 26 (difference) | -4.67 | <0.001 |
Between 26 & 30 (difference) | -4.53 | <0.001 |
Over 30 (difference) | -7.16 | <0.001 |
Average age (raw) | ~0 | 1.000 |
Linear regression tells us the same thing:
- Age difference matters: For every year a team is older than its opponent, the average margin increases by nearly 11 points.
- Team make-up by age buckets, and absolute average age, add no positive value once you know the age difference.
- All age bucket differences are negative, so, piling up players in any single age range (even “prime” years) doesn’t help beyond simply being older overall.
Overall, what this tells us is that age matters, and that it is a good enough blunt instrument that it overrides any and all value of trying to put together a complex model based on different age profiles of different teams.
Next up, I will take a close look at home ground advantage.
For more information on the notebook used to conduct this analysis, please take a look at the AFL Data Analysis GitHub repository.