How Does Team Age Profile Affect Winning?

Our initial simple analysis here identified that the older the team, the more likely they would win. My underlying theory for why this is the case isn't that older players are necessarily better (though they may be), but rather that teams that tend to be older are more likely to be in a “win now” situation. Once teams no longer feel like they can realistically challenge for a premiership they will start to play kids and look to the draft.

In some situations, though, there could be "losing" teams that have a high number of older players—i.e. they are on the way down, haven’t brought the kids in yet, and their average age is high. This theory leaves open the idea that it might not be just the average age, but potentially an age profile, such as having more players in the 24–28 year old bucket, that could be a better indicator of what a winning team looks like.

Before getting into the analysis, I wanted to understand the trend of player ages over time. I had a general theory that players were getting older and that 30 years old was no longer considered the cliff in which footy players were put out to pasture (a trend that has played out in other professional sports over the same period).

Looking at the numbers, it seems that teams have been getting older:

Average Age by Season

Looking at the make up of players over that time, it is almost completely driven by an increase in players over 30, with the average per team going from ~1.7 to ~4 per team per game.

Team Age Profile Proportions by Season

The biggest drop off over this time was in players aged 20–23. So, footy has been experiencing a consistent aging of players over the last ~15 years or so: the future is not yet now old man.[1]

What does this mean for winning and losing? Well, nothing directly, but what it does tell me is that teams view age less and less as a cut off for whether or not a player can contribute to the team. Teams are more likely to play older 30+ players if they think those players can help them win, and this potentially adds credence to age, or age profiles, being important variables to look at.

To begin with, does the older team win more often? Yes! Not only does the older team win more often, but they win 63% of the time: this puts "just pick the older team" up there with all the different models I have looked at so far:

Model Performance (Accuracy %)
Home team only56.6
Older team63.0
Top-down (Model v1)61.7
Player-based sum (Model v2a)63.7
Player-based sum (Model v2b)65.8

To dive into the next layer down: is it just the older team, or can the age profile of a team make a difference? And to what extent does being older make a difference, a goal a game, 3 goals a game? To understand these questions I will conduct both logistic and linear regression. One looks at the binary outcome of win/loss (using logistic regression like we have been using previously) and the other looks at the size of victory or defeat as a continuous outcome (linear regression).

Starting with logistic regression:

Feature Name Coefficient p-value
Age difference0.6556<0.001
Under 20 (difference)-0.31530.004
Between 20 & 23 (difference)-0.22780.034
Between 23 & 26 (difference)-0.23090.030
Between 26 & 30 (difference)-0.26170.014
Over 30 (difference)-0.4023<0.001
Average age (raw)-0.00500.880

What does above tell us? Well:

Linear regression:

Feature Name Coefficient p-value
Age difference10.91<0.001
Under 20 (difference)-8.41<0.001
Between 20 & 23 (difference)-4.67<0.001
Between 23 & 26 (difference)-4.67<0.001
Between 26 & 30 (difference)-4.53<0.001
Over 30 (difference)-7.16<0.001
Average age (raw)~01.000

Linear regression tells us the same thing:

In fact, the R-squared value is 18.6%. This means that age features (which we know is pretty much just the difference in age) explains 18.6% of the difference in margin between teams.

Overall, what this tells us is that age matters, and that it is a good enough blunt instrument that it overrides any and all value of trying to put together a complex model based on different age profiles of different teams.

Next up, I will take a close look at home ground advantage.

For more information on the notebook used to conduct this analysis, please take a look at the AFL Data Analysis GitHub repository.


1 The future is not yet now, old man.