The format changes at Majors are the most prominent of a number of rule changes that are being considered to reduce the length of tennis matches in order to make the tennis ‘product’ more attractive. In this guest post, Francesco Lisi and Matteo Grigoletto of the Department of Statistical Sciences at the University of Padua investigate how removing the first serve or advantage points could impact match lengths.
After the 2019 men’s Wimbledon final became the first singles match to be affected by the new tiebreak at 12-12 in the 5th format, many are wondering whether the change was a good thing. Whatever your opinion about that specific change, the fact that every Grand Slam in 2019 is using a different format for men’s singles is an indication of how much tennis' governing boards are concerned about match lengths.
The format changes at Majors are the most prominent of a number of rule changes that are being considered to reduce the length of tennis matches in order to make the tennis ‘product’ more attractive.
These changes range from harmless ones, such as limiting the medical time outs per match or the time for warming up, to more impacting ones, such as modifying the structure of sets and of tie-breaks.
‘We are currently in the process of gathering feedback from various stakeholders, including players, media, broadcasters, sponsors, and fans’ said Simon Higson, ATP VP of Corporate Communications and PR. ‘No conclusions will be drawn until we have completed this process and gone through the necessary review process.’
To test some of these new rules, the ATP opted for an entirely new competition, the Next Gen ATP Finals, where matches are played best of 5 sets, with first to four games sets, tie-break at 3-All, no-advantage score, no let, shot-clock after 25 seconds. This format, called Fast4, strongly reduces the length of matches, but has been criticized in that it completely changes the traditional set structure and is often perceived by players and audience as ‘another sport’.
A fundamental question for all of these changes is whether the length of matches can be reduced while preserving the major features of the sport’s scoring system?
I believe there are basically three possibilities: abolishing advantages, or abolishing the second service, or both. The rationale being that these alternatives would strongly reduce the dominance of the serve.
To understand this point, Figure 1 shows the average number of aces per match by year and Figure 2 the fraction of matches requiring one or more tiebreaks for the years from 1990 to 2018 (source www.atptour.com and OnCourt). The trends in these plots (measured by nonparametric regression curves in red) suggest that as the strength of serve has risen, so too has the number of service games.
What impact these changes could have is a bit tricky, since no match is currently played with only one of these changes. But we can make some headway using simulation. The purpose of these simulations is to demonstrate how the probability distribution of match durations would shift under each rule change.
- the current standard
- the standard played with no advantages on 40-40
- the standard format played with only one serve
- the standard format with no advantages and one serve
To simulate a match with only one serve we have to consider which are the consequences of abolishing the two-service rule on the server’s performance. We can follow the viewpoint of Klaassen and Magnus (2014)3: a player with only one service is equivalent to a player with two services who has fault the first one. So, with only one service, a player should use his current second service, and not something between the first and second one. Thus, when the first serve is eliminated, the simulation uses only data on second serve performance to set the parameters that determine the likely outcomes on serve.
Building and setting the simulator requires the following pieces of information: kind of tournament (Grand Slam, ATP1000, ATP500, ATP250), surface, round, final score, duration, number of aces, number of double faults, number of first and second serves, number of first and second serves in, number of winning first and second serves, total points won, as well as the distributions of the ball’s speed and of the number rallies for each point. For the distribution of the shot number per point we used data from the Match Charting Project (www.tennisabstract.com); for the ball’s speed we referred to this post and this post; all other data come from OnCourt.
Data refer to matches played in the period from January 2011 to December 2018. We discarded cases with missing durations, Olympics and Next Gen ATP Finals, as well as walk-out and retired matches. After that, our sample consists of 3744 best of five matches and 16246 best of three matches.
Figure 3 shows kernel density estimations of match length distribution together with average durations. The humps present in the right tail of both distributions are the effect of the different number of sets with which the match can end. The table below lists the values of some specific quantiles, from the median (Q50) to the 99.9% quantile (Q999). Median lengths are 92 minutes (mean 98) for best of 3 matches and 142 minutes (mean 150) for best of 5 matches.
|Best of 3||92||119||142||154||200|
|Best of 5||142||179||214||233||319|
Results, in terms of median durations and probabilities of a match lasting more that $k$ hours, are given in the following table. They are based on 20,000 matches played best of 5. Model’s parameters were estimated using only data from Grand Slam’s championships.
|No Ad, One Service||106||4.1||0.8||0.2||0.03|
Results show that both the ‘No Ad’ and ‘One service’ options are effective in reducing the probability of matches lasting more than three hours, with a stronger effect produced by the ‘One service’ format. More specifically, the probability of a best of five match lasting more than three hours is around 25% with the current format, but it reduces to around 15% by abolishing advantages on 40-40 and to around 10% by abolishing the first serve.
The effect is even more drastic if both modifications are introduced, with only a 4.1% probability of exceeding three hours. Apart from the duration itself, it is known that shorter formats also imply an increase in the number of upsets and, thus, in the uncertainty about the final result of the match.
Let us consider, now, the match played in the 2019 men’s Wimbledon final, lasted 297 minutes. What would the expected duration have been under each of the 4 scenarios, according to the simulator? 20000 matches have been simulated using only the rallies distribution for grass and probabilities estimated on the previous matches between Federer and Djokovic. Expected durations and probabilities of the match lasting more than $k$ hours are given in the following table
|No Ad, One Service||131||7.9||1.7||0.33||0.05|
Both ‘No Ad’ and ‘One service’ scenarios do not dramatically change the average duration but markedly reduce the probability of a long match. For example, the probability of a match lasting more than 4.5 hours, under current rules, is 1.7%. This is a quite low probability but not completely negligible. The ‘No Ad’ format reduces it by almost 5 times and the ‘One service’ format by more than 4 times.
Thus, from a statistical viewpoint these two modifications are not very different, and the choice between them could be based on the players' preferences. The joint application of ‘No Ad’ & ‘One service’ rules, instead, seems too impacting and, probably, not necessary.
Kovalchik, S. A., & Ingram, M. (2018). Estimating the duration of professional tennis matches for varying formats. Journal of Quantitative Analysis in Sports, 14(1), 13-23. ↩︎
Lisi, F. and Grigoletto, M. (2019). Modeling and simulating durations of professional tennis matches by resampling match features. (Under Review) ↩︎
Klaassen, F., & Magnus, J. R. (2014). Analyzing Wimbledon: The power of statistics. Oxford University Press, USA. ↩︎