Introduction

Article 
MATH 

Google Scholar
 
Article 

Google Scholar
 
Pfeiffer, M. Modeling the relationship between training and performance-a comparison of two antagonistic concepts. Int. J. Comput. Sci. Sport. 7 (2), 13–32. (2008)Article 

Google Scholar
 
Article 
PubMed 
PubMed Central 

Google Scholar
 

Material & methods

Data description

The authors declare no competing interests.

The 2019 dataset

Fifteen (15) national elite Short-track speed skaters voluntary participated to a previous study18. One of the skaters was injured and removed from the study, leaving 14 athletes in. Athletes had the same coach for both datasets. Out of these 14 athletes, 2 were also represented in the 2019 data set.Bayesian modelling is a statistical paradigm that allows for combining information from observed data (the likelihood) with information from background knowledge (the prior). It also enables solving models with Monte-Carlo Markov Chains (MCMC), an approach that unveils the whole distribution of the resulting, concordant posterior estimations. On one hand, the inclusion of a prior knowledge makes a special sense in the context of a deterministic model, from which the components have a biological meaning. On the other hand, unveiling the posterior distribution is a powerful tool for the diagnosis of statistical issues such as ill-conditioning or an overall lack of information15. Tackling FFMs from a Bayesian perspective has not been done until recently16. Peng et al. identified a gain in the plausibility of the parameter estimates as compared to the frequentist counterpart (i.e. non-Bayesian, based on pure likelihood). However, the predictive ability of the FFM fitted in a Bayesian framework was not investigated by these authors16.On the other hand, the chains that did not converge exhibited a strong autocorrelation within chains for all parameters and inconsistency between chains for most parameters. In the unconverging chains, τG and τH displayed a mirror behaviour: they took a close value for a given chain at a given iteration (Fig. 4a and c, and Appendix C). The mirror posture was also observed for kG and kH that were almost identical for a given non-converged chain at a given iteration (Fig. 5, and Appendix B). Note that on Fig. 5 (and Appendix C), the sample dispersion for kG and kH for the converged chain(s) is too small to be noticeable at the given scale, and only one chain converged for this athlete.

The 2016 dataset

Using a Fitness-Only Model (FOM), we showed that the optimal value for the time exponent was not estimable from the likelihood. The Fig. 1 (c-d) exemplified this pattern: using a flat prior for the time exponent in the FOM led to estimates with no physiological sense at all. Adding biologically meaningful information to the fitness dynamic, by fixing its time exponent or by means of a bayesian prior, led to the model with the best predictive ability in cross-validation. Adding a fatigue component implies two challenges: the estimation of the fatigue time exponent itself and dealing with an antagonism between the fitness and the fatigue components during the estimation process. The diagnosis of the Markov chains used to fit the FFM in a Bayesian framework illustrated how this antagonism prevented the model from converging. Noteworthily, maxima of likelihood were easily identifiable for the fatigue time exponent when the fitness time exponent had been fixed, but never when both parameters were simultaneously estimated. Consequently, augmenting the model complexity with supplementary state variables (i.e. fatigue) does not ensure enhanced model performance, as parsimony appears to prevail in time-invariant FFMs23.Ludwig, M., Asteroth, A., Rasche, C. & Pfeiffer, M. Including the past: performance modeling using a Preload Concept by means of the fitness-fatigue model. Int. J. Comput. Sci. Sport. 18 (1), 115–134. https://doi.org/10.2478/ijcss-2019-0007 (2019).In the previous section, we observed that providing information for τG was beneficial to the predictive ability of the model. This questions the appropriate value to which τG should be informed. We addressed this question by the mean of a grid search. The grid ranged from one day to 100 days with a step of 1 day for the fitness half-life. The time exponent parameter τG was computed according to each fitness half-life, fixed to this value, and all other parameters (a0, kG, σ²) were estimated. The fitness (Fig. 3a) and the predictive ability (Fig. 3b) of the model were then assessed.

Models

Article 
PubMed 
MATH 

Google Scholar
 

$$:y_sleft(tright)sim:N(widehaty_sleft(tright),:sigma:_s^2)$$
(1)

RMSE in cross-validation depending on the way the fitness-only model was fitted. The boxplot corresponds to the 7 athletes for each fold of the cross-validation. “Frequentist”: all parameters, including τG, were directly estimated by minimization of the RMSE in the training set. “Fixed”: is identical to “Frequentist” with the exception that τG was fixed to 43.28 (corresponding to a half-life of 30 days). “Flat” and “Informative” correspond to bayesian fitting with a gamma distribution for τG. “Flat”: the prior for τG was Γ(a = 1, b = 0.5). “Informative”: the prior for τG was Γ(a = 2 × 43.28, b = 2).

$$:widehaty_FFM,::sleft(tright)=:a_0,s+k_G,ssum:_i=1^t-1w_sleft(iright)e^frac-(t-i)tau:_G,s:-k_H,ssum:_i=1^t-1w_sleft(iright)e^frac-(t-i)tau:_H,s$$

You can also search for this author in
PubMed Google ScholarYou can also search for this author in
PubMed Google Scholar

$$:widehaty_FOM,sleft(tright)=:a_0,s+k_G,ssum:_i=1^t-1w_sleft(iright)e^frac-(t-i)tau:_G,s$$

Racine, J. Consistent cross-validatory model-selection for dependent data: hv-block cross-validation. J. Econ. 99 (1), 39–61 (2000).

Model fitting

Prior knowledge on the time exponents

A frequentist framework where all parameters are estimated;

$$:e^frac-xtau:=0.5iff:::tau:=frachtextltextnleft(2right)$$

Matabuena, M. & Rodríguez-López, R. An Improved Version of the classical Banister Model to Predict Changes in Physical Condition. Bull. Math. Biol. 81 (6), 1867–1884. https://doi.org/10.1007/s11538-019-00588-y (2019).

Frequentist model fitting

Seenovate, Paris, 75009, FranceDMeM, INRAe, Univ Montpellier, Montpellier, 34000, France

Table 1 Boundaries used to constraint the frequentist parameter estimation.

DOI: https://doi.org/10.1038/s41598-025-88153-7

Bayesian model fitting

Non-convergent chains for τG and τH did not converge for the other parameters neither;Busso, T. Variable dose-response relationship between exercise training and performance. Med. Sci. Sports Exerc. 35 (7), 1188–1195 (2003).Stan Development Team. – Stan Modeling Language Users Guide and Reference Manual. Available at: https://mc-stan.org (2023).

Cross-validation

The predictive ability of the models was assessed by the mean of cross-validation. Since we deal with time series, the data were time-ordered and split accordingly in subsets in respect of hidden-value block cross-validation with an incremental window12,22. We used minimal values of 35 days and 15 days for training and test sets, respectively. Due to potential dependencies between training and test data, a 5-day gap between each training and test subsets was considered to ensure unbiased model predictions. This parameterization yielded 2 folds for the 2019 dataset and 3 folds for the 2016 dataset as described in Table 2.

Table 2 Cross-validation folds description for each of the 2 datasets involved in the study.

Using a Bayesian framework, we found that some parameters could not be estimated solely from the available data. However, the model demonstrated a strong capacity to incorporate prior information. The formulation incorporating prior knowledge about fitness dynamics outperformed all others in this study.

Results

Estimation of the fitness time exponents

Informativeness for the fitness time exponent

Article 

Google Scholar
 

  • We used root mean square error (RMSE) between predicted and observed performance to quantify the predictive ability of the models. RMSE of the training data, i.e. the residual deviation of the model σ, was also employed to measure the goodness of fit of a model to its training data.
  • Maxima of likelihood emerged for τH in the investigated range and for all athletes. These maxima were specific to each athlete and comprised between fatigue half-life = 2 days ( τH = 2.89) to fatigue half-life = 6 days ( τH = 8.66) (Fig. 6a). These maxima of likelihood were not associated with better predictive abilities, excepted for fold 1 of athlete 8 and fold 1 of athlete 4 (Fig. 6b).
  • Following a 15-minutes standardised warm-up, participants performed weekly flying starts for a one-lap maximal run. Four photocells were settled for recording skating times. The dependent variable further modelled was the individual maximal acceleration (from speed derivation); hence a higher performance value was sought. On-ice and off-ice training sessions were used as the explanatory variables, as described by Méline et al.18 and consistently with the 2019 dataset.
  • This is referred to as the fitness-only model (FOM) in this study.

We considered the Banister’s fitness-fatigue model (FFM) from a stochastic perspective, in which the predicted performance of athlete s at time t: (:y_sleft(tright)), was considered as the expectation from a gaussian distribution:Initiating the model state variables with relevant values has a potential implication in the inference26. Null state variables come with the assumption of modelling performances of untrained athletes, which is usually not realistic. Consequently, we investigated simulations of previous training session (up to 2 years priors to the observed datasets) to start from non-null fitness and fatigue states. However, it did not lead to a reliable improvement of the predictive ability of the model, neither for the FOM, neither for the FFM model (data not shown).

Fig. 1
figure 1
Despite a meaningful field of application around sports performance, FFMs have been criticized due to their lack of accuracy and their unreliability for predicting athletic performances12. A few authors questioned their foundation, highlighting methodological and mathematical flaws amongst ill-conditioning and failure to capture meaningful information regardless of the model complexity7,12,13,14. Estimating one or more states variables (e.g. fitness, fatigue) from a single input, FFMs suffer from explanation and predictive capability due to a lack of information. Since the model stems from prior knowledge (i.e. the physiological kinetics underpinning FFMs transfer functions), estimating model parameters in a Bayesian framework could reinforce the framework and benefit from a priori expertise.
Table 3 Uncorrected p-value of the difference between the RMSE in cross-validation depending on the way the fitness-only model was fitted. Repetitions correspond to the 7 athletes and 2 folds of the cross-validation. The p-values were obtained with a two-sided student mean comparison. Significance threshold after Bonferroni correction: α = 5%/6 = 0.0083.
Fig. 2
figure 2
Van De Schoot, R., Depaoli, S., King, R., Kramer, B., Märtens, K., Tadesse, M. G., Vannucci, M., Gelman, A., Veen, D., Willemsen, J. & Yau, C. Bayesian statistics and modelling. Nat. Rev. Methods Primers. 1 (1), 1. https://doi.org/10.1038/s43586-020-00001-2 (2021).

Grid search for an optimal fitness time exponent

Article 
PubMed 
MATH 

Google Scholar
 
Sorry, a shareable link is not currently available for this article.

Fig. 3
figure 3
Imbach, F., Perrey, S., Chailan, R., Meline, T. & Candau, R. Training load responses modelling and model generalisation in elite sports. Sci. Rep. 12 (1), 1–14 (2022).

Estimation of the fatigue time exponents

Informativeness for the fatigue time exponent

Non-convergent chains exhibited a mirroring pattern with nearly identical τG and τH, and kG and kH for a given chain at a given iteration.Thibaut MélinePeng, K., Brodie, R. T., Swartz, T. B. & Clarke, D. C. Bayesian inference of the impulse-response model of athlete training and performance. International Journal of Performance Analysis in Sport, 24(1), 74–89. https://doi.org/10.1080/24748668.2023.2268480 (2023).We compared two strategies for the time exponent estimations. On the one hand, we used non-informative priors (:tau:_GsimvarGamma:(a=1,:b=0.5)) which is equivalent to a (:upchi:^2) distribution with 2 degrees of freedom; same for (:tau:_{H}). On the other hand, we fitted the models with informative priors for the time exponents: (:tau:_GsimvarGamma:(a=2*43.28,:b=2)) and (:tau:_{H}simvarGamma:(a=2*2.89,:b=2)). The average of a gamma distribution defined by its shape (a) and rate (b) is a/b, hence these priors are consistent with the background knowledge target as developed in 2.3.1. We emphasize that these targets have been designed without information from the data. The rate parameter b = 2 was set high but not extreme.

Fig. 4
figure 4
Article 
CAS 

Google Scholar
 
Fig. 5
figure 5
Due to their interest for identifying an optimal training program in which training doses, scheduling and recovery lead the performance potential of each athlete, several extensions of the common FFM relate a collection of FFMs henceforth1,2,5,6,7,8,9,10,11. Among these modifications, Busso8 added nonlinearity to the model by weighting the fatigue states from the accumulation of training stimuli. Two other alternatives relied on delayed effects of training, using delay and serial functions supported by physiological evidence6,11.

Hellard, P., Avalos, M., Lacoste, L., Barale, F., Chatard, J. C. & Millet, G. P. Assessing the limitations of the Banister model in monitoring training. J. Sports Sci. 24 (5), 509–520. https://doi.org/10.1080/02640410500244697 (2006).

Grid search for an optimal fatigue time exponent

Calvert, T. W., Banister, E. W., Savage, M. V. & Bach, T. A systems model of the effects of training on physical performance. IEEE Trans. Syst. Man. Cybern Syst. 2, 94–102 (1976).Alexandre Marchal, Othmène Benazieb, Yisakor Weldegebriel & Frank Imbach

Fig. 6
figure 6
The frequentist fitting process consisted in 10 independent runs from which we retained runs that converged and minimized the residual standard deviation. Since the model was ill-conditioned, as we explored in depth throughout this study, the parameters estimation became unstable even under constraints optimization (i.e. boundaries) for convergence (Table 1).

Performance of the fitness-fatigue model

Marchal, A., Benazieb, O., Weldegebriel, Y. et al. Statistical flaws of the fitness-fatigue sports performance prediction model.
Sci Rep 15, 3706 (2025). https://doi.org/10.1038/s41598-025-88153-7For two athletes (athletes 3 and 5, see Appendix B), kH exhibited instability that was expressed in the converged chains: it wandered to extreme values before coming back to the maximal density area of the chain. These expeditions caused local autocorrelation for kH and were locally associated with close-to-0 samples for τH.

Confirmation on a new independent dataset

Informativeness for the fatigue time exponent

Accepted:

  • The performance data were collected each week and consisted in the time to perform a 166.68 m race after a standardised warm-up; hence a lower performance value was sought. Timing gate systems (Brower timing system, USA) were used to record valid individual time trial performance17. On-ice and off-ice training loads were used as the explanatory variable as described in the Appendix 1 from Imbach, Perrey, et al.12.
  • Conceptualisation, A.M., F.I.; methodology and investigation, A.M., Y.W., O.B., F.I.; data curation, T.M., Y.W.; recruitment, T.M.; resource development, A.M., Y.W., O.B.; formal analysis, A.M.; writing original draft preparation, A.M., F.I.; writing-review and editing, A.M., Y.W., O.B., F.I.; supervision, F.I.; project administration, F.I. All authors have read and agreed to the published version of the manuscript.
  • A frequentist framework with the fitness time exponent τG being fixed to 43.28 (see Sect. 2.3.1 for details);
  • A Bayesian framework with an informative prior for τG.

Article 
CAS 
PubMed 

Google Scholar
 
Vermeire, K., Ghijs, M., Bourgois, J. G. & Boone, J. The fitness–fatigue model: what’s in the numbers? Int. J. Sports Physiol. Perform. 17 (5), 810–813. https://doi.org/10.1123/ijspp.2021-0494 (2022).

Performance of the fitness-fatigue model

The identifiability of τG and τH was investigated in a Bayesian framework with informative priors (see Fig. 4 for a representative athlete and Appendix B for all athletes involved in the 2019 dataset).

Discussion

Imbach, F., Sutton-Charani, N., Montmain, J., Candau, R. & Perrey, S. The use of fitness-fatigue models for Sport Performance Modelling: conceptual issues and contributions from machine-learning. Sports Med. Open. 8 (1), 1–6 (2022).The FFM has already been extensively criticized in the literature13,24,25. In particular, Hellard et al.13 highlighted the fitness-fatigue model ill-conditioning by estimating the asymptotic correlation matrix of parameters. The authors showed a high correlation between kG and kH ((:text{c}text{o}text{r}({textk}_G,{:textk}_{H}):=:0.91)) and τG and τH ((:text{c}text{o}text{r}({{uptau:}}_G,{:{uptau:}}_{H}):=:0.99)) and concluded that the model was not reliable. We reproduced and illustrated this result from a Bayesian perspective using Markov chains. The extra step undertook with our paper, aside confirming the model ill-conditioning on two new independent datasets, was to show that adding information with a prior was not sufficient to overcome this issue.Méline, T., Mathieu, L., Borrani, F., Candau, R. & Sanchez, A. M. Systems model and individual simulations of training strategies in elite short-track speed skaters. J. Sports Sci. 37 (3), 347–355. https://doi.org/10.1080/02640414.2018.1504375 (2019).The latter aspect was associated with strong correlations between the fitness and fatigue states which resulted in a poor quality of fit to the training data. This situation was particularly problematic for the 5 athletes for whom no chain converged at all, as in this situation, the fitness and fatigue components were exactly equal at each time point.Raw Markov chains of kG (a) and kH (b) estimated from the model with fitness and fatigue, for athlete 1, using informative priors for τG and τH. Each color represents one of 10 chains, the x-axis is the iteration number and the y-axis are the kG (a) and kH (b) values. kG and kH both followed a flat priors k = -|X| with X ~ N(0, 5²). Note that only the yellow Markov chain converged.Frank ImbachIn this study, we investigated the estimability of the Fitness-Fatigue Model (FFM) parameters for sports performance modelling. We concluded that fitness and fatigue parameters are antagonistic in the estimation process, therefore non-identifiable, and that the fatigue component does not improve the predictive ability of the model. The results were supported by an analysis conducted over two independent datasets.A Bayesian framework using a non-informative prior for τG;

Practical considerations

  • of variance (:sigma:_s^2) and expectation (:widehaty_sleft(tright)) a function of the daily training doses or workloads (:w_sleft(iright))1:
  • Quality of the fitness-fatigue model fitted with fixed values for the time exponent τH. The time exponent is computed from the resulting half-life for fatigue, provided as the x-axis and varying from 1 to 29 (a) and 1 to 22 (b). Plot (a) shows the quality of fit in the training processus, assessed using the likelihood. Plot (b) shows the predictive ability assessed as the RMSE for each fold (fold 1: plain line and fold 2: hashed line). Each athlete is represented with a different color detailed in the legend. Likelihood and RMSE in prediction are returned as percentage of the difference between their maximum value and their minimum value on the y-axis. The vertical black line corresponds to x = 2 days, i.e. the half-life we informed elsewhere in this paper.
  • kH was very instable in converged chains, frequently wandering at extreme values;
  • Busso, T. & Chalencon, S. Validity and accuracy of impulse-response models for modeling and Predicting Training effects on performance of swimmers. Med. Sci. Sports Exerc. 55 (7), 1274–1285. https://doi.org/10.1249/MSS.0000000000003139 (2023).