Inferential or analytical statistics is a branch of statistics that allows researchers to make inferences or generalisations about a larger population based on data collected from a sample. This process is crucial in sports science, where it is often impractical or impossible to collect data from every athlete or event. Instead, researchers use inferential statistics to draw conclusions about a population from a carefully selected sample (Chapter 1.8 Sampling).
The primary goal of inferential statistics is to test hypotheses or predictions. This involves comparing groups, examining relationships between variables, and modelling data to make predictions about future events or behaviours. The following sections will explore the main types of inferential statistical tests, including comparison tests, correlation tests, and regression tests, with examples relevant to sports science.
2.6.1 The Lady Tasting Tea Experiment
One of the classic examples of hypothesis testing is the Lady Tasting Tea
experiment conducted by Ronald Fisher. This experiment involved a lady who claimed she could distinguish whether tea or milk was poured first into a cup. Fisher designed an experiment to test this claim scientifically.
Experiment Setup: The lady was presented with eight cups of tea, four of which had the tea poured first and four where the milk was poured first. The cups were presented in random order, and the lady was asked to identify which cups had the tea poured first.
Statistical Analysis: Fisher calculated the probability of the lady correctly identifying all four cups by chance, which is 1/70 (approximately 1.43 %). This low probability led Fisher to reject the null hypothesis that the lady was guessing, concluding instead that she could indeed tell the difference at a significance level of 1.43 %. If Lady Ottoline Violet Anne Morrell (1873–1938) successfully guessed 3 out of 4 cups, the chance increased to 24.3 %. Because the lady picked 4 of the 4 cups correctly, Fisher was willing to reject the null hypothesis and accepted the lady's ability at the 1.14% significance level.
This experiment is a fundamental example of hypothesis testing and introduces the concept of statistical significance. Fisher's Exact Test, which was developed from this experiment, is still used today for small sample sizes and categorical data.
2.6.2 Hypothesis Testing
Hypothesis testing is a fundamental aspect of inferential statistics. It involves making a prediction (the hypothesis) and then using statistical tests to determine whether the observed data supports this hypothesis or whether it should be rejected (1.7 The Research Hypotheses).
Comparison Tests
Comparison tests are used to determine if there are significant differences between groups or conditions. These tests are crucial in sports science for comparing performance metrics across different teams, training methods, or time periods.
-
t-test: This test is used to compare the means of two groups. For example, a t-test could determine if there is a significant difference in average sprint times between two different training programmes.
- One sample t-test: Compares the mean of a single sample to a known reference value.
- Two independent-sample t-test: Compares the means of two independent groups.
- Paired sample t-test: Compares means from the same group at different times (e.g., pre- and post-training).
-
ANOVA (Analysis of Variance): Used when comparing the means of three or more groups. For instance, an ANOVA could assess whether different diets lead to different average performance levels across several athlete groups.
- One-way ANOVA: Compares means across multiple groups based on one independent variable.
-
Repeated Measures ANOVA: Used when the same subjects are measured under different conditions or at different times. For example, this test could compare the performance of a group of athletes across different stages of a training cycle.
-
Mann-Whitney U Test: A nonparametric test used to compare differences between two independent groups when the data do not follow a normal distribution. In sports science, this test might be applied to compare the recovery times of athletes using two different rehabilitation techniques when the data are not normally distributed.
-
Wilcoxon Signed-Rank Test: A nonparametric alternative to the paired-sample test. This test is used when comparing two related samples (as t-test), such as the performance of athletes before and after a specific training intervention, where the data do not meet the assumptions of parametric tests.
-
Kruskal-Wallis H Test: An extension of the Mann-Whitney U test for comparing three or more independent groups. It is useful in situations where the data do not meet the assumptions required for ANOVA. For example, this test could be used to compare the effectiveness of different training programmes across several groups of athletes when the performance data are not normally distributed.
Correlation Tests
Correlation tests measure the strength and direction of the relationship between two variables. In sports science, these tests help researchers understand how variables, such as training intensity and performance, are associated (related).
Pearson’s r: This parametric test measures the linear correlation between two quantitative variables. For example, Pearson’s r could be used to assess the relationship between the amount of time spent training and improvements in an athlete’s speed.
Example: A study might use Pearson’s r to determine if there is a significant positive correlation between the number of hours of strength training and the increase in vertical jump height among basketball players.
Spearman’s rho: A nonparametric test used to measure the correlation between two variables when the data are not normally distributed or when dealing with ordinal data. For example, Spearman’s rho might assess the relationship between athletes' rankings in a competition and their self-reported motivation levels.
Chi-Square Test ($\chi^2$): This test is used to assess the relationship between two categorical variables. In sports, the chi-square test could evaluate whether there is an association between the type of training programme and the likelihood of injury.
Example: A chi-square test could be used to examine whether the distribution of different types of injuries differs significantly between male and female athletes.
Regression Tests
Regression analysis involves modelling the relationship between one dependent variable and one or more independent variables. This technique is widely used in sports science for predicting outcomes based on various predictors.
Simple Linear Regression: This test examines the relationship between one independent variable and one dependent variable. In sports, it might be used to predict an athlete's performance based on a single predictor, such as the number of training hours.
Example: A coach might use simple linear regression to predict the distance a shot-put athlete can throw based on their strength training hours.
Multiple Linear Regression: This test is used when predicting a dependent variable based on multiple independent variables. In sports, this could be used to predict performance outcomes based on various factors, such as training intensity, diet, and psychological state.
Example: Researchers might use multiple linear regression to predict marathon finishing times based on variables like weekly mileage, age, and VO2max.
Logistic Regression: This test is used when the dependent variable is binary (nominal categorical variables that contain only two, mutually exclusive categories; e.g., success/failure, injury/no injury). In sports science, logistic regression could be applied to predict the likelihood of an athlete sustaining an injury based on factors such as training load and recovery time.
Example: A logistic regression model could be used to assess the probability that a football player will experience an injury during a season based on their playing time.
-
Nominal Regression: Used when the dependent variable is categorical with more than two categories. This type of regression might be applied to predict the type of injury (e.g., sprain, fracture, strain) based on factors like sport type, training regimen, and equipment used.
Ordinal Logistic Regression: Used when the dependent variable is ordinal. This test might be used to predict an athlete's performance level (e.g., excellent, good, average, poor) based on their training hours and competition experience.

Understanding these different statistical tests and when to use them is crucial for conducting rigorous and valid research in sports science. Each test provides specific insights that can inform training, performance analysis, and overall sports strategy.
2.6.3 Choosing a Comparison Test
When conducting research, it is crucial to choose the appropriate statistical test based on the research question, the scale of measurement of the data, and the distribution of the data.
Scales of Measurement: The scale of measurement for your data determines which tests are appropriate:
Nominal Data: Categorical data without a specific order (e.g., type of sport). Suitable tests include the chi-square test, Odds ratio test (OR)
Ordinal Data: Categorical data with a specific order but without consistent intervals between categories (e.g., athlete rankings). The Mann-Whitney U test and the Wilcoxon test are often used.
Interval and Ratio Data: Numeric data where intervals between values are meaningful, with ratio data having a true zero point (e.g., time, distance). The t-test and ANOVA are typically used for these types of data.
Parametric vs Nonparametric Tests: Parametric tests assume the data follows a normal distribution (Gauss normal distribution), while nonparametric tests do not. A normal distribution test should be conducted before deciding on the appropriate test.
Paired vs Independent Samples: Paired (or repeated measures) samples involve the same subjects measured under different conditions, while independent samples involve different subjects.
2.6.4 Choosing a Correlation Tests
Choosing the correct correlation test depends on the type of data and whether it meets the assumptions for parametric tests.
Pearson’s r: Used for parametric, quantitative data where both variables are continuous and normally distributed.
Spearman’s rho: Used for nonparametric data (it is therefore an alternative to Pearson's r) or ordinal data. It assesses the strength and direction of the monotonic relationship between two variables.
Chi-Square Test ($\chi^2$): Used for testing the association between categorical variables.
In addition, dimension reduction techniques like Factor Analysis and Principal Component Analysis (PCA) can be used when researchers need to identify underlying factors or reduce the number of variables.
2.6.5 Choosing a Regression Tests
Regression tests are used to model the relationship between dependent and independent variables.
Simple Linear Regression: Suitable when predicting a dependent variable based on one metric independent variable.
Multiple Linear Regression: Used when predicting a dependent variable based on two or more independent variables.
Logistic Regression: Applied when the dependent variable is binary.
Nominal Regression: Used for categorical dependent variables with more than two categories.
Ordinal Logistic Regression: Appropriate when the dependent variable is ordinal.
Review Questions
-
What is the primary goal of inferential statistics in sports science?
-
Explain the difference between parametric and nonparametric tests. Provide an example of each type relevant to sports studies.
-
What is the purpose of hypothesis testing, and how does it relate to inferential statistics?
-
Describe the main differences between the t-test, ANOVA, and Mann-Whitney U test. In what situations would you use each?
-
How does Pearson’s r differ from Spearman’s rho, and in what situations would each test be appropriate?
-
What is the significance of the Lady Tasting Tea experiment in the context of hypothesis testing?
-
Outline the steps involved in choosing the appropriate regression test for a given set of data. Provide an example relevant to sports science.
-
What factors should be considered when selecting a comparison test for your research?
-
Explain the concept of statistical significance and its importance in the context of inferential statistics.
-
Discuss how logistic regression differs from linear regression and provide an example of when logistic regression might be used in sports science.
Exercises
-
Hypothesis Testing Exercise: Design a simple hypothesis testing experiment related to sports. For example, compare the average recovery time of athletes following two different rehabilitation protocols. State the null hypothesis and alternative hypothesis, and determine which statistical test would be appropriate for analysing the data.
-
Comparison Test Application: Using a provided data set, perform a t-test to compare the performance of two groups of athletes following different training regimens. Interpret the results and discuss whether the differences are statistically significant.
-
Correlation Analysis: Collect data on two variables relevant to sports performance (e.g., hours of training and improvement in speed). Calculate the correlation between these variables using Pearson’s r and Spearman’s rho. Compare the results and discuss any differences.
-
Regression Modelling: Given a data set with multiple predictors (e.g., training hours, diet quality, and psychological readiness), perform a multiple linear regression analysis to predict an athlete's performance score. Interpret the coefficients and discuss which variables are most strongly associated with performance.
-
Lady Tasting Tea Experiment Replication: Conduct a simplified version of the Lady Tasting Tea experiment with a small group of participants. Use Fisher's Exact Test to analyse the results, and discuss the implications of the findings in terms of hypothesis testing.
-
Choosing the Right Test: You are given a data set with mixed data types (nominal, ordinal, and continuous variables). Select the appropriate statistical tests for analysing different aspects of the data, explaining your reasoning for each choice.