Statistical Correlation Tests, Bias, Risk & More

Statistical Correlation Tests

Statistical correlation tests are performed with the aim of finding out whether a relationship exists between variables, and then determining the magnitude and action of that relationship. In other words, correlation is a statistical measure that expresses the extent to which two variables are linearly related. It’s commonly used for describing simple relationships without making a statement about cause and effect.

If two variables tend to move up or down together, they are considered to be positively correlated. However, if they tend to move in opposite directions, they are considered to be negatively correlated.

Parametric tests are based on an assumption that the data being analyzed follows a normal distribution. The more data points a distribution has, the more it can approach a normal distribution. Lack of data points would require the use of non-parametric tests.

Non-parametric tests a.k.a. distribution-free tests are methods of statistical analysis that do not require a distribution to meet the required assumptions to be analyzed, especially if the data is not normally distributed.

Student T-Test

statistical correlation tests
Retrieved from https://researchbasics.education.uconn.edu/t-test/ on 11th March 2023

UNPAIRED T-TEST testing two means

The unpaired t-test a.k.a. independent t-test is a statistical test which aims to determine whether there is a difference between two unrelated groups. The unpaired t-test is used to make a statement about the population based on two independent samples. To make this statement, the mean value of the two samples is compared.

ANOVA testing more than two means

ANOVA a.k.a. Analysis of Variance, is a statistical test used to investigate the difference between the means of more than two groups. A one-way ANOVA uses one independent variable, whereas a two-way ANOVA uses two independent variables.

CHI-SQUARED TESTtesting the association between two categorical variables

A chi-squared test is a statistical test that is used to compare observed and expected results. The main aim of the chi-squared test is to identify whether a disparity between actual and predicted data is due to chance or to a link between the variables under consideration.

Choosing the Right Test

statistical correlation tests
Retrieved from https://www.scribbr.com/statistics/statistical-tests/ on 11th March 2023
statistical correlation tests
Retrieved from https://www.healthknowledge.org.uk/public-health-textbook/research-methods/1b-statistical-methods/parametric-nonparametric-tests on 11th March 2023

Bias Potential Sources

Bias is the systematic deviation from the truth. Different sources of bias may include:

  • selection or sampling bias – inadequate selection of study participants usually due to high non-response rate, inadequate follow-up, inadequate sampling method use, or inadequate controls or comparisons groups
  • confounding bias – existing differences between comparison groups in one or more parameters which may be directly associated with the outcome and the candidate risk factor in question
  • instrumental bias – faulty measurement instrumentation due to lack of calibration, inaccurate diagnostic tests, etc
  • information bias – systematically incorrect measurements or responses, or from differential misclassification of disease or exposure status of participants eg. due to questionnaire ambiguity or insensitivity
  • systematic bias – one observer may underestimate readings, leading to his respondents having lower readings than those observed by someone else
  • respondent bias – misunderstandings, lack of interest, or recall issues in the unaffected group
  • random bias – observer may underestimate or overestimate measurements, which mistakes tend to even out on averaging

Calculating Statistical Risk

In statistics, the risk for a particular group to develop a disease refers to the rate of disease in the group concerned.

RISK DIFFERENCE / ABSOLUTE RISK – the excess risk that exposed individuals have.

RISK RATIO – the measurement of the risk in the exposed group as a multiple of the risk in the unexposed group.

ODDS RATIO – odds refer to the chance of developing the disease rather than not developing the disease. Odds Ratio refers to the chances of developing the condition for an exposed individual relative to an unexposed individual.

statistical correlation tests
Retrieved from https://www.researchgate.net/publication/249313828_Houwing_etal_AAP2013/figures?lo=1 on 12th March 2023

The Relevance of Testing the Sensitivity & Specificity of a Screening Diagnostic Test

Screening programs need to provide diagnostic tests with the least disturbance possible for the individual, yet with enough sensitivity and specificity to detect the disease in question. Assessing the sensitivity and specificity of a test requires its outcome to be compared against a gold standard eg. comparing the Faecal Occult Blood Test sensitivity and specificity to a colonoscopy, in this case considered to be the gold standard.

statistical correlation tests
Retrieved from https://www.researchgate.net/publication/49650721_Sensitivity_specificity_predictive_values_and_likelihood_ratios/figures?lo=1 on 15th March 2023

Multi-Variate Analysis

Linear regression analysis is used to predict the value of a variable based on the value of another variable. The variable you want to predict is called the dependent variable, while the variable you are using to predict the other variable’s value is called the independent variable.

statistical correlation tests
Example of Linear Regression – Retrieved from https://sphweb.bumc.bu.edu/otlt/MPH-Modules/BS/BS704-EP713_MultivariableMethods/ on 15th March 2023

Logistic regression is a statistical analysis method to predict a binary outcome eg. yes or no, based on prior observations of a data set. A logistic regression model predicts a dependent data variable by analyzing the relationship between one or more existing independent variables. This technique results in an odds ratio rather than a rate of change per unit change in the independent variable.

statistical correlation tests
Linear Regression vs Logistic Regression – Retrieved from https://www.datacamp.com/tutorial/understanding-logistic-regression-python on 15th March 2023

Poisson Regression models are best used for modeling events where the outcomes are counts. Or, more specifically, count data: discrete data with non-negative integer values that count something, like the number of times an event occurs during a given time-frame or the number of people in line at the grocery store. This technique results in a risk ratio instead of a rate of change per unit change in the independent variable.

Example of Poisson Regression – Retrieved from https://sherrytowers.com/2018/03/06/poisson-regression/ on 15th March 2023

Survival Analysis is concerned with studying the time between entry to a study and a subsequent event. Originally the analysis was concerned with time from treatment until death, hence the name, but survival analysis is applicable to many areas as well as mortality.

Example of Survival Analysis – Retrieved from https://www.graphpad.com/guides/survival-analysis on 15th March 2023

Relative Survival is defined as the ratio of the proportion of observed survivors in a cohort of cancer patients to the proportion of expected survivors in a comparable set of cancer free individuals. The formulation is based on the assumption of independent competing causes of death.

Example of Relative Survival Analysis – Retrieved from https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0202044 on 15th March 2023

Meta-analysis is a research process used to systematically synthesise or merge the findings of single, independent studies, using statistical methods to calculate an overall or ‘absolute’ effect. Meta-analysis does not simply pool data from smaller studies to achieve a larger sample size.

Example of Meta-Analysis – Retrieved from https://www.mdpi.com/2624-8611/4/4/49 on 15th March 2023

Did you find the above nursing information useful? Follow us on Facebook and fill in your email address below to receive new blogposts in your inbox as soon as they’re published 🙂

Cohort Studies Critical Appraisal

Cohort Studies are observational studies on groups of people with defined characteristics in which outcomes related to particular exposure (or lack thereof) are compared. Cohort Studies are usually indicated in studies where manipulated exposure is considered to be unethical (eg. no group of people should be asked to smoke for the purpose of outcome comparison). Similarly, these are observational studies, thus they lack the opportunity to control or prevent the expected outcome.

cohort studies
Retrieved from https://www.pinterest.com/pin/435512226447421378/ on 24th February 2023

Hierarchy of Evidence

Retrieved from https://www.sketchbubble.com/en/presentation-hierarchy-of-evidence.html on 18th February 2023

Cohort Studies Advantages & Disadvantages

Cohort Studies need to include a control group – a group which is not exposed to the risk factor of interest. Participants are selected based on their exposure status at the start of the study, and exposed and unexposed groups need to be selected from the same population.

Advantages

  • exposure to the risk factor of interest is measured prior to disease onset, which reduced bias
  • rare exposures can be examined by appropriate selection of study cohorts
  • multiple outcomes can be studied for a single type of exposure
  • calculates incidence and relative risk of disease in both exposed and unexposed participants over time

Disadvantages

  • changes in the participants’ exposure status and diagnostic criteria that may happen over time can affect the individuals’ classification based on exposure and disease status; the researcher should think about what measures may need to be taken if the participants change their patterns throughout the study period
  • risk of information bias – outcome may be influenced by information on the participant’s exposure status
  • loss of follow-ups may introduce attrition bias, where the characteristics of drop-outs and those completing the study may be significantly different, leading to a reduction in the validity of the study
  • expensive and time consuming

Preventing Loss to Follow Up

During the recruitment process, the researcher should obtain all information required so that the participant can be easily contacted. In addition, the researcher should exclude participants that are likely to be lost (eg. a prospective participant may have plans to move to another country).

During the follow-up period, the researcher should maintain regular contact through different means, and possibly provide tokens or gifts to encourage continued participation.

Prospective VS Retrospective Cohort Studies

In Prospective Cohort Studies, participants are identified at the time of exposure. They are followed up over time until outcome occurs.

Advantages: Prospective Cohort Studies are designed with specific data collection methods.

Disadvantages: Such studies entail a long indefinite follow-up period until an outcome occurs. They are susceptible to loss of follow-up, and are usually expensive.

cohort studies
Retrieved from https://sphweb.bumc.bu.edu/otlt/mph-modules/ep/ep713_analyticoverview/ep713_analyticoverview3.html on 24th February 2023

In Retrospective Cohort Studies, the chosen participants would have already been exposed to and subsequently experienced an outcome. Thus, outcome data measured in the past is then reconstructed for analysis.

Advantages: Retrospective Cohort Studies are cheaper and quicker than prospective studies, and make use of past data, which can be accessed immediately.

Disadvantages: Such studies are susceptible to both recall bias and information bias, and may be subjected to incomplete, inaccurate, or inconsistent data due to limited control over data collection.

cohort studies
Retrieved from https://sphweb.bumc.bu.edu/otlt/mph-modules/ep/ep713_analyticoverview/ep713_analyticoverview3.html on 24th February 2023

Cohort Studies Critical Appraisal

Casp Tool

CASP Tool for Cohort Studies Critical Appraisal can be found here.

To view blogpost featuring Cochrane videos on all types of studies please click here.

Types of Statistical Tests Used in Cohort Studies

  • Risk Ratio (RR)
  • Odds Ratio (OR)
  • Confidence Interval (CI)

Did you find the above nursing information useful? Follow us on Facebook and fill in your email address below to receive new blogposts in your inbox as soon as they’re published 🙂