Have a language expert improve your writing
Run a free plagiarism check in 10 minutes, generate accurate citations for free.
- Knowledge Base
Methodology
- What Is a Case-Control Study? | Definition & Examples

What Is a Case-Control Study? | Definition & Examples
Published on February 4, 2023 by Tegan George . Revised on June 22, 2023.
A case-control study is an experimental design that compares a group of participants possessing a condition of interest to a very similar group lacking that condition. Here, the participants possessing the attribute of study, such as a disease, are called the “case,” and those without it are the “control.”
It’s important to remember that the case group is chosen because they already possess the attribute of interest. The point of the control group is to facilitate investigation, e.g., studying whether the case group systematically exhibits that attribute more than the control group does.
Table of contents
When to use a case-control study, examples of case-control studies, advantages and disadvantages of case-control studies, other interesting articles, frequently asked questions.
Case-control studies are a type of observational study often used in fields like medical research, environmental health, or epidemiology. While most observational studies are qualitative in nature, case-control studies can also be quantitative , and they often are in healthcare settings. Case-control studies can be used for both exploratory and explanatory research , and they are a good choice for studying research topics like disease exposure and health outcomes.
A case-control study may be a good fit for your research if it meets the following criteria.
- Data on exposure (e.g., to a chemical or a pesticide) are difficult to obtain or expensive.
- The disease associated with the exposure you’re studying has a long incubation period or is rare or under-studied (e.g., AIDS in the early 1980s).
- The population you are studying is difficult to contact for follow-up questions (e.g., asylum seekers).
Retrospective cohort studies use existing secondary research data, such as medical records or databases, to identify a group of people with a common exposure or risk factor and to observe their outcomes over time. Case-control studies conduct primary research , comparing a group of participants possessing a condition of interest to a very similar group lacking that condition in real time.
A faster, more affordable way to improve your paper
Scribbr’s new AI Proofreader checks your document and corrects spelling, grammar, and punctuation mistakes with near-human accuracy and the efficiency of AI!

Proofread my paper
Case-control studies are common in fields like epidemiology, healthcare, and psychology.
You would then collect data on your participants’ exposure to contaminated drinking water, focusing on variables such as the source of said water and the duration of exposure, for both groups. You could then compare the two to determine if there is a relationship between drinking water contamination and the risk of developing a gastrointestinal illness. Example: Healthcare case-control study You are interested in the relationship between the dietary intake of a particular vitamin (e.g., vitamin D) and the risk of developing osteoporosis later in life. Here, the case group would be individuals who have been diagnosed with osteoporosis, while the control group would be individuals without osteoporosis.
You would then collect information on dietary intake of vitamin D for both the cases and controls and compare the two groups to determine if there is a relationship between vitamin D intake and the risk of developing osteoporosis. Example: Psychology case-control study You are studying the relationship between early-childhood stress and the likelihood of later developing post-traumatic stress disorder (PTSD). Here, the case group would be individuals who have been diagnosed with PTSD, while the control group would be individuals without PTSD.
Case-control studies are a solid research method choice, but they come with distinct advantages and disadvantages.
Advantages of case-control studies
- Case-control studies are a great choice if you have any ethical considerations about your participants that could preclude you from using a traditional experimental design .
- Case-control studies are time efficient and fairly inexpensive to conduct because they require fewer subjects than other research methods .
- If there were multiple exposures leading to a single outcome, case-control studies can incorporate that. As such, they truly shine when used to study rare outcomes or outbreaks of a particular disease .
Disadvantages of case-control studies
- Case-control studies, similarly to observational studies, run a high risk of research biases . They are particularly susceptible to observer bias , recall bias , and interviewer bias.
- In the case of very rare exposures of the outcome studied, attempting to conduct a case-control study can be very time consuming and inefficient .
- Case-control studies in general have low internal validity and are not always credible.
Case-control studies by design focus on one singular outcome. This makes them very rigid and not generalizable , as no extrapolation can be made about other outcomes like risk recurrence or future exposure threat. This leads to less satisfying results than other methodological choices.
If you want to know more about statistics , methodology , or research bias , make sure to check out some of our other articles with explanations and examples.
- Student’s t -distribution
- Normal distribution
- Null and Alternative Hypotheses
- Chi square tests
- Confidence interval
- Quartiles & Quantiles
- Cluster sampling
- Stratified sampling
- Data cleansing
- Reproducibility vs Replicability
- Peer review
- Prospective cohort study
Research bias
- Implicit bias
- Cognitive bias
- Placebo effect
- Hawthorne effect
- Hindsight bias
- Affect heuristic
- Social desirability bias
Here's why students love Scribbr's proofreading services
Discover proofreading & editing
A case-control study differs from a cohort study because cohort studies are more longitudinal in nature and do not necessarily require a control group .
While one may be added if the investigator so chooses, members of the cohort are primarily selected because of a shared characteristic among them. In particular, retrospective cohort studies are designed to follow a group of people with a common exposure or risk factor over time and observe their outcomes.
Case-control studies, in contrast, require both a case group and a control group, as suggested by their name, and usually are used to identify risk factors for a disease by comparing cases and controls.
A case-control study differs from a cross-sectional study because case-control studies are naturally retrospective in nature, looking backward in time to identify exposures that may have occurred before the development of the disease.
On the other hand, cross-sectional studies collect data on a population at a single point in time. The goal here is to describe the characteristics of the population, such as their age, gender identity, or health status, and understand the distribution and relationships of these characteristics.
Cases and controls are selected for a case-control study based on their inherent characteristics. Participants already possessing the condition of interest form the “case,” while those without form the “control.”
Keep in mind that by definition the case group is chosen because they already possess the attribute of interest. The point of the control group is to facilitate investigation, e.g., studying whether the case group systematically exhibits that attribute more than the control group does.
The strength of the association between an exposure and a disease in a case-control study can be measured using a few different statistical measures , such as odds ratios (ORs) and relative risk (RR).
No, case-control studies cannot establish causality as a standalone measure.
As observational studies , they can suggest associations between an exposure and a disease, but they cannot prove without a doubt that the exposure causes the disease. In particular, issues arising from timing, research biases like recall bias , and the selection of variables lead to low internal validity and the inability to determine causality.
Sources in this article
We strongly encourage students to use sources in their work. You can cite our article (APA Style) or take a deep dive into the articles below.
George, T. (2023, June 22). What Is a Case-Control Study? | Definition & Examples. Scribbr. Retrieved November 28, 2023, from https://www.scribbr.com/methodology/case-control-study/
Schlesselman, J. J. (1982). Case-Control Studies: Design, Conduct, Analysis (Monographs in Epidemiology and Biostatistics, 2) (Illustrated). Oxford University Press.
Is this article helpful?
Tegan George
Other students also liked, what is an observational study | guide & examples, control groups and treatment groups | uses & examples, cross-sectional study | definition, uses & examples, what is your plagiarism score.
- Open access
- Published: 27 August 2021
A case–control study of factors associated with SARS-CoV-2 infection among healthcare workers in Colombia
- Merida Rodriguez-Lopez ORCID: orcid.org/0000-0001-8245-0811 1 ,
- Beatriz Parra 2 ,
- Enrique Vergara 1 ,
- Laura Rey 1 ,
- Mercedes Salcedo 2 ,
- Gabriela Arturo 3 ,
- Liliana Alarcon 3 ,
- Jorge Holguin 1 , 3 &
- Lyda Osorio 2
BMC Infectious Diseases volume 21 , Article number: 878 ( 2021 ) Cite this article
16k Accesses
8 Citations
7 Altmetric
Metrics details
Healthcare Workers (HCW) are repeatedly exposed to SARS-CoV-2 infection. The aim of this study was to identify factors associated with SARS-CoV-2 infection among HCW in one of the largest cities in Colombia.
We conducted a case–control study, where cases had a positive reverse transcription-polymerase chain reaction and controls had a negative result. Participants were randomly selected and interviewed by phone. Analyses were performed using logistic regression models.
A total of 110 cases and 113 controls were included. Men (AdjOR 4.13 95% CI 1.70–10.05), Nurses (AdjOR 11.24 95% CI 1.05–119.63), not using a high-performance filtering mask (AdjOR 2.27 95% CI 1.02–5.05) and inadequate use of personal protective equipment (AdjOR 4.82 95% CI 1.18–19.65) were identified as risk factors. Conversely, graduate (AdjOR 0.06 95% CI 0.01–0.53) and postgraduate (AdjOR 0.05 95% CI 0.005–0.7) education, feeling scared or nervous (AdjOR 0.45 95% CI 0.22–0.91), not always wearing any gloves, caps and goggles/face shields (AdjOR 0.10 95% CI 0.02–0.41), and the use of high-performance filtering or a combination of fabric plus surgical mask (AdjOR 0.27 95% CI 0.09–0.80) outside the workplace were protective factors.
This study highlights the protection provided by high-performance filtering masks or double masking among HCW. Modifiable and non-modifiable factors and the difficulty of wearing other protective equipment needs to be considered in designing, implementing and monitoring COVID-19 biosafety protocols for HCW.
Peer Review reports
Introduction
Over 55 millions of people were infected worldwide by the Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) in 2020 [ 1 ]. In previous coronavirus pandemic outbreaks, SARS-CoV-1 in 2003 and the Middle East Respiratory Syndrome in 2012, between 10 and 20% of infected people were Healthcare Workers (HCW) [ 2 , 3 ]. In the current pandemic, the prevalence among HCW varies between countries from 2 to 30% [ 4 ]. The Coronavirus Disease 2019 (COVID-19) caused by SARS-CoV-2, affects people’s lives and threatens their biological [ 5 , 6 ], physiological [ 7 , 8 ], family and social health [ 9 , 10 ]. HCW are repeatedly exposed to the virus leading to an increased risk of the disease [ 11 ] and sequelae [ 12 ] compared to the general population. Hence, COVID-19 could reduce the workforce availability to respond to this emergency.
The first case of SARS-CoV-2 in Colombia was reported in March 2020. Seven months later, the Colombian National Institute of Health has informed over 16,500 infected HCW, most of whom were associated to the workplace [ 13 ]. In a descriptive study of HCW in Cali, one of the largest cities in Colombia, 65% of infections were related to the workplace and the most affected were women and nursing assistants [ 14 ]. To date, there is scarce evidence in Latin America, concerning risk factors for the infection particularly among HCW, who are exposed to both, workplace and community transmission. Studies are mostly from Asia, Europe and North America [ 15 , 16 , 17 , 18 ]. They have focused on nurses and medical staff, and they have mainly evaluated the presence of symptoms and the exposures to occupational factors, including aerosol-generating procedures [ 15 ]. Cultural differences and availability of resources between countries and institutions, limit a direct extrapolation of previous findings. Less is known about the effect of factors related to potential community transmission or the risk among other hospital workers. Moreover, there is controversy abound the appropriate types of masks for HCW in community settings [ 19 ]. Therefore, the aim of this study was to determine the factors associated with SARS-CoV-2 infection among HCW in Cali, Colombia.
Subjects and methods
Study design.
We conducted a case–control study in HCW who served in health care institutions in Cali, Colombia. Participants were identified by merging the database of positive reverse transcription-polymerase chain reaction (RT-PCR) results with the routine surveillance system of COVID-19 (event code 346) or acute respiratory infections (event codes 345 and 348), who were reported with or without symptoms (as part of cluster investigations), between June 10 th and July 25 th , 2020. This time framework matches the first peak of the epidemic curve in Cali [ 20 ]. Cases and controls were randomly (simple random sampling without replacement) selected from those identified as HCW with a positive and negative test, respectively. The outcome status was confirmed with the database of epidemiological investigation of COVID-19 in health care facilities compiled by the local health authorities of Cali and during the telephone interview. This strategy ensures a representative sample of different health care institutions independently of size, patient type, care level, management or service provided. Sample size was estimated as 111 participants for each group with 80% power, 95% level of confidence, 18% of exposure among controls, Odds Ratio (OR) of 2.5, 1:1 allocation ratio, and 10% of withdrawal.
HCW were defined as those working in healthcare environments regardless of whether they were directly or indirectly involved in clinical activities such driving an ambulance or worked in a hospital or in homecare. Potential participants were contacted by phone and eligibility criteria were confirmed (18 years or older, not being pregnant or having a coagulopathy, and working in a health care institution that have the potential to assist COVID-19 patients, or being in contact before they had a RT-PCR test with infectious materials such as body fluids and contaminated surfaces and supplies). The study protocol was part of the public health research to face the pandemic and was revised by the Universidad Javeriana Cali Ethics Committee. Inform consent was obtained online for all participants.
Data collection
Data was collected by two trained researchers via telephone and using a structured questionnaire. The questionnaire included modifiable and non-modifiable factors: sociodemographic, clinical and lifestyle factors referred to six months before the test result, psychological factors referred to one month before the test. Occupational, exposure to COVID-19 cases, social behavior and personal protection equipment (PPE) factors referred to two weeks before the RT-PCR test. Feeling scared or nervous or having insomnia were evaluated by a five-point Likert scale, and further dichotomized as never or anytime. Height, weight, and compliance to recommended PPE use were self-reported. The exposure to a positive person was evaluated by the question: “To your knowledge, were you in contact with a person diagnosed with COVID-19, at least 2 weeks prior to the test?” A high-performance filtering mask was considered as the use of N95, P100 or M3. The frequency of use of each PPE at work were classified as always wearing them or not. Self-perception of the adequate use of PPE was evaluated as many times, sometimes or few times. The use of medicines for prophylaxis purposes included hydroxychloroquine and ivermectin. Vitamins, nutritional supplements, and hormonal contraceptives, usually taken for a long period were also included. Interviewers were blinded to the case status. At the end of each interview, blindness was broken to confirm the status of each participant as to prevent potential misclassification bias due to controls having a positive test after their report to the surveillance system.
Statistical analysis
Normality assumption was checked using Shapiro Wilk test. Then, study groups were described and compared using median (interquartile range) and relative frequencies for quantitative and qualitative variables, respectively. Body Mass Index (BMI) was estimated from self-reported weight and height and categorized as obese (≥ 30 kg/m 2 ), overweight (25 to < 30 kg/m 2 ) and not overweight nor obese (< 25 kg/m 2 ). Epidemiological weeks were calculated based on the date of the test result. To account for correlation among exposures in multiple analysis, new variables were defined. For example, the use of surgical caps, goggles/face shields, and gloves were grouped as single PPE. As HCW may have work in more than one hospital area, these were classified according to risk as “high-risk” if working in COVID-19-designated zones and any of emergency room, inpatient ward or intensive care unit (ICU), as “middle risk” if did not work in a COVID-19-designated zone but in emergency or ICU, and as “low risk” if did not work in COVID-19-designated zone nor emergency nor ICU.
Mann–Whitney U -test and Chi-Square or Fisher test were used for comparisons as appropriate. Multiple Logistic regression models were fitted using the backward strategy and the likelihood ratio test. A variable remained in the model when partial F had a P ≤ 0.10, when confounding effect was observed, or by its clinical relevance on the outcome (i.e.; epidemiological weeks and hospital area). Model fit was evaluated by Hosmer and Lemeshow test. Calibration, specificity, and collinearity was also checked. The final model was selected considering the highest explicative ability measured by PseudoR 2 . Analyses were performed using Stata version 15 (StataCorp. LP, College Station,TX).
The flow diagram of the study population is shown in Fig. 1 . Among those contacted that met the eligibility criteria, 5% of cases and 14% of controls declined to participate, resulting in a final sample of 110 cases and 113 controls. RT-PCR was ordered because of symptoms in 59.2% of participants and the remaining 40.8% as part of contact tracing or institutional screening. At the time of the interview, all HCW reported to wear some type of facemask in both the institutional and community settings. Oral contraceptives were the most common type of hormonal contraception (82%). Differences between cases and control are shown in Tables 1 , 2 and 3 . Among females, the difference between cases and controls in the use of hormonal contraceptives was observed mainly in symptomatic women (OR = 2.05 95% CI 0.75–5.64).

Flowchart of the study participants
Modifiable and non-modifiable risk factors remained in the multivariate model as shown in Table 4 . The use of a high-performance mask or a combination of fabric and surgical mask outside the workplace showed a protective effect (AdjOR = 0.27 95% CI 0.09–0.80). Not wearing any of surgical caps, face shields/goggles or gloves (AdjOR = 0.10 95% CI 0.02–0.41) and feeling scared or nervous (AdjOR = 0.45 95% CI 0.22–0.91) were also protective. On the contrary, not always wearing high-performance mask within the workplace (AdjOR = 2.27 95% CI 1.02–5.05) and not using PPE properly (AdjOR = 4.82 95% CI 1.18–19.65) were positive associated with the infection. Male gender (AdjOR = 4.13 95% CI 1.70–10.05) and being nurse AdjOR = 11.24 95% CI 1.05–119.63) increased the risk, while college graduate AdjOR = 0.06 95% CI 0.01–0.53) and postgraduate education (AdjOR = 0.05 95% CI 0.005–0.47) reduced the risk of a positive RT-PCR.
This study identified modifiable and non-modifiable factors associated to a positive RT-PCR among HCW. Particularly, a greater protective effect of high-performance masks, or double masking outside the workplace was observed when compared to other types. Conversely, surgical caps, face shields/goggles and gloves were found to increase risk. Psychological factors that prevented being overconfident about SARS-CoV-2 transmission were protective. For non-modifiable factors, male gender increased the risk while higher level of education was protective.
Concerning face-masks, those HCW always-wearing high-performance filtering masks had a better protection when compared to those wearing them occasionally or wearing other types of facemasks. This protective effect is controversial in the literature, with results suggesting greater [ 21 ], similar [ 22 ] or even lower [ 23 ] protection compared to surgical masks. Different types of masks, manufacturer standards, and the evaluation of potential confounders may explain discordances between studies. In addition, there is not a clear recommendation for the type of mask that HCW need to wear outside the workplace [ 19 , 24 ]. In line with previous studies [ 25 , 26 ], our results suggest that fabric and surgical masks performed similarly, while wearing high-performance filtering masks or a combination of fabric plus surgical mask reduces the risk of infection compared to the use of surgical mask exclusively. Therefore, HCW could be advised to wear high-performance mask even when they are not directly taking care of COVID-19 patients, or in case of a shortage, low resource settings or high cost of high-performance masks, a combination of fabric plus surgical mask as an alternative.
Controversially, our study reported a greater risk among those who always wore face shields/goggles, gloves and surgical caps. In this regard, the evidence is limited [ 24 ] and the statistically significant protective effect disappears after covariates adjustment [ 27 ]. A false sense of safety resulting in self-contamination, sharing reusable PPE without appropriate disinfection protocols, or relaxing their use [ 28 , 29 , 30 ] could explain this result. In any case, emphasis needs to be given to the proper use of PPE during and after patient´s care, as previously stated [ 15 , 31 , 32 , 33 ].
Another modifiable psychological factor showing a protective effect was feeling scared or nervous. Despite the fact that we did not evaluate the source of stress, anxious individuals are less confident in their abilities to managing threated situations [ 34 ]. Therefore, they are more sensitive to feedback and to be hypervigilant in monitoring their surroundings and themselves which leads to strategic actions to avoid harm [ 35 ]. Whether this apparent protective effect will persist through the duration of the pandemic needs to be elucidated.
Non-modifiable risk factors included sex, education and occupation.Our results support a greater risk of having a positive RT-PCR among men. The testosterone suppression effect on the innate immune responses [ 36 ], the differential expression of ACE2 between males and females [ 37 ], and a better compliance among women with biosafety measures [ 38 ] could explain the gender differences in COVID-19 susceptibility. Notably, we observed a differential but no significant risk among women according to the use of hormonal contraceptives, which requires further evaluation. The greater risk among less-educated adults compared to university graduated is consistent with a previous report [ 39 ]. Our study reports a greater risk among nurses when compared to nursing assistants; however, the precision of this estimation was low. Despite these factors are not modifiable, some strategies focusing on high risk groups could be implemented to reduce their risk, e.g. special training and monitoring for men and less educated groups.
To prevent misclassification bias, interviewers were masked to the participant´s case or control status. Although we did not quantify the possible effect of recall bias, phone questionnaires have been used in other pandemics [ 40 ] and are as valid as face-to-face interviews for collecting behavioural information [ 41 , 42 ]. Moreover, we expect recall bias to be non-differential given that the time between the RT-PCR results and the interview were similar between groups. Self-report of anthropometric measures has been found to be accurate in terms of weight classification [ 43 , 44 ]. The reasons for declining participation were similar between groups and were mainly related to availability (in terms of time), which made selection bias unlikely. Residual confounding could be present due to unmeasured variables such as quality of training, doffing practices, or the prevalence of the infection in the place of residence. In addition, residual confounding could be due to remaining differences in variables such as the type of hormonal contraceptives and the number of mask layers. Our results should not be extrapolated to the general population because health care workers are likely to behave differently regarding PPE use and risk of infection.
In conclusion, modifiable and non-modifiable factors were associated to SARS-CoV-2 infection among HCW, independent of the level of exposure. High-performance masks or double masking, adequate use of PPE and feeling scared or nervous were protective factors. In addition, gender, level of education along with occupational characteristics, were also associated with the risk of infection and need to be considered when planning public health and health care facilities prevention strategies.
Availability of data and materials
The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.
Abbreviations
Angiotensin Converting Enzyme
Adjusted Odds Ratio
Confidence Interval
Coronavirus Infection Disease 2019
Diabetes Mellitus
Health Care Worker
High Blood Pressure
Personal Protection Equipment
Intensive Care Unit
Reverse Transcription-Polymerase Chain Reaction
Severe Acute Respiratory Syndrome Coronavirus 2
1Progressier. The Coronavirus App 2020. https://coronavirus.app/map . Accessed 1 Jan 2021.
World Health Organization. Summary of probable SARS cases with onset of illness from 1 November 2002 to 31 July 2003 2003. Available at: https://www.who.int/csr/sars/country/table2004_04_21/en/ (accessed 1 January 2021).
World Health Organization. Epidemic and pandemic-prone diseases MERS situation update, January 2020 2019. https://applications.emro.who.int/docs/EMCSR246E.pdf?ua=1 . Accessed 1 Jan 2021.
Harrison D, Muradali K, El Sahly H, Bozkurt B, Jneid H. Impact of the SARS-CoV-2 pandemic on health-care workers. Hosp Pract. 2020;48(4):161–4. https://doi.org/10.1080/21548331.2020.1771010 .
Article Google Scholar
Azer SA. COVID-19: pathophysiology, diagnosis, complications and investigational therapeutics. New Microbes New Infect. 2020;37: 100738. https://doi.org/10.1016/j.nmni.2020.100738 .
Article CAS PubMed PubMed Central Google Scholar
Wu L, O’Kane AM, Peng H, Bi Y, Motriuk-Smith D, Ren J. SARS-CoV-2 and cardiovascular complications: from molecular mechanisms to pharmaceutical management. Biochem Pharmacol. 2020;178: 114114. https://doi.org/10.1016/j.bcp.2020.114114 .
Aksoy YE, Koçak V. Psychological effects of nurses and midwives due to COVID-19 outbreak: the case of Turkey. Arch Psychiatr Nurs. 2020;34(5):427–33. https://doi.org/10.1016/j.apnu.2020.07.011 .
Article PubMed PubMed Central Google Scholar
Luo M, Guo L, Yu M, Jiang W, Wang H. The psychological and mental impact of coronavirus disease 2019 (COVID-19) on medical staff and general public—a systematic review and meta-analysis. Psychiatry Res. 2020;291: 113190. https://doi.org/10.1016/j.psychres.2020.113190 .
Solis J, Franco-Paredes C, Henao-Martínez AF, Krsak M, Zimmer SM. Structural vulnerability in the U.S. revealed in three waves of COVID-19. Am J Trop Med Hyg. 2020;103(1):25–7. https://doi.org/10.4269/ajtmh.20-0391 .
Fisayo T, Tsukagoshi S. Three waves of the COVID-19 pandemic. Postgrad Med J. 2020. https://doi.org/10.1136/postgradmedj-2020-138564 .
Article PubMed Google Scholar
Barrett ES, Horton DB, Roy J, Gennaro ML, Brooks A, Tischfield J, et al. Prevalence of SARS-CoV-2 infection in previously undiagnosed health care workers in New Jersey, at the onset of the U.S. COVID-19 pandemic. BMC Infect Dis. 2020;20(1):853. https://doi.org/10.1186/s12879-020-05587-2 .
Ngai JC, Ko FW, Ng SS, To KW, Tong M, Hui DS. The long-term impact of severe acute respiratory syndrome on pulmonary function, exercise capacity and health status. Respirology. 2010;15(3):543–50. https://doi.org/10.1111/j.1440-1843.2010.01720.x .
Instituto Nacional de Salud. COVID-19 en personal de salud en Colombia | Boletín No. 52 | 21-10-2020 2020. https://www.ins.gov.co/Noticias/Paginas/coronavirus-personal-salud.aspx . Accessed 13 Jan 2021.
Torres M, Holguín J, Alarcón L, Arturo G, Luna A, Murillo S, et al. Análisis descriptivo de los casos positivos para COVID-19 del sector salud, Santiago de Cali-Colombia 2020. Gac Sanit. 2020;34:285–6.
Google Scholar
Gómez-Ochoa SA, Franco OH, Rojas LZ, Raguindin PF, Roa-Díaz ZM, Wyssmann BM, et al. COVID-19 in healthcare workers: a living systematic review and meta-analysis of prevalence, risk factors, clinical characteristics, and outcomes. Am J Epidemiol. 2021;190(1):161–75. https://doi.org/10.1093/aje/kwaa191 .
Iversen K, Bundgaard H, Hasselbalch RB, Kristensen JH, Nielsen PB, Pries-Heje M, et al. Risk of COVID-19 in health-care workers in Denmark: an observational cohort study. Lancet Infect Dis. 2020;20(12):1404–8. https://doi.org/10.1016/S1473-3099(20)30589-2 .
Liu T, Liang W, Zhong H, He J, Chen Z, He G, et al. Risk factors associated with COVID-19 infection: a retrospective cohort study based on contacts tracing. Emerg Microbes Infect. 2020;9(1):1546–53. https://doi.org/10.1080/22221751.2020.1787799 .
Ran L, Chen X, Wang Y, Wu W, Zhang L, Tan X. Risk factors of healthcare workers with corona virus disease 2019: a retrospective cohort study in a designated hospital of Wuhan in China. Clin Infect Dis. 2020;71(16):2218–21. https://doi.org/10.1093/cid/ciaa287 .
Article CAS PubMed Google Scholar
Chou R, Dana T, Jungbauer R, Weeks C. Update alert 3: masks for prevention of respiratory virus infections, including SARS-CoV-2, in health care and community settings. Ann Intern Med. 2020. https://doi.org/10.7326/L20-1292 .
COVID-19 en Colombia. https://www.ins.gov.co/Noticias/Paginas/coronavirus-filtro.aspx . Accessed 1 Jan 2021.
Chu DK, Akl EA, Duda S, Solo K, Yaacoub S, Schünemann HJ, et al. Physical distancing, face masks, and eye protection to prevent person-to-person transmission of SARS-CoV-2 and COVID-19: a systematic review and meta-analysis. Lancet. 2020;395(10242):1973–87. https://doi.org/10.1016/S0140-6736(20)31142-9 .
Bartoszko JJ, Farooqi MAM, Alhazzani W, Loeb M. Medical masks vs N95 respirators for preventing COVID-19 in healthcare workers: a systematic review and meta-analysis of randomized trials. Influenza Other Respir Viruses. 2020;14(4):365–73. https://doi.org/10.1111/irv.12745 .
Piapan L, De Michieli P, Ronchese F, Rui F, Mauro M, Peresson M, et al. COVID-19 outbreak in healthcare workers in hospitals in Trieste. North-east Italy J Hosp Infect. 2020;106(3):626–8. https://doi.org/10.1016/j.jhin.2020.08.012 .
Garcia LR, Jones AE, Anderson TN, Fisher CL, Seeley KML, Beeson EA, et al. Facial protection for healthcare workers during pandemics: a scoping review. BMJ Glob Health. 2020;5(5): e002553. https://doi.org/10.1136/bmjgh-2020-002553 .
Xiao J, Fang M, Chen Q, He B. SARS, MERS and COVID-19 among healthcare workers: a narrative review. J Infect Public Health. 2020;13(6):843–8. https://doi.org/10.1016/j.jiph.2020.05.019 .
Brooks J, Beezhold DH, Noti JD, et al. Maximizing fit for cloth and medical procedure masks to improve performance and reduce SARS-CoV-2 transmission and exposure 2021. MMWR. 2021. https://doi.org/10.15585/mmwr.mm7007e1 .
Chou R, Dana T, Buckley DI, Selph S, Fu R, Totten AM. Update Alert 3: Epidemiology of and Risk Factors for Coronavirus Infection in Health Care Workers. Ann Intern Med. 2020;173(6):W123–4. https://doi.org/10.7326/L20-1005 .
Reddy SC, Valderrama AL, Kuhar DT. Improving the use of personal protective equipment: applying lessons learned. Clin Infect Dis. 2019;69(Suppl 3):S165–70. https://doi.org/10.1093/cid/ciz619 .
Zellmer C, Van Hoof S, Safdar N. Variation in health care worker removal of personal protective equipment. Am J Infect Control. 2015;43(7):750–1. https://doi.org/10.1016/j.ajic.2015.02.005 .
Katanami Y, Hayakawa K, Shimazaki T, Sugiki Y, Takaya S, Yamamoto K, et al. Adherence to contact precautions by different types of healthcare workers through video monitoring in a tertiary hospital. J Hosp Infect. 2018;100(1):70–5. https://doi.org/10.1016/j.jhin.2018.01.001 .
Chatterjee P, Anand T, Singh KJ, Rasaily R, Singh R, Das S, et al. Healthcare workers & SARS-CoV-2 infection in India: a case-control investigation in the time of COVID-19. Indian J Med Res. 2020;151(5):459–67. https://doi.org/10.4103/ijmr.IJMR_2234_20 .
Chou R, Dana T, Buckley DI, Selph S, Fu R, Totten AM. Epidemiology of and risk factors for coronavirus infection in health care workers: a living rapid review. Ann Intern Med. 2020;173(2):120–36. https://doi.org/10.7326/M20-1632 .
MacIntyre CR, Wang Q, Cauchemez S, Seale H, Dwyer DE, Yang P, et al. A cluster randomized clinical trial comparing fit-tested and non-fit-tested N95 respirators to medical masks to prevent respiratory virus infection in health care workers. Influenza Other Respir Viruses. 2011;5(3):170–9. https://doi.org/10.1111/j.1750-2659.2011.00198.x .
Eysenck MW, Derakshan N. New perspectives in attentional control theory. Pers Individ Dif. 2011;50(7):955–60. https://doi.org/10.1016/j.paid.2010.08.019 .
Cheng BH, McCarthy JM. Understanding the dark and bright sides of anxiety: A theory of workplace anxiety. J Appl Psychol. 2018;103(5):537–60. https://doi.org/10.1037/apl0000266 .
Conti P, Younes A. Coronavirus COV-19/SARS-CoV-2 affects women less than men: clinical response to viral infection. J Biol Regul Homeost Agents. 2020;34(2):339–43. https://doi.org/10.23812/Editorial-Conti-3 .
Komukai K, Mochizuki S, Yoshimura M. Gender and the renin-angiotensin-aldosterone system. Fundam Clin Pharmacol. 2010;24(6):687–98. https://doi.org/10.1111/j.1472-8206.2010.00854.x .
Moran KR, Del Valle SY. A meta-analysis of the association between gender and protective behaviors in response to respiratory epidemics and pandemics. PLoS ONE. 2016;11(10): e0164541. https://doi.org/10.1371/journal.pone.0164541 .
Rezende LFM, Thome B, Schveitzer MC, Souza-Júnior PRB, Szwarcwald CL. Adults at high-risk of severe coronavirus disease-2019 (Covid-19) in Brazil. Rev Saude Publica. 2020;54:50. https://doi.org/10.11606/s1518-8787.2020054002596 .
Rubin GJ, Bakhshi S, Amlôt R, Fear N, Potts HWW, Michie S. The design of a survey questionnaire to measure perceptions and behaviour during an influenza pandemic: the Flu TElephone Survey Template (FluTEST). Southampton: NIHR Journals Library; 2014.
Smith A, Lyons A, Pitts M, Croy S, Ryall R, Garland S, et al. Assessing knowledge of human papillomavirus and collecting data on sexual behavior: computer assisted telephone versus face to face interviews. BMC Public Health. 2009. https://doi.org/10.1186/1471-2458-9-429 .
Fenig S, Levav I, Kohn R, Yelin N. Telephone vs face-to-face interviewing in a community psychiatric survey. Am J Public Health. 1993;83(6):896–8. https://doi.org/10.2105/ajph.83.6.896 .
Bowring AL, Peeters A, Freak-Poli R, Lim MS, Gouillou M, Hellard M. Measuring the accuracy of self-reported height and weight in a community-based sample of young people. BMC Med Res Methodol. 2012;21(12):175. https://doi.org/10.1186/1471-2288-12-175 .
Wright FL, Green J, Reeves G, Beral V, Cairns BJ. Validity over time of self-reported anthropometric variables during follow-up of a large cohort of UK women. BMC Med Res Methodol. 2015. https://doi.org/10.1186/s12874-015-0075-1 .
Download references
Acknowledgements
This work was supported by Pontificia Universidad Javeriana-Cali and Universidad del Valle. The content is solely the responsibility of the authors and does not necessarily represent the official view of Pontificia Universidad Javeriana or Universidad del Valle.
This work was supported by Pontificia Universidad Javeriana-Cali and Universidad del Valle.
Author information
Authors and affiliations.
Pontificia Universidad Javeriana Seccional Cali, Calle 18 No.118 - 250 Edificio Raúl Posada S.J. Tercer Piso, Cali, Colombia
Merida Rodriguez-Lopez, Enrique Vergara, Laura Rey & Jorge Holguin
Universidad del Valle, Cali, Colombia
Beatriz Parra, Mercedes Salcedo & Lyda Osorio
Secretaria de Salud Pública Municipal de Cali, Cali, Colombia
Gabriela Arturo, Liliana Alarcon & Jorge Holguin
You can also search for this author in PubMed Google Scholar
Contributions
MR-L: Conception and design of the study, data collection, funding acquisition, statistical analysis, data interpretation, writing of the article and review and approval of the final version. LO: conception and design of the study, data collection, funding acquisition, statistical analysis, data interpretation, review and approval of the final version BP: Conception and designed of the study, funding acquisition, data collection and interpretation, review and approval of the final version. MS: Conception and designed of the study, data collection and interpretation, review and approval of the final version. EV & LR: Data collection, review and approval of the final version. GA: Conception and designed of the study, data collection and interpretation, review and approval of the final version. LA & JH: Data collection and interpretation, review and approval of the final version. All authors read and approved the final manuscript.
Corresponding author
Correspondence to Merida Rodriguez-Lopez .
Ethics declarations
Ethics approval and consent to participate.
Informed consent was obtained from all participants. All methods were carried out in accordance with relevant guidelines and regulations. The study protocol was approved by the Universidad Javeriana Cali Ethics Committee.
Consent for publication
Not applicable.
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ . The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/ ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
Reprints and Permissions
About this article
Cite this article.
Rodriguez-Lopez, M., Parra, B., Vergara, E. et al. A case–control study of factors associated with SARS-CoV-2 infection among healthcare workers in Colombia. BMC Infect Dis 21 , 878 (2021). https://doi.org/10.1186/s12879-021-06581-y
Download citation
Received : 21 April 2021
Accepted : 17 August 2021
Published : 27 August 2021
DOI : https://doi.org/10.1186/s12879-021-06581-y
Share this article
Anyone you share the following link with will be able to read this content:
Sorry, a shareable link is not currently available for this article.
Provided by the Springer Nature SharedIt content-sharing initiative
- Risk factors
- Healthcare workers
BMC Infectious Diseases
ISSN: 1471-2334
- Submission enquiries: [email protected]
- General enquiries: [email protected]
Quantitative study designs: Case Control
Quantitative study designs.
- Introduction
- Cohort Studies
- Randomised Controlled Trial
Case Control
- Cross-Sectional Studies
- Study Designs Home
In a Case-Control study there are two groups of people: one has a health issue (Case group), and this group is “matched” to a Control group without the health issue based on characteristics like age, gender, occupation. In this study type, we can look back in the patient’s histories to look for exposure to risk factors that are common to the Case group, but not the Control group. It was a case-control study that demonstrated a link between carcinoma of the lung and smoking tobacco . These studies estimate the odds between the exposure and the health outcome, however they cannot prove causality. Case-Control studies might also be referred to as retrospective or case-referent studies.
Stages of a Case-Control study
This diagram represents taking both the case (disease) and the control (no disease) groups and looking back at their histories to determine their exposure to possible contributing factors. The researchers then determine the likelihood of those factors contributing to the disease.

(FOR ACCESSIBILITY: A case control study is likely to show that most, but not all exposed people end up with the health issue, and some unexposed people may also develop the health issue)
Which Clinical Questions does Case-Control best answer?
Case-Control studies are best used for Prognosis questions.
For example: Do anticholinergic drugs increase the risk of dementia in later life? (See BMJ Case-Control study Anticholinergic drugs and risk of dementia: case-control study )
What are the advantages and disadvantages to consider when using Case-Control?
* Confounding occurs when the elements of the study design invalidate the result. It is usually unintentional. It is important to avoid confounding, which can happen in a few ways within Case-Control studies. This explains why it is lower in the hierarchy of evidence, superior only to Case Studies.
What does a strong Case-Control study look like?
A strong study will have:
- Well-matched controls, similar background without being so similar that they are likely to end up with the same health issue (this can be easier said than done since the risk factors are unknown).
- Detailed medical histories are available, reducing the emphasis on a patient’s unreliable recall of their potential exposures.
What are the pitfalls to look for?
- Poorly matched or over-matched controls. Poorly matched means that not enough factors are similar between the Case and Control. E.g. age, gender, geography. Over-matched conversely means that so many things match (age, occupation, geography, health habits) that in all likelihood the Control group will also end up with the same health issue! Either of these situations could cause the study to become ineffective.
- Selection bias: Selection of Controls is biased. E.g. All Controls are in the hospital, so they’re likely already sick, they’re not a true sample of the wider population.
- Cases include persons showing early symptoms who never ended up having the illness.
Critical appraisal tools
To assist with critically appraising case control studies there are some tools / checklists you can use.
CASP - Case Control Checklist
JBI – Critical appraisal checklist for case control studies
CEBMA – Centre for Evidence Based Management – Critical appraisal questions (focus on leadership and management)
STROBE - Observational Studies checklists includes Case control
SIGN - Case-Control Studies Checklist
NCCEH - Critical Appraisal of a Case Control Study for environmental health
Real World Examples
Smoking and carcinoma of the lung; preliminary report
- Doll, R., & Hill, A. B. (1950). Smoking and carcinoma of the lung; preliminary report. British Medical Journal , 2 (4682), 739–748. Retrieved from https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2038856/
- Key Case-Control study linking tobacco smoking with lung cancer
- Notes a marked increase in incidence of Lung Cancer disproportionate to population growth.
- 20 London Hospitals contributed current Cases of lung, stomach, colon and rectum cancer via admissions, house-physician and radiotherapy diagnosis, non-cancer Controls were selected at each hospital of the same-sex and within 5 year age group of each.
- 1732 Cases and 743 Controls were interviewed for social class, gender, age, exposure to urban pollution, occupation and smoking habits.
- It was found that continued smoking from a younger age and smoking a greater number of cigarettes correlated with incidence of lung cancer.
Anticholinergic drugs and risk of dementia: case-control study
- Richardson, K., Fox, C., Maidment, I., Steel, N., Loke, Y. K., Arthur, A., . . . Savva, G. M. (2018). Anticholinergic drugs and risk of dementia: case-control study. BMJ , 361, k1315. Retrieved from http://www.bmj.com/content/361/bmj.k1315.abstract .
- A recent study linking the duration and level of exposure to Anticholinergic drugs and subsequent onset of dementia.
- Anticholinergic Cognitive Burden (ACB) was estimated in various drugs, the higher the exposure (measured as the ACB score) the greater likeliness of onset of dementia later in life.
- Antidepressant, urological, and antiparkinson drugs with an ACB score of 3 increased the risk of dementia. Gastrointestinal drugs with an ACB score of 3 were not strongly linked with onset of dementia.
- Tricyclic antidepressants such as Amitriptyline have an ACB score of 3 and are an example of a common area of concern.
Omega-3 deficiency associated with perinatal depression: Case-Control study
- Rees, A.-M., Austin, M.-P., Owen, C., & Parker, G. (2009). Omega-3 deficiency associated with perinatal depression: Case control study. Psychiatry Research , 166(2), 254-259. Retrieved from http://www.sciencedirect.com/science/article/pii/S0165178107004398 .
- During pregnancy women lose Omega-3 polyunsaturated fatty acids to the developing foetus.
- There is a known link between Omgea-3 depletion and depression
- Sixteen depressed and 22 non-depressed women were recruited during their third trimester
- High levels of Omega-3 were associated with significantly lower levels of depression.
- Women with low levels of Omega-3 were six times more likely to be depressed during pregnancy.
References and Further Reading
Doll, R., & Hill, A. B. (1950). Smoking and carcinoma of the lung; preliminary report. British Medical Journal, 2(4682), 739–748. Retrieved from https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2038856/
Greenhalgh, Trisha. How to Read a Paper: the Basics of Evidence-Based Medicine, John Wiley & Sons, Incorporated, 2014. ProQuest Ebook Central, http://ebookcentral.proquest.com/lib/deakin/detail.action?docID=1642418 .
Himmelfarb Health Sciences Library. (2019). Study Design 101: Case-Control Study. Retrieved from https://himmelfarb.gwu.edu/tutorials/studydesign101/casecontrols.cfm
Hoffmann, T., Bennett, S., & Del Mar, C. (2017). Evidence-Based Practice Across the Health Professions (Third edition. ed.): Elsevier.
Lewallen, S., & Courtright, P. (1998). Epidemiology in practice: case-control studies. Community Eye Health, 11(28), 57. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1706071/
Pelham, B. W. a., & Blanton, H. (2013). Conducting research in psychology : measuring the weight of smoke /Brett W. Pelham, Hart Blanton (Fourth edition. ed.): Wadsworth Cengage Learning.
Rees, A.-M., Austin, M.-P., Owen, C., & Parker, G. (2009). Omega-3 deficiency associated with perinatal depression: Case control study. Psychiatry Research, 166(2), 254-259. Retrieved from http://www.sciencedirect.com/science/article/pii/S0165178107004398
Richardson, K., Fox, C., Maidment, I., Steel, N., Loke, Y. K., Arthur, A., … Savva, G. M. (2018). Anticholinergic drugs and risk of dementia: case-control study. BMJ, 361, k1315. Retrieved from http://www.bmj.com/content/361/bmj.k1315.abstract
Statistics How To. (2019). Case-Control Study: Definition, Real Life Examples. Retrieved from https://www.statisticshowto.com/case-control-study/
- << Previous: Randomised Controlled Trial
- Next: Cross-Sectional Studies >>
- Last Updated: Aug 21, 2023 10:12 AM
- URL: https://deakin.libguides.com/quantitative-study-designs

An official website of the United States government
The .gov means it’s official. Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.
The site is secure. The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.
- Publications
- Account settings
- Advanced Search
- Journal List
- Am J Public Health
- v.79(8); Aug 1989
A case-control study of pancreatic cancer and cigarettes, alcohol, coffee and diet.
A pancreatic cancer case-control study was conducted in the Minneapolis-St. Paul area. Family members were interviewed about the subject's usage of cigarettes, alcohol, coffee, and other dietary factors in the two years prior to death (cases, n = 212) or prior to interview (controls, n = 220). The adjusted odds ratio for two packs or more of cigarettes per day was 3.92 (95% CI = 1.18, 13.01) and four or more drinks per day OR 2.69 (95% CI = 1.00, 7.27). Coffee was not a risk factor (seven cups or more per day; OR 0.58 (95% CI = 0.27, 1.27). A positive trend was observed for beef and pork consumption, and a negative trend from cruciferous vegetables.
Full text is available as a scanned copy of the original print version. Get a printable copy (PDF file) of the complete article (724K), or click on a page image below to browse page by page. Links to PubMed are also available for Selected References .
Selected References
These references are in PubMed. This may not be the complete list of references from this article.
- MacMahon B. Risk factors for cancer of the pancreas. Cancer. 1982 Dec 1; 50 (11 Suppl):2676–2680. [ PubMed ] [ Google Scholar ]
- Heuch I, Kvåle G, Jacobsen BK, Bjelke E. Use of alcohol, tobacco and coffee, and risk of pancreatic cancer. Br J Cancer. 1983 Nov; 48 (5):637–643. [ PMC free article ] [ PubMed ] [ Google Scholar ]
- Durbec JP, Chevillotte G, Bidart JM, Berthezene P, Sarles H. Diet, alcohol, tobacco and risk of cancer of the pancreas: a case-control study. Br J Cancer. 1983 Apr; 47 (4):463–470. [ PMC free article ] [ PubMed ] [ Google Scholar ]
- Hinds MW, Kolonel LN, Lee J, Hirohata T. Associations between cancer incidence and alcohol/cigarette consumption among five ethnic groups in Hawaii. Br J Cancer. 1980 Jun; 41 (6):929–940. [ PMC free article ] [ PubMed ] [ Google Scholar ]
- MacMahon B, Yen S, Trichopoulos D, Warren K, Nardi G. Coffee and cancer of the pancreas. N Engl J Med. 1981 Mar 12; 304 (11):630–633. [ PubMed ] [ Google Scholar ]
- Wynder EL, Hall NE, Polansky M. Epidemiology of coffee and pancreatic cancer. Cancer Res. 1983 Aug; 43 (8):3900–3906. [ PubMed ] [ Google Scholar ]
- Gold EB, Gordis L, Diener MD, Seltser R, Boitnott JK, Bynum TE, Hutcheon DF. Diet and other risk factors for cancer of the pancreas. Cancer. 1985 Jan 15; 55 (2):460–467. [ PubMed ] [ Google Scholar ]
- Norell SE, Ahlbom A, Erwald R, Jacobson G, Lindberg-Navier I, Olin R, Törnberg B, Wiechel KL. Diet and pancreatic cancer: a case-control study. Am J Epidemiol. 1986 Dec; 124 (6):894–902. [ PubMed ] [ Google Scholar ]
- Mack TM, Yu MC, Hanisch R, Henderson BE. Pancreas cancer and smoking, beverage consumption, and past medical history. J Natl Cancer Inst. 1986 Jan; 76 (1):49–60. [ PubMed ] [ Google Scholar ]
- Falk RT, Pickle LW, Fontham ET, Correa P, Fraumeni JF., Jr Life-style risk factors for pancreatic cancer in Louisiana: a case-control study. Am J Epidemiol. 1988 Aug; 128 (2):324–336. [ PubMed ] [ Google Scholar ]
- Hsieh CC, MacMahon B, Yen S, Trichopoulos D, Warren K, Nardi G. Coffee and pancreatic cancer (Chapter 2) N Engl J Med. 1986 Aug 28; 315 (9):587–589. [ PubMed ] [ Google Scholar ]
- Olsen GW, Mandel JS. Selection of elderly controls using random digit dialing. Am J Public Health. 1988 Nov; 78 (11):1487–1488. [ PMC free article ] [ PubMed ] [ Google Scholar ]
- Percy C, Stanek E, 3rd, Gloeckler L. Accuracy of cancer death certificates and its effect on cancer mortality statistics. Am J Public Health. 1981 Mar; 71 (3):242–250. [ PMC free article ] [ PubMed ] [ Google Scholar ]
- McLaughlin JK, Blot WJ, Mehl ES, Mandel JS. Problems in the use of dead controls in case-control studies. I. General results. Am J Epidemiol. 1985 Jan; 121 (1):131–139. [ PubMed ] [ Google Scholar ]
- Kolonel LN, Hirohata T, Nomura AM. Adequacy of survey data collected from substitute respondents. Am J Epidemiol. 1977 Dec; 106 (6):476–484. [ PubMed ] [ Google Scholar ]
- Marshall J, Priore R, Haughey B, Rzepka T, Graham S. Spouse-subject interviews and the reliability of diet studies. Am J Epidemiol. 1980 Nov; 112 (5):675–683. [ PubMed ] [ Google Scholar ]
- Sandler DP, Shore DL. Quality of data on parents' smoking and drinking provided by adult offspring. Am J Epidemiol. 1986 Nov; 124 (5):768–778. [ PubMed ] [ Google Scholar ]
- Velema JP, Walker AM, Gold EB. Alcohol and pancreatic cancer. Insufficient epidemiologic evidence for a causal relationship. Epidemiol Rev. 1986; 8 :28–41. [ PubMed ] [ Google Scholar ]
- Hakulinen T, Lehtimäki L, Lehtonen M, Teppo L. Cancer morbidity among two male cohorts with increased alcohol consumption in Finland. J Natl Cancer Inst. 1974 Jun; 52 (6):1711–1714. [ PubMed ] [ Google Scholar ]
- Robinette CD, Hrubec Z, Fraumeni JF., Jr Chronic alcoholism and subsequent mortality in World War II veterans. Am J Epidemiol. 1979 Jun; 109 (6):687–700. [ PubMed ] [ Google Scholar ]
- Ishii K, Nakamura K, Takeuchi T, Hirayama T. Chronic calcifying pancreatitis and pancreatic carcinoma in Japan. Digestion. 1973; 9 (5):429–437. [ PubMed ] [ Google Scholar ]
- Pour PM, Reber HA, Stepan K. Modification of pancreatic carcinogenesis in the hamster model. XII. Dose-related effect of ethanol. J Natl Cancer Inst. 1983 Nov; 71 (5):1085–1087. [ PubMed ] [ Google Scholar ]
- Armstrong B, Doll R. Environmental factors and cancer incidence and mortality in different countries, with special reference to dietary practices. Int J Cancer. 1975 Apr 15; 15 (4):617–631. [ PubMed ] [ Google Scholar ]
- Longnecker DS, Wiebkin P, Schaeffer BK, Roebuck BD. Experimental carcinogenesis in the pancreas. Int Rev Exp Pathol. 1984; 26 :177–229. [ PubMed ] [ Google Scholar ]
- Wattenberg LW. Inhibition of carcinogen-induced neoplasia by sodium cyanate, tert-butyl isocyanate, and benzyl isothiocyanate administered subsequent to carcinogen exposure. Cancer Res. 1981 Aug; 41 (8):2991–2994. [ PubMed ] [ Google Scholar ]
- Research article
- Open access
- Published: 14 May 2020

Application of the matched nested case-control design to the secondary analysis of trial data
- Christopher Partlett ORCID: orcid.org/0000-0001-5139-3412 1 , 2 ,
- Nigel J. Hall 3 ,
- Alison Leaf 4 , 2 ,
- Edmund Juszczak 2 &
- Louise Linsell 2
BMC Medical Research Methodology volume 20 , Article number: 117 ( 2020 ) Cite this article
16k Accesses
12 Citations
3 Altmetric
Metrics details
A nested case-control study is an efficient design that can be embedded within an existing cohort study or randomised trial. It has a number of advantages compared to the conventional case-control design, and has the potential to answer important research questions using untapped prospectively collected data.
We demonstrate the utility of the matched nested case-control design by applying it to a secondary analysis of the Abnormal Doppler Enteral Prescription Trial. We investigated the role of milk feed type and changes in milk feed type in the development of necrotising enterocolitis in a group of 398 high risk growth-restricted preterm infants.
Using matching, we were able to generate a comparable sample of controls selected from the same population as the cases. In contrast to the standard case-control design, exposure status was ascertained prior to the outcome event occurring and the comparison between the cases and matched controls could be made at the point at which the event occurred. This enabled us to reliably investigate the temporal relationship between feed type and necrotising enterocolitis.
Conclusions
A matched nested case-control study can be used to identify credible associations in a secondary analysis of clinical trial data where the exposure of interest was not randomised, and has several advantages over a standard case-control design. This method offers the potential to make reliable inferences in scenarios where it would be unethical or impractical to perform a randomised clinical trial.
Peer Review reports
Key messages
A matched nested case-control design provides an efficient way to investigate causal relationships using untapped data from prospective cohort studies and randomised controlled trials
This method has several advantages over a standard case-control design, particularly when studying time-dependent exposures on rare outcomes
It offers the potential to make reliable inferences in scenarios where unethical or practical issues preclude the use of a randomised controlled trial
Randomised controlled trials (RCTs) are regarded as the gold standard in evidence based medicine, due to their prospective design and the minimisation of important sources of bias through the use of randomisation, allocation concealment and blinding. However, RCTs are not always appropriate due to ethical or practical issues, particularly when investigating risk factors for an outcome. If beliefs about the causal role of a risk factor are already embedded within a clinical community, based on concrete evidence or otherwise, then it is not possible to conduct an RCT due to lack of equipoise. It is often not feasible to randomise potential risk factors, for example, if they are biological or genetic or if there is a strong element of patient preference involved. In such scenarios, the main alternative is to conduct an observational study; either a prospective cohort study which can be complicated and costly, or a retrospective case-control study with methodological shortcomings.
The nested case-control study design employs case-control methodology within an established prospective cohort study [ 1 ]. It first emerged in the 1970–80s and was typically used when it was expensive or difficult to obtain data on a particular exposure for all members of the cohort; instead a subset of controls would be selected at random [ 2 ]. This method with the use of matching has been shown to be an efficient design that can be used to provide unbiased estimates of relative risk with considerable cost savings [ 3 , 4 , 5 ]. Cases who develop the outcome of interest at a given point in time are matched to a random subset of members of the cohort who have not experienced the outcome at that time. These controls may develop the outcome later and become a case themselves, and they may also act as a control for other cases [ 6 , 7 ]. This approach has a number of advantages compared to the standard case-control design: (1) cases and controls are sampled from the same population, (2) exposures are measured prior to the outcome occurring, and (3) cases can be matched to controls at the time (e.g. age) of the outcome event.
More recently, the nested case-control design has been used within RCTs to investigate the causative role of risk factors in the development of trial outcomes [ 8 , 9 , 10 ]. In this paper we investigate the utility of the matched nested case-control design in a secondary analysis of the ADEPT: Abnormal Doppler Enteral Prescription Trial (ISRCTN87351483) data, to investigate the role of different types of milk feed (and changes in types of milk feed) in the development of necrotising enterocolitis. We illustrate the use of this methodology and explore issues relating to its implementation. We also discuss and appraise the value of this methodology in answering similar challenging research questions using clinical trial data more generally.
ADEPT: Abnormal Doppler Enteral Prescription Trial (ISRCTN87351483) was funded by Action Medical Research (SP4006) and investigated whether early (24–48 h after birth) or late (120–144 h after birth) introduction of milk feeds was a risk factor for necrotising enterocolitis (NEC) in a population of 404 infants born preterm and growth-restricted, following abnormal antenatal Doppler blood flow velocities [ 11 ]. Consent and randomisation occurred in the first 2 days after birth. There was no difference found in the incidence of NEC between the two groups, however there was interest in the association between feed type (formula/fortifier or exclusive mother/donor breast milk) and the development of NEC. Breast milk is one of few factors believed to reduce the risk of NEC that has been widely adopted into clinical practice, despite a paucity of high quality population based data [ 12 , 13 ]. However, due to lack of equipoise it would not be ethical or feasible to conduct a trial randomising newborn infants to formula or breast milk.
With additional funding from Action Medical Research (GN2506), the authors used a matched nested case-control design to investigate the association between feed type and the development of severe NEC, defined as Bell’s staging Stage II or III [ 14 ], using detailed daily feed log data from the ADEPT trial. The feed type and quantity of feed was recorded daily until an infant had reached full feeds and had ceased parenteral nutrition, or until 28 days after birth, whichever was longest. Using this information, infants were classified according to the following predefined exposures:
Exposure to formula milk or fortifier in the first 14 days of life
Exposure to formula milk or fortifier in the first 28 days of life
Any prior exposure to formula milk or fortifier
Change in feed type (between formula, fortifier or breast milk) within the previous 7 days.
In the remainder of the methods section we discuss the challenges of conducting this analysis and practical issues encountered in applying the matched nested case-control methodology. In the results section we present data from different aspects of the analysis, to illustrate the utility of this approach in answering the research question.
Cohort time axis
For the main trial analysis, time of randomisation was defined as time zero, which is the conventional approach given that events occurring prior to randomisation cannot be influenced by the intervention under investigation. However, for the nested case-control analysis, time zero was defined as day of delivery because age in days was considered easier to interpret, and also it was possible for an outcome event to occur prior to randomisation. Infants were followed up until their exit time, which was defined by the first occurrence of NEC, death or the last daily feed log record.
Case definition
An infant was defined as a case at their first recorded incidence of severe NEC, defined as Bell’s staging Stage II or III [ 14 ]. Infants could only be included as a case once; subsequent episodes of NEC in the same infant were not counted. Once an infant had been identified as a case, they could not be included in any future risk sets for other cases, even if the NEC episode had been resolved.
Risk set definition
One of the major challenges was identifying an appropriate risk set from which controls could be sampled, whilst also allowing the analysis to incorporate the time dependent feed log data and adjust for known confounders. A diagnosis of NEC has a crucial impact on the subsequent feeding of an infant, therefore it was essential that the analysis only included exposure to non-breast milk feeds prior to the onset of NEC. A standard case-control analysis would have produced misleading results in this context, as infants would have been defined as a cases if they had experienced NEC prior to the end of the study period, regardless of the timing of the event in relation to exposure to non-breast milk. Using a matched nested case-control design allowed us to match an infant with a diagnosis of NEC (case) at a given point in time (days from delivery) to infants with similar characteristics (with respect to other important confounding factors), who had not experienced NEC at the failure time of the case. Figure 1 is a schematic diagram of this process. Each time an outcome event occurred (case), infants that were still at risk were eligible to be selected as a control (risk set). A matching algorithm was used to select a sample of controls with similar characteristics from this risk set. Infants selected as controls could go on to become a case themselves, and could also be included in the risk sets for other cases.

Schematic diagram illustrating the selection of controls from each risk set. Three days following delivery, an infant develops NEC. At this point, there are 11 infants left in the risk set. Four controls with the closest matching are selected, including one infant that becomes a future case on day 18
Selection of matching factors
An important consideration was the appropriate selection of matching factors as well as identifying the optimum mechanism for matching. Sex, gestational age and birth weight were considered to be clear candidates for matching factors, as they are all associated with the development NEC. Gestational age and birth weight in particular are both likely to impact the infant’s feeding and thus their exposure to non-breast milk feeds. Both gestational age and birth weight were matched simultaneously, because of the strong collinearity between gestational age and birth weight, illustrated in Fig. 2 . This was achieved by minimising the Mahalanobis distance from the case to prospective controls of the same sex [ 15 ]. That is, selecting the control closest in gestational age and birth weight to the case while taking into account the correlation between these characteristics.

Scatterplot of birth weight versus gestational age for infants with NEC (cases) and those without (controls)
Typically, treatment allocation would be incorporated as a matching factor since in a secondary analysis it is a nuisance factor imposed by the trial design, which should be accounted for. However, in this example, the ADEPT allocation is associated with likelihood of exposure, since it directly influences the feeding regime. For example, an infant randomised to receive early introduction of feeds is more likely to be exposed to non-breast milk feeds in the first 14 days (44%) than an infant randomised to late introduction of feeds (23%). The main trial results also demonstrated no evidence of association with the outcome (NEC) and therefore there was a concern about the potential for overmatching. Overmatching is caused by inappropriate selection of matching factors (i.e. factors which are not associated with the outcome of interest), which may harm the statistical efficiency of the analysis [ 16 ]. Therefore, we did not include the ADEPT allocation as a matching factor, but we conduct an unadjusted and adjusted analysis by trial arm, to examine its impact on the results.
Selection of controls
Another important consideration was the method used to randomly select controls from each risk set for each case. This can be performed with or without replacement and including or excluding the case in the risk set. We chose the recommended option of sampling without replacement and excluding the case from the risk set, which produces the optimal unbiased estimate of relative risk, with greater statistical efficiency [ 17 , 18 ]. However, infants could be included in multiple risk sets and be selected more than once as a control. We also included future cases of NEC as controls in earlier risk sets, as their exclusion can also lead to biased estimates of relative risk [ 19 ].
Number of controls
In standard case-control studies it has been shown that there is little statistical efficiency gained from having more than four matched controls relative to each case [ 20 , 21 ]. Using five controls is only 4% more efficient than using four, therefore there is no added benefit in using additional controls if a cost is attached, for example taking extra biological samples in a prospective cohort setting. However gains in statistical efficiency are possible by using more than four controls if the probability of exposure among controls is low (< 0.1) [ 4 , 5 ]. Neither of these were issues for this particular analysis, as there were no additional costs involved in using more controls and prevalence of the defined exposures to non-breast milk was over 20% among infants without a diagnosis of NEC. However, there was a concern that including additional controls with increasing distance from the gestational age and birth weight of the case may undermine the matching algorithm. Also, increasing the number of controls sampled per case would lead to an increase in repeated sampling, resulting in larger number of duplicates present in the overall matched control population. This was a particular concern as control duplication was most likely to occur for infants with the lowest birth weight and gestational ages, from which there is a much smaller pool of control infants to sample from. This would have resulted in a small number of infants (with low birth weight and gestational age) being sampled multiple times and having disproportionate weighting in the matched control sample. Therefore, we limited the number of matched controls to four per case.
Statistical analysis
The baseline characteristics of infants with NEC, the matched control group, and all infants with no diagnosis of NEC (non-cases) were compared. Numbers (with percentages) were presented for binary and categorical variables, and means (and standard deviations) or medians (with interquartile range and/or range) for continuous variables. Cases were matched to four controls with the same sex and smallest Mahalanobis distance based on gestational age and birth weight. Conditional logistic regression was used to calculate the odds ratio of developing NEC for cases compared matched controls for each predefined exposure with 95% confidence intervals. Unadjusted odds ratios were calculated, along with estimates adjusting for ADEPT allocation.
The results of the full analysis, including the application of this method to explore the relationship between feed type and other clinically relevant outcomes, are reported in a separate clinical paper (in preparation). Of the 404 infants randomised to ADEPT, 398 were included in this analysis (1 infant was randomised in error, 1 set of parents withdrew consent, 3 infants had no daily feed log data and for 1 infant the severity of NEC was unknown). There were 35 cases of severe NEC and 363 infants without a diagnosis of severe NEC (non-cases). Of the 140 matched controls randomly sampled from the risk set, 109 were unique, 31 were sampled more than once, and 8 had a subsequent diagnosis of severe NEC.
The baseline characteristics of infants with severe NEC (cases) and their matched controls are shown in Table 1 , alongside the characteristics of infants without a diagnosis of severe NEC (non-cases). The matching algorithm successfully produced a well matched collection of controls, based on the majority of these characteristics. There were, however, a slightly higher proportion of infants with the lowest birthweights (< 750 g) among the cases compared to the matched controls (49% vs 38%). The only other factors to show a noticeable difference between cases and matched controls are maternal hypertension (37% vs 49%) and ventilation at trial entry (6% vs 21%), neither of which have been previously identified as risk factors for NEC. Figure 3 shows scatter plots of birth weight and gestational age for the 35 individual cases of NEC and their matched controls, which provides a visual representation of the matching.

Scatterplots showing the matched cases and controls for each case of severe NEC. Each panel contains a separate case of NEC and the matched controls
The main results of the adjusted analysis are presented in Fig. 4 . Unadjusted analyses are included in Table A 1 in the supplementary material, alongside a post-hoc sensitivity analysis that additionally includes covariate adjustment for gestational age and birthweight. While the study did not identify any significant trends between feed-type and severe NEC the findings were consistent with the a priori hypothesis, that exposure to non-breast milk feeds is associated with an increased risk of NEC. In addition, the study identified some potential trends in the association of feed-type with other important outcomes, worthy of further investigation.

Forest plot showing the adjusted odds ratio comparing severe NEC to exposures. Odds ratios are adjusted for sex, gestational age and birthweight (via matching) and trial arm (via covariate adjustment). a Odds ratio and 95% confidence interval. b 109 unique controls
Employing a matched nested case-control design for this secondary analysis of clinical trial data overcame many of the limitations of a standard case-control analysis. We were able to select controls from the same population as the cases thus avoiding selection bias. Using matching, we were able to create a comparable sample of controls with respect to important clinical characteristics and confounding factors. This method allowed us to reliably investigate the temporal relationship between feed type and severe NEC since the exposure data was collected prospectively prior to the outcome occurring. We were also able to successfully investigate the relationship between feed type and several other important outcomes such as sepsis. A standard case-control analysis is typically based on recall or retrospective data collection once the outcome is known, which can introduce recall bias. If we had performed a simple comparison between cases and non-cases of NEC without taking into account the timing of the exposure, this would have produced misleading results. Another advantage of the matched nested case-control design was that we were able to match cases to controls at the time of the outcome event so that they were of comparable ages. The methodology is especially powerful when the timing of the exposure is of importance, particularly for time-dependent exposures such as the one studied here.
While the efficient use of existing trial data has a number of benefits, there are of course disadvantages to using data that were collected for another primary purpose. For instance, it is possible that such data are less robustly collected and checked. As a result, researchers may be more likely to encounter participants with either invalid or missing data.
For instance, the some of the additional feed log data collected in ADEPT were never intended to be used to answer clinical research questions, rather, their purpose was to monitor the adherence of participants to the intervention or provide added background information. In this study, it was necessary to make assumptions about missing data to fill small gaps in the daily feed logs. Researchers should take care that such assumptions are fully documented in the statistical analysis plan in advance and determined blinded to the outcome. Another option is to plan these sub-studies at the design phase, however, there needs to be a balance between the potential burden of additional data collection and having a streamlined trial that is able to answer the primary research question.
Another limitation of the methodology is that it is only possible to match on known confounders. This is in contrast to a randomised controlled trial, in which it is possible to balance on unknown and unmeasured baseline characteristics. As a consequence, particular care must be given to select important matching factors, but also to avoid overmatching.
The methodology allows for participants to be selected as controls multiple times, so there is the possibility that systematic duplication of a specific subset of participants (e.g. infants with a lower birthweight and smaller gestational age) could lead to a small number of participants disproportionately influencing the results. Within this study, we conducted sensitivity analyses with fewer controls, and were able to demonstrate that this had a minimal impact on the findings.
We have demonstrated how a matched nested case-control design can be embedded within an RCT to identify credible associations in a secondary analysis of clinical trial data where the exposure of interest was not randomised. We planned this study after the clinical trial data had already been collected, but it could have been built in seamlessly as a SWAT (Study Within A Trial) during the trial design phase, to ensure that all relevant data were collected in advance with minimal effort. This method has several advantages over a standard case-control design and offers the potential to make reliable inferences in scenarios where unethical or practical issues preclude the use of an RCT. Moreover, because of the flexibility of the methodology in terms of the design and analysis, the matched nested case-control design could reasonably be applied to a wide range of challenging research questions. There is an abundance of high quality large prospective studies and clinical trials with well characterised cohorts, in which this methodology could be applied to investigate causal relationships, adding considerable value for money to the original studies.
Availability of data and materials
ADEPT trial data are available upon reasonable request, subject to the NPEU Data Sharing Policy.
Abbreviations
Abnormal Doppler Enteral Prescription Trial
- Randomised controlled trial
Necrotising enterocolitis
Continuous positive airway pressure
Umbilical artery catheter
Umbilical venous catheter
Study within a trial
Breslow N. Design and analysis of case-control studies. Annu Rev Public Health. 1982;3(1):29–54.
Article CAS Google Scholar
Mantel N. Synthetic retrospective studies and related topics. Biometrics. 1973;29(3):479–86.
Breslow NE. Statistics in epidemiology: the case-control study. J Am Stat Assoc. 1996;91(433):14–28.
Breslow NE, Lubin J, Marek P, Langholz B. Multiplicative models and cohort analysis. J Am Stat Assoc. 1983;78(381):1–12.
Article Google Scholar
Goldstein L, Langholz B. Asymptotic theory for nested case-control sampling in the cox regression model. Ann Stat. 1992;20(4):1903–28.
Ernster VL. Nested Case-Control Studies. Prev Med. 1994;23(5):587–90. https://doi.org/10.1006/pmed.1994.1093 .
Article CAS PubMed Google Scholar
Essebag V, Genest J Jr, Suissa S, Pilote L. The nested case-control study in cardiology. Am Heart J. 2003;146(4):581–90. https://doi.org/10.1016/S0002-8703(03)00512-X .
Article PubMed Google Scholar
Nieuwlaat R, Connolly BJ, Hubers LM, Cuddy SM, Eikelboom JW, Yusuf S, et al. Quality of individual INR control and the risk of stroke and bleeding events in atrial fibrillation patients: a nested case control analysis of the ACTIVE W study. Thromb Res. 2012;129(6):715–9.
Fox GJ, Nhung NV, Loi NT, Sy DN, Britton WJ, Marks GB. Barriers to adherence with tuberculosis contact investigation in six provinces of Vietnam: a nested case–control study. BMC Infect Dis. 2015;15(1):103.
Mattson CL, Bailey RC, Agot K, Ndinya-Achola J, Moses S. A nested case-control study of sexual practices and risk factors for prevalent HIV-1 infection among young men in Kisumu, Kenya. Sex Trans Dis. 2007;34(10):731.
Google Scholar
Leaf A, Dorling J, Kempley S, McCormick K, Mannix P, Linsell L, et al. Early or delayed enteral feeding for preterm growth-restricted infants: a randomized trial. Pediatrics. 2012;129(5):e1260–e8. https://doi.org/10.1542/peds.2011-2379 .
Lucas A, Cole TJ. Breast milk and neonatal necrotising enterocolitis. Lancet. 1990;336(8730):1519–23.
McGuire W, Anthony MY. Donor human milk versus formula for preventing necrotising enterocolitis in preterm infants: systematic review. Arch Dis Child Fetal Neonatal Ed. 2003;88(1):F11–F4.
Walsh MC, Kliegman RM. Necrotizing enterocolitis: treatment based on staging criteria. Pediatr Clin N Am. 1986;33:179–201.
Mahalanobis PC. On the Generalized Distance in Statistics. Proceedings of the National Institute of Science of India. 1936;2:49-55.
Brookmeyer R, Liang K, Linet M. Matched case-control designs and overmatched analyses. Am J Epidemiol. 1986;124(4):693–701.
Lubin JH. Extensions of analytic methods for nested and population-based incident case-control studies. J Chronic Dis. 1986;39(5):379–88.
Robins JM, Gail MH, Lubin JH. More on" biased selection of controls for case-control analyses of cohort studies". Biometrics. 1986;42(2):293–9.
Lubin JH, Gail MH. Biased selection of controls for case-control analyses of cohort studies. Biometrics. 1984;40(1):63–75.
Ury HK. Efficiency of case-control studies with multiple controls per case: continuous or dichotomous data. Biometrics. 1975;31(3):643–9.
Gail M, Williams R, Byar DP, Brown C. How many controls? J Chronic Dis. 1976;29(11):723–31.
Meeting abstracts from the 5th International Clinical Trials Methodology Conference (ICTMC 2019). Trials. 2019;20(Suppl 1):579 Brighton, UK. 06–09 October 2019. doi: 10e.1186/s13063-019-3688-6 .
Download references
Acknowledgements
This work was presented at the International Clinical Trials Methodology Conference (ICTMC) in 2019 and the abstract is published within Trials [ 22 ].
This work was supported by Action Medical Research [Grant number GN2506]. The funder had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Author information
Authors and affiliations.
Nottingham Clinical Trials Unit, University of Nottingham, Nottingham, UK
Christopher Partlett
National Perinatal Epidemiology Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
Christopher Partlett, Alison Leaf, Edmund Juszczak & Louise Linsell
University Surgery Unit, Faculty of Medicine, University of Southampton, Southampton, UK
Nigel J. Hall
Department of Child Health, Faculty of Medicine, University of Southampton, Southampton, UK
Alison Leaf
You can also search for this author in PubMed Google Scholar
Contributions
NH, AL, EJ and LL conceived the project. CP performed the statistical analyses under the supervision of LL and EJ. CP and LL drafted the manuscript and EJ, AL and NH critically reviewed it. All authors were involved in the interpretation of results. The author(s) read and approved the final manuscript.
Corresponding author
Correspondence to Christopher Partlett .
Ethics declarations
Ethics approval and consent to participate.
No ethical approval was required for this study, since it used only previously collected, fully anonymised research data.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interest.
Additional information
Publisher’s note.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Additional file 1..
Table A1 Association between exposures and the development of Severe NEC. Each case is matched to 4 controls with the same sex and the smallest distance in terms of the Malhalanobis distance based on gestational age and birthweight.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ . The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/ ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
Reprints and Permissions
About this article
Cite this article.
Partlett, C., Hall, N.J., Leaf, A. et al. Application of the matched nested case-control design to the secondary analysis of trial data. BMC Med Res Methodol 20 , 117 (2020). https://doi.org/10.1186/s12874-020-01007-w
Download citation
Received : 03 December 2019
Accepted : 05 May 2020
Published : 14 May 2020
DOI : https://doi.org/10.1186/s12874-020-01007-w
Share this article
Anyone you share the following link with will be able to read this content:
Sorry, a shareable link is not currently available for this article.
Provided by the Springer Nature SharedIt content-sharing initiative
- Preterm infants
- Neonatology
- Statistical methods
- Nested case-control
BMC Medical Research Methodology
ISSN: 1471-2288
- Submission enquiries: [email protected]
- General enquiries: [email protected]
Click through the PLOS taxonomy to find articles in your field.
For more information about PLOS Subject Areas, click here .
Loading metrics
Open Access
Peer-reviewed
Research Article
A Case-Control Study on the Risk Factors for Meningococcal Disease among Children in Greece
* E-mail: [email protected]
Affiliation Department of Hygiene and Epidemiology, Faculty of Medicine, University of Thessaly, Larissa, Greece
Affiliation Aghia Sophia Children’s Hospital, National and Kapodistrian University of Athens School of Medicine, Athens, Greece
Affiliation National Reference Centre for Meningitis, National School of Public Health, Athens, Greece
- Christos Hadjichristodoulou,
- George Mpalaouras,
- Vasiliki Vasilopoulou,
- Antonios Katsioulis,
- George Rachiotis,
- Kalliopi Theodoridou,
- Georgia Tzanakaki,
- Vassiliki Syriopoulou,
- Maria Theodoridou

- Published: June 28, 2016
- https://doi.org/10.1371/journal.pone.0158524
- Reader Comments
The aim of this study was to identify environmental or genetic risk factors that are associated with invasive meningococcal disease (IMD) in children in Greece.
A case-control study was performed in 133 children (44 cases and 89 controls) aged between 0–14 years, who were hospitalized in a children's hospital in Athens. Demographics and possible risk factors were collected by the use of a structured questionnaire. To investigate the association of mannose binding lectin (MBL) with IMD, a frequency analysis of the haplotypes of the MBL2 gene and quantitative measurement of MBL serum protein levels were performed using Nanogen NanoChipR 400 technology and immuno-enzyme techniques, respectively.
The multivariate analysis revealed that changes in a child's life setting (relocation or vacation, OR = 7.16), paternal smoking (OR = 4.51), upper respiratory tract infection within the previous month (OR = 3.04) and the density of people in the house/100m 2 (OR = 3.16), were independent risk factors associated with IMD. Overall 18.8% of patients had a MBL2 genotype with low functionality compared to 10.1% of healthy controls, but this was not statistically significant (p = 0.189).
Prevention strategies aimed at reducing parental smoking and other risk factors identified in this study could decrease the risk of IMD among children in Greece.
Citation: Hadjichristodoulou C, Mpalaouras G, Vasilopoulou V, Katsioulis A, Rachiotis G, Theodoridou K, et al. (2016) A Case-Control Study on the Risk Factors for Meningococcal Disease among Children in Greece. PLoS ONE 11(6): e0158524. https://doi.org/10.1371/journal.pone.0158524
Editor: Daniela Flavia Hozbor, Universidad Nacional de la Plata, ARGENTINA
Received: January 26, 2016; Accepted: June 16, 2016; Published: June 28, 2016
Copyright: © 2016 Hadjichristodoulou et al. This is an open access article distributed under the terms of the Creative Commons Attribution License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the paper.
Funding: The authors received no specific funding for this work.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Invasive meningococcal disease (IMD) is a contagious bacterial disease caused by a meningococcus ( Neisseria meningitidis ), a Gram-negative bacterium that is classified into 13 capsular groups according to its capsular polysaccharides. Six of these (A, B, C, Y, X and W), are of clinical significance as they cause invasive infections. In Europe, groups B and C are mainly responsible for IMD [ 1 ]. In the USA, groups B, C and Y cause a high proportion of IMD [ 2 ], while in Africa group A is predominant and groups W, X and C are also endemic [ 3 ]. Meningitis and septicemia are the two main clinical forms of IMD, and sometimes both clinical forms are found in the same patient. Meningococcal meningitis is a serious infection of the meninges that can cause severe brain damage and other sequelae. In meningococcal septicaemia, the onset of the symptoms is sudden and death can follow within hours. IMD has a high fatality rate and many survivors develop permanent sequelae [ 4 – 5 ].
Meningococcal infections are transmitted between people through respiratory droplets or secretions. N . meningitidis inhabits the mucosal membrane of the nose and throat, where it usually causes no harm [ 6 – 7 ]. There is substantial evidence that approximately 10% of the general population are asymptomatic carriers, although this rate varies with age, and is associated with a peak in early adulthood [ 8 – 10 ]. Several polysaccharide and conjugate vaccines are available for the protection of humans from the most common capsular groups of IMD. Polysaccharide vaccines are available in bivalent (A, C), trivalent (A, C, W), and quadrivalent (A, C, W, Y) forms. Conjugate vaccines, which are more immunogenic and can provide herd immunity, are available in monovalent (A or C), quadrivalent (A, C, W, Y), or combinatorial (group C and Haemophilus influenzae type b) forms [ 10 ]. Recently, a new vaccine against group B has been developed based on reverse vaccinology [ 11 ]. In Greece, the vaccine against group C has been included in the National Vaccination Programme (NVP) since 2006; recently, a vaccine against group B was made available in Greece but is not yet included in the NVP.
The risk of meningococcal infection in an individual is dependent on the balance of the virulence of the strain and the host’s immune response. Moreover, several environmental risk factors have been associated with the disease in several countries [ 12 – 13 ]. Both active and passive smoking in particular have been found to increase the risk of IMD in pediatric populations [ 14 – 16 ]. Other risk factors include crowded living conditions, close contact with an infected person, a history of recent upper respiratory tract infections and low socio-economic status [ 17 – 20 ]. Finally, individual risk factors such as an underlying disease (e.g., malignancies) or asplenia are also associated with a higher risk of developing IMD [ 21 – 22 ].
Genetic mutations of MBL(an acute protein phase that contributes to the elimination of bacteria by activating the complement system) have been identified as possible risk factors associated with IMD in several studies [ 23 – 25 ]. As contradictory results exist regarding the role of MBL in IMD, and there is no available evidence from Greece, we decided to investigate the role of MBL further as a predisposing factor for IMD. The aim of the current study was therefore to identify possible environmental or genetic factors that increase the predisposition of children in Greece to developing IMD, including an analysis of MBL serum protein levels and haplotype analysis of MBL2 gene.
Materials and Methods
Ethics statement.
Approval of the study protocol was received by the Ethics Committee of Aghia Sofia Children’s Hospital, which waived the need for written consent. Parents or guardians were informed about the aim of the study, and they provided written consent for their child’s participation in the study.
Study design
A case-control study was performed using 133 children (44 cases and 89 controls). All participants were children aged between 0–14 years, who had been hospitalized in two children's hospitals (Aghia Sophia and P & A Kyriakou) in Athens, Greece, within a 2-year period from January 2011 to December 2012.
Cases were had been hospitalized with a diagnosis of IMD (meningococcal meningitis and/or sepsis). In all cases, N . meningitidis was identified in samples of biological material (blood or CSF) in the laboratory and isolated using bacterial cultures or molecular techniques, such as polymerase chain reaction (PCR). In addition, to increase the reliability and the power of our genetic research, the frequency analysis of the gene polymorphisms of MBL2 , included 45 extra blood samples from the National Reference Centre for Meningitis (NRCM). These samples were collected from cases of IMD in children aged 0–14 years, who had become ill during the same period (2011–2012) and had been admitted to various hospitals across Greece. Our study was designed to encompass two spring seasons in order to capture any respective seasonal increase in IMD.
Controls were children hospitalized in the surgical wards of the two hospitals with a diagnosis that was unrelated to IMD or other infections. All controls were matched to cases using the sex and age (year of birth) of each child, and the week of admission. When it was possible, at least two controls were randomly selected for every case.
Data collection
In both cases and controls, a whole blood sample (3–4 mL) was collected on the day the child was admitted to the hospital, in order to study the MBL2 haplotypes and the serum levels of MBL protein. For each blood sample, 0.5 mL was stored immediately at -80°C, in order to be used in the frequency analysis of MBL2 polymorphisms. After centrifugation of the remaining blood sample, plasma was collected and kept readily cryopreserved at -80°C for the quantitative measurement of MBL.
Questionnaire
A questionnaire was distributed to the parents of both cases and controls, in order to obtain information on the following: their child’s demographic details (sex, age, race, height, weight, mother’s/father’s educational status, social security status, use of paediatrician or general practitioner services, family income, specific population group); family history (number of family members, number of children, birth order, family medical history, family medical history of meningitis); housing environment (size of the house in square meters, number of household members, heating system, type of house [single family house or block of flats], exposure to passive smoking at home). Moreover, the questionnaire included questions regarding perinatal history and breastfeeding; history of hospitalization; vaccination history; medications/ social history during the last month: attendance at nursery school, elementary/high school/college attendance; participation in sporting events; attendance at parties and playgrounds; church attendance; visits to restaurants or coffee bars; kissing other people; use of public transport; and change of life setting (relocation or vacation). If the answer to the question of change in life setting was positive the parents were asked to specify. Parents were also asked to report if their children had signs and symptoms compatible with upper respiratory infection (fever and/or cough and/or sore throat and/or rhinitis) during the last month. It should be noted that some questions in the questionnaire (e.g., kissing other people and coffee bars attendance) were relevant to older children (≥12 years old), while other questions (e.g., nursery and play areas attendance) were relevant to children aged ≤10 years.
Genetic analysis of MBL
MBL2 includes three polymorphic sites in exon 1 (cd52, cd54, cd57) forming the haplotypes AO, OO, and AA, three polymorphisms in the area of the promoter (-550, -221, +4) forming the haplotypes HY, LY and LX and four polymorphic sites located in exon 4 (carbohydrate recognition domain-CRD). The frequency analysis of MBL2 polymorphisms was performed by the Choremeio Research Laboratory of Medical Genetics at the University of Athens. Of the 44 cases of IMD, only 36 agreed to participate in the genetic analysis while 69 out of the 89 of the controls agreed to participate (response rates of 81.8% and 77.5%, respectively). To increase the power of the study, 45 cases of IMD from NRCM were included in the genetic analysis to achieve a total number of 81 cases of IMD analyzed.
Genomic DNA was isolated from 350 μl of peripheral blood, using the BioRobotR M48 System (Qiagen, Hilden, Germany) and the MagAttractR DNA Blood Midi M48 Kit (Qiagen, Hilden, Germany). For the characterization of the six SNPs in MBL2 , we developed an advanced high throughout methodology using the Nanogen NanoChipR 400 system (NC400, Nanogen Inc). Three separate regions of the MBL2 gene containing SNPs were designed for PCR amplification using the PrimerQuestSM tool provided by IDTR (Integrated DNA Technologies). The set-up of all PCR reactions was performed automatically using the Biorobot 3000 platform (Qiagen, Hilden, Germany) and carried out in a Techne TC-412 thermal cycler using the amplicon-down format of the Nanogen protocol. Results were estimated by the instrument's software.
The quantitative measurement of the MBL levels of serum was performed in 43 of 44 (97.7%) patients, and in 73 of 89 (82.0%) controls at the Department of Hygiene and Epidemiology at the University of Thessaly, using the MBL Oligomer ELISA Kit (BioPorto Diagnostics Co.).
Statistical analysis
All data collected from the study participants (questionnaire, clinical and laboratory results) were entered into an electronic database using Epi-info software (version 3.5.3, CDC, Atlanta). Statistical analysis was performed using IBM SPSS Statistics software (v.22.0. Armonk, NY: IBM Corp.).
Quantitative variables were presented either as mean values with standard deviation or as a median value with the interquartile range (IQR). Qualitative variables were presented as absolute and relative frequencies with the corresponding 95% confidence intervals (95% CI). The receiver operating characteristic (ROC) curve analysis was conducted to determine the optimal cutoff values of the quantitative variable density (number of persons per 100 m 2 ), which was used to distinguish between IMD cases and controls.
In the univariate analysis, the Chi-square test or the Fisher's exact test was used to investigate the associations between the qualitative variables. The Chi-square test for trend was used to assess any dose response relationship between the ordinal factors (e.g., number of cigarettes per day) and IMD. The Mann-Whitney test was used to explore differences between IMD cases and controls with regards to quantitative non-normally distributed variables.
In the multivariate analysis, multiple logistic regression analysis was performed, using the backward stepwise conditional method with a removal criterion of p-value equal to or greater than 0.10, inorder to identify the independent risk factors for the onset of IMD by calculating the odds ratios (ORs) and the corresponding 95% CI. The dependent variable was IMD and as independent variables were used all the statistically significant risk factors found in the univariate analysis together with age and gender were used as independent variables. For the genetic analysis of MBL, the necessary sample size was calculated to be 200 children (100 per group) assuming a study power of 0.80, alpha 0.05, and considering 10.0% of low MBL in controls to identify a significant OR 3.00 (cases vs. controls). A result with a p-value <0.05 was considered to be statistically significant.
Participant characteristics
During the initial study period, 50 cases of IMD were hospitalized in both hospitals. In two cases, the diagnosis was not confirmed and the guardians of four patients (all females) refused to participate in the study. Overall, 44 cases of IMD participated in the study (response rate: 91.1%), of whom 27 (61.3%) were males and 17 (38.7%) females. The median age was 3 years (IQR: 1–4 years). Regarding race, 93.2% of them were white, 2.3% black, and 4.5% were from an other racial background. Also, 18 IMD cases (~41%) were second-generation immigrants, of whom 13 (~72%) were born in Greece while their parents were from Albania. Seven patients presented with septicemia, nine had meningitis and 28 patients had both. Sixteen patients were diagnosed using PCR and 28 through culture tests. The majority of cases were caused by group B meningococcus (36/44), one patient had group A and one had group W, while meningococcus was nongroupable in six patients. Furthermore, 23 out of 40 (57.5%) cases of IMD had been vaccinated with meningococcal meningitis C vaccine.
Regarding the controls, 99 patients were initially recorded but only 89 controls participated in the study (response rate: 89.9%). Out of these 89 controls, 65 (73%) were males and 24 (27%) were females, while 98.8%were white, 0% were black and 1.2% had another racial background. The median age of controls was 3 years (IQR: 1–5 years). Approximately 24% of the controls were second-generation immigrants.
Univariate analysis
All statistically significant risk factors, according to the univariate analysis, are presented in Table 1 . None of the other parameters included in the questionnaire, including gender, age, race, pets, normal birth, breastfeeding insurance status, annual family income, parental education level, maternal smoking, maternal cigarettes consumption per day, fever, cough, headache, vaccination status for N . meningitidis type C, or parental occupation, were found to have any statistically significant association with the occurrence of IMD. Moreover, factors such as the use of antibiotics, participation in sports activities, participation in parties, or kissing other people during the previous month were not statistically significant. Finally, no statistically significant difference was found in the serum protein levels of MBL between cases and controls.
- PPT PowerPoint slide
- PNG larger image
- TIFF original image
https://doi.org/10.1371/journal.pone.0158524.t001
As shown in Table 1 , a change in life setting in the previous month (vacation or relocation) was identified as a risk factor (p = 0.010). The most frequent answer within the positive responses in the relocation/vacation question among cases, was vacation to another town within Greece (60%) or abroad (20%) for ≥5days. Moreover, 20% reported a relocation without specifying whether it was in the same town or in another town. Finally, it should be noted that most of the vacations were to celebrate Easter, or to visit relatives and friends in the migrants’ country of origin (mainly Albania).
Parental smoking (p = 0.030, OR = 2.48) and paternal smoking (p = 0.004, OR = 3.19) were identified to be associated with IMD, although the difference for maternal smoking was not statistically significant (p = 0.150, OR = 1.74). Moreover, for fathers a dose-response was revealed between number of cigarettes per day and therisk of IDM. The OR for smoking ≥20 cigarettes per day was 3.32 (95% C.I.: 1.46–7.58), compared to 2.56 (95% C.I.: 0.62–10.53) for smoking 1–19 cigarettes per day. The mean number of cigarettes per day smoked by the fathers was double of that of the mothers (20 vs. 10).
Multivariate analysis
As shown in Table 2 , the multivariate analysis revealed the following independent risk factors: relocation or vacation during the last month (OR = 7.16; 95% CI: 1.80–28.50), paternal smoking (OR = 4.51; 95% CI: 1.60–12.67), recent history (past 30 days) of viral respiratory infection (OR = 3.04; 95% CI: 1.17–7.91) and crowd density at home (OR = 3.16; 95% CI: 1.16–8.60).
https://doi.org/10.1371/journal.pone.0158524.t002
MBL2 haplotype analysis
According to the analysis of the MBL polymorphisms in exon 1 and in the promoter region, the patients were classified into three groups of high (HYA / HYA, HYA / LYA, LYA / LYA, and LYA / LXA), moderate (LXA / LXA, HYA / O and LYA / O) and low (LXA / O and O / O) functionality of the MBL2 gene, except for one sample. The low functionality of MBL2 , was observed in 18.8% of the patients and in 10.1% of the control group, but this difference was not statistically significant (p = 0.189) ( Table 3 ).
https://doi.org/10.1371/journal.pone.0158524.t003
This study revealed that a recent history of relocation or holiday, paternal smoking, a recent history of viral upper respiratory tract infection and crowded home conditions were independent risk factors for IMD. One interesting and possibly important novel finding of our study was the association between a change in the life setting either by relocation or vacation in the previous month with IMD (OR = 7.16). During these trips, extensive social activities are expected together with changes in pharyngeal flora as indicated by previous studies [ 26 ]. The precise mechanism by which travelling for a vacation could be a risk factor is unknown, but a recent case-crossover study reported that travelling abroad was independently associated with meningococcal carriage [ 27 ]. Relocation/vacation activities could be linked with changes in the flora of the nasopharynx as a result of environmental changes, including colonization by strains of N . meningitidis . It is known that the risk of invasive IMD is higher in newly colonized people [ 28 ]. This finding could have implications related to the systematic vaccination of unvaccinated young people who will be travelling or relocating. This prevention activity has a number of practical limitations related to the cost of the vaccine, the need for booster doses and the time needed to achieve protective immunity. Moreover, as reported by Tzanakaki et al., the suggested coverage of the 4CmenB vaccine would be 89% in Greece [ 29 ].
Our findings are in line with well-known risk factors for IMD, such as a history of viral respiratory tract infections [ 30 ], and parental passive smoking. A stronger association between paternal smoking together with a dose–response effect has been identified previously, while in a previous study maternal smoking was identified as more important risk factor [ 31 ]. The most plausible explanation for our findings was the fact that the fathers in our study smoked a higher average number (20 per day) of cigarettes compared to mothers (10 per day), while both parents contribute almost equally to child care. A meta-analysis found a significant association, between passive smoking and IMD in children [ 32 ]. In addition, a positive, statistically significant association between passive smoking and the carriage of N . meningitidis has also been documented in previous studies. It has been suggested that this increased risk may be attributed to the increased ability of the bacteria to adhere to mucosa in the presence of smoke [ 33 ].
Finally, multiple logistic regression analysis has revealed that the crowd density in the house (number of persons per 100 square meters of the house) was an independent risk factor for IMD. It should be noted that a nationwide population-based case-control study among preschool children reported that the risk of IMD increased with an increasing household density [ 34 ]. A number of studies have demonstrated a positive correlation between low socio-economic status and the risk of invasive meningococcal infections [ 17 – 20 ]. For this purpose, increased population density in the household has been considered as an indirect indicator of low socio-economic status.
The present study also attempted to explore the correlation between a particular genotype of the MBL2 gene and its predisposition to IMD. The patients were classified into three groups of high, moderate and low functionality of the MBL2 gene, based on the combination of polymorphisms in exon 1 and in the region of the promoter. Patients mostly had the low functionality genotype, compared to healthy controls ( Table 3 , 18.8% vs. 10.1%, p = 0.189). The difference was not statistically significant, probably because of the relatively small number of cases and controls, which resulted in an underpowered study. Several studies have reported different results regarding the association between the MBL protein and IMD. Summerfield et al. [ 34 ] were the first to report the possible link between polymorphisms of the MBL2 gene and the appearance of unusual and severe infections in adults. Moreover, the correlation between IMD and MBL2 polymorphisms was found in a case-control study by Hibberd et al. [ 23 ]. This study revealed a significantly higher frequency of homozygosity and heterozygosity in patients with IMD compared to healthy controls, which was associated with an increased risk for IMD [ 23 ]. The authors estimated that approximately 32% of IMD cases might be attributed to MBL2 polymorphisms. Other studies supported these findings [ 24 – 25 ]. However, another case-control study including 5500 Europeans (296 case and 5196 controls) questioned the above findings and the authors concluded that there was no correlation between MBL2 polymorphisms and IMD [ 35 ].
Our results are subject to several limitations, with the most significant being the limited number of participants. The study power for the effect of MBL was estimated at 0.31,which was low. Thus, the fact that we did not identify a statistically significant difference does not mean that one does not exist in reality. Another limitation was related to the case-control design. Given the absence of the criterion for a temporal association in the case-control studies, we cannot claim that the associations observed in the present study between several risk factors and the outcome are causative. Moreover, our case-control study design is prone to information bias. In particular, the parents of the cases may report (recall) several exposures more readily than controls. However, on the other hand, it is not likely that parents of cases over-reported their smoking habits, and consequently, the exposure of their children to passive smoking. On the contrary, it would be expected that parents would under-report their smoking activities, which could lead to an underestimation of the impact of parental smoking on the risk associated with IMD. A further limitation of our study was that symptoms that were suggestive of an infection of the upper respiratory tract were based on self-reports, without the implementation of serological tests for the detection of viral antigens. Finally, we matched cases to controls by age and thus we lost the opportunity to verify the age as a risk factor. However, as age is a well-known strong confounder, we preferred to control for age to be able to study the other possible risk factors.
In conclusion, our case-control study indicated that paternal smoking, a recent history of upper respiratory tract viral infection, crowded households and recent relocation/vacation activities were independent risk factors for IMD. Additional studies are needed to explore in detail the role of relocation or holidays as risk factors for IMD and to assess the actual risk posed. Preventive activities aimed at reducing parental smoking and other risk factors could decrease the risk of IMD among children in Greece.
Author Contributions
Conceived and designed the experiments: CH VV MT. Performed the experiments: VV KT. Analyzed the data: CH AK. Contributed reagents/materials/analysis tools: GT VS. Wrote the paper: CH GM GR. Provided expertise and editing: GR. All the authors provided constructive comments and approved the final version of the manuscript.
- 1. European Centre for Disease Prevention and Control. Meningococcal disease. Available: http://ecdc.europa.eu/en/healthtopics/meningococcal/Pages/index.aspx . Accessed: 7 April 2016.
- View Article
- Google Scholar
- 3. World Health Organization. Meningitis outbreak response in sub-Saharan Africa—WHO guidance. Available: http://apps.who.int/iris/bitstream/10665/144727/1/WHO_HSE_PED_CED_14.5_eng.pdf?ua=1&ua=1 . Accessed 7 April 2016.
- PubMed/NCBI
- 6. World Health Organization. Meningococcal meningitis Fact sheet N°141. Available: http://www.who.int/mediacentre/factsheets/fs141/en/ . Accessed 20 January 2016.
- 12. World Health Organization. Control of epidemic meningococcal disease. WHO practical guidelines. 2nd edition. WHO/EMC/BAC/98/3/EN. (1998) Available: http://www.who.int/csr/resources/publications/meningitis/whoemcbac983.pdf . Accessed 20 January 2016.
Case Control Studies
Affiliations.
- 1 University of Nebraska Medical Center
- 2 Spectrum Health/Michigan State University College of Human Medicine
- PMID: 28846237
- Bookshelf ID: NBK448143
A case-control study is a type of observational study commonly used to look at factors associated with diseases or outcomes. The case-control study starts with a group of cases, which are the individuals who have the outcome of interest. The researcher then tries to construct a second group of individuals called the controls, who are similar to the case individuals but do not have the outcome of interest. The researcher then looks at historical factors to identify if some exposure(s) is/are found more commonly in the cases than the controls. If the exposure is found more commonly in the cases than in the controls, the researcher can hypothesize that the exposure may be linked to the outcome of interest.
For example, a researcher may want to look at the rare cancer Kaposi's sarcoma. The researcher would find a group of individuals with Kaposi's sarcoma (the cases) and compare them to a group of patients who are similar to the cases in most ways but do not have Kaposi's sarcoma (controls). The researcher could then ask about various exposures to see if any exposure is more common in those with Kaposi's sarcoma (the cases) than those without Kaposi's sarcoma (the controls). The researcher might find that those with Kaposi's sarcoma are more likely to have HIV, and thus conclude that HIV may be a risk factor for the development of Kaposi's sarcoma.
There are many advantages to case-control studies. First, the case-control approach allows for the study of rare diseases. If a disease occurs very infrequently, one would have to follow a large group of people for a long period of time to accrue enough incident cases to study. Such use of resources may be impractical, so a case-control study can be useful for identifying current cases and evaluating historical associated factors. For example, if a disease developed in 1 in 1000 people per year (0.001/year) then in ten years one would expect about 10 cases of a disease to exist in a group of 1000 people. If the disease is much rarer, say 1 in 1,000,0000 per year (0.0000001/year) this would require either having to follow 1,000,0000 people for ten years or 1000 people for 1000 years to accrue ten total cases. As it may be impractical to follow 1,000,000 for ten years or to wait 1000 years for recruitment, a case-control study allows for a more feasible approach.
Second, the case-control study design makes it possible to look at multiple risk factors at once. In the example above about Kaposi's sarcoma, the researcher could ask both the cases and controls about exposures to HIV, asbestos, smoking, lead, sunburns, aniline dye, alcohol, herpes, human papillomavirus, or any number of possible exposures to identify those most likely associated with Kaposi's sarcoma.
Case-control studies can also be very helpful when disease outbreaks occur, and potential links and exposures need to be identified. This study mechanism can be commonly seen in food-related disease outbreaks associated with contaminated products, or when rare diseases start to increase in frequency, as has been seen with measles in recent years.
Because of these advantages, case-control studies are commonly used as one of the first studies to build evidence of an association between exposure and an event or disease.
In a case-control study, the investigator can include unequal numbers of cases with controls such as 2:1 or 4:1 to increase the power of the study.
Disadvantages and Limitations
The most commonly cited disadvantage in case-control studies is the potential for recall bias. Recall bias in a case-control study is the increased likelihood that those with the outcome will recall and report exposures compared to those without the outcome. In other words, even if both groups had exactly the same exposures, the participants in the cases group may report the exposure more often than the controls do. Recall bias may lead to concluding that there are associations between exposure and disease that do not, in fact, exist. It is due to subjects' imperfect memories of past exposures. If people with Kaposi's sarcoma are asked about exposure and history (e.g., HIV, asbestos, smoking, lead, sunburn, aniline dye, alcohol, herpes, human papillomavirus), the individuals with the disease are more likely to think harder about these exposures and recall having some of the exposures that the healthy controls.
Case-control studies, due to their typically retrospective nature, can be used to establish a correlation between exposures and outcomes, but cannot establish causation . These studies simply attempt to find correlations between past events and the current state.
When designing a case-control study, the researcher must find an appropriate control group. Ideally, the case group (those with the outcome) and the control group (those without the outcome) will have almost the same characteristics, such as age, gender, overall health status, and other factors. The two groups should have similar histories and live in similar environments. If, for example, our cases of Kaposi's sarcoma came from across the country but our controls were only chosen from a small community in northern latitudes where people rarely go outside or get sunburns, asking about sunburn may not be a valid exposure to investigate. Similarly, if all of the cases of Kaposi's sarcoma were found to come from a small community outside a battery factory with high levels of lead in the environment, then controls from across the country with minimal lead exposure would not provide an appropriate control group. The investigator must put a great deal of effort into creating a proper control group to bolster the strength of the case-control study as well as enhance their ability to find true and valid potential correlations between exposures and disease states.
Similarly, the researcher must recognize the potential for failing to identify confounding variables or exposures, introducing the possibility of confounding bias, which occurs when a variable that is not being accounted for that has a relationship with both the exposure and outcome. This can cause us to accidentally be studying something we are not accounting for but that may be systematically different between the groups.
Copyright © 2023, StatPearls Publishing LLC.
- Introduction
- Issues of Concern
- Clinical Significance
- Enhancing Healthcare Team Outcomes
- Review Questions
Publication types
- Study Guide

IMAGES
VIDEO
COMMENTS
Knowledge Base Methodology What Is a Case-Control Study? | Definition & Examples What Is a Case-Control Study? | Definition & Examples Published on February 4, 2023 by Tegan George . Revised on June 22, 2023.
Sounak Chakraborty, PhD DOI: https://doi.org/10.1016/j.chest.2020.03.009 A Practical Overview of Case-Control Studies in Clinical Practice Case-control studies are one of the major observational study designs for performing clinical research.
As an actual example of a case-control study, children with autism spectrum disorder (ASD) may be compared with normally developing children to determine whether a history of maternal antidepressant use during pregnancy is more frequent among cases than among controls; if it is, and if the association remains statistically significant after adju...
For example, a researcher may want to look at the rare cancer Kaposi's sarcoma. The researcher would find a group of individuals with Kaposi's sarcoma (the cases) and compare them to a group of patients who are similar to the cases in most ways but do not have Kaposi's sarcoma (controls).
The main advantages of a nested case-control study are as follows: (1) cost reduction and effort minimization, as only a fraction of the parent cohort requires the necessary outcome assessment; (2) reduced selection bias, as both case and control subjects are sampled from the same population; and (3) flexibility in analysis by allowing testing of a hypotheses in the future that is not ...
32658653 10.1016/j.chest.2020.03.009 Case-control studies are one of the major observational study designs for performing clinical research. The advantages of these study designs over other study designs are that they are relatively quick to perform, economical, and easy to design and implement.
This article describes several types of case-control designs, with simple graphical displays to help understand their differences. Study design considerations are reviewed, including sample size, power, and measures associated with risk factors for clinical outcomes.
Of the four main types of case-control studies, we will focus on the basic case-control study and the nested case-control study. Other types include the case-cohort study and the case-crossover study which are discussed elsewhere. 9 In a nested case-control study, the case-control study is embedded within a cohort of patients, and cases and controls are both selected from the same cohort.
Abstract Case-control studies are a type of observational epidemiological study that involve comparing two groups of individuals; one group with a defined outcome and the other without (normal). By doing this, one can look back in time to analyze the possible factors that may have contributed to the development of that outcome.
The following example has been fabricated to review the design of case-control study and is summarized in Fig. 37.2.The interventional radiology department at Hospital Y performed a single institution retrospective case-control study in an effort to better understand the impact of uterine leiomyomas on patients' reproductive results from June 2009 to June 2019.
An example of (1) would be a study of endophthalmitis following ocular surgery. When an outbreak is in progress, answers must be obtained quickly. An example of (2) would be a study of risk factors for uveal melanoma, or corneal ulcers.
Abstract. Case-Control study design is a type of observational study. In this design, participants are selected for the study based on their outcome status. Thus, some participants have the outcome of interest (referred to as cases), whereas others do not have the outcome of interest (referred to as controls). The investigator then assesses the ...
Cohort studies and case-control studies are two primary types of observational studies that aid in evaluating associations between diseases and exposures. In this review article, we describe these study designs, methodological issues, and provide examples from the plastic surgery literature. Keywords: observational studies, case-control study ...
8 Citations 7 Altmetric Metrics Abstract Background Healthcare Workers (HCW) are repeatedly exposed to SARS-CoV-2 infection. The aim of this study was to identify factors associated with SARS-CoV-2 infection among HCW in one of the largest cities in Colombia. Methods
Case Control. In a Case-Control study there are two groups of people: one has a health issue (Case group), and this group is "matched" to a Control group without the health issue based on characteristics like age, gender, occupation. In this study type, we can look back in the patient's histories to look for exposure to risk factors that ...
Use of control (comparison) groups is a powerful research tool. In case-control studies, controls estimate the frequency of an exposure in the population under study. Controls can be taken from known or unknown study populations. A known group consists of a defined population observed over a period, such as passengers on a cruise ship. When the study group is known, a sample of the population ...
Abstract. A pancreatic cancer case-control study was conducted in the Minneapolis-St. Paul area. Family members were interviewed about the subject's usage of cigarettes, alcohol, coffee, and other dietary factors in the two years prior to death (cases, n = 212) or prior to interview (controls, n = 220). The adjusted odds ratio for two packs or ...
The purpose of this article is to present in elementary mathematical and statistical terms a simple way to quickly and effectively teach and understand case-control studies, as they are commonly done in dynamic populations-without using the rare disease assumption. Our focus is on case-control studi …
A nested case-control study is an efficient design that can be embedded within an existing cohort study or randomised trial. It has a number of advantages compared to the conventional case-control design, and has the potential to answer important research questions using untapped prospectively collected data. We demonstrate the utility of the matched nested case-control design by applying it ...
Abstract Purpose The aim of this study was to identify environmental or genetic risk factors that are associated with invasive meningococcal disease (IMD) in children in Greece. Methods A case-control study was performed in 133 children (44 cases and 89 controls) aged between 0-14 years, who were hospitalized in a children's hospital in Athens.
For example, if obtaining exposure information is difficult or costly (e.g., if it involves lengthy interviews or collection of serum samples), then it may be more efficient to conduct a prevalence case-control study by obtaining exposure information on all of the prevalent cases of disease and a sample of controls selected at random from the ...
A case study is one of the most commonly used methodologies of social research. This article attempts to look into the various dimensions of a case study research strategy, the different epistemological strands which determine the particular case study type and approach adopted in the field, discusses the factors which can enhance the effectiveness of a case study research, and the debate ...
Step 5: Write the text. With a well-defined thesis statement, it's time to start composing your case study assignment example. Organize your paper into key sections, including the introduction, body, and conclusion. Continuing with our example of sustainable urban planning, let's explore this in more detail. Imagine you are focusing on the ...
For example, a researcher may want to look at the rare cancer Kaposi's sarcoma. The researcher would find a group of individuals with Kaposi's sarcoma (the cases) and compare them to a group of patients who are similar to the cases in most ways but do not have Kaposi's sarcoma (controls).