Abstract
Beyond calculating parameter estimates to characterize the distribution of genetic features of populations (frequencies of mutations in various regions of the genome, allele frequencies, measures of Hardy–Weinberg disequilibrium), genetic epidemiology aims to identify correlations between genetic variants and phenotypic traits, with considerable emphasis placed on finding genetic variants that increase susceptibility to disease and disease-related traits. However, determining correlation alone does not suffice: genetic variants common in an isolated ethnic group with a high burden of a given disease may show relatively high correlation with disease but, as markers of ethnicity, these may not necessarily have any functional role in disease. To establish a causal relationship between genetic variants and disease (or disease-related traits), proper statistical analyses of human data must incorporate epidemiologic approaches to examining sets of families or unrelated individuals with information available on individuals’ disease status or related traits.
Through different analytical approaches, statistical analysis of human data can answer several important questions about the relationship between genes and disease:
-
1.
Does the disease tend to cluster in families more than expected by chance alone?
-
2.
Does the disease appear to follow a particular genetic model of transmission in families?
-
3.
Do variants at a particular genetic marker tend to cosegregate with disease in families?
-
4.
Do specific genetic markers tend to be carried more frequently by those with disease than by those without, in a given population (or across families)?
The first question can be examined using studies of familial aggregation or correlation. An ancillary question: “how much of the susceptibility to disease (or variation in disease-related traits) might be accounted for by genetic factors?” is typically answered by estimating heritability, the proportion of disease susceptibility or trait variation attributable to genetics. The second question can be formally tested using pedigrees for which disease affection status or trait values are available through a modeling approach known as segregation analysis. The third question can be answered with data on pedigrees with affected members and genotype information at markers of interest, using linkage analysis. The fourth question is answerable using genotype information at markers on unrelated affected and unaffected individuals and/or families with affected and unaffected members. All of these questions can also be explored for quantitative (or continuously distributed) traits by examining variation in trait values between family members or between unrelated individuals. While each of these questions and the analytical approaches for answering them is explored extensively in subsequent chapters (heritability in Chapters 9 and 10, segregation in Chapter 12, linkage in Chapters 13–17, and association in Chapters 18–21 and 23), this chapter focuses on statistical methods to answer questions of familial aggregation.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
del Junco D, et al (1984) The familial aggregation of rheumatoid arthritis and its relationship to the HLA-DR4 association. Am J Epidemiol 119: 813–829
Nestadt G, et al (2000) A family study of obsessive-compulsive disorder. Arch Gen Psychiatry 57: 358–363
Peretz I, Cummings S, Dube MP (2007) The genetics of congenital amusia (tone deafness): a family-aggregation study. Am J Hum Genet 81: 582–588
Rice TK (2008) Familial resemblance and heritability. Adv Genet 60: 35–49
Visscher PM, Hill WG, Wray NR (2008) Heritability in the genomics era--concepts and misconceptions. Nat Rev Genet 9: 255–266
Naldi L, et al (2001) Family history of psoriasis, stressful life events, and recent infectious disease are risk factors for a first episode of acute guttate psoriasis: results of a case-control study. J Am Acad Dermatol 44: 433–438
Mantel N, Haenszel W (1959) Statistical aspects of the analysis of data from retrospective studies of disease. J Natl Cancer Inst 22: 719–748
Ponz de Leon M, et al (1989) Familial aggregation of tumors in the three-year experience of a population-based colorectal cancer registry. Cancer Res 49: 4344–4348
Hopper JL, Hannah MC, Mathews JD (1984) Genetic Analysis Workshop II: pedigree analysis of a binary trait without assuming an underlying liability. Genet Epidemiol 1: 183–188
Connolly MA, Liang KY (1988) Conditional logistic regression models for correlated binary data. Biometrika 75: 501–506
Hopper JL, Derrick PL (1986) A log-linear model for binary pedigree data. Genet Epidemiol Suppl 1: 73–82
Khoury MJ, Beaty TH, Cohen BH (1993) Fundamentals of genetic epidemiology. Monographs in epidemiology and biostatics v. 19. Oxford University Press: New York
Liang KY, Beaty TH (1991) Measuring familial aggregation by using odds-ratio regression models. Genet Epidemiol 8: 361–370
Bishop YMM, Fienberg SE, Holland PW (1975) Discrete multivariate analysis : theory and practice. MIT Press: Cambridge, MA
Cohen BH (1980) Chronic obstructive pulmonary disease: a challenge in genetic epidemiology. Am J Epidemiol 112: 274–288
Schwartz AG, Boehnke M, Moll PP (1988) Family risk index as a measure of familial heterogeneity of cancer risk. A population-based study in metropolitan Detroit. Am J Epidemiol 128: 524–535
Breslow NE, Day NE (1980) Statistical methods in cancer research. IARC scientific publications. International Agency for Research on Cancer: Lyon
Beaty TH, et al (1997) Testing for interaction between maternal smoking and TGFA genotype among oral cleft cases born in Maryland 1992–1996. Cleft Palate Craniofac J 34: 447–454
Seibold MA, Schwartz DA (2010) The Lung: The Natural Boundary Between Nature And Nurture. Annu Rev Physiol 73: 457–478
Garantziotis S, Schwartz DA (2010) Ecogenomics of respiratory diseases of public health significance. Annu Rev Public Health. 31: 37–51.
Demeo DL, et al (2007) Determinants of airflow obstruction in severe alpha-1-antitrypsin deficiency. Thorax 62: 806–813
Pare G, et al (2010) On the use of variance per genotype as a tool to identify quantitative trait interaction effects: a report from the Women’s Genome Health Study. PLoS Genet 6: e1000981
Murcray CE, Lewinger JP, Gauderman WJ (2009) Gene-environment interaction in genome-wide association studies. Am J Epidemiol 169: 219–226.
Hwang SJ, et al (1994) Minimum sample size estimation to detect gene-environment interaction in case-control designs. Am J Epidemiol 140: 1029–1037
Foppa I, Spiegelman D (1997) Power and sample size calculations for case-control studies of gene-environment interactions with a polytomous exposure variable. Am J Epidemiol 146: 596–604
Garcia-Closas M, Lubin JH (1999) Power and sample size calculations in case-control studies of gene-environment interactions: comments on different approaches. Am J Epidemiol 149: 689–692
Sturmer T, Brenner H (2000) Potential gain in efficiency and power to detect gene-environment interactions by matching in case-control studies. Genet Epidemiol 18: 63–80
Gauderman, WJ (2002) Sample size requirements for association studies of gene-gene interaction. Am J Epidemiol 155: 478–484
Claus EB, Risch NJ, Thompson WD (1990) Age at onset as an indicator of familial risk of breast cancer. Am J Epidemiol 131: 961–972
Mettlin C, et al (1990) The association of age and familial risk in a case-control study of breast cancer. Am J Epidemiol 131: 973–983
Pulver AE, Liang KY (1991) Estimating effects of proband characteristics on familial risk: II. The association between age at onset and familial risk in the Maryland schizophrenia sample. Genet Epidemiol 8: 339–350
Liang KY, Zeger SL (1986) Longitudinal data analysis using generalized linear models. Biometrika 73: 13–22
Liang KY (1987) Extended Mantel-Haenszel estimating procedure for multivariate logistic regression models. Biometrics 43: 289–299
Liang KY, Pulver AE (1996) Analysis of case-control/family sampling design. Genet Epidemiol 13: 253–270
Hsu L, Zhao LP (1996) Assessing familial aggregation of age at onset, by using estimating equations, with application to breast cancer. Am J Hum Genet 58: 1057–1071
Li H, Yang P, Schwartz AG (1998) Analysis of age of onset data from case-control family studies. Biometrics 54: 1030–1039
Liang KY (1991) Estimating effects of probands’ characteristics on familial risk: I. Adjustment for censoring and correlated ages at onset. Genet Epidemiol 8: 329–338
Liang KY, Zeger SL (1993) Regression analysis for correlated data. Annu Rev Public Health 14: 43–68
Maestri NE, et al (1988) Assessing familial aggregation of congenital cardiovascular malformations in case-control studies. Genet Epidemiol 5: 343–354
Khoury MJ, Beaty TH, Liang KY (1988) Can familial aggregation of disease be explained by familial aggregation of environmental risk factors? Am J Epidemiol 127: 674–683
Thomas DC (2004) Statistical methods in genetic epidemiology. Oxford University Press: New York
Bratt O, et al (2010) Effects of prostate-specific antigen testing on familial prostate cancer risk estimates. J Natl Cancer Inst 102: 1336–1343
Beaty TH, et al (1997) Analyzing sibship correlations in birth weight using large sibships from Norway. Genet Epidemiol, 14: 423–433
Wedderburn RWM. (1974) Quasi-likelihood functions, generalized linear models, and the Gauss—Newton method. Biometrika 61: 439–447
Hanrahan LP, et al (1990) SMRFIT: a Statistical Analysis System (SAS) program for standardized mortality ratio analyses and Poisson regression model fits in community disease cluster investigations. Am J Epidemiol, 1990. 132(Suppl): S116-122
Tokuhata GK, Lilienfeld AM (1963) Familial aggregation of lung cancer in humans. J Natl Cancer Inst 30: 289–312
Feinleib M, et al (1977) The NHLBI twin study of cardiovascular disease risk factors: methodology and summary of results. Am J Epidemiol 106: 284–285
Khoury MJ, Erickson JD, James LM (1982) Etiologic heterogeneity of neural tube defects: clues from epidemiology. Am J Epidemiol 115: 538–548
Khoury MJ, Erickson JD, James LM (1982) Etiologic heterogeneity of neural tube defects. II. Clues from family studies. Am J Hum Genet 34: 980–987
ten Kate LP, et al (1982) Familial aggregation of coronary heart disease and its relation to known genetic risk factors. Am J Cardiol 50: 945–953
Sattin RW, et al (1985) Family history and the risk of breast cancer. JAMA 253: 1908–1913
Nielsen HE, et al (1987) Risk factors and sib correlation in physiological neonatal jaundice. Acta Paediatr Scand 76: 504–511
Beaty TH, et al (1988) Effect of maternal and infant covariates on sibship correlation in birth weight. Genet Epidemiol 5: 241–253
Linet MS, et al (1989) Familial cancer history and chronic lymphocytic leukemia. A case-control study. Am J Epidemiol 130: 655–664
Lin JP, et al.(1998) Familial clustering of rheumatoid arthritis with other autoimmune diseases. Hum Genet 103: 475–482
Criswell LA, et al (2005) Analysis of families in the multiple autoimmune disease genetics consortium (MADGC) collection: the PTPN22 620W allele associates with multiple autoimmune phenotypes. Am J Hum Genet 76: 561–571
Beaty TH, et al (2006) Analysis of candidate genes on chromosome 2 in oral cleft case-parent trios from three populations. Hum Genet 120: 501–518
Raynor LA, et al (2009) Familial aggregation of age-related hearing loss in an epidemiological study of older adults. Am J Audiol 18: 114–118
Krogh C, et al (2010) Familial aggregation and heritability of pyloric stenosis. JAMA 303: 2393–2399
Xiong L, et al (2010) Family study of restless legs syndrome in Quebec, Canada: clinical characterization of 671 familial cases. Arch Neurol 67: 617–622
Saito YA, et al (2010) Familial aggregation of irritable bowel syndrome: a family case-control study. Am J Gastroenterol 105: 833–841
Lichtenstein P, et al (2010) The genetics of autism spectrum disorders and related neuropsychiatric disorders in childhood. Am J Psychiatry 167: 1357–1363
Petersen L, Andersen PK, Sorensen TI (2010) Genetic influences on incidence and case-fatality of infectious disease. PLoS One 5: e10603
Liang KY, Beaty TH. (2000) Statistical designs for familial aggregation. Stat Methods Med Res 9: 543–562
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer Science+Business Media, LLC
About this protocol
Cite this protocol
Naj, A.C., Park, Y.S., Beaty, T.H. (2012). Detecting Familial Aggregation. In: Elston, R., Satagopan, J., Sun, S. (eds) Statistical Human Genetics. Methods in Molecular Biology, vol 850. Humana Press. https://doi.org/10.1007/978-1-61779-555-8_8
Download citation
DOI: https://doi.org/10.1007/978-1-61779-555-8_8
Published:
Publisher Name: Humana Press
Print ISBN: 978-1-61779-554-1
Online ISBN: 978-1-61779-555-8
eBook Packages: Springer Protocols