Logo image
Leveraging hierarchical population structure in discrete association studies
Journal article   Open access   Peer reviewed

Leveraging hierarchical population structure in discrete association studies

P. Awadalla, J. Carlson, C. Kadie, S. Mallal and D. Heckerman
PloS one, Vol.2(7), pp.1-13
2007
pdf
leveraging_hierarchical_population.pdfDownloadView
Published (Version of Record) Open Access
url
Free to Read *No subscription requiredView

Abstract

Population structure can confound the identification of correlations in biological data. Such confounding has been recognized in multiple biological disciplines, resulting in a disparate collection of proposed solutions. We examine several methods that correct for confounding on discrete data with hierarchical population structure and identify two distinct confounding processes, which we call coevolution and conditional influence. We describe these processes in terms of generative models and show that these generative models can be used to correct for the confounding effects. Finally, we apply the models to three applications: identification of escape mutations in HIV-1 in response to specific HLA-mediated immune pressure, prediction of coevolving residues in an HIV-1 peptide, and a search for genotypes that are associated with bacterial resistance traits in Arabidopsis thaliana. We show that coevolution is a better description of confounding in some applications and conditional influence is better in others. That is, we show that no single method is best for addressing all forms of confounding. Analysis tools based on these models are available on the internet as both web based applications and downloadable source code at http://atom.research.microsoft.com/bio/p​hylod.aspx

Details

UN Sustainable Development Goals (SDGs)

This output has contributed to the advancement of the following goals:

#3 Good Health and Well-Being

Source: InCites

Metrics

273 File views/ downloads
75 Record Views

InCites Highlights

These are selected metrics from InCites Benchmarking & Analytics tool, related to this output

Collaboration types
Industry collaboration
Domestic collaboration
International collaboration
Citation topics
1 Clinical & Life Sciences
1.66 HIV
1.66.46 HIV Pathogenesis
Web Of Science research areas
Genetics & Heredity
ESI research areas
Molecular Biology & Genetics
Logo image