2 options
Identifying complex trait-related genes via regulation-informed gene-based analyses / Binglan Li.
- Format:
- Book
- Thesis/Dissertation
- Author/Creator:
- Li, Binglan, author.
- Language:
- English
- Subjects (All):
- Bioinformatics.
- Genomics and computational biology--Penn dissertations.
- Penn dissertations--Genomics and computational biology.
- Local Subjects:
- Bioinformatics.
- Genomics and computational biology--Penn dissertations.
- Penn dissertations--Genomics and computational biology.
- Genre:
- Academic theses.
- Physical Description:
- 1 online resource (298 pages)
- Contained In:
- Dissertations Abstracts International 82-07B.
- Place of Publication:
- [Philadelphia, Pennsylvania] : University of Pennsylvania ; Ann Arbor : ProQuest Dissertations & Theses, 2020.
- Language Note:
- English
- System Details:
- Mode of access: World Wide Web.
- text file
- Summary:
- While our understanding of dysregulated genes is essential for improvement of clinical care, the majority of complex trait-associated genetic variants (>90%) are located in noncoding regions of the human genome. Also connecting noncoding genetic variants to downstream affected genes is challenging. On the other hand, noncoding elements can regulate genes. Regulatory elements such as expression quantitative trait loci (eQTLs) provides a potential means to link noncoding genetic variants to affected genes and to explore complex disease mechanisms.Transcriptome-wide association studies (TWAS) is a popular algorithm that exploits eQTLs to prioritize transcriptionally regulated genes from genome-wide association studies (GWAS). Transcriptional regulation is tissue-specific. However, it was unclear how biological properties of eQTLs and gene expression levels will affect the power of different TWAS methods. To answer this question, I designed and developed a novel data simulation framework that efficiently simulates variant, gene, and disease data according to designed relationships across multiple tissues simultaneously. The simulation showed that TWAS performance differed for tissue-specific genes and for genes that were expressed across all tissues. Thus, I put forth a tissue specificity-aware TWAS (TSA-TWAS) framework, validated its utility in clinical trials data, and promoted further suggestions for future TWAS regarding varied scenarios.Centralized biobanks, such as Penn Medicine Biobank (PMBB), and Electronic Medical Records and Genomics (eMERGE) network, have collected a plethora of biospecimen and disease diagnosis; and recruited participants of varied genetic ancestries. However, it is not clear how disease susceptibility genes are like for different genetic ancestries and categories of diseases. Based on the simulation of the thesis part one, I designed a framework that applies TWAS and other data integrative methods on multi-ancestry EHR-linked biobanks to identify ancestry-specific and cross-ancestry gene-disease associations under a discovery (eMERGE III network) and replication (PMBB) study design. This study characterized a multi-ancestry gene-disease connection landscape. This thesis contributes (1) a novel multi-tissue variant-gene-trait simulation framework, comprehensive evaluation of TWAS and (2) a multi-ancestry gene-disease connection landscape. Together, the thesis helps improve the understanding of genetically regulated genes underlying complex diseases and promote translation of basic science discoveries to clinical health care.
- Notes:
- Source: Dissertations Abstracts International, Volume: 82-07, Section: B.
- Advisors: Ritchie, Marylyn D.; Committee members: Christopher Brown; Yoseph Barash; Hae Kyung Im; Scott Damrauer.
- Department: Genomics and Computational Biology.
- Ph.D. University of Pennsylvania 2020.
- Local Notes:
- School code: 0175
- ISBN:
- 9798557060127
- Access Restriction:
- Restricted for use by site license.
- This item is not available from ProQuest Dissertations & Theses.
- This item must not be sold to any third party vendors.
The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.