1 option
Reproducible, Generalizable, and Scalable Analytic Software for Large Neuroimaging Datasets / Chenying Zhao.
- Format:
- Book
- Thesis/Dissertation
- Author/Creator:
- Zhao, Chenying, author.
- Language:
- English
- Subjects (All):
- Bioengineering.
- Bioinformatics.
- Neurosciences.
- Biostatistics.
- Medical imaging.
- Bioengineering--Penn dissertations.
- Penn dissertations--Bioengineering.
- Local Subjects:
- Bioengineering.
- Bioinformatics.
- Neurosciences.
- Biostatistics.
- Medical imaging.
- Bioengineering--Penn dissertations.
- Penn dissertations--Bioengineering.
- Physical Description:
- 1 online resource (117 pages)
- Distribution:
- Ann Arbor : ProQuest Dissertations & Theses, 2023
- Contained In:
- Dissertations Abstracts International 85-08B.
- Place of Publication:
- [Philadelphia, Pennsylvania] : University of Pennsylvania, 2022.
- Language Note:
- English
- Summary:
- Neuroimaging using Magnetic Resonance Imaging (MRI) has evolved to become one of the primary methods for understanding human brain structure and function. However, recently there have been numerous reports that together constitute a crisis of reproducibility in human neuroimaging studies. This problem can be particularly acute in large and complex neuroimaging datasets. Although researchers start to adopt standards such as Brain Imaging Data Structure (BIDS) and BIDS Apps, tools that facilitate reproducible research with large-scale datasets remain nascent. The overall goal of this thesis was to develop reproducible, generalizable, and scalable analytic software for large neuroimaging data resources. This effort yielded two novel software packages: BIDS App Bootstrap (BABS) and ModelArray. BABS is a user-friendly Python package that provides a reproducible and generalizable workflow for large-scale image processing using BIDS Apps. BABS automatically records the full audit trail of the image processing by utilizing the data version control tool DataLad and adopting the FAIRly big framework. BABS is scalable for large datasets, and supports job management at scale on high performance computing (HPC) clusters. BABS is also generalizable to different use cases, including different BIDS datasets and BIDS Apps. The user-friendly interface of BABS facilitates its application by general users. The second software package in this thesis, ModelArray, is an R package for memory-efficient and generalizable statistical analysis of large-scale datasets. Its memory efficiency allows it to be applied to large datasets even on local computers with limited resources. ModelArray supports mass-univariate statistical analysis using linear models and nonlinear, generalized additive models (GAMs). Diverse statistical models available in R can be incorporated in ModelArray by leveraging its extensibility. Furthermore, ModelArray provides a consistent workflow for large datasets with different data types, including fixel-wise, voxel-wise, and surface data, with generalizability to other data types. In addition to open-source code, ModelArray is released as a Docker container, which facilitates portability and reproducible statistical analysis. Taken together, the generalizable tools developed in this thesis facilitate reproducible neuroimaging research at scale.
- Notes:
- Source: Dissertations Abstracts International, Volume: 85-08, Section: B.
- Advisors: Satterthwaite, Theodore D.; Committee members: Cohen, Yale E.; Shinohara, Russell T.; Davis, Kathryn A.
- Department: Bioengineering.
- Ph.D. University of Pennsylvania 2023.
- Local Notes:
- School code: 0175
- ISBN:
- 9798381509847
- Access Restriction:
- Restricted for use by site license.
The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.