AI ACCELERATED COLLISIONAL CROSS SECTION PREDICTION FOR HIGH THROUGHPUT METABOLITE IDENTIFICATION
Metabolomics refers to the collective characterization of small organic molecules in a biological sample. While instrumentation and software continues to improve for metabolomics studies, the fraction of annotated signals in untargeted metabolomics experiments remain small. Translating features to metabolite identities present a major bottleneck, confounded by the lack of authentic standards to build comprehensive experimental databases. I illustrate the development of collisional cross section (CCS) prediction methods through deduction from theory and induction from available data. The theoretical CCS prediction involves multistep modeling of conformational ensemble followed by simulation of ion mobility. The advanced computational chemistry operations were automated using the AutoGraph conformational clustering protocol and implementation of the workflow in Snakemake. In a complementary approach, I applied a graph convolutional deep Bayesian neural net to predict CCS values and their uncertainty values. The quantified uncertainty was used to guide ab initio prediction of CCS values in an active learning strategy. The developed methodologies lay the foundation to a continuously refining in silico CCS library to aid in metabolite annotation.
Read
- In Collections
-
Electronic Theses & Dissertations
- Copyright Status
- Attribution-ShareAlike 4.0 International
- Material Type
-
Theses
- Authors
-
Tanemura, Kiyoto Aramis
- Thesis Advisors
-
Merz, Kenneth M.
- Committee Members
-
Wilson, Angela K.
Hong, Heedeok
Wei, Guowei
- Date
- 2022
- Subjects
-
Chemistry
- Program of Study
-
Chemistry - Doctor of Philosophy
- Degree Level
-
Doctoral
- Language
-
English
- Pages
- 110 pages
- Permalink
- https://doi.org/doi:10.25335/72hc-2g02