Statistical and computational methods for biological data
The development of biological data focuses on machine learning and statistical methods. In immunotherapy, gene-expression deconvolution is used to quantify different types of cells in a mixed population. It provides a highly promising solution to rapidly characterize the tumor-infiltrating immune landscape and identify cold cancers. However, a major challenge is that gene-expression data are frequently contaminated by many outliers that decrease the estimation accuracy. Thus, it is imperative to develop a robust deconvolution method that automatically decontaminates data by reliably detecting and removing outliers. Our development of an algorithm called adaptive Least Trimmed Square (aLTS) identifies outliers in regression models, allows us to effectively detect and omit the outliers, and provides us robust estimations of the coefficients. For the guarantees of the convergence property and parameters recovery, we also included certain theoretical results.Another interesting topic is the investigation of the association of phenotype responses with the identified intricate patterns in transcription factor binding sites for DNA sequences. To address these concerns, we pushed forward with a deep learning-based framework. On one hand, to capture regulatory motifs, we utilized convolution and pooling layers. On the other hand, to understand the long-term dependencies among motifs, we used position embedding and multi-head self-attention layers. We pursued the improvement of our model's overall efficacy through the integration of transfer learning and multi-task learning. To ascertain confirmed and novel transcription factor binding motifs (TFBMs), along with their relationships internally, we provided interpretations of our DNA quantification model.
Read
- In Collections
-
Electronic Theses & Dissertations
- Copyright Status
- In Copyright
- Material Type
-
Theses
- Authors
-
Hao, Yuning
- Thesis Advisors
-
Xie, Yuying
- Committee Members
-
Cui, Yuehua
Hong, Hyokyoung
Yan, Ming
- Date Published
-
2019
- Subjects
-
Diseases--Statistical methods
Biometry
- Program of Study
-
Statistics - Doctor of Philosophy
- Degree Level
-
Doctoral
- Language
-
English
- Pages
- xi, 89 pages
- ISBN
-
9781085696005
1085696006
- Permalink
- https://doi.org/doi:10.25335/5m11-nq97