Fast NCRNA identification techniques
Many noncoding RNAs (ncRNAs) function through both their sequences and secondary structures. Thus, secondary structure derivation is an important issue in today's RNA research. The state-of-the-art structure annotation tools are based on comparative analysis, which derives consensus structure of homologous ncRNAs. Despite promising results from existing ncRNA aligning and consensus structure derivation tools, there is a need for more efficient and accurate ncRNA secondary structure modeling and alignment methods.In this thesis, we introduce grammar string, a novel ncRNA secondary structure representation that encodes an ncRNA's sequence and secondary structure in the parameter space of a context-free grammar (CFG). Being a string defined on a special alphabet constructed from a CFG, it converts ncRNA alignment into sequence alignment with n square complexity. We explain how this representation is used in derivation of consensus secondary structure through multiple ncRNA alignment and also how existing clustering methods could be applied to ncRNAs represented by this model.
Read
- In Collections
-
Electronic Theses & Dissertations
- Copyright Status
- In Copyright
- Material Type
-
Theses
- Authors
-
Takyar, Seyedeh Shohreh
- Thesis Advisors
-
Sun, Yanni
- Committee Members
-
Brown, Titus
Cole, James
- Date Published
-
2012
- Subjects
-
Non-coding RNA
- Program of Study
-
Computer Science
- Degree Level
-
Masters
- Language
-
English
- Pages
- x, 71 pages
- ISBN
-
9781267315748
1267315741
- Permalink
- https://doi.org/doi:10.25335/11qy-a067