McCartney, Ann (2018) The identification and characterization of RNA-mediated gene fusions across primate genomes. PhD thesis, Dublin City University.
Abstract
New genes arise through gene duplication, retrotransposition, exon shuffling, gene fusion/fission, and de-novo genesis from noncoding DNA. Thus far, RNAmediated gene fusion (RMGFs) has been shown to introduce functional novelty, divergent selective pressures, and divergent expression profiles when compared to unfused parent genes. However, the frequency and properties of these new genes remain largely unknown. Through the application of genome-wide
networks to NGS data from Great Apes we aim to identify RMGFs, investigate their epigenetic profiles and analyse their potential mechanisms of generation, particularly through segmental duplications (SD). Subsequently, we aim to both computationally and experimentally investigate their expression and translation profiles and to characterise the cis-regulatory mechanisms behind RMGF transcription regulation. Finally, in order to enhance our understanding of the modular structure of RMGFs network based analyses were carried out to determine pFam domain usage patterns. 69 RMGFs were identified including 9 human-specific genes, their ancestry investigated across 32 high-quality vertebrate species and a significant enrichment in human SD shown. qRT-PCR and RNA-seq analyses reveal heterogeneous tissue expression with a bias towards testes specific expression in support of the ‘out-of-testis’ hypothesis. Moreover, cis-regulatory analyses of splice factor-binding sites, histone
modifications and transcription factor binding sites support this profile of expression. Ribosomal profiling of human fibroblast cell lines has uncovered translation for 3 RMGFs and these genes remain functionally unannotated. RMGF domain usage pattern does not significantly differ from non-fused protein coding genes in human or indeed across vertebrates. Our genome-wide scan for RMGFs across primates has uncovered that their occurrence is frequent, they are enriched in regions of SD, their transcriptional output and cis-motifs support the ‘out-of-testes’ hypothesis and that their domain usage does not differ significantly to that of non-fused genes.
Metadata
Item Type: | Thesis (PhD) |
---|---|
Date of Award: | November 2018 |
Refereed: | No |
Supervisor(s): | O'Connell, Mary J. and Downing, Tim |
Subjects: | Biological Sciences > Bioinformatics |
DCU Faculties and Centres: | DCU Faculties and Schools > Faculty of Science and Health > School of Biotechnology |
Use License: | This item is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 3.0 License. View License |
Funders: | IRC |
ID Code: | 22640 |
Deposited On: | 22 Nov 2018 16:05 by Tim Downing . Last Modified 01 Feb 2023 20:48 |
Documents
Full text available as:
Preview |
PDF
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
37MB |
Downloads
Downloads
Downloads per month over past year
Archive Staff Only: edit this record