Optimisation of Next Generation Sequencing Methodologies for RNA Modifications Detection


Book Description

Ribonucleic acid (RNA) is an essential biomolecule in the domain of life. As a mediator between the genetic information contained in deoxyribonucleic acid (DNA) and the cell, its regulation has a major impact on the homeostasis of living organisms. The discovery and identification of chemical modifications at nucleotide level in RNA has made it possible to study a new layer of regulation of these molecules. In view of the importance of these chemical alterations within the biology domain, a specific term for the study of these modifications has been coined: epitranscriptomic. More than 170 modifications have now been listed, and their analysis has led to the discovery of numerous links between these alterations and various diseases such as viral infections, cancers and neurodegenerative diseases. Epitranscriptomic therefore holds out the hope of developing innovative treatments for yet incurable diseases. However, of the large number of modifications identified to date, few methods exist that can locate them precisely in RNA sequences. In order to overcome these limitations, the work presented here concerns the development and optimisation of methods for detecting modifications through high-throughput sequencing. More specifically, the focus of this work is on the computational processing of sequencing data in order to perfect the detection of these modified nucleotides. As a first step, an application of a method for detecting 2'-O-methylations via high-throughput sequencing entitled RiboMethSeq was carried out on tRNA samples from Escherichia coli and Saccharomyces cerevisiae with the aim of studying the role of these modifications in innate immunity. This study uncovered interesting details about the regulation of immunity, but also revealed detection limitations on the part of RiboMethSeq. These limitations are mainly due to the non-optimal parameterisation of the various data processing stages, but also to the lack of transfer RNA (tRNA) sequence references suitable for epitranscriptomic studies. These limitations have been specifically addressed and a thorough optimisation of these two concepts has been implemented, enabling a more in-depth analysis of the links between 2'-O-methylations and innate immunity in a second study. Finally, with the aim of pushing back the limits of 2'-O-methylation detection, the potential of deep learning algorithms for detecting modifications is explored using RiboMethSeq ribosomal RNA (rRNA) data as a training set for Random Forest algorithm. Secondly, development of alternative methods for the detection of two modifications, N6-methlyadenosine (m6A) and pseudouridine (Psi) is carried out. Existing detection methods for these modifications have limitations due to the nature of the approach chosen. In order to fill these gaps, two alternatives processes based on the induction of chemical signatures emitted by these modifications were proposed and then applied to biological samples, proving their robustness of detection as well as their quality of quantification of these biomolecules of interest. Finally, this work concludes with the simultaneous use of three detection methods - RiboMethSeq, AlkAnilineSeq and BisulfiteSeq - on the same set of brain cell samples in the context of studies on neurodegenerative diseases. The combination of these methods enabled five human tRNA modifications to be mapped, and allowing their respective quantification according to the nature of the cell tissue and stress conditions. This work has enabled us to confirm in greater detail observations already seen in the literature, but also to highlight a still little-studied modification, Dihydrouridine, as a potential determining factor in tRNA fragmentation.




RNA Modifications


Book Description

This detailed book describes some of the most recent advances and up-to-date methodologies to detect, quantify, analyze, and elucidate the biological function of different types of RNA modifications. Importantly, the methodologies and tools described herein can be applied to a wide variety of organisms and can be used to address biological and clinical questions. Beginning with a section on bioinformatics tools, the collection continues with sections on detecting RNA modifications using Nanopore direct RNA sequencing, next-generation sequencing approaches, qPCR- and molecular biology-based methods, mass spectrometry- and NMR-based methods, as well as approaches to assess kinetics, determinants, and functions of RNA modifications. Written for the highly successful Methods in Molecular Biology series style, chapters include introduction to their respective topics, lists of the necessary materials and reagents, step-by-step, readily reproducible laboratory protocols, and tips on troubleshooting and avoiding known pitfalls. Authoritative and practical, RNA Modifications: Methods and Protocols serves as an ideal guide for those working directly in the fields of epitranscriptomics and post-transcriptional gene regulation, as well as for scientists and clinicians interested in bioinformatic tools to study RNA modifications and techniques to dissect their roles in physiology and disease. Chapter 20 is available open access under a CC BY 4.0 license.




Computational Methods for Next Generation Sequencing Data Analysis


Book Description

Introduces readers to core algorithmic techniques for next-generation sequencing (NGS) data analysis and discusses a wide range of computational techniques and applications This book provides an in-depth survey of some of the recent developments in NGS and discusses mathematical and computational challenges in various application areas of NGS technologies. The 18 chapters featured in this book have been authored by bioinformatics experts and represent the latest work in leading labs actively contributing to the fast-growing field of NGS. The book is divided into four parts: Part I focuses on computing and experimental infrastructure for NGS analysis, including chapters on cloud computing, modular pipelines for metabolic pathway reconstruction, pooling strategies for massive viral sequencing, and high-fidelity sequencing protocols. Part II concentrates on analysis of DNA sequencing data, covering the classic scaffolding problem, detection of genomic variants, including insertions and deletions, and analysis of DNA methylation sequencing data. Part III is devoted to analysis of RNA-seq data. This part discusses algorithms and compares software tools for transcriptome assembly along with methods for detection of alternative splicing and tools for transcriptome quantification and differential expression analysis. Part IV explores computational tools for NGS applications in microbiomics, including a discussion on error correction of NGS reads from viral populations, methods for viral quasispecies reconstruction, and a survey of state-of-the-art methods and future trends in microbiome analysis. Computational Methods for Next Generation Sequencing Data Analysis: Reviews computational techniques such as new combinatorial optimization methods, data structures, high performance computing, machine learning, and inference algorithms Discusses the mathematical and computational challenges in NGS technologies Covers NGS error correction, de novo genome transcriptome assembly, variant detection from NGS reads, and more This text is a reference for biomedical professionals interested in expanding their knowledge of computational techniques for NGS data analysis. The book is also useful for graduate and post-graduate students in bioinformatics.




Clinical Applications for Next-Generation Sequencing


Book Description

Clinical Applications for Next Generation Sequencing provides readers with an outstanding postgraduate resource to learn about the translational use of NGS in clinical environments. Rooted in both medical genetics and clinical medicine, the book fills the gap between state-of-the-art technology and evidence-based practice, providing an educational opportunity for users to advance patient care by transferring NGS to the needs of real-world patients. The book builds an interface between genetic laboratory staff and clinical health workers to not only improve communication, but also strengthen cooperation. Users will find valuable tactics they can use to build a systematic framework for understanding the role of NGS testing in both common and rare diseases and conditions, from prenatal care, like chromosomal abnormalities, up to advanced age problems like dementia. - Fills the gap between state-of-the-art technology and evidence-based practice - Provides an educational opportunity which advances patient care through the transfer of NGS to real-world patient assessment - Promotes a practical tool that clinicians can apply directly to patient care - Includes a systematic framework for understanding the role of NGS testing in many common and rare diseases - Presents evidence regarding the important role of NGS in current diagnostic strategies




Next Generation Sequencing


Book Description

Next generation sequencing (NGS) has surpassed the traditional Sanger sequencing method to become the main choice for large-scale, genome-wide sequencing studies with ultra-high-throughput production and a huge reduction in costs. The NGS technologies have had enormous impact on the studies of structural and functional genomics in all the life sciences. In this book, Next Generation Sequencing Advances, Applications and Challenges, the sixteen chapters written by experts cover various aspects of NGS including genomics, transcriptomics and methylomics, the sequencing platforms, and the bioinformatics challenges in processing and analysing huge amounts of sequencing data. Following an overview of the evolution of NGS in the brave new world of omics, the book examines the advances and challenges of NGS applications in basic and applied research on microorganisms, agricultural plants and humans. This book is of value to all who are interested in DNA sequencing and bioinformatics across all fields of the life sciences.







Next Generation Sequencing


Book Description

This volume covers a wide range of various fields of research, with the common thread being Next Generation Sequencing (NGS) related methods and applications, as well as analysis and interpretation of the data obtained. Chapters guide readers through the highly dynamic processes of translational and transcriptional profiling of a cell, method to detect copy number alterations (CNAs), targeted sequencing applications, method called “Hi-Plex” to characterize known polymorphic loci, single-cell of DNA or RNA, identify and characterize rare circulating CD4 T cells, and computational pipeline for RNAseq analysis. Written in the highly successful Methods in Molecular Biology series format, chapters include introductions to their respective topics, lists of the necessary materials and reagents, step-by-step, readily reproducible laboratory protocols, and tips on troubleshooting and avoiding known pitfalls. Authoritative and practical, Next Generation Sequencing: Methods and Protocols aims to be useful and informative for further study into this vital field.




Next Generation Sequencing and Sequence Assembly


Book Description

The goal of this book is to introduce the biological and technical aspects of next generation sequencing methods, as well as algorithms to assemble these sequences into whole genomes. The book is organized into two parts; part 1 introduces NGS methods and part 2 reviews assembly algorithms and gives a good insight to these methods for readers new to the field. Gathering information, about sequencing and assembly methods together, helps both biologists and computer scientists to get a clear idea about the field. Chapters will include information about new sequencing technologies such as ChIp-seq, ChIp-chip, and De Novo sequence assembly. ​




Algorithms for Next-Generation Sequencing Data


Book Description

The 14 contributed chapters in this book survey the most recent developments in high-performance algorithms for NGS data, offering fundamental insights and technical information specifically on indexing, compression and storage; error correction; alignment; and assembly. The book will be of value to researchers, practitioners and students engaged with bioinformatics, computer science, mathematics, statistics and life sciences.