Computational Methods for Multi-Omics Data Analysis in Cancer Precision Medicine


Book Description

Cancer is a complex and heterogeneous disease often caused by different alterations. The development of human cancer is due to the accumulation of genetic and epigenetic modifications that could affect the structure and function of the genome. High-throughput methods (e.g., microarray and next-generation sequencing) can investigate a tumor at multiple levels: i) DNA with genome-wide association studies (GWAS), ii) epigenetic modifications such as DNA methylation, histone changes and microRNAs (miRNAs) iii) mRNA. The availability of public datasets from different multi-omics data has been growing rapidly and could facilitate better knowledge of the biological processes of cancer. Computational approaches are essential for the analysis of big data and the identification of potential biomarkers for early and differential diagnosis, and prognosis.




Machine Learning for Clinical Trials and Precision Medicine


Book Description

Machine learning (ML) has been wildly applied in biomedicine and healthcare. The growing abundance of medical data and the advance of biological technologies (e.g. next-generation sequencing) have offered great opportunities for using ML in computational biology and health. In this thesis, I present my works contributing to this emerging field in three aspects -- using large-scale datasets to advance medical studies, developing algorithms to solve biological challenges, and building analysis tools for new technologies. In the first part, I present two works of applying ML on large-scale real-world data: one for clinical trial design and one for precision medicine. Overly restrictive eligibility criteria has been a key barrier for clinical trials. In the thesis, I introduce a powerful computational framework, Trial Pathfinder, which enables inclusive criteria and data valuation for clinical trials. A critical goal for precision medicine is to characterize how patients with specific genetic mutations respond to therapies. In the thesis, I present systematic pan-cancer analysis of mutation-treatment interactions using large real-world clinico-genomics data. In the second part, I introduce my work on developing algorithms to solve biological challenge -- aligning multiple datasets with subset correspondence information. In many biological and medical applications, we have multiple related datasets from different sources or domains, and learning efficient computational mappings between these datasets is an important problem. In the thesis, I present an end-to-end optimal transport framework that effectively leverages side information to align datasets. Finally, I present my work on developing analysis tools for new technologies -- spatial transcriptomics and RNA velocity. Recently high-throughput image-based transcriptomic methods were developed and enabled researchers to spatially resolve gene expression variation at the molecular level for the first time. In the thesis, I describe a general analysis tool to quantitatively study the spatial correlations of gene expression in fixed tissue sections. Recent development in inferring RNA velocity from single-cell RNA-seq opens up exciting new vista into developmental lineage and cellular dynamics. In the thesis, I introduce a principled computational framework that extends RNA velocity to quantify systems level dynamics and improve single-cell data analysis.




Big Data in Omics and Imaging


Book Description

Big Data in Omics and Imaging: Association Analysis addresses the recent development of association analysis and machine learning for both population and family genomic data in sequencing era. It is unique in that it presents both hypothesis testing and a data mining approach to holistically dissecting the genetic structure of complex traits and to designing efficient strategies for precision medicine. The general frameworks for association analysis and machine learning, developed in the text, can be applied to genomic, epigenomic and imaging data. FEATURES Bridges the gap between the traditional statistical methods and computational tools for small genetic and epigenetic data analysis and the modern advanced statistical methods for big data Provides tools for high dimensional data reduction Discusses searching algorithms for model and variable selection including randomization algorithms, Proximal methods and matrix subset selection Provides real-world examples and case studies Will have an accompanying website with R code The book is designed for graduate students and researchers in genomics, bioinformatics, and data science. It represents the paradigm shift of genetic studies of complex diseases– from shallow to deep genomic analysis, from low-dimensional to high dimensional, multivariate to functional data analysis with next-generation sequencing (NGS) data, and from homogeneous populations to heterogeneous population and pedigree data analysis. Topics covered are: advanced matrix theory, convex optimization algorithms, generalized low rank models, functional data analysis techniques, deep learning principle and machine learning methods for modern association, interaction, pathway and network analysis of rare and common variants, biomarker identification, disease risk and drug response prediction.




Adaptive Treatment Strategies in Practice: Planning Trials and Analyzing Data for Personalized Medicine


Book Description

Personalized medicine is a medical paradigm that emphasizes systematic use of individual patient information to optimize that patient's health care, particularly in managing chronic conditions and treating cancer. In the statistical literature, sequential decision making is known as an adaptive treatment strategy (ATS) or a dynamic treatment regime (DTR). The field of DTRs emerges at the interface of statistics, machine learning, and biomedical science to provide a data-driven framework for precision medicine. The authors provide a learning-by-seeing approach to the development of ATSs, aimed at a broad audience of health researchers. All estimation procedures used are described in sufficient heuristic and technical detail so that less quantitative readers can understand the broad principles underlying the approaches. At the same time, more quantitative readers can implement these practices. This book provides the most up-to-date summary of the current state of the statistical research in personalized medicine; contains chapters by leaders in the area from both the statistics and computer sciences fields; and also contains a range of practical advice, introductory and expository materials, and case studies.




Chinese Power and Artificial Intelligence


Book Description

This book provides a comprehensive account of Chinese AI in its various facets, based on primary Chinese-language sources. China’s rise as an AI power is an event of importance to the world and a potential challenge to liberal democracies. Filling a gap in the literature, this volume is fully documented, data-driven, and presented in a scholarly format suitable for citation and for supporting downstream research, while also remaining accessible to laypersons. It brings together 15 recognized international experts to present a full treatment of Chinese artificial intelligence. The volume contains chapters on state, commercial, and foreign sources of China’s AI power; China’s AI talent, scholarship, and global standing; the impact of AI on China’s development of cutting-edge disciplines; China’s use of AI in military, cyber, and surveillance applications; AI safety, threat mitigation, and the technology’s likely trajectory. The book ends with recommendations drawn from the authors’ interactions with policymakers and specialists worldwide, aimed at encouraging AI’s healthy development in China and preparing the rest of the world to engage with it. This book will be of much interest to students of Chinese politics, science and technology studies, security studies and international relations.




Cognitive Informatics and Soft Computing


Book Description

This book presents best selected research papers presented at the 3rd International Conference on Cognitive Informatics and Soft Computing (CISC 2020), held at Balasore College of Engineering & Technology, Balasore, Odisha, India, from 12 to 13 December 2020. It highlights, in particular, innovative research in the fields of cognitive informatics, cognitive computing, computational intelligence, advanced computing, and hybrid intelligent models and applications. New algorithms and methods in a variety of fields are presented, together with solution-based approaches. The topics addressed include various theoretical aspects and applications of computer science, artificial intelligence, cybernetics, automation control theory, and software engineering.




Artificial Intelligence for Drug Development, Precision Medicine, and Healthcare


Book Description

Artificial Intelligence for Drug Development, Precision Medicine, and Healthcare covers exciting developments at the intersection of computer science and statistics. While much of machine-learning is statistics-based, achievements in deep learning for image and language processing rely on computer science’s use of big data. Aimed at those with a statistical background who want to use their strengths in pursuing AI research, the book: · Covers broad AI topics in drug development, precision medicine, and healthcare. · Elaborates on supervised, unsupervised, reinforcement, and evolutionary learning methods. · Introduces the similarity principle and related AI methods for both big and small data problems. · Offers a balance of statistical and algorithm-based approaches to AI. · Provides examples and real-world applications with hands-on R code. · Suggests the path forward for AI in medicine and artificial general intelligence. As well as covering the history of AI and the innovative ideas, methodologies and software implementation of the field, the book offers a comprehensive review of AI applications in medical sciences. In addition, readers will benefit from hands on exercises, with included R code.




Machine Learning and Systems Biology in Genomics and Health


Book Description

This book discusses the application of machine learning in genomics. Machine Learning offers ample opportunities for Big Data to be assimilated and comprehended effectively using different frameworks. Stratification, diagnosis, classification and survival predictions encompass the different health care regimes representing unique challenges for data pre-processing, model training, refinement of the systems with clinical implications. The book discusses different models for in-depth analysis of different conditions. Machine Learning techniques have revolutionized genomic analysis. Different chapters of the book describe the role of Artificial Intelligence in clinical and genomic diagnostics. It discusses how systems biology is exploited in identifying the genetic markers for drug discovery and disease identification. Myriad number of diseases whether be infectious, metabolic, cancer can be dealt in effectively which combines the different omics data for precision medicine. Major breakthroughs in the field would help reflect more new innovations which are at their pinnacle stage. This book is useful for researchers in the fields of genomics, genetics, computational biology and bioinformatics.




Artificial Intelligence and Computational Dynamics for Biomedical Research


Book Description

This work presents the latest development in the field of computational intelligence to advance Big Data and Cloud Computing concerning applications in medical diagnosis. As forum for academia and professionals it covers state-of-the-art research challenges and issues in the digital information & knowledge management and the concerns along with the solutions adopted in these fields.