Machine Learning in Python for Visual and Acoustic Data-based Process Monitoring


Book Description

This book is designed to help readers gain quick familiarity with deep learning-based computer vision and abnormal equipment sound detection techniques. The book helps you take your first step towards learning how to use convolutional neural networks (the ANN architecture that is behind the modern revolution in computer vision) and build image sensor-based manufacturing defect detection solutions. A quick introduction is also provided to how modern predictive maintenance solutions can be built for process critical equipment by analyzing the sound generated by the equipment. Overall, this short eBook sets the foundation with which budding process data scientists can confidently navigate the world of modern computer vision and acoustic monitoring.




Machine Learning in Python for Process and Equipment Condition Monitoring, and Predictive Maintenance


Book Description

This book is designed to help readers quickly gain a working knowledge of machine learning-based techniques that are widely employed for building equipment condition monitoring, plantwide monitoring , and predictive maintenance solutions in process industry . The book covers a broad spectrum of techniques ranging from univariate control charts to deep learning-based prediction of remaining useful life. Consequently, the readers can leverage the concepts learned to build advanced solutions for fault detection, fault diagnosis, and fault prognosis. The application focused approach of the book is reader friendly and easily digestible to the practicing and aspiring process engineers and data scientists. Upon completion, readers will be able to confidently navigate the Prognostics and Health Management literature and make judicious selection of modeling approaches suitable for their problems. This book has been divided into seven parts. Part 1 lays down the basic foundations of ML-assisted process and equipment condition monitoring, and predictive maintenance. Part 2 provides in-detail presentation of classical ML techniques for univariate signal monitoring. Different types of control charts and time-series pattern matching methodologies are discussed. Part 3 is focused on the widely popular multivariate statistical process monitoring (MSPM) techniques. Emphasis is paid to both the fault detection and fault isolation/diagnosis aspects. Part 4 covers the process monitoring applications of classical machine learning techniques such as k-NN, isolation forests, support vector machines, etc. These techniques come in handy for processes that cannot be satisfactorily handled via MSPM techniques. Part 5 navigates the world of artificial neural networks (ANN) and studies the different ANN structures that are commonly employed for fault detection and diagnosis in process industry. Part 6 focusses on vibration-based monitoring of rotating machinery and Part 7 deals with prognostic techniques for predictive maintenance applications. Broadly, the book covers the following: Exploratory analysis of process data Best practices for process monitoring and predictive maintenance solutions Univariate monitoring via control charts and time series data mining Multivariate statistical process monitoring techniques (PCA, PLS, FDA, etc.) Machine learning and deep learning techniques to handle dynamic, nonlinear, and multimodal processes Fault detection and diagnosis of rotating machinery using vibration data Remaining useful life predictions for predictive maintenance




Unsupervised Process Monitoring and Fault Diagnosis with Machine Learning Methods


Book Description

This unique text/reference describes in detail the latest advances in unsupervised process monitoring and fault diagnosis with machine learning methods. Abundant case studies throughout the text demonstrate the efficacy of each method in real-world settings. The broad coverage examines such cutting-edge topics as the use of information theory to enhance unsupervised learning in tree-based methods, the extension of kernel methods to multiple kernel learning for feature extraction from data, and the incremental training of multilayer perceptrons to construct deep architectures for enhanced data projections. Topics and features: discusses machine learning frameworks based on artificial neural networks, statistical learning theory and kernel-based methods, and tree-based methods; examines the application of machine learning to steady state and dynamic operations, with a focus on unsupervised learning; describes the use of spectral methods in process fault diagnosis.




Machine Learning in Python for Process Systems Engineering


Book Description

This book provides an application-focused exposition of modern ML tools that have proven useful in process industry and hands-on illustrations on how to develop ML-based solutions for process monitoring, predictive maintenance, fault diagnosis, inferential modeling, dimensionality reduction, and process control. This book considers unique characteristics of industrial process data and uses real data from industrial systems for illustrations. With the focus on practical implementation and minimal programming or ML prerequisites, the book covers the gap in available ML resources for industrial practitioners. The authors of this book have drawn from their years of experience in developing data-driven industrial solutions to provide a guided tour along the wide range of available ML methods and declutter the world of machine learning. The readers will find all the resources they need to deal with high-dimensional, correlated, noisy, corrupted, multimode, and nonlinear process data. The book has been divided into four parts. Part 1 provides a perspective on the importance of ML in process systems engineering and lays down the basic foundations of ML. Part 2 provides in-detail presentation of classical ML techniques and has been written keeping in mind the various characteristics of industrial process systems. Part 3 is focused on artificial neural networks and deep learning. Part 4 covers the important topic of deploying ML solutions over web and shows how to build a production-ready process monitoring web application. Broadly, the book covers the following: Varied applications of ML in process industry Fundamentals of machine learning workflow Practical methodologies for pre-processing industrial data Classical ML methods and their application for process monitoring, fault diagnosis, and soft sensing Deep learning and its application for predictive maintenance Reinforcement learning and its application for process control Deployment of ML solution over web




Machine Learning in Python for Dynamic Process Systems


Book Description

This book is designed to help readers gain a working-level knowledge of machine learning-based dynamic process modeling techniques that have proven useful in process industry. Readers can leverage the concepts learned to build advanced solutions for process monitoring, soft sensing, inferential modeling, predictive maintenance, and process control for dynamic systems. The application-focused approach of the book is reader friendly and easily digestible to the practicing and aspiring process engineers, and data scientists. The authors of this book have drawn from their years of experience in developing data-driven industrial solutions to provide a guided tour along the wide range of available ML methods and declutter the world of machine learning for dynamic process modeling. Upon completion, readers will be able to confidently navigate the system identification literature and make judicious selection of modeling approaches suitable for their problems. This book has been divided into three parts. Part 1 of the book provides perspectives on the importance of ML for dynamic process modeling and lays down the basic foundations of ML-DPM (machine learning for dynamic process modeling). Part 2 provides in-detail presentation of classical ML techniques and has been written keeping in mind the different modeling requirements and process characteristics that determine a model’s suitability for a problem at hand. These include, amongst others, presence of multiple correlated outputs, process nonlinearity, need for low model bias, need to model disturbance signal accurately, etc. Part 3 is focused on artificial neural networks and deep learning. The following topics are broadly covered: · Exploratory analysis of dynamic dataset · Best practices for dynamic modeling · Linear and discrete-time classical parametric and non-parametric models · State-space models for MIMO systems · Nonlinear system identification and closed-loop identification · Neural networks-based dynamic process modeling




Applied Machine Learning Solutions with Python


Book Description

A problem-focused guide for tackling industrial machine learning issues with methods and frameworks chosen by experts. KEY FEATURES ● Popular techniques for problem formulation, data collection, and data cleaning in machine learning. ● Comprehensive and useful machine learning tools such as MLFlow, Streamlit, and many more. ● Covers numerous machine learning libraries, including Tensorflow, FastAI, Scikit-Learn, Pandas, and Numpy. DESCRIPTION This book discusses how to apply machine learning to real-world problems by utilizing real-world data. In this book, you will investigate data sources, become acquainted with data pipelines, and practice how machine learning works through numerous examples and case studies. The book begins with high-level concepts and implementation (with code!) and progresses towards the real-world of ML systems. It briefly discusses various concepts of Statistics and Linear Algebra. You will learn how to formulate a problem, collect data, build a model, and tune it. You will learn about use cases for data analytics, computer vision, and natural language processing. You will also explore nonlinear architecture, thus enabling you to build models with multiple inputs and outputs. You will get trained on creating a machine learning profile, various machine learning libraries, Statistics, and FAST API. Throughout the book, you will use Python to experiment with machine learning libraries such as Tensorflow, Scikit-learn, Spacy, and FastAI. The book will help train our models on both Kaggle and our datasets. WHAT YOU WILL LEARN ● Construct a machine learning problem, evaluate the feasibility, and gather and clean data. ● Learn to explore data first, select, and train machine learning models. ● Fine-tune the chosen model, deploy, and monitor it in production. ● Discover popular models for data analytics, computer vision, and Natural Language Processing. ● Create a machine learning profile and contribute to the community. WHO THIS BOOK IS FOR This book caters to beginners in machine learning, software engineers, and students who want to gain a good understanding of machine learning concepts and create production-ready ML systems. This book assumes you have a beginner-level understanding of Python. TABLE OF CONTENTS 1. Introduction to Machine Learning 2. Problem Formulation in Machine Learning 3. Data Acquisition and Cleaning 4. Exploratory Data Analysis 5. Model Building and Tuning 6. Taking Our Model into Production 7. Data Analytics Use Case 8. Building a Custom Image Classifier from Scratch 9. Building a News Summarization App Using Transformers 10. Multiple Inputs and Multiple Output Models 11. Contributing to the Community 12. Creating Your Project 13. Crash Course in Numpy, Matplotlib, and Pandas 14. Crash Course in Linear Algebra and Statistics 15. Crash Course in FastAPI




Practical Machine Learning with Python


Book Description

Master the essential skills needed to recognize and solve complex problems with machine learning and deep learning. Using real-world examples that leverage the popular Python machine learning ecosystem, this book is your perfect companion for learning the art and science of machine learning to become a successful practitioner. The concepts, techniques, tools, frameworks, and methodologies used in this book will teach you how to think, design, build, and execute machine learning systems and projects successfully. Practical Machine Learning with Python follows a structured and comprehensive three-tiered approach packed with hands-on examples and code. Part 1 focuses on understanding machine learning concepts and tools. This includes machine learning basics with a broad overview of algorithms, techniques, concepts and applications, followed by a tour of the entire Python machine learning ecosystem. Brief guides for useful machine learning tools, libraries and frameworks are also covered. Part 2 details standard machine learning pipelines, with an emphasis on data processing analysis, feature engineering, and modeling. You will learn how to process, wrangle, summarize and visualize data in its various forms. Feature engineering and selection methodologies will be covered in detail with real-world datasets followed by model building, tuning, interpretation and deployment. Part 3 explores multiple real-world case studies spanning diverse domains and industries like retail, transportation, movies, music, marketing, computer vision and finance. For each case study, you will learn the application of various machine learning techniques and methods. The hands-on examples will help you become familiar with state-of-the-art machine learning tools and techniques and understand what algorithms are best suited for any problem. Practical Machine Learning with Python will empower you to start solving your own problems with machine learning today! What You'll Learn Execute end-to-end machine learning projects and systems Implement hands-on examples with industry standard, open source, robust machine learning tools and frameworks Review case studies depicting applications of machine learning and deep learning on diverse domains and industries Apply a wide range of machine learning models including regression, classification, and clustering. Understand and apply the latest models and methodologies from deep learning including CNNs, RNNs, LSTMs and transfer learning. Who This Book Is For IT professionals, analysts, developers, data scientists, engineers, graduate students




PYTHON GUI PROJECTS WITH MACHINE LEARNING AND DEEP LEARNING


Book Description

PROJECT 1: THE APPLIED DATA SCIENCE WORKSHOP: Prostate Cancer Classification and Recognition Using Machine Learning and Deep Learning with Python GUI Prostate cancer is cancer that occurs in the prostate. The prostate is a small walnut-shaped gland in males that produces the seminal fluid that nourishes and transports sperm. Prostate cancer is one of the most common types of cancer. Many prostate cancers grow slowly and are confined to the prostate gland, where they may not cause serious harm. However, while some types of prostate cancer grow slowly and may need minimal or even no treatment, other types are aggressive and can spread quickly. The dataset used in this project consists of 100 patients which can be used to implement the machine learning and deep learning algorithms. The dataset consists of 100 observations and 10 variables (out of which 8 numeric variables and one categorical variable and is ID) which are as follows: Id, Radius, Texture, Perimeter, Area, Smoothness, Compactness, Diagnosis Result, Symmetry, and Fractal Dimension. The models used in this project are K-Nearest Neighbor, Random Forest, Naive Bayes, Logistic Regression, Decision Tree, Support Vector Machine, Adaboost, LGBM classifier, Gradient Boosting, XGB classifier, MLP classifier, and CNN 1D. Finally, you will develop a GUI using PyQt5 to plot boundary decision, ROC, distribution of features, feature importance, cross validation score, and predicted values versus true values, confusion matrix, learning curve, performance of the model, scalability of the model, training loss, and training accuracy. PROJECT 2: THE APPLIED DATA SCIENCE WORKSHOP: Urinary Biomarkers Based Pancreatic Cancer Classification and Prediction Using Machine Learning with Python GUI Pancreatic cancer is an extremely deadly type of cancer. Once diagnosed, the five-year survival rate is less than 10%. However, if pancreatic cancer is caught early, the odds of surviving are much better. Unfortunately, many cases of pancreatic cancer show no symptoms until the cancer has spread throughout the body. A diagnostic test to identify people with pancreatic cancer could be enormously helpful. In a paper by Silvana Debernardi and colleagues, published this year in the journal PLOS Medicine, a multi-national team of researchers sought to develop an accurate diagnostic test for the most common type of pancreatic cancer, called pancreatic ductal adenocarcinoma or PDAC. They gathered a series of biomarkers from the urine of three groups of patients: Healthy controls, Patients with non-cancerous pancreatic conditions, like chronic pancreatitis, and Patients with pancreatic ductal adenocarcinoma. When possible, these patients were age- and sex-matched. The goal was to develop an accurate way to identify patients with pancreatic cancer. The key features are four urinary biomarkers: creatinine, LYVE1, REG1B, and TFF1. Creatinine is a protein that is often used as an indicator of kidney function. YVLE1 is lymphatic vessel endothelial hyaluronan receptor 1, a protein that may play a role in tumor metastasis. REG1B is a protein that may be associated with pancreas regeneration. TFF1 is trefoil factor 1, which may be related to regeneration and repair of the urinary tract. The models used in this project are K-Nearest Neighbor, Random Forest, Naive Bayes, Logistic Regression, Decision Tree, Support Vector Machine, Adaboost, LGBM classifier, Gradient Boosting, XGB classifier, and MLP classifier. Finally, you will develop a GUI using PyQt5 to plot boundary decision, ROC, distribution of features, feature importance, cross validation score, and predicted values versus true values, confusion matrix, learning curve, performance of the model, scalability of the model, training loss, and training accuracy. PROJECT 3: DATA SCIENCE CRASH COURSE: Voice Based Gender Classification and Prediction Using Machine Learning and Deep Learning with Python GUI This dataset was created to identify a voice as male or female, based upon acoustic properties of the voice and speech. The dataset consists of 3,168 recorded voice samples, collected from male and female speakers. The voice samples are pre-processed by acoustic analysis in R using the seewave and tuneR packages, with an analyzed frequency range of 0hz-280hz (human vocal range). The following acoustic properties of each voice are measured and included within the CSV: meanfreq: mean frequency (in kHz); sd: standard deviation of frequency; median: median frequency (in kHz); Q25: first quantile (in kHz); Q75: third quantile (in kHz); IQR: interquantile range (in kHz); skew: skewness; kurt: kurtosis; sp.ent: spectral entropy; sfm: spectral flatness; mode: mode frequency; centroid: frequency centroid (see specprop); peakf: peak frequency (frequency with highest energy); meanfun: average of fundamental frequency measured across acoustic signal; minfun: minimum fundamental frequency measured across acoustic signal; maxfun: maximum fundamental frequency measured across acoustic signal; meandom: average of dominant frequency measured across acoustic signal; mindom: minimum of dominant frequency measured across acoustic signal; maxdom: maximum of dominant frequency measured across acoustic signal; dfrange: range of dominant frequency measured across acoustic signal; modindx: modulation index. Calculated as the accumulated absolute difference between adjacent measurements of fundamental frequencies divided by the frequency range; and label: male or female. The models used in this project are K-Nearest Neighbor, Random Forest, Naive Bayes, Logistic Regression, Decision Tree, Support Vector Machine, Adaboost, LGBM classifier, Gradient Boosting, XGB classifier, MLP classifier, and CNN 1D. Finally, you will develop a GUI using PyQt5 to plot boundary decision, ROC, distribution of features, feature importance, cross validation score, and predicted values versus true values, confusion matrix, learning curve, performance of the model, scalability of the model, training loss, and training accuracy. PROJECT 4: DATA SCIENCE CRASH COURSE: Thyroid Disease Classification and Prediction Using Machine Learning and Deep Learning with Python GUI Thyroid disease is a general term for a medical condition that keeps your thyroid from making the right amount of hormones. Thyroid typically makes hormones that keep body functioning normally. When the thyroid makes too much thyroid hormone, body uses energy too quickly. The two main types of thyroid disease are hypothyroidism and hyperthyroidism. Both conditions can be caused by other diseases that impact the way the thyroid gland works. Dataset used in this project was from Garavan Institute Documentation as given by Ross Quinlan 6 databases from the Garavan Institute in Sydney, Australia. Approximately the following for each database: 2800 training (data) instances and 972 test instances. This dataset contains plenty of missing data, while 29 or so attributes, either Boolean or continuously-valued. The models used in this project are K-Nearest Neighbor, Random Forest, Naive Bayes, Logistic Regression, Decision Tree, Support Vector Machine, Adaboost, LGBM classifier, Gradient Boosting, XGB classifier, MLP classifier, and CNN 1D. Finally, you will develop a GUI using PyQt5 to plot boundary decision, ROC, distribution of features, feature importance, cross validation score, and predicted values versus true values, confusion matrix, learning curve, performance of the model, scalability of the model, training loss, and training accuracy.




Building Machine Learning Systems with Python


Book Description

Get more from your data by creating practical machine learning systems with Python Key Features Develop your own Python-based machine learning system Discover how Python offers multiple algorithms for modern machine learning systems Explore key Python machine learning libraries to implement in your projects Book Description Machine learning allows systems to learn things without being explicitly programmed to do so. Python is one of the most popular languages used to develop machine learning applications, which take advantage of its extensive library support. This third edition of Building Machine Learning Systems with Python addresses recent developments in the field by covering the most-used datasets and libraries to help you build practical machine learning systems. Using machine learning to gain deeper insights from data is a key skill required by modern application developers and analysts alike. Python, being a dynamic language, allows for fast exploration and experimentation. This book shows you exactly how to find patterns in your raw data. You will start by brushing up on your Python machine learning knowledge and being introduced to libraries. You'll quickly get to grips with serious, real-world projects on datasets, using modeling and creating recommendation systems. With Building Machine Learning Systems with Python, you’ll gain the tools and understanding required to build your own systems, all tailored to solve real-world data analysis problems. By the end of this book, you will be able to build machine learning systems using techniques and methodologies such as classification, sentiment analysis, computer vision, reinforcement learning, and neural networks. What you will learn Build a classification system that can be applied to text, images, and sound Employ Amazon Web Services (AWS) to run analysis on the cloud Solve problems related to regression using scikit-learn and TensorFlow Recommend products to users based on their past purchases Understand different ways to apply deep neural networks on structured data Address recent developments in the field of computer vision and reinforcement learning Who this book is for Building Machine Learning Systems with Python is for data scientists, machine learning developers, and Python developers who want to learn how to build increasingly complex machine learning systems. You will use Python's machine learning capabilities to develop effective solutions. Prior knowledge of Python programming is expected.




ECPPM 2021 - eWork and eBusiness in Architecture, Engineering and Construction


Book Description

eWork and eBusiness in Architecture, Engineering and Construction 2021 collects the papers presented at the 13th European Conference on Product and Process Modelling (ECPPM 2021, Moscow, 5-7 May 2021). The contributions cover a wide spectrum of thematic areas that hold great promise towards the advancement of research and technological development targeted at the digitalization of the AEC/FM (Architecture, Engineering, Construction and Facilities Management) domains. High quality contributions are devoted to critically important problems that arise, including: Information and Knowledge Management Semantic Web and Linked Data Communication and Collaboration Technologies Software Interoperability BIM Servers and Product Lifecycle Management Systems Digital Twins and Cyber-Physical Systems Sensors and Internet of Things Big Data Artificial and Augmented Intelligence in AEC Construction Management 5D/nD Modelling and Planning Building Performance Simulation Contract, Cost and Risk Management Safety and Quality Sustainable Buildings and Urban Environments Smart Buildings and Cities BIM Standardization, Implementation and Adoption Regulatory and Legal Aspects BIM Education and Training Industrialized Production, Smart Products and Services Over the past quarter century, the biennial ECPPM conference series, as the oldest BIM conference, has provided researchers and practitioners with a unique platform to present and discuss the latest developments regarding emerging BIM technologies and complementary issues for their adoption in the AEC/FM industry.