Predictive Analytics and Data Mining


Book Description

Put Predictive Analytics into ActionLearn the basics of Predictive Analysis and Data Mining through an easy to understand conceptual framework and immediately practice the concepts learned using the open source RapidMiner tool. Whether you are brand new to Data Mining or working on your tenth project, this book will show you how to analyze data, uncover hidden patterns and relationships to aid important decisions and predictions. Data Mining has become an essential tool for any enterprise that collects, stores and processes data as part of its operations. This book is ideal for business users, data analysts, business analysts, business intelligence and data warehousing professionals and for anyone who wants to learn Data Mining.You’ll be able to:1. Gain the necessary knowledge of different data mining techniques, so that you can select the right technique for a given data problem and create a general purpose analytics process.2. Get up and running fast with more than two dozen commonly used powerful algorithms for predictive analytics using practical use cases.3. Implement a simple step-by-step process for predicting an outcome or discovering hidden relationships from the data using RapidMiner, an open source GUI based data mining tool Predictive analytics and Data Mining techniques covered: Exploratory Data Analysis, Visualization, Decision trees, Rule induction, k-Nearest Neighbors, Naïve Bayesian, Artificial Neural Networks, Support Vector machines, Ensemble models, Bagging, Boosting, Random Forests, Linear regression, Logistic regression, Association analysis using Apriori and FP Growth, K-Means clustering, Density based clustering, Self Organizing Maps, Text Mining, Time series forecasting, Anomaly detection and Feature selection. Implementation files can be downloaded from the book companion site at www.LearnPredictiveAnalytics.com Demystifies data mining concepts with easy to understand language Shows how to get up and running fast with 20 commonly used powerful techniques for predictive analysis Explains the process of using open source RapidMiner tools Discusses a simple 5 step process for implementing algorithms that can be used for performing predictive analytics Includes practical use cases and examples




AI dan DATA SCIENCE dengan Python GUI: Studi Kasus Covid-19 dan Stroke


Book Description

KASUS 1: COVID-19 Karena penyebaran COVID-19, pengembangan vaksin dituntut sesegera mungkin. Terlepas dari pentingnya analisis data dalam pengembangan vaksin, tidak banyak dataset sederhana yang dapat ditangani oleh pada analis data. Kumpulan data dan kode sampel telah dikumpulkan untuk prediksi epitop Bcell, salah satu topik penelitian utama dalam pengembangan vaksin, tersedia secara gratis. Dataset ini dikembangkan selama proses penelitian kami dan data yang terkandung di dalamnya diperoleh dari IEDB dan UniProt. Sel B yang menginduksi respon imun spesifik antigen in vivo menghasilkan sejumlah besar antibodi spesifik antigen dengan mengenali subregion (wilayah epitop) protein antigen. Sel B ini dapat menghambat fungsinya dengan mengikat antibodi ke protein antigen. Memprediksi daerah epitop bermanfaat untuk desain dan pengembangan vaksin yang bertujuan untuk menginduksi produksi antibodi spesifik antigen. Sel B inilah menjadi dataset utama yang dipakai pada proyek ini. Dataset ini memuat kolom: parent_protein_id, protein_seq, start_position, end_position, peptide_seq, chou_fasman, emini, kolaskar_tongaonkar, parker, hydrophobicity, isoelectric_point, aromacity, stability, dan target. Selanjutnya, Anda akan belajar menggunakan Scikit-Learn, Keras, TensorFlow, NumPy, Pandas, Seaborn, dan sejumlah Pustaka lain untuk memprediksi COVID-19 Epitope menggunakan dataset COVID-19/SARS B-cell Epitope Prediction yang disediakan di Kaggle. Model-model machine learning yang digunakan adalah K-Nearest Neighbor, Random Forest, Naive Bayes, Logistic Regression, Decision Tree, Support Vector Machine, Adaboost, Gradient Boosting, XGB classifier, dan MLP classifier. Kemudian, Anda akan mempelajari cara menerapkan model CNN sekuensial dan VGG16 untuk mendeteksi dan memprediksi Covid-19 X-RAY menggunakan COVID-19 Xray Dataset (Train & Test Sets) yang disediakan di Kaggle. Folder itu sendiri terdiri dari dua subfolder: test dan train. Terakhir, Anda akan mengembangkan GUI menggunakan PyQt5 untuk menampilkan batas-batas keputusan tiap model, ROC, distribusi fitur, keutamaan fitur, skor validasi silang, nilai-nilai prediksi versus nilai-nilai sebenarnya, matriks confusion, rugi pelatihan, dan rugi akurasi. KASUS 2: STROKE Menurut Organisasi Kesehatan Dunia (WHO), stroke adalah penyebab kematian ke-2 secara global, yang bertanggung jawab atas sekitar 11% dari total kematian. Dataset yang digunakan pada penelitian ini berguna untuk memprediksi kemungkinan seorang pasien terkena stroke berdasarkan parameter masukan seperti jenis kelamin, usia, berbagai penyakit, dan status merokok. Setiap baris dalam data memberikan informasi yang relevan tentang pasien. Informasi tiap kolom: id: Pengenal unik; gender: "Male", "Female" atau "Other"; age: Usia pasien; hypertension: 0 jika pasien tidak memiliki hipertensi, 1 jika pasien memiliki hipertensi; heart_disease: 0 jika pasien tidak memiliki penyakit jantung, 1 jika pasien memiliki penyakit jantung; ever_married: "No" atau "Yes"; work_type: "children", "Govt_jov", "Never_worked", "Private" atau "Self-employed"; Residence_type: "Rural" atau "Urban"; avg_glucose_level: Rata-rata kadar glukosa dalam darah; bmi: body mass index; smoking_status: "formerly smoked", "never smoked", "smokes" atau "Unknown"*; stroke: 1 jika pasien mengalami stroke atau 0 jika tidak. Selanjutnya, Anda akan belajar menggunakan Scikit-Learn, Keras, TensorFlow, NumPy, Pandas, Seaborn, dan sejumlah Pustaka lain untuk menganalisa dan memprediksi stroke menggunakan dataset yang disediakan di Kaggle. Model-model yang digunakan adalah K-Nearest Neighbor, Random Forest, Naive Bayes, Logistic Regression, Decision Tree, Support Vector Machine, Adaboost, Gradient Boosting, LGBM classifier, XGB classifier, MLP classifier, dan CNN 1D. Terakhir, Anda akan mengembangkan GUI menggunakan Qt Designer dan PyQt5 untuk ROC, distribusi fitur, keutamaan fitur, menampilkan batas-batas keputusan tiap model, diagram nilai-nilai prediksi versus nilai-nilai sebenarnya, matriks confusion, rugi pelatihan, rugi akurasi, kurva pembelajaran model, skalabilitas model, dan kinerja model.




Human-Centered Technology for a Better Tomorrow


Book Description

This book acts as a compilation of papers presented in the Human Engineering Symposium (HUMENS 2021). The symposium theme, “Human-centered Technology for A Better Tomorrow,” covers the following research topics: ergonomics, biomechanics, sports technology, medical device and instrumentation, artificial intelligence / machine learning, industrial design, rehabilitation, additive manufacturing, modelling and bio-simulation, and signal processing. Fifty-nine articles published in this book are divided into four parts, namely Part 1—Artificial Intelligence and Biosimulation, Part 2—Biomechanics, Safety and Sports, Part 3—Design and Instrumentation, and Part 4—Ergonomics.




Proceedings of Sixth International Congress on Information and Communication Technology


Book Description

This book gathers selected high-quality research papers presented at the Sixth International Congress on Information and Communication Technology, held at Brunel University, London, on February 25–26, 2021. It discusses emerging topics pertaining to information and communication technology (ICT) for managerial applications, e-governance, e-agriculture, e-education and computing technologies, the Internet of Things (IoT) and e-mining. Written by respected experts and researchers working on ICT, the book offers a valuable asset for young researchers involved in advanced studies. The book is presented in four volumes.




Data Science


Book Description

Learn the basics of Data Science through an easy to understand conceptual framework and immediately practice using RapidMiner platform. Whether you are brand new to data science or working on your tenth project, this book will show you how to analyze data, uncover hidden patterns and relationships to aid important decisions and predictions. Data Science has become an essential tool to extract value from data for any organization that collects, stores and processes data as part of its operations. This book is ideal for business users, data analysts, business analysts, engineers, and analytics professionals and for anyone who works with data. You'll be able to: - Gain the necessary knowledge of different data science techniques to extract value from data. - Master the concepts and inner workings of 30 commonly used powerful data science algorithms. - Implement step-by-step data science process using using RapidMiner, an open source GUI based data science platform Data Science techniques covered: Exploratory data analysis, Visualization, Decision trees, Rule induction, k-nearest neighbors, Naïve Bayesian classifiers, Artificial neural networks, Deep learning, Support vector machines, Ensemble models, Random forests, Regression, Recommendation engines, Association analysis, K-Means and Density based clustering, Self organizing maps, Text mining, Time series forecasting, Anomaly detection, Feature selection and more... - Contains fully updated content on data science, including tactics on how to mine business data for information - Presents simple explanations for over twenty powerful data science techniques - Enables the practical use of data science algorithms without the need for programming - Demonstrates processes with practical use cases - Introduces each algorithm or technique and explains the workings of a data science algorithm in plain language - Describes the commonly used setup options for the open source tool RapidMiner




Proceedings of Sixth International Congress on Information and Communication Technology


Book Description

This book gathers selected high-quality research papers presented at the Sixth International Congress on Information and Communication Technology, held at Brunel University, London, on February 25–26, 2021. It discusses emerging topics pertaining to information and communication technology (ICT) for managerial applications, e-governance, e-agriculture, e-education and computing technologies, the Internet of things (IoT) and e-mining. Written by respected experts and researchers working on ICT, the book offers a valuable asset for young researchers involved in advanced studies. The book is presented in four volumes.




Modern Educational Measurement


Book Description




Integrated Learning for ERP Success


Book Description

The results are in. The evidence has been analyzed. Research shows that the lack of enterprise-wide training is the biggest reason for ERP implementation failures. It is the single most important precursor to achieving success. Integrated Learning for ERP Success is the first resource to offer a specifically defined, comprehensive method fo




Modern Educational Measurement


Book Description

This time-honored work provides the most useful tools for accurate assessment of students and how well the goals of curricula are met in this thorough re-orientation of "Modern Educational Measurement." Overhauled to approach the topic from the perspective of the people in the trenches who must master the uses and abuses of testing methods and assessment instruments, this book offers timely, well-documented, and extremely practical information on this important subject. Further, it presents the material in a way that makes it more interesting and engaging than other texts on the market. In addition, the author's personal, engaging, and humorous writing style brings the subject matter to life and helps readers maintain their interest in the material. The book aims to help educational leaders, the administrators and the teachers who must grapple with the problems and the methods of assessment in order to improve educational practices for students everywhere. Follows a logical and developmental framework that takes readers from a general overview of the significance of assessment in education, to a discussion of how to evaluate the usefulness of different measurement strategies, to hands-on advice on how to construct accurate and effective assessment instruments, to a perceptive overview of the dos and don'ts of the field. Designed for anyone interested in Educational Measurement and Evaluation, Assessment, and Testing.




Ethnographies of Archaeological Practice


Book Description

Ethnographic perspectives are often used by archaeologists to study cultures both past and present - but what happens when the ethnographic gaze is turned back onto archaeological practices themselves? That is the question posed by this book, challenging conventional ideas about the relationship between the subject and the object, the observer and the observed, and the explainers and the explained. This book explores the production of archaeological knowledge from a range of ethnographic perspectives. Fieldwork spans large parts of the world, with sites in Turkey, the Netherlands, Mexico, Brazil, Italy, Germany, the USA and the United Kingdom being covered. They focus on excavation, inscription, heritage management, student training, the employment of hired workers and many other aspects of archaeological practice. These experimental ethnographic studies are situated right on the interface of archaeology and anthropology_on the road to a more holistic study of the present and the past.