E-Data


Book Description

Dyche presents the complete manager's briefing on what data warehousing technology can do today and how to achieve optimal results. Using real-world case studies from Charles Schwab, Bank of America, Qantas, 20th Century Fox, and others, she covers decision support, database marketing, and many industry-specific data warehouse applications.




Data Smart


Book Description

Data Science gets thrown around in the press like it'smagic. Major retailers are predicting everything from when theircustomers are pregnant to when they want a new pair of ChuckTaylors. It's a brave new world where seemingly meaningless datacan be transformed into valuable insight to drive smart businessdecisions. But how does one exactly do data science? Do you have to hireone of these priests of the dark arts, the "data scientist," toextract this gold from your data? Nope. Data science is little more than using straight-forward steps toprocess raw data into actionable insight. And in DataSmart, author and data scientist John Foreman will show you howthat's done within the familiar environment of aspreadsheet. Why a spreadsheet? It's comfortable! You get to look at the dataevery step of the way, building confidence as you learn the tricksof the trade. Plus, spreadsheets are a vendor-neutral place tolearn data science without the hype. But don't let the Excel sheets fool you. This is a book forthose serious about learning the analytic techniques, the math andthe magic, behind big data. Each chapter will cover a different technique in aspreadsheet so you can follow along: Mathematical optimization, including non-linear programming andgenetic algorithms Clustering via k-means, spherical k-means, and graphmodularity Data mining in graphs, such as outlier detection Supervised AI through logistic regression, ensemble models, andbag-of-words models Forecasting, seasonal adjustments, and prediction intervalsthrough monte carlo simulation Moving from spreadsheets into the R programming language You get your hands dirty as you work alongside John through eachtechnique. But never fear, the topics are readily applicable andthe author laces humor throughout. You'll even learnwhat a dead squirrel has to do with optimization modeling, whichyou no doubt are dying to know.




Information-Driven Business


Book Description

Information doesn't just provide a window on the business, increasingly it is the business. The global economy is moving from products to services which are described almost entirely electronically. Even those businesses that are traditionally associated with making things are less concerned with managing the manufacturing process (which is largely outsourced) than they are with maintaining their intellectual property. Information-Driven Business helps you to understand this change and find the value in your data. Hillard explains techniques that organizations can use and how businesses can apply them immediately. For example, simple changes to the way data is described will let staff support their customers much more quickly; and two simple measures let executives know whether they will be able to use the content of a database before it is even built. This book provides the foundation on which analytical and data rich organizations can be created. Innovative and revealing, this book provides a robust description of Information Management theory and how you can pragmatically apply it to real business problems, with almost instant benefits. Information-Driven Business comprehensively tackles the challenge of managing information, starting with why information has become important and how it is encoded, through to how to measure its use.




From Data and Information Analysis to Knowledge Engineering


Book Description

This volume collects revised versions of papers presented at the 29th Annual Conference of the Gesellschaft für Klassifikation, the German Classification Society, held at the Otto-von-Guericke-University of Magdeburg, Germany, in March 2005. In addition to traditional subjects like Classification, Clustering, and Data Analysis, converage extends to a wide range of topics relating to Computer Science: Text Mining, Web Mining, Fuzzy Data Analysis, IT Security, Adaptivity and Personalization, and Visualization.




Information-Theoretic Methods in Data Science


Book Description

The first unified treatment of the interface between information theory and emerging topics in data science, written in a clear, tutorial style. Covering topics such as data acquisition, representation, analysis, and communication, it is ideal for graduate students and researchers in information theory, signal processing, and machine learning.




Information Quality


Book Description

Provides an important framework for data analysts in assessing the quality of data and its potential to provide meaningful insights through analysis Analytics and statistical analysis have become pervasive topics, mainly due to the growing availability of data and analytic tools. Technology, however, fails to deliver insights with added value if the quality of the information it generates is not assured. Information Quality (InfoQ) is a tool developed by the authors to assess the potential of a dataset to achieve a goal of interest, using data analysis. Whether the information quality of a dataset is sufficient is of practical importance at many stages of the data analytics journey, from the pre-data collection stage to the post-data collection and post-analysis stages. It is also critical to various stakeholders: data collection agencies, analysts, data scientists, and management. This book: Explains how to integrate the notions of goal, data, analysis and utility that are the main building blocks of data analysis within any domain. Presents a framework for integrating domain knowledge with data analysis. Provides a combination of both methodological and practical aspects of data analysis. Discusses issues surrounding the implementation and integration of InfoQ in both academic programmes and business / industrial projects. Showcases numerous case studies in a variety of application areas such as education, healthcare, official statistics, risk management and marketing surveys. Presents a review of software tools from the InfoQ perspective along with example datasets on an accompanying website. This book will be beneficial for researchers in academia and in industry, analysts, consultants, and agencies that collect and analyse data as well as undergraduate and postgraduate courses involving data analysis.




Health Informatics Vision: From Data via Information to Knowledge


Book Description

The latest developments in data, informatics and technology continue to enable health professionals and informaticians to improve healthcare for the benefit of patients everywhere. This book presents full papers from ICIMTH 2019, the 17th International Conference on Informatics, Management and Technology in Healthcare, held in Athens, Greece from 5 to 7 July 2019. Of the 150 submissions received, 95 were selected for presentation at the conference following review and are included here. The conference focused on increasing and improving knowledge of healthcare applications spanning the entire spectrum from clinical and health informatics to public health informatics as applied in the healthcare domain. The field of biomedical and health informatics is examined in a very broad framework, presenting the research and application outcomes of informatics from cell to population and exploring a number of technologies such as imaging, sensors, and biomedical equipment, together with management and organizational aspects including legal and social issues. Setting research priorities in health informatics is also addressed. Providing an overview of the latest developments in health informatics, the book will be of interest to all those working in the field.




Information Technology and Data in Healthcare


Book Description

Healthcare transformation requires us to continually look at new and better ways to manage insights – both within and outside the organization. Increasingly, the ability to glean and operationalize new insights efficiently as a byproduct of an organization’s day-to-day operations is becoming vital for hospitals and health systems to survive and prosper. One of the long-standing challenges in healthcare informatics has been the ability to deal with the sheer variety and volume of disparate healthcare data and the increasing need to derive veracity and value out of it. This book addresses several topics important to the understanding and use of data in healthcare. First, it provides a formal explanation based on epistemology (theory of knowledge) of what data actually is, what we can know about it, and how we can reason with it. The culture of data is also covered and where it fits into healthcare. Then, data quality is addressed, with a historical appreciation, as well as new concepts and insights derived from the author’s 35 years of experience in technology. The author provides a description of what healthcare data analysis is and how it is changing in the era of abundant data. Just as important is the topic of infrastructure and how it provides capability for data use. The book also describes how healthcare information infrastructure needs to change in order to meet current and future needs. The topics of artificial intelligence (AI) and machine learning in healthcare are also addressed. The author concludes with thoughts on the evolution of the role and use of data and information going into the future.




Data and Information Quality


Book Description

This book provides a systematic and comparative description of the vast number of research issues related to the quality of data and information. It does so by delivering a sound, integrated and comprehensive overview of the state of the art and future development of data and information quality in databases and information systems. To this end, it presents an extensive description of the techniques that constitute the core of data and information quality research, including record linkage (also called object identification), data integration, error localization and correction, and examines the related techniques in a comprehensive and original methodological framework. Quality dimension definitions and adopted models are also analyzed in detail, and differences between the proposed solutions are highlighted and discussed. Furthermore, while systematically describing data and information quality as an autonomous research area, paradigms and influences deriving from other areas, such as probability theory, statistical data analysis, data mining, knowledge representation, and machine learning are also included. Last not least, the book also highlights very practical solutions, such as methodologies, benchmarks for the most effective techniques, case studies, and examples. The book has been written primarily for researchers in the fields of databases and information management or in natural sciences who are interested in investigating properties of data and information that have an impact on the quality of experiments, processes and on real life. The material presented is also sufficiently self-contained for masters or PhD-level courses, and it covers all the fundamentals and topics without the need for other textbooks. Data and information system administrators and practitioners, who deal with systems exposed to data-quality issues and as a result need a systematization of the field and practical methods in the area, will also benefit from the combination of concrete practical approaches with sound theoretical formalisms.




Legal Data and Information in Practice


Book Description

Legal Data and Information in Practice provides readers with an understanding of how to facilitate the acquisition, management, and use of legal data in organizations such as libraries, courts, governments, universities, and start-ups. Presenting a synthesis of information about legal data that will furnish readers with a thorough understanding of the topic, the book also explains why it is becoming crucial that data analysis be integrated into decision-making in the legal space. Legal organizations are looking at how to develop data-driven insights for a variety of purposes and it is, as Sutherland shows, vital that they have the necessary skills to facilitate this work. This book will assist in this endeavour by providing an international perspective on the issues affecting access to legal data and clearly describing methods of obtaining and evaluating it. Sutherland also incorporates advice about how to critically approach data analysis. Legal Data and Information in Practice will be essential reading for those in the law library community who are based in English-speaking countries with a common law tradition. The book will also be useful to those with a general interest in legal data, including students, academics engaged in the study of information science and law.