Schema Matching and Mapping


Book Description

Requiring heterogeneous information systems to cooperate and communicate has now become crucial, especially in application areas like e-business, Web-based mash-ups and the life sciences. Such cooperating systems have to automatically and efficiently match, exchange, transform and integrate large data sets from different sources and of different structure in order to enable seamless data exchange and transformation. The book edited by Bellahsene, Bonifati and Rahm provides an overview of the ways in which the schema and ontology matching and mapping tools have addressed the above requirements and points to the open technical challenges. The contributions from leading experts are structured into three parts: large-scale and knowledge-driven schema matching, quality-driven schema mapping and evolution, and evaluation and tuning of matching tasks. The authors describe the state of the art by discussing the latest achievements such as more effective methods for matching data, mapping transformation verification, adaptation to the context and size of the matching and mapping tasks, mapping-driven schema evolution and merging, and mapping evaluation and tuning. The overall result is a coherent, comprehensive picture of the field. With this book, the editors introduce graduate students and advanced professionals to this exciting field. For researchers, they provide an up-to-date source of reference about schema and ontology matching, schema and ontology evolution, and schema merging.




Fundamentals of Business Intelligence


Book Description

This book presents a comprehensive and systematic introduction to transforming process-oriented data into information about the underlying business process, which is essential for all kinds of decision-making. To that end, the authors develop step-by-step models and analytical tools for obtaining high-quality data structured in such a way that complex analytical tools can be applied. The main emphasis is on process mining and data mining techniques and the combination of these methods for process-oriented data. After a general introduction to the business intelligence (BI) process and its constituent tasks in chapter 1, chapter 2 discusses different approaches to modeling in BI applications. Chapter 3 is an overview and provides details of data provisioning, including a section on big data. Chapter 4 tackles data description, visualization, and reporting. Chapter 5 introduces data mining techniques for cross-sectional data. Different techniques for the analysis of temporal data are then detailed in Chapter 6. Subsequently, chapter 7 explains techniques for the analysis of process data, followed by the introduction of analysis techniques for multiple BI perspectives in chapter 8. The book closes with a summary and discussion in chapter 9. Throughout the book, (mostly open source) tools are recommended, described and applied; a more detailed survey on tools can be found in the appendix, and a detailed code for the solutions together with instructions on how to install the software used can be found on the accompanying website. Also, all concepts presented are illustrated and selected examples and exercises are provided. The book is suitable for graduate students in computer science, and the dedicated website with examples and solutions makes the book ideal as a textbook for a first course in business intelligence in computer science or business information systems. Additionally, practitioners and industrial developers who are interested in the concepts behind business intelligence will benefit from the clear explanations and many examples.




Machine Learning Paradigms


Book Description

This book is the inaugural volume in the new Springer series on Learning and Analytics in Intelligent Systems. The series aims at providing, in hard-copy and soft-copy form, books on all aspects of learning, analytics, advanced intelligent systems and related technologies. These disciplines are strongly related and mutually complementary; accordingly, the new series encourages an integrated approach to themes and topics in these disciplines, which will result in significant cross-fertilization, research advances and new knowledge creation. To maximize the dissemination of research findings, the series will publish edited books, monographs, handbooks, textbooks and conference proceedings. This book is intended for professors, researchers, scientists, engineers and students. An extensive list of references at the end of each chapter allows readers to probe further into those application areas that interest them most.




Introduction to CSS for Paged Media


Book Description

CSS (Cascading Style Sheets) is a type of style sheet which can be used for not only Web designs, but also a variety of printing uses and page media such as PDF. Using CSS in paged media design for XML and HTML is not yet common, but its use is expected to increase as the development of CSS3 progresses. This introduction guide will explain CSS Paged Media formatting based on Antenna House Formatter V6 and includes examples.




Microsoft SQL Server 2008 Integration Services Unleashed


Book Description

Microsoft SQL Server Integration Services is Microsoft’s powerful platform for building enterprise-level data integration and data transformation solutions. It’s a powerful product, but it’s also complex and can be confusing if you don’t have a clear map for the journey. Microsoft SQL Server 2008 Integration Services Unleashed will be the only book you’ll need to harness the power that Integration Services provides. Through clear, concise explanations and samples, you’ll grasp a clear understanding of working in the Integration Services environment, including how to set up stock components, how to use the various designer features, and how to gain practical knowledge on configuring, deploying, securing, and managing packages. Sample packages are provided to reinforce the discussion and quickly help you gain hands-on experience, and more complex topics such as Data Flow Task internals and tuning, advanced transformations, and writing custom components are all illustrated in easy-to-understand graphics. In addition, there are several custom tasks and transformations and two useful utilities with full source code available for you to use and study, including an ADO.NET destination, a text file encryption task, and a data profiling transform. Detailed information on: Using the powerful Integration Services tools to create solutions without the need to write lines of code Creating packages programmatically or developing custom tasks via the Integration Services object Building robust packages to solve common requirements Securing packages for different environments Using often overlooked or unknown platform features Setting up all the stock components, including data flow components, tasks, Foreach enumerators, connection managers, and log providers Writing robust and useful custom tasks Building packages that seamlessly deploy to other environments Writing custom data flow adapters and transforms Using script tasks and components Easily modifying configurations for multiple packages simultaneously Writing a Task UI that looks just like the stock tasks Tapping into the power of Integration Services for accessing heterogeneous data sources Using expressions to make packages more responsive to the environment Migrating your DTS packages with no stress Kirk Haselden is the Group Program Manager for the Microsoft Master Data Management product forthcoming in the next wave of Office SharePoint Services and owns the long term strategy, vision, planning, and development of that product. Kirk has been with Microsoft for 12 years in various groups including Hardware, eHome, Connected Home, SQL Server, and Office Business Platform. He was the development manager for Integration Services and the primary designer for the runtime, as well as many of the tasks. He has written a number of articles for SQL Server Magazine, speaks regularly at industry events, writes profusely on his personal and MSDN blog, and holds 35 patents or patents pending. Category: Microsoft SQL Server Covers: Microsoft SQL Server 2008 Integration Services User Level: Intermediate–Advanced