Enabling Real-time Analytics on IBM z Systems Platform


Book Description

Regarding online transaction processing (OLTP) workloads, IBM® z SystemsTM platform, with IBM DB2®, data sharing, Workload Manager (WLM), geoplex, and other high-end features, is the widely acknowledged leader. Most customers now integrate business analytics with OLTP by running, for example, scoring functions from transactional context for real-time analytics or by applying machine-learning algorithms on enterprise data that is kept on the mainframe. As a result, IBM adds investment so clients can keep the complete lifecycle for data analysis, modeling, and scoring on z Systems control in a cost-efficient way, keeping the qualities of services in availability, security, reliability that z Systems solutions offer. Because of the changed architecture and tighter integration, IBM has shown, in a customer proof-of-concept, that a particular client was able to achieve an orders-of-magnitude improvement in performance, allowing that client's data scientist to investigate the data in a more interactive process. Open technologies, such as Predictive Model Markup Language (PMML) can help customers update single components instead of being forced to replace everything at once. As a result, you have the possibility to combine your preferred tool for model generation (such as SAS Enterprise Miner or IBM SPSS® Modeler) with a different technology for model scoring (such as Zementis, a company focused on PMML scoring). IBM SPSS Modeler is a leading data mining workbench that can apply various algorithms in data preparation, cleansing, statistics, visualization, machine learning, and predictive analytics. It has over 20 years of experience and continued development, and is integrated with z Systems. With IBM DB2 Analytics Accelerator 5.1 and SPSS Modeler 17.1, the possibility exists to do the complete predictive model creation including data transformation within DB2 Analytics Accelerator. So, instead of moving the data to a distributed environment, algorithms can be pushed to the data, using cost-efficient DB2 Accelerator for the required resource-intensive operations. This IBM Redbooks® publication explains the overall z Systems architecture, how the components can be installed and customized, how the new IBM DB2 Analytics Accelerator loader can help efficient data loading for z Systems data and external data, how in-database transformation, in-database modeling, and in-transactional real-time scoring can be used, and what other related technologies are available. This book is intended for technical specialists and architects, and data scientists who want to use the technology on the z Systems platform. Most of the technologies described in this book require IBM DB2 for z/OS®. For acceleration of the data investigation, data transformation, and data modeling process, DB2 Analytics Accelerator is required. Most value can be achieved if most of the data already resides on z Systems platforms, although adding external data (like from social sources) poses no problem at all.




DB2 11 for z/OS Technical Overview


Book Description

IBM® DB2® Version 11.1 for z/OS® (DB2 11 for z/OS or just DB2 11 throughout this book) is the fifteenth release of DB2 for IBM MVSTM. It brings performance and synergy with the IBM System z® hardware and opportunities to drive business value in the following areas. DB2 11 can provide unmatched reliability, availability, and scalability - Improved data sharing performance and efficiency - Less downtime by removing growth limitations - Simplified management, improved autonomics, and reduced planned outages DB2 11 can save money and save time - Aggressive CPU reduction goals - Additional utilities performance and CPU improvements - Save time and resources with new autonomic and application development capabilities DB2 11 provides simpler, faster migration - SQL compatibility, divorce system migration from application migration - Access path stability improvements - Better application performance with SQL and XML enhancements DB2 11 includes enhanced business analytics - Faster, more efficient performance for query workloads - Accelerator enhancements - More efficient inline database scoring enables predictive analytics The DB2 11 environment is available either for new installations of DB2 or for migrations from DB2 10 for z/OS subsystems only. This IBM Redbooks® publication introduces the enhancements made available with DB2 11 for z/OS. The contents help database administrators to understand the new functions and performance enhancements, to plan for ways to use the key new capabilities, and to justify the investment in installing or migrating to DB2 11.




LHC Design Report


Book Description




Thirty Years with Stata


Book Description

This volume is a sometimes serious and sometimes whimsical retrospective of Stata, its development, and its use over the last 30 years. The view from the inside opens with an essay by Bill Gould, Stata's president and cofounder, that discusses the challenges and concepts that guided the design and implementation of Stata. This is followed by an interview of Bill by Joe Newton that discusses Bill's early interest in computing, his early work on a program for matching prom dates in the days when you had to purchase time on computers, and further exploration of the guiding principles behind Stata. Finally, Sean Becketti, Stata's first employee, delves into the not-to-be-missed culture of Stata in its infancy. The view from the outside comprises 14 essays by prominent researchers and members of the Stata community. Most discuss Stata's use and evolution in disciplines such as behavioral sciences, business, economics, epidemiology, time series, political science, public health, public policy, veterinary epidemiology, and statistics. Some take a sweeping overview. Others are more intimate personal recollections. Mostly, we simply wanted to celebrate the relationship between Stata users and Stata software. We hope that this volume holds something interesting for everyone.




Recent Developments in Data Science and Business Analytics


Book Description

This edited volume is brought out from the contributions of the research papers presented in the International Conference on Data Science and Business Analytics (ICDSBA- 2017), which was held during September 23-25 2017 in ChangSha, China. As we all know, the field of data science and business analytics is emerging at the intersection of the fields of mathematics, statistics, operations research, information systems, computer science and engineering. Data science and business analytics is an interdisciplinary field about processes and systems to extract knowledge or insights from data. Data science and business analytics employ techniques and theories drawn from many fields including signal processing, probability models, machine learning, statistical learning, data mining, database, data engineering, pattern recognition, visualization, descriptive analytics, predictive analytics, prescriptive analytics, uncertainty modeling, big data, data warehousing, data compression, computer programming, business intelligence, computational intelligence, and high performance computing among others. The volume contains 55 contributions from diverse areas of Data Science and Business Analytics, which has been categorized into five sections, namely: i) Marketing and Supply Chain Analytics; ii) Logistics and Operations Analytics; iii) Financial Analytics. iv) Predictive Modeling and Data Analytics; v) Communications and Information Systems Analytics. The readers shall not only receive the theoretical knowledge about this upcoming area but also cutting edge applications of this domains.




Bridge Management Systems


Book Description




Proceedings of the Third International Conference on Computing, Mathematics and Statistics (iCMS2017)


Book Description

This book is a product of the Third International Conference on Computing, Mathematics and Statistics (iCMS2017) to be held in Langkawi in November 2017. It is divided into four sections according to the thrust areas: Computer Science, Mathematics, Statistics, and Multidisciplinary Applications. All sections sought to confront current issues that society faces today. The book brings collectively quantitative, as well as qualitative, research methods that are also suitable for future research undertakings. Researchers in Computer Science, Mathematics and Statistics can use this book as a sourcebook to enrich their research works.




Contouring Human Development


Book Description

This book acquaints readers with a range of techniques to help them effectively identify, record, map, analyze and report on patterns in various dimensions of human development (HD) with spatial scales down to the village level. It is impossible to capture HD at the local and global scale with only a single index, because differences in HD at the international scale are caused by ‘general’ factors, whereas local-scale differences are influenced by ‘specific’ factors. This book offers a variety of methods for scientifically mapping HD at any spatial scale. It covers how to rationally select variables; how to test the models; how to validate the results, and how to analyze them. For this purpose, it employs a case study on an Indian district. The socio-economic factors regulating the patterns of HD are now more complex than they were only a few decades ago, making it essential to incorporate newer models in order to successfully ‘replicate’ the real-world situation. Accordingly, the book offers essential methodological tools & techniques for mapping HD. It sheds new light on a handful of statistical multivariate analysis and machine learning algorithms that are rarely used in the social sciences when dealing with HD, yet have sound mathematical and statistical bases. These techniques can be successfully used for predictive analysis in the earth & natural sciences, decision sciences and management disciplines, and are equally effective in terms of capturing, predicting and projecting the composite HD ‘landscape.’ This book will especially benefit two groups of readers: firstly, HD practitioners who want to find out ‘why some areas are doing better than others’ by exploring the complex interactions of spatially linked variables with different HD parameters. And secondly, practitioners in other branches of the social sciences who are not concerned with HD but are looking for ‘hands-on training’ with techniques they can apply in their respective field of spatial investigations.




DB2 10 for z/OS Performance Topics


Book Description

DB2® 10 for z/OS can reduce the total DB2 CPU demand from 5-20%, compared to DB2 9, when you take advantage of all the enhancements. Many CPU reductions are built in directly to DB2, requiring no application changes. Some enhancements are implemented through normal DB2 activities through rebinding, restructuring database definitions, improving applications, and utility processing. The CPU demand reduction features have the potential to provide significant total cost of ownership savings based on the application mix and transaction types. Improvements in optimization reduce costs by processing SQL automatically with more efficient data access paths. Improvements through a range-list index scan access method, list prefetch for IN-list, more parallelism for select and index insert processing, better work file usage, better record identifier (RID) pool overflow management, improved sequential detection, faster log I/O, access path certainty evaluation for static SQL, and improved distributed data facility (DDF) transaction flow all provide more efficiency without changes to applications. These enhancements can reduce total CPU enterprise costs because of improved efficiency in the DB2 10 for z/OS. DB2 10 includes numerous performance enhancements for Large Objects (LOBs) that save disk space for small LOBs and that provide dramatically better performance for LOB retrieval, inserts, load, and import/export using DB2 utilities. DB210 can also more effectively REORG partitions that contain LOBs. This IBM Redbooks® publication® provides an overview of the performance impact of DB2 10 for z/OS discussing the overall performance and possible impacts when moving from version to version. We include performance measurements that were made in the laboratory and provide some estimates. Keep in mind that your results are likely to vary, as the conditions and work will differ. In this book, we assume that you are somewhat familiar with DB2 10 for z/OS. See DB2 10 for z/OS Technical Overview, SG24-7892-00, for an introduction to the new functions.




Web Mining


Book Description

Web Mining is moving the World Wide Web toward a more useful environment in which users can quickly and easily find the information they need. Web Mining uses document content, hyperlink structure, and usage statistics to assist users in meeting their needed information. This book provides a record of current research and practical applications in Web searching. It includes techniques that will improve the utilization of the Web by the design of Web sites, as well as the design and application of search agents. This book presents research and related applications in a manner that encourages additional work toward improving the reduction of information overflow, which is so common today in Web search results.