Working with grep, sed, and awk Pocket Primer


Book Description

The goal of this book is to introduce readers to three powerful command line utilities that can be combined to create simple yet powerful shell scripts for performing a multitude of tasks. The code samples and scripts use the bash shell, and typically involve very small text files so that you can focus on understanding the features of grep, sed, and awk. In the simplest terms, grep (global regular expression print) will search input files in data for certain words or word patterns and print the lines that match it. sed is useful for changing or modifying data. Awk is a programming language also used for searching a data file for certain patterns, but can also perform certain tasks on the pattern matches it finds. Aimed at a reader relatively new to working in a bash environment, the book is comprehensive enough to be a good reference and teach a few new tricks to those who already have some experience with these command line utilities.




Linux Shell Programming Pocket Primer


Book Description

The goal of this book is to introduce readers to an assortment of powerful command line utilities that can be combined to create simple, yet powerful shell scripts. While all examples and scripts use the “bash” command set, many of the concepts translate into other forms of shell scripting (ksh, sh, csh), including the concept of piping data between commands, regular expression substitution and the sed and awk commands. Aimed at a reader relatively new to working in a bash environment, the book is comprehensive enough to be a good reference and teach a few new tricks to those who already have some experience with creating shell scripts. FEATURES: Covers extensive topics, code samples, and scripting utilities Includes material on piping data between commands, regular expression substitution, cleaning datasets, and the sed and awk commands Features companion files with code samples from the book (available for downloading from the publisher)




Data Cleaning Pocket Primer


Book Description

As part of the best selling Pocket Primer series, this book is an effort to give programmers sufficient knowledge of data cleaning to be able to work on their own projects. It is designed as a practical introduction to using flexible, powerful (and free) Unix / Linux shell commands to perform common data cleaning tasks. The book is packed with realistic examples and numerous commands that illustrate both the syntax and how the commands work together. Companion files with source code are available for downloading from the publisher. Features: - A practical introduction to using flexible, powerful (and free) Unix / Linux shell commands to perform common data cleaning tasks - Includes the concept of piping data between commands, regular expression substitution, and the sed and awk commands - Packed with realistic examples and numerous commands that illustrate both the syntax and how the commands work together - Assumes the reader has no prior experience, but the topic is covered comprehensively enough to teach a pro some new tricks - Includes companion files with all of the source code examples (download from the publisher).




Bash Command Line and Shell Scripts Pocket Primer


Book Description

As part of the best-selling Pocket Primer series, this book is designed to introduce readers to an assortment of useful command-line utilities that can be combined to create simple, yet powerful shell scripts. While all examples and scripts use the “bash” command set, many of the concepts translate into other command shells (such as sh, ksh, zsh, and csh), including the concept of piping data between commands and the highly versatile sed and awk commands. Aimed at a reader relatively new to working in a bash environment, the book is comprehensive enough to be a good reference and teach a few new techniques to those who already have some experience with creating shell scripts. It contains a variety of code fragments and shell scripts for data scientists, data analysts, and other people who want shell-based solutions to “clean” various types of text files. In addition, the concepts and code samples in this book are useful for people who want to simplify routine tasks. Includes companion files with all of the source code examples (download from the publisher by writing to [email protected]). Features: Takes introductory concepts and commands in bash, and then demonstrates their uses in simple, yet powerful shell scripts Contains an assortment of shell scripts for data scientists, data analysts, and other people who want shell-based solutions to “clean” various types of text files Includes companion files with all of the source code examples (available for download from the publisher)




Sed & Awk


Book Description

In Sed & Awk, Dale Dougherty and Arnold Robbins describe two text manipulation programs that are mainstays of the UNIX programmer's toolbox. This new edition covers the Sed and Awk systems as they are now mandated by the POSIX standard.




Python Tools for Data Scientists Pocket Primer


Book Description

As part of the best-selling Pocket Primer series, this book is designed to provide a thorough introduction to numerous Python tools for data scientists. The book covers features of NumPy and Pandas, how to write regular expressions, and how to perform data cleaning tasks. It includes separate chapters on data visualization and working with Sklearn and SciPy. Companion files with source code are available. FEATURES: Introduces Python, NumPy, Sklearn, SciPy, and awk Covers data cleaning tasks and data visualization Features numerous code samples throughout Includes companion files with source code




Python for TensorFlow Pocket Primer


Book Description

As part of the best-selling Pocket Primer series, this book is designed to prepare programmers for machine learning and deep learning/TensorFlow topics. It begins with a quick introduction to Python, followed by chapters that discuss NumPy, Pandas, Matplotlib, and scikit-learn. The final two chapters contain an assortment of TensorFlow 1.x code samples, including detailed code samples for TensorFlow Dataset (which is used heavily in TensorFlow 2 as well). A TensorFlow Dataset refers to the classes in the tf.data.Dataset namespace that enables programmers to construct a pipeline of data by means of method chaining so-called lazy operators, e.g., map(), filter(), batch(), and so forth, based on data from one or more data sources. Companion files with source code are available for downloading from the publisher by writing [email protected]. Features: A practical introduction to Python, NumPy, Pandas, Matplotlib, and introductory aspects of TensorFlow 1.x Contains relevant NumPy/Pandas code samples that are typical in machine learning topics, and also useful TensorFlow 1.x code samples for deep learning/TensorFlow topics Includes many examples of TensorFlow Dataset APIs with lazy operators, e.g., map(), filter(), batch(), take() and also method chaining such operators Assumes the reader has very limited experience Companion files with all of the source code examples (download from the publisher)




Regular Expressions


Book Description

As part of the bestselling Pocket Primer series, the goal of this book is to introduce readers to regular expressions in several technologies. It is intended for data scientists, data analysts, and others who want to understand regular expressions to perform various tasks. You will acquire an understanding of how to create an assortment of regular expressions, such as filtering data for strings containing uppercase or lowercase letters; matching integers, decimals, hexadecimal, and scientific numbers; and context-dependent pattern matching expressions. It includes REs with Python, R, bash, Perl, Java, and more. Companion files with source code are available for downloading from the publisher. Features: • Uses REs with Python, R, bash, Java, and more • Packed with realistic examples and numerous commands • Assumes the reader has no prior experience, but the topic is covered comprehensively enough to teach a pro some new tricks • Includes companion files with all of the source code examples (download from the publisher) ON THE COMPANION FILES (available from the publisher for downloading) • Source code samples




Effective awk Programming


Book Description

Effective awk Programming,3rd Edition, focuses entirely on awk, exploring it in the greatest depth of the three awk titles we carry. It's an excellent companion piece to the more broadly focused second edition. This book provides complete coverage of the gawk 3.1 language as well as the most up-to-date coverage of the POSIX standard for awk available anywhere. Author Arnold Robbins clearly distinguishes standard awk features from GNU awk (gawk)-specific features, shines light into many of the "dark corners" of the language (areas to watch out for when programming), and devotes two full chapters to example programs. A brand new chapter is devoted to TCP/IP networking with gawk. He includes a summary of how the awk language evolved. The book also covers: Internationalization of gawk Interfacing to i18n at the awk level Two-way pipes TCP/IP networking via the two-way pipe interface The new PROCINFO array, which provides information about running gawk Profiling and pretty-printing awk programs In addition to covering the awk language, this book serves as the official "User's Guide" for the GNU implementation of awk (gawk), describing in an integrated fashion the extensions available to the System V Release 4 version of awk that are also available in gawk. As the official gawk User's Guide, this book will also be available electronically, and can be freely copied and distributed under the terms of the Free Software Foundation's Free Documentation License (FDL). A portion of the proceeds from sales of this book will go to the Free Software Foundation to support further development of free and open source software. The third edition of Effective awk Programming is a GNU Manual and is published by O'Reilly & Associates under the Free Software Foundation's Free Documentation License (FDL). A portion of the proceeds from the sale of this book is donated to the Free Software Foundation to further development of GNU software. This book is also available in electronic form; you have the freedom to modify this GNU Manual, like GNU software. Copies published by the Free Software Foundation raise funds for GNU development.




Bash Guide for Beginners (Second Edition)


Book Description

The Bash Guide for Beginners (Second Edition) discusses concepts useful in the daily life of the serious Bash user. While a basic knowledge of shell usage is required, it starts with a discussion of shell building blocks and common practices. Then it presents the grep, awk and sed tools that will later be used to create more interesting examples. The second half of the course is about shell constructs such as loops, conditional tests, functions and traps, and a number of ways to make interactive scripts. All chapters come with examples and exercises that will help you become familiar with the theory.