Information Theory and Language


Book Description

“Information Theory and Language” is a collection of 12 articles that appeared recently in Entropy as part of a Special Issue of the same title. These contributions represent state-of-the-art interdisciplinary research at the interface of information theory and language studies. They concern in particular: • Applications of information theoretic concepts such as Shannon and Rényi entropies, mutual information, and rate–distortion curves to the research of natural languages; • Mathematical work in information theory inspired by natural language phenomena, such as deriving moments of subword complexity or proving continuity of mutual information; • Empirical and theoretical investigation of quantitative laws of natural language such as Zipf’s law, Herdan’s law, and Menzerath–Altmann’s law; • Empirical and theoretical investigations of statistical language models, including recently developed neural language models, their entropies, and other parameters; • Standardizing language resources for statistical investigation of natural language; • Other topics concerning semantics, syntax, and critical phenomena. Whereas the traditional divide between probabilistic and formal approaches to human language, cultivated in the disjoint scholarships of natural sciences and humanities, has been blurred in recent years, this book can contribute to pointing out potential areas of future research cross-fertilization.




Information Theory Meets Power Laws


Book Description

Discover new theoretical connections between stochastic phenomena and the structure of natural language with this powerful volume! Information Theory Meets Power Laws: Stochastic Processes and Language Models presents readers with a novel subtype of a probabilistic approach to language, which is based on statistical laws of texts and their analysis by means of information theory. The distinguished author insightfully and rigorously examines the linguistic and mathematical subject matter while eschewing needlessly abstract and superfluous constructions. The book begins with a less formal treatment of its subjects in the first chapter, introducing its concepts to readers without mathematical training and allowing those unfamiliar with linguistics to learn the book’s motivations. Despite its inherent complexity, Information Theory Meets Power Laws: Stochastic Processes and Language Models is a surprisingly approachable treatment of idealized mathematical models of human language. The author succeeds in developing some of the theory underlying fundamental stochastic and semantic phenomena, like strong nonergodicity, in a way that has not previously been seriously attempted. In doing so, he covers topics including: Zipf’s and Herdan’s laws for natural language Power laws for information, repetitions, and correlations Markov, finite-state,and Santa Fe processes Bayesian and frequentist interpretations of probability Ergodic decomposition, Kolmogorov complexity, and universal coding Theorems about facts and words Information measures for fields Rényi entropies, recurrence times, and subword complexity Asymptotically mean stationary processes Written primarily for mathematics graduate students and professionals interested in information theory or discrete stochastic processes, Information Theory Meets Power Laws: Stochastic Processes and Language Models also belongs on the bookshelves of doctoral students and researchers in artificial intelligence, computational and quantitative linguistics as well as physics of complex systems.




A Theory of Language and Information


Book Description

Written by one of the most respected figures in American linguistics, this book develops an approach to the analysis of language on a mathematical model. Harris presents a formal theory of language structure, in which syntax is characterized as an orderly system of departure from random combinings of sounds, words, and all the elements of language. He argues that the combining of words in a sentence constitutes a mathematical object, and that each departure from randomness is a contribution to the structure and meaning of a sentence. Discussing the differences in the structure and content of language, mathematics, and music, Harris shows that the use of language in a science constitutes a distinguishable sub-language. Remarkable and compelling, Harris's magnum opus will be considered the classical analysis of the structuring of information and development of language.







Grammatical Man


Book Description




The Information


Book Description

From the bestselling author of the acclaimed Chaos and Genius comes a thoughtful and provocative exploration of the big ideas of the modern era: Information, communication, and information theory. Acclaimed science writer James Gleick presents an eye-opening vision of how our relationship to information has transformed the very nature of human consciousness. A fascinating intellectual journey through the history of communication and information, from the language of Africa’s talking drums to the invention of written alphabets; from the electronic transmission of code to the origins of information theory, into the new information age and the current deluge of news, tweets, images, and blogs. Along the way, Gleick profiles key innovators, including Charles Babbage, Ada Lovelace, Samuel Morse, and Claude Shannon, and reveals how our understanding of information is transforming not only how we look at the world, but how we live. A New York Times Notable Book A Los Angeles Times and Cleveland Plain Dealer Best Book of the Year Winner of the PEN/E. O. Wilson Literary Science Writing Award




Information Theory


Book Description

Originally developed by Claude Shannon in the 1940s, information theory laid the foundations for the digital revolution, and is now an essential tool in telecommunications, genetics, linguistics, brain sciences, and deep space communication. In this richly illustrated book, accessible examples are used to introduce information theory in terms of everyday games like ‘20 questions’ before more advanced topics are explored. Online MatLab and Python computer programs provide hands-on experience of information theory in action, and PowerPoint slides give support for teaching. Written in an informal style, with a comprehensive glossary and tutorial appendices, this text is an ideal primer for novices who wish to learn the essential principles and applications of information theory.




The Mathematical Theory of Communication


Book Description

Scientific knowledge grows at a phenomenal pace--but few books have had as lasting an impact or played as important a role in our modern world as The Mathematical Theory of Communication, published originally as a paper on communication theory more than fifty years ago. Republished in book form shortly thereafter, it has since gone through four hardcover and sixteen paperback printings. It is a revolutionary work, astounding in its foresight and contemporaneity. The University of Illinois Press is pleased and honored to issue this commemorative reprinting of a classic.




An Introduction to Information Theory


Book Description

Behind the familiar surfaces of the telephone, radio, and television lies a sophisticated and intriguing body of knowledge known as information theory. This is the theory that has permeated the rapid development of all sorts of communication, from color television to the clear transmission of photographs from the vicinity of Jupiter. Even more revolutionary progress is expected in the future. To give a solid introduction to this burgeoning field, J. R. Pierce has revised his well-received 1961 study of information theory for an up-to-date second edition. Beginning with the origins of the field, Dr. Pierce follows the brilliant formulations of Claude Shannon and describes such aspects of the subject as encoding and binary digits, entropy. language and meaning, efficient encoding , and the noisy channel. He then goes beyond the strict confines of the topic to explore the ways in which information theory relates to physics, cybernetics, psychology, and art. Mathematical formulas are introduced at the appropriate points for the benefit of serious students. A glossary of terms and an appendix on mathematical notation are provided to help the less mathematically sophisticated. J. R. Pierce worked for many years at the Bell Telephone Laboratories, where he became Director of Research in Communications Principles. He is currently affiliated with the engineering department of the California Institute of Technology. While his background is impeccable, Dr. Pierce also possesses an engaging writing style that makes his book all the more welcome. An Introduction to Information Theory continues to be the most impressive non-technical account available and a fascinating introduction to the subject for laymen. "An uncommonly good study. . . . Pierce's volume presents the most satisfying discussion to be found."? Scientific American.




Information Theory and Coding


Book Description