Computer Processing of Sanskrit Nominal Inflections


Book Description

Computer Processing of Sanskrit Nominal Inflections: Methods and Implementation is the result of Research and Development (R&D) at the Master of Philosophy (MPhil) level at Jawaharlal Nehru University, New Delhi, India. The title of the dissertation was “Machine Recognition and Morphological Analysis of Subanta-Padas.” The work, which is based on the reverse engineering implementation of Panini’s Sanskrit Grammar, brings together new and original studies in the area of computational linguistics, language technology and natural language processing with reference to parsing Sanskrit nominal inflections. On the surface level, Panini has defined rules in a forward looking generative fashion which makes reverse analysis necessary for parsing. Since parsing inflections is the first basic step towards complete analysis, the present work has relevance for any larger system that may evolve in future.




Human Language Technology. Challenges for Computer Science and Linguistics


Book Description

This book constitutes the refereed proceedings of the 4th Language and Technology Conference: Challenges for Computer Science and Linguistics, LTC 2009, held in Poznan, Poland, in November 2009. The 52 revised and in many cases substantially extended papers presented in this volume were carefully reviewed and selected from 103 submissions. The contributions are organized in topical sections on speech processing, computational morphology/lexicography, parsing, computational semantics, dialogue modeling and processing, digital language resources, WordNet, document processing, information processing, and machine translation.




Sanskrit Parsing


Book Description

About the Book India has a rich grammatical tradition, still extant in the form of PÀõini’s grammar as well as the theories of verbal cognition. These two together provide a formal theory of language communication. The formal nature of the theory makes it directly relevant to the new technology called Natural Language Processing. This book, first presents the key concepts from the Indian Grammatical Tradition (IGT) that are necessary for understanding the information flow in a language string and its dynamics. A fresh look at these concepts from the perspective of Natural Language Processing is provided. This is then followed by a concrete application of building a parser for Sanskrit using the framework of Indian Grammatical Tradition. This book not only documents the salient pieces of work carried out over the last quarter century under Computational Paninian Grammar, but provides the first comprehensive exposition of the ideas involved. It fills a gap for students of Computational Linguistics/Natural Language Processing who are working on Indian languages using PÀõinian Grammatical Framework for developing their computational models and do not have direct access to the texts in Sanskrit. Similarly for the Sanskrit scholars and the students it provides an example of concrete application of the Indian theories to solve a contemporary problem. About the Author Amba Kulkarni is a computational linguist. Since 1991 she has been engaged in showing the relevance of Indian Grammatical Tradition to the field of computational linguistics. She has contributed towards the building of Anusaarakas (language accessors) among English and Indian languages. She is the founder head of the Department of Sanskrit Studies, University of Hyderabad established in 2006. Since then her focus of research is on use of Indian grammatical theories for computational processing of Sanskrit texts. Under her leadership, a consortium of institutes developed several computational tools for Sanskrit and also a prototype of Sanskrit–Hindi Machine Translation system. In 2015, she was awarded a “Vishishta Sanskrit Sevavrati Sammana” by the Rashtriya Sanskrit Sansthan, New Delhi for her contribution to the studies and research on Sanskrit-based knowledge system. She was a fellow at the Indian Institute of Advanced Study, Shimla during 2015-17.







Science and Scientification in South Asia and Europe


Book Description

This volume critically examines the role of science in the humanities and social sciences. It studies how cultures and societies in South Asia and Europe underwent a transformation with the adoption or adaptation of scientific methods, turning ancient cultural processes and phenomena into an enhanced scientific structure. The chapters in this book Discuss the development of science as a method in modern and historical contexts and the differences between modern science, scientification and pseudoscience. Study the interactions between bodies of knowledge such as Sanskrit and computer science; mathematics and Vedic mathematics; science and philosophy. Drawing on textual material, extensive fieldwork and in-depth interviews, this book will be of great interest to scholars and researchers of philosophy, Indology, history, linguistics, history and philosophy of science and social science.




Sanskrit Computational Linguistics


Book Description

This volume constitutes the refereed proceedings of the 4th International Symposium on Sanskrit Computational Linguistics, held in New Delhi, India, in December 2010. The 18 revised full papers presented were carefully reviewed and selected from numerous submissions. The papers can be categorized under following broad areas such as phonology and speech technology; morphology and shallow parsing; syntax, semantics and parsing; lexical resources, annotation and search; machine translation and ambiguity resolution.







Sanskrit Computational Linguistics


Book Description

This volume constitutes the thoroughly refereed post-conference proceedings of the First and Second International Symposia on Sanskrit Computational Linguistics, held in Rocquencourt, France, in October 2007 and in Providence, RI, USA, in May 2008 respectively. The 11 revised full papers of the first and the 12 revised papers of the second symposium presented with an introduction and a keynote talk were carefully reviewed and selected from the lectures given at both events. The papers address several topics such as the structure of the Paninian grammatical system, computational linguistics, lexicography, lexical databases, formal description of sanskrit grammar, phonology and morphology, machine translation, philology, and OCR.




Understanding Morphology


Book Description

This new edition of Understanding Morphology has been fully revised in line with the latest research. It now includes 'big picture' questions to highlight central themes in morphology, as well as research exercises for each chapter. Understanding Morphology presents an introduction to the study of word structure that starts at the very beginning. Assuming no knowledge of the field of morphology on the part of the reader, the book presents a broad range of morphological phenomena from a wide variety of languages. Starting with the core areas of inflection and derivation, the book presents the interfaces between morphology and syntax and between morphology and phonology. The synchronic study of word structure is covered, as are the phenomena of diachronic change, such as analogy and grammaticalization. Theories are presented clearly in accessible language with the main purpose of shedding light on the data, rather than as a goal in themselves. The authors consistently draw on the best research available, thus utilizing and discussing both functionalist and generative theoretical approaches. Each chapter includes a summary, suggestions for further reading, and exercises. As such this is the ideal book for both beginning students of linguistics, or anyone in a related discipline looking for a first introduction to morphology.




Multilingual Natural Language Processing Applications


Book Description

Multilingual Natural Language Processing Applications is the first comprehensive single-source guide to building robust and accurate multilingual NLP systems. Edited by two leading experts, it integrates cutting-edge advances with practical solutions drawn from extensive field experience. Part I introduces the core concepts and theoretical foundations of modern multilingual natural language processing, presenting today’s best practices for understanding word and document structure, analyzing syntax, modeling language, recognizing entailment, and detecting redundancy. Part II thoroughly addresses the practical considerations associated with building real-world applications, including information extraction, machine translation, information retrieval/search, summarization, question answering, distillation, processing pipelines, and more. This book contains important new contributions from leading researchers at IBM, Google, Microsoft, Thomson Reuters, BBN, CMU, University of Edinburgh, University of Washington, University of North Texas, and others. Coverage includes Core NLP problems, and today’s best algorithms for attacking them Processing the diverse morphologies present in the world’s languages Uncovering syntactical structure, parsing semantics, using semantic role labeling, and scoring grammaticality Recognizing inferences, subjectivity, and opinion polarity Managing key algorithmic and design tradeoffs in real-world applications Extracting information via mention detection, coreference resolution, and events Building large-scale systems for machine translation, information retrieval, and summarization Answering complex questions through distillation and other advanced techniques Creating dialog systems that leverage advances in speech recognition, synthesis, and dialog management Constructing common infrastructure for multiple multilingual text processing applications This book will be invaluable for all engineers, software developers, researchers, and graduate students who want to process large quantities of text in multiple languages, in any environment: government, corporate, or academic.