Sanskrit Parsing


Book Description

About the Book India has a rich grammatical tradition, still extant in the form of PÀõini’s grammar as well as the theories of verbal cognition. These two together provide a formal theory of language communication. The formal nature of the theory makes it directly relevant to the new technology called Natural Language Processing. This book, first presents the key concepts from the Indian Grammatical Tradition (IGT) that are necessary for understanding the information flow in a language string and its dynamics. A fresh look at these concepts from the perspective of Natural Language Processing is provided. This is then followed by a concrete application of building a parser for Sanskrit using the framework of Indian Grammatical Tradition. This book not only documents the salient pieces of work carried out over the last quarter century under Computational Paninian Grammar, but provides the first comprehensive exposition of the ideas involved. It fills a gap for students of Computational Linguistics/Natural Language Processing who are working on Indian languages using PÀõinian Grammatical Framework for developing their computational models and do not have direct access to the texts in Sanskrit. Similarly for the Sanskrit scholars and the students it provides an example of concrete application of the Indian theories to solve a contemporary problem. About the Author Amba Kulkarni is a computational linguist. Since 1991 she has been engaged in showing the relevance of Indian Grammatical Tradition to the field of computational linguistics. She has contributed towards the building of Anusaarakas (language accessors) among English and Indian languages. She is the founder head of the Department of Sanskrit Studies, University of Hyderabad established in 2006. Since then her focus of research is on use of Indian grammatical theories for computational processing of Sanskrit texts. Under her leadership, a consortium of institutes developed several computational tools for Sanskrit and also a prototype of Sanskrit–Hindi Machine Translation system. In 2015, she was awarded a “Vishishta Sanskrit Sevavrati Sammana” by the Rashtriya Sanskrit Sansthan, New Delhi for her contribution to the studies and research on Sanskrit-based knowledge system. She was a fellow at the Indian Institute of Advanced Study, Shimla during 2015-17.




Sanskrit Computational Linguistics


Book Description

Sanskrit is the primary culture-bearing language of India, with a continuous production of literature in all ?elds of human endeavor over the course of four millennia. Precededbyastrongoraltraditionofknowledgetransmission,records of written Sanskrit remain in the form of inscriptions dating back to the ?rst centuryB. C. E. Extantmanuscriptsin Sanskritnumber over30million,one h- dred times those in Greek and Latin combined, constituting the largest cultural heritage that any civilization has produced prior to the invention of the printing press. Sanskrit works include extensive epics; subtle and intricate philosophical, mathematical, medical, legal, and scienti?c treatises; and imaginative and rich literary,poetic,anddramatictexts. WhiletheSanskritlanguageisofpreeminent importance to the intellectual and cultural heritage of India, the importance of the intellectual and cultural heritage of India to the rest of the world during the pastfewmillennia andinthe presenteracanhardlybe overestimated. The int- lectualandculturalheritageofIndia hasbeen amajorfactor inthedevelopment of the world's religions, languages, literature, arts, sciences, and history. Sanskritdocumentsaremovingintothedigitalmedium. Recentdecadeshave witnessed the growth of machine-readable Sanskrit texts in archives such as 1 the Thesaurus Indogermanischer Text- und Sprachmaterialien (TITUS), Ky- 2 3 oto University, Indology, the Gottingen ¨ Register of Electronic Texts in In- 4 dian Languages. The last few years have witnessed a burgeoning of digital images of Sanskrit manuscripts and books hosted on-line.







Digital Humanities and Buddhism


Book Description

IDH Religion provides a series of short introductions to specific areas of study at the intersections of digital humanities and religion, offering an overview of current methodologies, techniques, tools, and projects as well as defining challenges and opportunities for further research. This volume explores DH and Buddhism in four sections: Theory and Method; Digital Conservation, Preservation and Archiving; Digital Analysis; Digital Resources. It covers themes such as language processing, digital libraries, online lexicography, and ethnographic methods. Erratum: Unfortunately there is a mistake in the print version in the last paragraph of page 14. READ is an open-source software system developed by a team consisting of Stefan Baums at the Bavarian Academy of Sciences and Humanities, Andrew Glass in Seattle, Ian McCrabb at the University of Sydney and Stephen White in Venice (https://github.com/readsoftware/read).




Advanced Communication and Intelligent Systems


Book Description

This book constitutes the refereed proceedings of the Second International Conference on Advanced Communication and Intelligent Systems, ICACIS 2023, held in Warsaw, Poland, during June 16–17, 2023 The 22 full papers included in this book were carefully reviewed and selected from 221 submissions. They were organized in topical sections as follows: Wireless Communication, Artificial Intelligence and Machine Learning, Robotics & Automation, Data Science, IoT and Smart Applications




Computer Processing of Sanskrit Nominal Inflections


Book Description

Computer Processing of Sanskrit Nominal Inflections: Methods and Implementation is the result of Research and Development (R&D) at the Master of Philosophy (MPhil) level at Jawaharlal Nehru University, New Delhi, India. The title of the dissertation was “Machine Recognition and Morphological Analysis of Subanta-Padas.” The work, which is based on the reverse engineering implementation of Panini’s Sanskrit Grammar, brings together new and original studies in the area of computational linguistics, language technology and natural language processing with reference to parsing Sanskrit nominal inflections. On the surface level, Panini has defined rules in a forward looking generative fashion which makes reverse analysis necessary for parsing. Since parsing inflections is the first basic step towards complete analysis, the present work has relevance for any larger system that may evolve in future.




Sanskrit Computational Linguistics


Book Description

This volume constitutes the thoroughly refereed post-conference proceedings of the First and Second International Symposia on Sanskrit Computational Linguistics, held in Rocquencourt, France, in October 2007 and in Providence, RI, USA, in May 2008 respectively. The 11 revised full papers of the first and the 12 revised papers of the second symposium presented with an introduction and a keynote talk were carefully reviewed and selected from the lectures given at both events. The papers address several topics such as the structure of the Paninian grammatical system, computational linguistics, lexicography, lexical databases, formal description of sanskrit grammar, phonology and morphology, machine translation, philology, and OCR.




A Companion to Digital Humanities


Book Description

This Companion offers a thorough, concise overview of the emerging field of humanities computing. Contains 37 original articles written by leaders in the field. Addresses the central concerns shared by those interested in the subject. Major sections focus on the experience of particular disciplines in applying computational methods to research problems; the basic principles of humanities computing; specific applications and methods; and production, dissemination and archiving. Accompanied by a website featuring supplementary materials, standard readings in the field and essays to be included in future editions of the Companion.




Digital Humanities and Research Methods in Religious Studies


Book Description

This volume provides practical, but provocative, case studies of exemplary projects that apply digital technology or methods to the study of religion. An introduction and 16 essays are organized by the kinds of sources digital humanities scholars use – texts, images, and places – with a final section on the professional and pedagogical issues digital scholarship raises for the study of religion.




Computational Humanities


Book Description

The first book to intervene in debates on computation in the digital humanities Bringing together leading experts from across North America and Europe, Computational Humanities redirects debates around computation and humanities digital scholarship from dualistic arguments to nuanced discourse centered around theories of knowledge and power. This volume is organized around four questions: Why or why not pursue computational humanities? How do we engage in computational humanities? What can we study using these methods? Who are the stakeholders? Recent advances in technologies for image and sound processing have expanded computational approaches to cultural forms beyond text, and new forms of data, from listservs and code repositories to tweets and other social media content, have enlivened debates about what counts as digital humanities scholarship. Providing case studies of collaborations between humanities-centered and computation-centered researchers, this volume highlights both opportunities and frictions, showing that data and computation are as much about power, prestige, and precarity as they are about p-values. Contributors: Mark Algee-Hewitt, Stanford U; David Bamman, U of California, Berkeley; Kaspar Beelen, U of London; Peter Bell, Philipps U of Marburg; Tobias Blanke, U of Amsterdam; Julia Damerow, Arizona State U; Quinn Dombrowski, Stanford U; Crystal Nicole Eddins, U of Pittsburgh; Abraham Gibson, U of Texas at San Antonio; Tassie Gniady; Crystal Hall, Bowdoin College; Vanessa M. Holden, U of Kentucky; David Kloster, Indiana U; Manfred D. Laubichler, Arizona State U; Katherine McDonough, Lancaster U; Barbara McGillivray, King’s College London; Megan Meredith-Lobay, Simon Fraser U; Federico Nanni, Alan Turing Institute; Fabian Offert, U of California, Santa Barbara; Hannah Ringler, Illinois Institute of Technology; Roopika Risam, Dartmouth College; Joshua D. Rothman, U of Alabama; Benjamin M. Schmidt; Lisa Tagliaferri, Rutgers U; Jeffrey Tharsen, U of Chicago; Marieke van Erp, Royal Netherlands Academy of Arts and Sciences; Lee Zickel, Case Western Reserve U.