Development of Linguistic Linked Open Data Resources for Collaborative Data-Intensive Research in the Language Sciences


Book Description

Making diverse data in linguistics and the language sciences open, distributed, and accessible: perspectives from language/language acquistiion researchers and technical LOD (linked open data) researchers. This volume examines the challenges inherent in making diverse data in linguistics and the language sciences open, distributed, integrated, and accessible, thus fostering wide data sharing and collaboration. It is unique in integrating the perspectives of language researchers and technical LOD (linked open data) researchers. Reporting on both active research needs in the field of language acquisition and technical advances in the development of data interoperability, the book demonstrates the advantages of an international infrastructure for scholarship in the field of language sciences. With contributions by researchers who produce complex data content and scholars involved in both the technology and the conceptual foundations of LLOD (linguistics linked open data), the book focuses on the area of language acquisition because it involves complex and diverse data sets, cross-linguistic analyses, and urgent collaborative research. The contributors discuss a variety of research methods, resources, and infrastructures. Contributors Isabelle Barrière, Nan Bernstein Ratner, Steven Bird, Maria Blume, Ted Caldwell, Christian Chiarcos, Cristina Dye, Suzanne Flynn, Claire Foley, Nancy Ide, Carissa Kang, D. Terence Langendoen, Barbara Lust, Brian MacWhinney, Jonathan Masci, Steven Moran, Antonio Pareja-Lora, Jim Reidy, Oya Y. Rieger, Gary F. Simons, Thorsten Trippel, Kara Warburton, Sue Ellen Wright, Claus Zinn




Development of Linguistic Linked Open Data Resources for Collaborative Data-intensive Research in the Language Sciences


Book Description

"This volume examines the challenges inherent in making diverse data in linguistics and the language sciences open, distributed, integrated, and accessible, thus fostering wide data sharing and collaboration. It is unique in integrating the perspectives of language researchers and technical LOD (linked open data) researchers. Reporting on both active research needs in the field of language acquisition and technical advances in the development of data interoperability, the book demonstrates the advantages of an international infrastructure for scholarship in the field of language sciences. With contributions by researchers who produce complex data content and scholars involved in both the technology and the conceptual foundations of LLOD (linguistics linked open data), the book focuses on the area of language acquisition because it involves complex and diverse data sets, cross-linguistic analyses, and urgent collaborative research. The contributors discuss a variety of research methods, resources, and infrastructures"--Publisher's description.




Research Methods in Language Acquisition


Book Description

Language acquisition research is challenging—the intricate behavioral and cognitive foundations of speech are difficult to measure objectively. The audible components of speech, however, are quantifiable and thus provide crucial data. This practical guide synthesizes the authors’ decades of experience into a comprehensive set of tools that will allow students and early career researchers in the field to design and conduct rigorous studies that produce reliable and valid speech data and interpretations. The authors thoroughly review specific techniques for obtaining qualitative and quantitative speech data, including how to tailor the testing environments for optimal results. They explore observational tasks for collecting natural speech and experimental tasks for eliciting specific types of speech. Language comprehension tasks are also reviewed so researchers can study participants’ interpretations of speech and conceptualizations of grammar. Most tasks are oriented towards children, but special considerations for infants are also reviewed, as well as multilingual children. Chapters also provide strategies for transcribing and coding raw speech data into reliable data sets that can be scientifically analyzed. Furthermore, they investigate the intricacies of interpretation so that researchers can make empirically sound inferences from their data and avoid common pitfalls that can lead to unscientific conclusions.




Linguistic Linked Data


Book Description

This is the first monograph on the emerging area of linguistic linked data. Presenting a combination of background information on linguistic linked data and concrete implementation advice, it introduces and discusses the main benefits of applying linked data (LD) principles to the representation and publication of linguistic resources, arguing that LD does not look at a single resource in isolation but seeks to create a large network of resources that can be used together and uniformly, and so making more of the single resource. The book describes how the LD principles can be applied to modelling language resources. The first part provides the foundation for understanding the remainder of the book, introducing the data models, ontology and query languages used as the basis of the Semantic Web and LD and offering a more detailed overview of the Linguistic Linked Data Cloud. The second part of the book focuses on modelling language resources using LD principles, describing how to model lexical resources using Ontolex-lemon, the lexicon model for ontologies, and how to annotate and address elements of text represented in RDF. It also demonstrates how to model annotations, and how to capture the metadata of language resources. Further, it includes a chapter on representing linguistic categories. In the third part of the book, the authors describe how language resources can be transformed into LD and how links can be inferred and added to the data to increase connectivity and linking between different datasets. They also discuss using LD resources for natural language processing. The last part describes concrete applications of the technologies: representing and linking multilingual wordnets, applications in digital humanities and the discovery of language resources. Given its scope, the book is relevant for researchers and graduate students interested in topics at the crossroads of natural language processing / computational linguistics and the Semantic Web / linked data. It appeals to Semantic Web experts who are not proficient in applying the Semantic Web and LD principles to linguistic data, as well as to computational linguists who are used to working with lexical and linguistic resources wanting to learn about a new paradigm for modelling, publishing and exploiting linguistic resources.




Discourse Analysis in Adults With and Without Communication Disorders


Book Description

Discourse Analysis in Adults With and Without Communication Disorders: A Resource for Clinicians and Researchers provides state-of-the-art information about discourse analysis with sections on Aging, Aphasia, Cognitive Communication Disorders, and Neurodegenerative Diseases. The three renowned editors are actively engaged in the area of discourse. Expert clinical researchers introduce and organize each section, and chapters are authored by leaders involved in discourse research worldwide. Discourse is considered the most natural unit of language. Effective production of discourse requires complex interactions among linguistic, cognitive, and social abilities that are sensitive to even mild disruption in any one of these elements. This book covers the examination of discourse in adults with acquired communication disorders, including selecting elicitation tasks, streamlining transcription processes, expanding analysis methods, and translating findings for treatment application. Key Features * Provides a global perspective on discourse assessment for clinicians * Dedicated chapters on aging, aphasia, traumatic brain injury, right hemisphere disorder, primary progressive aphasia, Alzheimer’s dementia, Parkinson’s disease, and amyotrophic lateral sclerosis Disclaimer: Please note that ancillary content (such as documents, audio, and video, etc.) may not be included as published in the original print version of this book.




CLARIN


Book Description

CLARIN, the "Common Language Resources and Technology Infrastructure", has established itself as a major player in the field of research infrastructures for the humanities. This volume provides a comprehensive overview of the organization, its members, its goals and its functioning, as well as of the tools and resources hosted by the infrastructure. The many contributors representing various fields, from computer science to law to psychology, analyse a wide range of topics, such as the technology behind the CLARIN infrastructure, the use of CLARIN resources in diverse research projects, the achievements of selected national CLARIN consortia, and the challenges that CLARIN has faced and will face in the future. The book will be published in 2022, 10 years after the establishment of CLARIN as a European Research Infrastructure Consortium by the European Commission (Decision 2012/136/EU). Watch our talk with the editors Darja Fišer and Andreas Witt here: https://youtu.be/ZOoiGbmMbxI




Remote Online Language Assessment: Eliciting Discourse from Children and Adults


Book Description

Being able to collect valid data is crucial for empirical science disciplines such as linguistics, developmental psycholinguistics, clinical psycholinguistics and speech and hearing sciences. In recent years there has been an increasing use of digital devices for remote language assessments, such as online elicitation of language samples, apps for eliciting expressive and productive lexical abilities, and online questionnaires. With the ongoing COVID-19 pandemic still affecting many lives globally, there have been numerous disruptions of face-to-face, in-person language assessments, leading many researchers to conduct their language assessments online. Despite the necessity of remote language assessments and the convenience they may bring to both assessors and assessees, the potential merits, limits, and problems of remote testing have not yet been systematically explored and understood. This timely Research Topic seeks contributions that mobilize new evidence and/or insightful and nuanced discussions to address questions such as: can we control online testing so that it is as good as face-to-face, in-person testing, and, if so, how? Do we have evaluative evidence of such practices, and if so, how robust is the evidence? What adaptations and concerns can and cannot be accommodated at the present time? What opportunities are offered by recent technological advances? Are there certain conditions in which online testing works better or worse? Last but not least, how do differences between offline, in-person language assessments and online, remote assessments affect the results of testing? The current topic has two main foci: the first deals with the assessment of conversational discourse in general and narrative discourse in particular, in both children and young adults. Communicative competence at the discourse level has been considered an essential and ecologically valid component in language assessments of children and adults, for three key reasons: 1) this competence is crucial for an individual’s everyday functioning and academic and social life, 2) it provides information about an individual’s socio-cognitive and linguistic abilities, and 3) it is a versatile test of language skills at the levels of content, form, use and their integration. The second focus is on comparing the results elicited via in-person assessments and remote, online assessments. This Research Topic welcomes empirical articles discussing new evidence, perspective and opinion papers on issues at the conceptual-methodological interface, and methods articles presenting approaches that can offer opportunities for remote testing of developmental discourse supported by recent technological advances. Potential themes may include, but are not limited to: • comparisons of remote versus in-person testing modes using a within-participants research design • learner variables such as age, gender, language status (monolingual, multilingual), and clinical status (typically-developing children and adults, children and adults with clinical conditions such as (developmental) language disorder, autism spectrum disorder) which may affect the efficacy of remote testing • linguistic variables such as the use of referential and relational devices and mental state language which may be subject to more variations when being assessed remotely • new methods that offer opportunities for the remote testing of developmental and adult discourse, supported by recent technological advances • articles addressing the same research question within developmental narrative discourse but using different (i.e. either online or offline) research methods.




Semantic Web


Book Description

What Is Semantic Web The World Wide Web Consortium is responsible for establishing the standards that will be used in the development of the Semantic Web, which is also referred to as Web 3.0 in certain circles (W3C). Making the data on the Internet understandable by machines is the objective of the Semantic Web. How You Will Benefit (I) Insights, and validations about the following topics: Chapter 1: Semantic Web Chapter 2: DARPA Agent Markup Language Chapter 3: Resource Description Framework Chapter 4: MPEG-7 Chapter 5: Web Ontology Language Chapter 6: RDF Schema Chapter 7: Semantic spectrum Chapter 8: SPARQL Chapter 9: FOAF (ontology) Chapter 10: Semantic wiki Chapter 11: RDFa Chapter 12: Semantic technology Chapter 13: RDF query language Chapter 14: Semantic publishing Chapter 15: Semantic HTML Chapter 16: Semantic Web Stack Chapter 17: Ontology engineering Chapter 18: XHTML+RDFa Chapter 19: Knowledge extraction Chapter 20: Open Semantic Framework Chapter 21: Linguistic Linked Open Data (II) Answering the public top questions about semantic web. (III) Real world examples for the usage of semantic web in many fields. (IV) 17 appendices to explain, briefly, 266 emerging technologies in each industry to have 360-degree full understanding of semantic web' technologies. Who This Book Is For Professionals, undergraduate and graduate students, enthusiasts, hobbyists, and those who want to go beyond basic knowledge or information for any kind of semantic web.




Translation Tools and Technologies


Book Description

To trainee translators and established professionals alike, the range of tools and technologies now available, and the speed with which they change, can seem bewildering. This state-of-the-art, copiously illustrated textbook offers a straightforward and practical guide to translation tools and technologies. Demystifying the workings of computer-assisted translation (CAT) and machine translation (MT) technologies, Translation Tools and Technologies offers clear step-by-step guidance on how to choose suitable tools (free or commercial) for the task in hand and quickly get up to speed with them, using examples from a wide range of languages. Translator trainers will also find it invaluable when constructing or updating their courses. This unique book covers many topics in addition to text translation. These include the history of the technologies, project management, terminology research and corpora, audiovisual translation, website, software and games localisation, and quality assurance. Professional workflows are at the heart of the narrative, and due consideration is also given to the legal and ethical questions arising from the reuse of translation data. With targeted suggestions for further reading at the end of each chapter to guide users in deepening their knowledge, this is the essential textbook for all courses in translation and technology within translation studies and translator training. Additional resources are available on the Routledge Translation Studies Portal.




Quantitative Historical Linguistics


Book Description

This book is an innovative guide to quantitative, corpus-based research in historical and diachronic linguistics. Gard B. Jenset and Barbara McGillivray argue that, although historical linguistics has been successful in using the comparative method, the field lags behind other branches of linguistics with respect to adopting quantitative methods. Here they provide a theoretically agnostic description of a new framework for quantitatively assessing models and hypotheses in historical linguistics, based on corpus data and using case studies to illustrate how this framework can answer research questions in historical linguistics. The authors offer an in-depth explanation and discussion of the benefits of working with quantitative methods, corpus data, and corpus annotation, and the advantages of open and reproducible research. The book will be a valuable resource for graduate students and researchers in historical linguistics, as well as for all those working with linguistic corpora.