Automatic Indexing and Abstracting of Document Texts


Book Description

Automatic Indexing and Abstracting of Document Texts summarizes the latest techniques of automatic indexing and abstracting, and the results of their application. It also places the techniques in the context of the study of text, manual indexing and abstracting, and the use of the indexing descriptions and abstracts in systems that select documents or information from large collections. Important sections of the book consider the development of new techniques for indexing and abstracting. The techniques involve the following: using text grammars, learning of the themes of the texts including the identification of representative sentences or paragraphs by means of adequate cluster algorithms, and learning of classification patterns of texts. In addition, the book is an attempt to illuminate new avenues for future research. Automatic Indexing and Abstracting of Document Texts is an excellent reference for researchers and professionals working in the field of content management and information retrieval.




Automatic Indexing and Abstracting of Document Texts


Book Description

Automatic Indexing and Abstracting of Document Texts summarizes the latest techniques of automatic indexing and abstracting, and the results of their application. It also places the techniques in the context of the study of text, manual indexing and abstracting, and the use of the indexing descriptions and abstracts in systems that select documents or information from large collections. Important sections of the book consider the development of new techniques for indexing and abstracting. The techniques involve the following: using text grammars, learning of the themes of the texts including the identification of representative sentences or paragraphs by means of adequate cluster algorithms, and learning of classification patterns of texts. In addition, the book is an attempt to illuminate new avenues for future research. Automatic Indexing and Abstracting of Document Texts is an excellent reference for researchers and professionals working in the field of content management and information retrieval.




Automatic Text Summarization


Book Description

Textual information in the form of digital documents quickly accumulates to create huge amounts of data. The majority of these documents are unstructured: it is unrestricted text and has not been organized into traditional databases. Processing documents is therefore a perfunctory task, mostly due to a lack of standards. It has thus become extremely difficult to implement automatic text analysis tasks. Automatic Text Summarization (ATS), by condensing the text while maintaining relevant information, can help to process this ever-increasing, difficult-to-handle, mass of information. This book examines the motivations and different algorithms for ATS. The author presents the recent state of the art before describing the main problems of ATS, as well as the difficulties and solutions provided by the community. The book provides recent advances in ATS, as well as current applications and trends. The approaches are statistical, linguistic and symbolic. Several examples are also included in order to clarify the theoretical concepts.




Encyclopedia of Language and Linguistics


Book Description

The first edition of ELL (1993, Ron Asher, Editor) was hailed as "the field's standard reference work for a generation". Now the all-new second edition matches ELL's comprehensiveness and high quality, expanded for a new generation, while being the first encyclopedia to really exploit the multimedia potential of linguistics. * The most authoritative, up-to-date, comprehensive, and international reference source in its field * An entirely new work, with new editors, new authors, new topics and newly commissioned articles with a handful of classic articles * The first Encyclopedia to exploit the multimedia potential of linguistics through the online edition * Ground-breaking and International in scope and approach * Alphabetically arranged with extensive cross-referencing * Available in print and online, priced separately. The online version will include updates as subjects develop ELL2 includes: * c. 7,500,000 words * c. 11,000 pages * c. 3,000 articles * c. 1,500 figures: 130 halftones and 150 colour * Supplementary audio, video and text files online * c. 3,500 glossary definitions * c. 39,000 references * Extensive list of commonly used abbreviations * List of languages of the world (including information on no. of speakers, language family, etc.) * Approximately 700 biographical entries (now includes contemporary linguists) * 200 language maps in print and online Also available online via ScienceDirect – featuring extensive browsing, searching, and internal cross-referencing between articles in the work, plus dynamic linking to journal articles and abstract databases, making navigation flexible and easy. For more information, pricing options and availability visit www.info.sciencedirect.com. The first Encyclopedia to exploit the multimedia potential of linguistics Ground-breaking in scope - wider than any predecessor An invaluable resource for researchers, academics, students and professionals in the fields of: linguistics, anthropology, education, psychology, language acquisition, language pathology, cognitive science, sociology, the law, the media, medicine & computer science. The most authoritative, up-to-date, comprehensive, and international reference source in its field




Natural Language Processing for Online Applications


Book Description

This text covers the technologies of document retrieval, information extraction, and text categorization in a way which highlights commonalities in terms of both general principles and practical concerns. It assumes some mathematical background on the part of the reader, but the chapters typically begin with a non-mathematical account of the key issues. Current research topics are covered only to the extent that they are informing current applications; detailed coverage of longer term research and more theoretical treatments should be sought elsewhere. There are many pointers at the ends of the chapters that the reader can follow to explore the literature. However, the book does maintain a strong emphasis on evaluation in every chapter both in terms of methodology and the results of controlled experimentation.




Automatic Text Summarization


Book Description

Textual information in the form of digital documents quickly accumulates to create huge amounts of data. The majority of these documents are unstructured: it is unrestricted text and has not been organized into traditional databases. Processing documents is therefore a perfunctory task, mostly due to a lack of standards. It has thus become extremely difficult to implement automatic text analysis tasks. Automatic Text Summarization (ATS), by condensing the text while maintaining relevant information, can help to process this ever-increasing, difficult-to-handle, mass of information. This book examines the motivations and different algorithms for ATS. The author presents the recent state of the art before describing the main problems of ATS, as well as the difficulties and solutions provided by the community. The book provides recent advances in ATS, as well as current applications and trends. The approaches are statistical, linguistic and symbolic. Several examples are also included in order to clarify the theoretical concepts.




Indexing


Book Description

Indexing consists of both novel and more traditional techniques. Cutting-edge indexing techniques, such as automatic indexing, ontologies, and topic maps, were developed independently of older techniques such as thesauri, but it is now recognized that these older methods also hold expertise.Indexing describes various traditional and novel indexing techniques, giving information professionals and students of library and information sciences a broad and comprehensible introduction to indexing. This title consists of twelve chapters: an Introduction to subject readings and theasauri; Automatic indexing versus manual indexing; Techniques applied in automatic indexing of text material; Automatic indexing of images; The black art of indexing moving images; Automatic indexing of music; Taxonomies and ontologies; Metadata formats and indexing; Tagging; Topic maps; Indexing the web; and The Semantic Web. - Makes difficult and complex techniques understandable - Contains may links to and illustrations from websites where new indexing techniques can be experienced - Provides references for further reading




Knowledge-Based Information Retrieval and Filtering from the Web


Book Description

Knowledge-Based Information Retrieval and Filtering from the Web contains fifteen chapters, contributed by leading international researchers, addressing the matter of information retrieval, filtering and management of the information on the Internet. The research presented deals with the need to find proper solutions for the description of the information found on the Internet, the description of the information consumers need, the algorithms for retrieving documents (and indirectly, the information embedded in them), and the presentation of the information found. The chapters include: -Ontological representation of knowledge on the WWW; -Information extraction; -Information retrieval and administration of distributed documents; -Hard and soft modeling based knowledge capture; -Summarization of texts found on the WWW; -User profiles and personalization for web-based information retrieval system; -Information retrieval under constricted bandwidth; -Multilingual WWW; -Generic hierarchical classification using the single-link clustering; -Clustering of documents on the basis of text fuzzy similarity; -Intelligent agents for document categorization and adaptive filtering; -Multimedia retrieval and data mining for E-commerce and E-business; -A Web-based approach to competitive intelligence; -Learning ontologies for domain-specific information retrieval; -An open, decentralized architecture for searching for, and publishing information in distributed systems.




Text, Speech and Dialogue


Book Description

This book constitutes the refereed proceedings of the 6th International Conference on Text, Speech and Dialogue, TSD 2003, held in Ceské Budejovice, Czech Republic in September 2003. The 60 revised full papers presented together with 2 invited contributions were carefully reviewed and selected from 121 submissions. The papers present a wealth of state-of-the-art research and development results in the field of natural language processing with an emphasis on text, speech, and spoken language ranging from theoretical and methodological issues to applications in various fields, such as web information retrieval, the semantic web, algorithmic learning, and dialogue systems.




Multidimensional Journal Evaluation


Book Description

Scientific communication depends primarily on publishing in journals. The most important indicator to determine the influence of a journal is the Impact Factor. Since this factor only measures the average number of citations per article in a certain time window, it can be argued that it does not reflect the actual value of a periodical. This book defines five dimensions, which build a framework for a multidimensional method of journal evaluation. The author is winner of the Eugene Garfield Doctoral Dissertation Scholarship 2011.