Explorations in Automatic Thesaurus Discovery


Book Description

Explorations in Automatic Thesaurus Discovery presents an automated method for creating a first-draft thesaurus from raw text. It describes natural processing steps of tokenization, surface syntactic analysis, and syntactic attribute extraction. From these attributes, word and term similarity is calculated and a thesaurus is created showing important common terms and their relation to each other, common verb--noun pairings, common expressions, and word family members. The techniques are tested on twenty different corpora ranging from baseball newsgroups, assassination archives, medical X-ray reports, abstracts on AIDS, to encyclopedia articles on animals, even on the text of the book itself. The corpora range from 40,000 to 6 million characters of text, and results are presented for each in the Appendix. The methods described in the book have undergone extensive evaluation. Their time and space complexity are shown to be modest. The results are shown to converge to a stable state as the corpus grows. The similarities calculated are compared to those produced by psychological testing. A method of evaluation using Artificial Synonyms is tested. Gold Standards evaluation show that techniques significantly outperform non-linguistic-based techniques for the most important words in corpora. Explorations in Automatic Thesaurus Discovery includes applications to the fields of information retrieval using established testbeds, existing thesaural enrichment, semantic analysis. Also included are applications showing how to create, implement, and test a first-draft thesaurus.







Introduction to Controlled Vocabularies


Book Description

This detailed book is a “how-to” guide to building controlled vocabulary tools, cataloging and indexing cultural materials with terms and names from controlled vocabularies, and using vocabularies in search engines and databases to enhance discovery and retrieval online. Also covered are the following: What are controlled vocabularies and why are they useful? Which vocabularies exist for cataloging art and cultural objects? How should they be integrated in a cataloging system? How should they be used for indexing and for retrieval? How should an institution construct a local authority file? The links in a controlled vocabulary ensure that relationships are defined and maintained for both cataloging and retrieval, clarifying whether a rose window and a Catherine wheel are the same thing, or how pot-metal glass is related to the more general term stained glass. The book provides organizations and individuals with a practical tool for creating and implementing vocabularies as reference tools, sources of documentation, and powerful enhancements for online searching.




The Dictionary of Obscure Sorrows


Book Description

NEW YORK TIMES BESTSELLER “It’s undeniably thrilling to find words for our strangest feelings…Koenig casts light into lonely corners of human experience…An enchanting book. “ —The Washington Post A truly original book in every sense of the word, The Dictionary of Obscure Sorrows poetically defines emotions that we all feel but don’t have the words to express—until now. Have you ever wondered about the lives of each person you pass on the street, realizing that everyone is the main character in their own story, each living a life as vivid and complex as your own? That feeling has a name: “sonder.” Or maybe you’ve watched a thunderstorm roll in and felt a primal hunger for disaster, hoping it would shake up your life. That’s called “lachesism.” Or you were looking through old photos and felt a pang of nostalgia for a time you’ve never actually experienced. That’s “anemoia.” If you’ve never heard of these terms before, that’s because they didn’t exist until John Koenig set out to fill the gaps in our language of emotion. The Dictionary of Obscure Sorrows “creates beautiful new words that we need but do not yet have,” says John Green, bestselling author of The Fault in Our Stars. By turns poignant, relatable, and mind-bending, the definitions include whimsical etymologies drawn from languages around the world, interspersed with otherworldly collages and lyrical essays that explore forgotten corners of the human condition—from “astrophe,” the longing to explore beyond the planet Earth, to “zenosyne,” the sense that time keeps getting faster. The Dictionary of Obscure Sorrows is for anyone who enjoys a shift in perspective, pondering the ineffable feelings that make up our lives. With a gorgeous package and beautiful illustrations throughout, this is the perfect gift for creatives, word nerds, and human beings everywhere.




The Merriam-Webster Thesaurus


Book Description

Find the right word fast! This indispensable guide from America's Language Experts is the perfect tool for readers and writers! This all new edition of The Merriam-Webster Thesaurus features more than 150,000 word choices, including related words, antonyms, and near antonyms. Each main entry provides the meaning shared by the synonyms listed and abundant usage examples show words used in context. Words alphabetically organized for ease of use. A great complement to The Merriam-Webster Dictionary and perfect for school, home, or office.




The Devil’s Dictionary


Book Description

“Dictionary, n: A malevolent literary device for cramping the growth of a language and making it hard and inelastic. This dictionary, however, is a most useful work.” Bierce’s groundbreaking Devil’s Dictionary had a complex publication history. Started in the mid-1800s as an irregular column in Californian newspapers under various titles, he gradually refined the new-at-the-time idea of an irreverent set of glossary-like definitions. The final name, as we see it titled in this work, did not appear until an 1881 column published in the periodical The San Francisco Illustrated Wasp. There were no publications of the complete glossary in the 1800s. Not until 1906 did a portion of Bierce’s collection get published by Doubleday, under the name The Cynic’s Word Book—the publisher not wanting to use the word “Devil” in the title, to the great disappointment of the author. The 1906 word book only went from A to L, however, and the remainder was never released under the compromised title. In 1911 the Devil’s Dictionary as we know it was published in complete form as part of Bierce’s collected works (volume 7 of 12), including the remainder of the definitions from M to Z. It has been republished a number of times, including more recent efforts where older definitions from his columns that never made it into the original book were included. Due to the complex nature of copyright, some of those found definitions have unclear public domain status and were not included. This edition of the book includes, however, a set of definitions attributed to his one-and-only “Demon’s Dictionary” column, including Bierce’s classic definition of A: “the first letter in every properly constructed alphabet.” Bierce enjoyed “quoting” his pseudonyms in his work. Most of the poetry, dramatic scenes and stories in this book attributed to others were self-authored and do not exist outside of this work. This includes the prolific Father Gassalasca Jape, whom he thanks in the preface—“jape” of course having the definition: “a practical joke.” This book is a product of its time and must be approached as such. Many of the definitions hold up well today, but some might be considered less palatable by modern readers. Regardless, the book’s humorous style is a valuable snapshot of American culture from past centuries. This book is part of the Standard Ebooks project, which produces free public domain ebooks.




Shakespeare's Words


Book Description

A vital resource for scholars, students and actors, this book contains glosses and quotes for over 14,000 words that could be misunderstood by or are unknown to a modern audience. Displayed panels look at such areas of Shakespeare's language as greetings, swear-words and terms of address. Plot summaries are included for all Shakespeare's plays and on the facing page is a unique diagramatic representation of the relationships within each play.




The Highly Selective Dictionary for the Extraordinarily Literate


Book Description

Between TV talk shows, radio call-in programs, email and the Internet, spontaneous-talk media has skyrocketed in the '90s. People are interacting more frequently and more fervently than ever before, turning the English language into an indecipherable mess. Now, this unique and concise compendium presents the most confused and misused words in the language today -- words misused by careless speakers and writers everywhere. It defines, discerns and distinguishes the finer points of sense and meaning. Was it fortuitous or only fortunate? Are you trying to remember, or more fully recollect? Is he uninterested or disinterested? Is it healthful or healthy, regretful or regrettable, notorious or infamous? The answers to these and many more fascinating etymological questions can be found within the pages of this invaluable (or is it valuable?) reference.




Cambridge Advanced Learner's Dictionary


Book Description

The Cambridge Advanced Learner's Dictionary is the ideal dictionary for advanced EFL/ESL learners. Easy to use and with a great CD-ROM - the perfect learner's dictionary for exam success. First published as the Cambridge International Dictionary of English, this new edition has been completely updated and redesigned. - References to over 170,000 words, phrases and examples explained in clear and natural English - All the important new words that have come into the language (e.g. dirty bomb, lairy, 9/11, clickable) - Over 200 'Common Learner Error' notes, based on the Cambridge Learner Corpus from Cambridge ESOL exams Plus, on the CD-ROM: - SMART thesaurus - lets you find all the words with the same meaning - QUICKfind - automatically looks up words while you are working on-screen - SUPERwrite - tools for advanced writing, giving help with grammar and collocation - Hear and practise all the words.




Webster's New World College Dictionary


Book Description

Webster's Fourth has been adopted by many magazines and newspapers as the definitive guide to the English language as spoken in America. Acclaimed for its 7000+ new words reflecting lifestyle changes, technology, and popular culture, the fourth edition contains 163,000 entries, with synonyms, so that it also functions as a thesaurus. Many entries put words into context as a further guide to understanding, and the dictionary includes 850 illustrations and maps and a world atlas. It's an excellent gift for students, and certainly for anyone who wants an up-to-date and easy-to-use reference for good writing and speaking.