My Account Log in

1 option

Taming text : how to find, organize, and manipulate it / Grant S. Ingersoll, Thomas S. Morton, Andrew L. Farris.

O'Reilly Online Learning: Academic/Public Library Edition Available online

View online
Format:
Book
Author/Creator:
Ingersoll, Grant S., author.
Morton, Thomas S., author.
Farris, Andrew L., author.
Language:
English
Subjects (All):
Text processing (Computer science).
Physical Description:
1 online resource (1 v.) : ill.
Edition:
1st edition
Place of Publication:
Shelter Island, NY : Manning, c2013.
Shelter Island, New York : Manning, [2013]
Language Note:
English
System Details:
text file
Summary:
Summary Taming Text is a hands-on, example-driven guide to working with unstructured text in the context of real-world applications. This book explores how to automatically organize text using approaches such as full-text search, proper name recognition, clustering, tagging, information extraction, and summarization. The book guides you through examples illustrating each of these topics, as well as the foundations upon which they are built. About this Book There is so much text in our lives, we are practically drowning in it. Fortunately, there are innovative tools and techniques for managing unstructured information that can throw the smart developer a much-needed lifeline. You’ll find them in this book. Taming Text is a practical, example-driven guide to working with text in real applications. This book introduces you to useful techniques like full-text search, proper name recognition, clustering, tagging, information extraction, and summarization. You’ll explore real use cases as you systematically absorb the foundations upon which they are built. Written in a clear and concise style, this book avoids jargon, explaining the subject in terms you can understand without a background in statistics or natural language processing. Examples are in Java, but the concepts can be applied in any language. What's Inside When to use text-taming techniques Important open-source libraries like Solr and Mahout How to build text-processing applications About the Authors Grant Ingersoll is an engineer, speaker, and trainer, a Lucene committer, and a cofounder of the Mahout machine-learning project. Thomas Morton is the primary developer of OpenNLP and Maximum Entropy. Drew Farris is a technology consultant, software developer, and contributor to Mahout, Lucene, and Solr.
Contents:
Getting started taming text
Foundations of taming text
Searching
Fuzzy string matching
Identifying people, places, and things
Clustering text
Classification, categorization, and tagging
Building an example question answering system
Untamed text : exploring the next frontier.
Notes:
Bibliographic Level Mode of Issuance: Monograph
Includes bibliographical references and index.
Description based on print version record.
ISBN:
9781638353867
1638353867
9781933988382
193398838X
OCLC:
828691427

The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.

Find

Home Release notes

My Account

Shelf Request an item Bookmarks Fines and fees Settings

Guides

Using the Find catalog Using Articles+ Using your account