1 option
Ferret / David Balmain.
- Format:
- Book
- Author/Creator:
- Balmain, David.
- Language:
- English
- Subjects (All):
- Database management.
- Query languages (Computer science).
- Relational databases.
- Ruby (Computer program language).
- Physical Description:
- 1 online resource (114 p.)
- Edition:
- First edition.
- Other Title:
- At head of cover title: Indexed searching for Ruby applications
- Place of Publication:
- Beijing : O'Reilly, 2008.
- Language Note:
- English
- System Details:
- text file
- Summary:
- With the introduction of Ferret, Ruby users now have one of the fastest and most flexible search libraries available. And it's surprisingly easy to use. This book will show you how to quickly get up and running with Ferret. You'll learn how to index different document types such as PDF, Microsoft Word, and HTML, as well as how to deal with foreign languages and different character encodings. Ferret describes the Ferret Query Language in detail along with the object-oriented approach to building queries. You will also be introduced to sorting, filtering, and highligh
- Contents:
- Table of Contents; Preface; Conventions Used in This Book; Using Code Examples; Safari® Enabled; How to Contact Us; Chapter 1. Getting Started; Installing Ferret; A Quick Example: Indexing the Filesystem; Summary; Chapter 2. Indexing; Index Storage; Documents, Fields, and Boosts; Documents; Fields; Boosts; Setting Up the Index; FieldInfo; :store; :index; :term_vector; FieldInfos; Basic Indexing Operations; Add; Get; Delete; Update; Indexing Non-String Datatypes; Number Fields; Date Fields; Sort Fields; Summary; Chapter 3. Advanced Indexing; How the Indexing Process Works
- Tuning Indexing PerformanceIn-Memory Indexing; Indexing Parameters; :max_buffer_memory and :chunk_size; :merge_factor; :max_buffered_docs; :max_merged_docs; :max_field_length; :use_compound_file; :index_skip_interval; :doc_skip_interval; Indexing parameter testing; Parallel Indexing; Optimizing the Index; Index Locking and Concurrency Issues; Multithreaded Environment; Multiprocess Environment; Summary; Chapter 4. Search; Overview of Searching Classes; IndexSearcher; Query; QueryParser; Filter; Sort; Building Queries; TermQuery; BooleanQuery; PhraseQuery; RangeQuery; MultiTermQuery
- PrefixQueryWildcardQuery; FuzzyQuery; MatchAllQuery; ConstantScoreQuery; FilteredQuery; Span Queries; SpanTermQuery; SpanFirstQuery; SpanOrQuery; SpanNotQuery; SpanNearQuery; Boosting Queries; QueryParser; Setting Up the QueryParser; Ferret Query Language; TermQuery; BooleanQuery; PhraseQuery; RangeQuery; WildcardQuery; FuzzyQuery; Boosting a query in FQL; Filtering Search Results; Using the RangeFilter; Using the QueryFilter; Writing Your Own Filter; :filter_proc, the New Filter; Sorting Search Results; SortField; Sort; Sorting by Date; Highlighting Query Results; Summary
- Chapter 5. AnalysisToken; TokenStream; Tokenizer; WhiteSpaceTokenizer; LetterTokenizer; StandardTokenizer; RegExpTokenizer; TokenFilter; LowerCaseFilter; StopFilter; StemFilter; HyphenFilter; Analyzer; StandardAnalyzer; PerFieldAnalyzer; Custom Analysis; Chapter 6. Ferret in Practice; Indexing Multiple Document Types; TextReader; HtmlReader; OOoReader (OpenOffice.org Reader); JpegReader; Mp3Reader; PdfReader; Other Indexing Improvements; Search Improvements; Putting It All Together; Summary; Index
- Notes:
- Includes index.
- Description based on print version record.
- ISBN:
- 9781306815222
- 1306815223
- 9780596522544
- 0596522541
- 9780596554712
- 0596554710
- OCLC:
- 609841022
The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.