My Account Log in

4 options

Mastering Apache Solr 7.x : an expert guide to advancing, optimizing, and scaling your enterprise search. / Sandeep Nair, Chintan Mehta, Dharmesh Vasoya.

EBSCOhost Academic eBook Collection (North America) Available online

View online

EBSCOhost Ebook Business Collection Available online

View online

Ebook Central Academic Complete Available online

View online

O'Reilly Online Learning: Academic/Public Library Edition Available online

View online
Format:
Book
Author/Creator:
Nair, Sandeep, author.
Mehta, Chintan, author.
Vasoya, Dharmesh, author.
Language:
English
Subjects (All):
Lucene (Electronic resource).
Electronic information resource searching.
Physical Description:
1 online resource (289 pages) : illustrations (some color)
Edition:
1st edition
Place of Publication:
Birmingham, England ; Mumbai, [India] : Packt Publishing, 2018.
System Details:
text file
Biography/History:
Nair Sandeep: Sandeep has been working in Liferay technology for more than 8 years and has more than 10 years' of overall experience in Java and Java EE technologies. He has executed projects using Liferay across various verticals such as construction, financial, and medical domains, providing solutions for collaboration, enterprise content management, and Web content Management systems. He has created a free and open source Google Chartlet plugin for Liferay which has been downloaded and used by people across 90 countries according to sourceforge statistics. Besides development, consulting, and implementing solutions he has also been involved in giving training on Liferay in other countries. Before he jumped into Liferay he had experience in Java and Java EE Technologies. He has authored "Liferay Beginner's Guide" and "Instant Liferay Portal 6 Starter" with Packt Publishing. When he is not coding, he loves to read books and travel. Mehta Chintan: Chintan Mehta is a co-founder of KNOWARTH Technologies and heads the cloud/RIMS/DevOps team. He has rich, progressive experience in server administration of Linux, AWS Cloud, DevOps, RIMS, and on open source technologies. He is also an AWS Certified Solutions Architect. Chintan has authored MySQL 8 for Big Data, Mastering Apache Solr 7. x, MySQL 8 Administrator's Guide, and Hadoop Backup and Recovery Solutions. Also, he has reviewed Liferay Portal Performance Best Practices and Building Serverless Web Applications. Vasoya Dharmesh: Dharmesh Vasoya is a Liferay 6. 2 certified developer. He has 5. 5 years of experience in application development with technologies such as Java, Liferay, Spring, Hibernate, Portlet, and JSF. He has successfully delivered projects in various domains, such as healthcare, collaboration, communication, and enterprise CMS, using Liferay. Dharmesh has good command of the configuration setup of servers such as Solr, Tomcat, JBOSS, and Apache Web Server. He has good experience of clustering, load balancing and performance tuning. He completed his MCA at Ahmedabad University.
Summary:
Accelerate your enterprise search engine and bring relevancy in your search analytics About This Book A practical guide in building expertise with Indexing, Faceting, Clustering and Pagination Master the management and administration of Enterprise Search Applications and services seamlessly Handle multiple data inputs such as JSON, xml, pdf, doc, xls,ppt, csv and much more. Who This Book Is For The book would rightly appeal to developers, software engineers, data engineers and database architects who are building or seeking to build enterprise-wide effective search engines for business intelligence. Prior experience of Apache Solr or Java programming is must to take the best of this book. What You Will Learn Design schema using schema API to access data in the database Advance querying and fine-tuning techniques for better performance Get to grips with indexing using Client API Set up a fault tolerant and highly available server with newer distributed capabilities, SolrCloud Explore Apache Tika to upload data with Solr Cell Understand different data operations that can be done while indexing Master advanced querying through Velocity Search UI, faceting and Query Re-ranking, pagination and spatial search Learn to use JavaScript, Python, SolrJ and Ruby for interacting with Solr In Detail Apache Solr is the only standalone enterprise search server with a REST-like application interface. providing highly scalable, distributed search and index replication for many of the world's largest internet sites. To begin with, you would be introduced to how you perform full text search, multiple filter search, perform dynamic clustering and so on helping you to brush up the basics of Apache Solr. You will also explore the new features and advanced options released in Apache Solr 7.x which will get you numerous performance aspects and making data investigation simpler, easier and powerful. You will learn to build complex queries, extensive filters and how are they compiled in your system to bring relevance in your search tools. You will learn to carry out Solr scoring, elements affecting the document score and how you can optimize or tune the score for the application at hand. You will learn to extract features of documents, writing complex queries in re-ranking the documents. You will also learn advanced options helping you to know what content is indexed and how the extracted content is indexed. Throughout the book, you would go through complex problems wi...
Contents:
Cover
Title Page
Copyright and Credits
Packt Upsell
Contributors
Table of Contents
Preface
Chapter 1: Introduction to Solr 7
Introduction to Solr
History of Solr
Lucene - the backbone of Solr
Why choose Solr?
Benefits of keyword search
Benefits of ranked results
Solr use cases
Social media
Science and research
Search engine
E-commerce
Media and entertainment
Government
Education
What's new in Solr 7?
Replication for SolrCloud
TLOG replicas
PULL replicas
Schemaless improvements
Autoscaling
Default numeric types
Spatial fields
SolrJ
JMX and MBeans
Other changes
Summary
Chapter 2: Getting Started
Solr installation
Understanding various files and the folder structure
bin
Solr script
Post script
contrib
DataImportHandler
ContentExtractionLibrary
LanguageIdentifier
Clustering
VelocityIntegration
dist and docs
example
core.properties
zoo.cfg
solr.xml
server
Running Solr
Running basic Solr commands
Production Solr setup
Loading sample data
Loading data from MySQL
Understanding the browse interface
Using the Solr admin interface
Dashboard
Logging
Cloud screens
Tree view
Graph view
Collections or core admin
Java properties
Thread dump
Collection-specific tools
Overview
Analysis
DataImport
Documents
Files
Query
Stream
Schema
Core-specific tools
Chapter 3: Designing Schemas
How Solr works
Getting started with Solr's basics
The schema file of Solr
Understanding field types
Definitions and properties of field types
Field type properties
Field types available in Solr
Understanding date fields
Understanding currencies and exchange rates
Understanding enum fields
Field management.
Field properties
Copying fields
Dynamic fields
Mastering Schema API
Schema API in detail
Schema operations
Listing fields, field types, DynamicFields, and CopyField rules
Deciphering schemaless mode
Creating a schemaless example
Schemaless mode configuration
Managed schema
Field guessing
Chapter 4: Mastering Text Analysis Methodologies
Understanding text analysis
What is text analysis?
How text analysis works
Understanding analyzer
What is an analyzer?
How an analyzer works
Understanding tokenizers
What is a tokenizer?
Available tokenizers in Solr
Standard tokenizer
White space tokenizer
Classic tokenizer
Keyword tokenizer
Lower case tokenizer
Letter tokenizer
N-gram tokenizer
Edge n-gram tokenizer
Understanding filters
What is a filter?
Available filters in Solr
Stop filter
Classic filter
Synonym filter
Synonym graph filter
ASCII folding filter
Keep word filter
KStem filter
KeywordMarkerFilterFactory
Word delimiter graph filter
Understanding CharFilter
Understanding PatternReplaceCharFilterFactor
Understanding multilingual analysis
Language identification
Configuring Solr for multiple language search
Creating separate fields per language
Creating separate indexes per language
Understanding phonetic matching
Understanding Beider-Morse phonetic matching
Chapter 5: Data Indexing and Operations
Basics of Solr indexing
Installing Postman
Exploring the post tool
Understanding index handlers
Working with an index handler with the XML format
Index handler with JSON
Apache Tika and indexing
Solr Cell basics
Indexing a binary using Tika
Language detection
Language detection configuration
Client APIs
Chapter 6: Advanced Queries - Part I.
Search relevance
Velocity search UI
Query parsing and syntax
Common query parameters
Standard query parser
Advantage
Disadvantage
Searching terms for standard query parser
Term modifiers
Wildcard searches
Fuzzy searches
Proximity searching
Range searches
Boolean operators
Escaping special characters
Grouping terms
Dates and times in query strings
Adding comments to the query string
The DisMax Query Parser
Advantages
DisMax query parser parameters
eDisMax Query Parser
Response writer
JSON
Standard XML
CSV
Velocity
Faceting
Common parameters
Field-value faceting parameters
Range faceting
Pivot faceting
Interval faceting
Highlighting
Highlighting parameters
Highlighter
Unified highlighter (hl.method=unified)
Original highlighter (hl.method=original)
FastVector highlighter (hl.method=fastVector)
Boundary scanners
The breakIterator boundary scanner
The simple boundary scanner
Chapter 7: Advanced Queries - Part II
Spellchecking
Spellcheck parameters
Implementation approaches
IndexBasedSpellChecker
DirectSolrSpellChecker
FileBasedSpellChecker
WordBreakSolrSpellChecker
Distributed spellcheck
Suggester
Suggester parameters
Running suggestions
Pagination
How to implement pagination
Cursor pagination
Result grouping
Result grouping parameters
Running result grouping
Result clustering
Result clustering parameters
Result clustering implementation
Install the clustering contrib
Declare the cluster search component
Declare the request handler and include the cluster search component
Spatial search
Spatial search implementation
Field types
Query parser
Spatial search query parser parameters
Function queries
Summary.
Chapter 8: Managing and Fine-Tuning Solr
JVM configuration
Managing the memory heap
Managing solrconfig.xml
User-defined properties
Implicit Solr core properties
Managing backups
Backup in SolrCloud
Standalone mode backups
Backup API
Backup status
API to restore
Restore status API
Snapshot API
JMX with Solr
JMX configuration
Logging configuration
Log settings using the admin web interface
Log level at startup
Setting the environment variable
Passing parameters in the startup script
Configuring Log4J for logging
SolrCloud overview
SolrCloud in interactive mode
SolrCloud - core concepts
Routing documents
Splitting shards
Setting up ignore commits from client applications
Enabling SSL - Solr security
Prerequisites
Generating a key and self-signed certificate
Starting Solr with SSL system properties
Performance statistics
Statistics for request handlers
Chapter 9: Client APIs - An Overview
Client API overview
JavaScript Client API
SolrJ Client API
Ruby Client API
Python Client API
Index.
Notes:
Includes index.
Description based on online resource; title from PDF title page (EBC, viewed March 14, 2018).
ISBN:
9781788837385
178883738X
9781788831550
1788831551
OCLC:
1027196146

The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.

Find

Home Release notes

My Account

Shelf Request an item Bookmarks Fines and fees Settings

Guides

Using the Find catalog Using Articles+ Using your account