Solr 101
Learn the basics of Solr (pronounced «solar»), an open source enterprise search platform, written in Java, from the Apache Lucene project.
Solr is a standalone full-text search server that uses the Lucene Java search library at its core for full-text indexing and search, and has REST-like HTTP/XML and JSON APIs that make it usable from most popular programming languages.
Learn about Solr’s major features, including full-text search, hit highlighting, faceted search, real-time indexing, dynamic clustering, database integration, NoSQL features and rich document (e.g., Word, PDF) handling.
Learn how Solr is highly scalable and fault tolerant in providing distributed search and index replication.
Learn why Solr is the most popular enterprise search engine.
COURSE SYLLABUS
Module 1 – Search Engines
Understand the importance of text search engines
Understand the Solr search procedure
Identify Solr components
Module 2 – Configure and Add Documents to Solr
Identifying the important files in a Solr installation
Define the schema for documents in the index
Understand the various ways to add documents to Solr
Module 3 – Analyzers and Queries
Use analyzers, tokenizers, and filters
Construct queries
Module 4 – SolrJ and Customization
Create SolrJ applications
Understand the customization options available in Solr
GENERAL INFORMATION
This course is self-paced.
It can be taken at any time.
It can be audited as many times as you wish.
Labs can be performed on the Cloud, or using a 64-bit system. If using a 64-bit system, you can install the required software (Linux-only), or use the supplied VMWare image. More details are provided in the section «Labs setup».
RECOMMENDED SKILLS PRIOR TO TAKING THIS COURSE
Basic knowledge of operating systems (UNIX/Linux).
Basic understanding of SQL and Java would be helpful.