Course Outline

Overall Goal

To provide experienced web developers and technical staff with a comprehensive introduction to the Solr search platform, equipping software developers with advanced skills for creating effective search solutions for government.

I. Fundamentals

  • Solr Overview
  • Installing and Running Solr
  • Adding Content to Solr
  • Interpreting a Solr XML Response
  • Modifying Parameters in the URL
  • Utilizing the Browse Interface
  • Labs: Installing Solr, Executing Queries

II. Searching

  • Sorting Search Results
  • Query Parsers
  • Advanced Query Techniques
  • Hardwiring Request Parameters
  • Incorporating Fields into Default Searches
  • Faceting for Enhanced Search Filters
  • Result Grouping
  • Labs: Advanced Queries, Experimenting with Faceted Search

III. Indexing

  • Incorporating Custom Content into Solr
  • Deleting Data from Solr
  • Building a Bookstore Search Application
  • Adding and Exploring Book Data
  • Deduplication Using the Update Processor
  • Labs: Indexing Various Document Collections for Government

IV. Schema Updating

  • Adding Fields to the Schema
  • Analyzing Text for Enhanced Search
  • Labs: Customizing the Solr Schema for Specific Needs

V. Relevance

  • Field Weighting for Improved Results
  • Phrase Queries for Precise Matching
  • Function Queries for Complex Scenarios
  • Fuzzy Search for Partial Matches
  • Sounds-Like Queries for Phonetic Searches
  • Labs: Implementing Relevance Techniques in Queries

VI. Extended Features

  • More-Like-This for Similar Results
  • Geospatial Search for Location-Based Data
  • Spell Checking to Enhance User Experience
  • Suggestions for Autocomplete and Recommendations
  • Highlighting Key Terms in Results
  • Pseudo-Fields for Derived Information
  • Pseudo-Joins for Complex Data Relationships
  • Multilingual Support for Diverse Users
  • Labs: Implementing Spell Checking and Suggestions for Government Applications

VII. Multicore

  • Managing Multiple Types of Data with Solr Cores
  • Labs: Creating and Administering Solr Cores for Diverse Data Sets

VIII. SolrCloud

  • Introduction to SolrCloud Architecture
  • Understanding How SolrCloud Functions
  • Commit Strategies for Data Consistency
  • Utilizing ZooKeeper for Cluster Management
  • Managing Solr Configuration Files in a Distributed Environment
  • Labs: Administering SolrCloud for Scalable and Resilient Search Solutions

IX. Developing with Solr API

  • Interfacing with Solr via RESTful APIs
  • Configuration Management for Integration
  • Indexing and Searching Data Programmatically
  • Integrating Solr with Spring Framework
  • Labs: Coding to Read and Write Solr Index, Practicing Spring Integration with Solr

X. Developing with Lucene API

  • Building a Lucene Index for Custom Applications
  • Searching, Viewing, and Debugging Indexed Data
  • Extracting Text from Documents Using Tika
  • Scaling Lucene Indices on Clustered Environments
  • Tuning Lucene Performance for Optimal Results
  • Labs: Coding with the Lucene API for Advanced Search Capabilities

XI. Conclusion

  • Alternative Approaches to Search Solutions
    • ElasticSearch for Highly Scalable Deployments
    • DataStax Enterprise: Integrating Solr and Cassandra for Big Data
    • Cloudera Solr Integration for Hadoop Ecosystems
    • Blur for Real-Time Search Applications
  • Future Directions in Search Technology for Government Operations

Requirements

All attendees should be experienced technical staff with a background in web application operations and, preferably, development for government.

Lab Environment:

Amazon EC2 servers will be provided to students for installation, administration, and lab work. Students will need an SSH client and a browser to access the cluster.

Zero Install: There is no need to install Solr software on students’ machines! (Although it is possible)

 21 Hours

Number of participants


Price per participant

Testimonials (5)

Upcoming Courses

Related Categories