Solr and Lucene at Etsy
Presented by Gregg Donovan, Etsy at Lucene Revolution 2011
Etsy is using Solr and Lucene to serve queries at a rate of more than 8 billion per year (and growing). In this case study, we will describe how Etsy has integrated Solr/Lucene into our continuous deployment infrastructure (see: http://codeascraft.etsy.com/2010/05/20/quantum-of-deployment/) , allowing for Solr configuration, Java-based indexers, and query parsing logic to go from passing tests to production code in minutes. We'll also discuss how we're leveraging Solr's new Geo-search to power both local item search and GeoIP-personalized location autosuggest.
We'll also share how we've extended Solr, adding personalized faceting and filtering as well as multi-currency sorting and filtering that accounts for realtime currency fluctuation (contributed in SOLR-2202) Note that code will be open-sourced/contributed for both of these features]. We will share our real-time monitoring techniques, including how we track Solr replication, query, and GC times in Ganglia. Finally, we'll discuss how we've used Hadoop-based user analytics to improve relevance and power data-driven spelling corrections, autocomplete suggestions, and related searches.
