-
Recent Posts
- Lucene Revolution 2012 – Call for Participation now open!
- SolrCloud is Coming (and looking to mix in even more ‘NoSQL’)
- Our Solr Reference Guide updated for v3.5
- Enhancing Discovery with Solr and Mahout – session slides now available!
- Solr and LucidWorks feature matrix available
- LucidWorks Enterprise latest version 2.0.1 released!
- Why Not AND, OR, And NOT?
- Options to tune document’s relevance in Solr
- Dallas JavaMUG December 14th 2011
- Apache Mahout user meeting – session slides and videos are now available!
Archives
Tags
acts_as_solr apache Apache Mahout best practices chump code4lib dismax drupal enterprise search Erik Hatcher field collapsing function query Grant Ingersoll hoss image isfdb local params Lucene lucene revolution LucidGaze lucid imagination Mahout Marc Krellenstein Mark Miller nested queries nutch Open Source Open Source Search qparser query parser queryparser Rails release result grouping Richmond Ruby schema design sint Solr solr 3.1 solr 4.0 solr cloud sortable Tika VA
Blog
Trends: Know your relevance
By Grant IngersollJuly 23, 2009
“Control, exploration, flexibility, tunability,” were the answers expounded by representatives of Microsoft, Endeca, and Vivisimo. Relevance is in the eye of the beholder, but relevance ranking is driven by the search engine. Know what criteria are driving the ranking of the results you’re looking at, or at least, be skeptical of them.
I couldn’t agree more with Theresa Regli’s excellent discussion of relevance, especially the point to be “skeptical” …
Training: Up and to the right
By David M. FishmanJuly 22, 2009
As the Great Recession tests all of our economic patience, many people I know, myself include, have gotten into the habit of looking at graphs of economic indicators. Stock prices, petroleum, unemployment, store closing, health care costs, it’s usually not good news these days. Particularly if you look back at some deep historical horizon, say, since last November. Lots of valleys, plains, with the peaks retreating in the distance.
Then I saw this little gem…
The SpanQuery
By Mark MillerJuly 18, 2009
SpanQuerys allow for nested, positional restrictions when matching documents in Lucene. SpanQuery’s are much like PhraseQuerys or MultiPhraseQuerys in that they all restrict term matches by position, but SpanQuerys can be much more expressive.
The basic SpanQuery units are the SpanTermQuery and the SpanNearQuery.
A SpanTermQuery is the most basic SpanQuery, and simply lets you specify a field, term, and boost by passing in a Term, just like a TermQuery. SpanTermQuery is used as a …
Thoughts on Efficiency of Enterprise Search on eWeek.com
By Grant IngersollJuly 16, 2009
eWeek.com recently posted a nice article by Dr. Yves Schabes, founder of Teragram, on how to make enterprise search better through some higher order processing techniques like metadata generation, applying taxonomies, etc. and doing relevance testing on a regular basis. Naturally, this got me thinking about all the different ways this relates to the Apache Lucene ecosystem (Lucene, Solr, Mahout, Tika, etc.) and Lucid Imagination.
First, by choosing an …
NYC Apache Lucene/Solr Meetup, Sponsored by Lucid Imagination and MTV Networks
By David M. FishmanJuly 7, 2009
| Wednesday, 22 July 2009 | ||
| 18:30 | to | 21:00 |
July 22, 2009, 6:30pm – 9:00 pm Eastern. Register here.
Hosted at MTV Networks Flagship Building
1515 Broadway, Times Square, New York, NY 10036
RSVP deadline: July 20, 2009 12:00 PM
RSVP deadline: July 20, 2009 12:00 PM
Presentations and discussion of innovations and applications with Lucene & Solr, the Apache Open Source Search Engine/Platform for the NYC Area. Now available: LIGHTNING TALKS
Agenda:
1. “Faster. Better. Solr! What to look for in …
Ranges over Functions in Solr 1.4
By yonikJuly 6, 2009
Solr 1.4 contains a new feature that allows range queries or range filters over arbitrary functions. It’s implemented as a standard Solr QParser plugin, and thus easily available for use any place that accepts the standard Solr Query Syntax by specifying the frange query type. Here’s an example of a filter specifying the lower and upper bounds for a function:
fq={!frange l=0 u=2.2}log(sum(user_ranking,editor_ranking))
The other interesting use for frange is to trade off memory …
Virtual words, real data
By David M. FishmanJuly 1, 2009
As virtualization and cloud computing buzz louder, Lucene/Solr open source search is adding a vibe of its own — most recently, with our announcement of our strategic partnership with ISYS technologies. A couple of weeks ago, Business Week wrote up how cloud computing will change business; and in between discussions of VMWare and Amazon’s EC2, tucked in a reference to Xoopit, “[a startup that] has built a specialized search engine capable of finding …