Apache Solr 1.5 on the move with more “functionality”

The paint is barely dry on Apache Solr 1.4 and the community is already on the move for Solr 1.5 (which may actually be Solr 2.0, but for now let’s call it 1.5).

I’m particularly excited about a few things:

  1. Massive scalability capabilities via distributed search, indexing and shard management – Up until now, Solr scales pretty well on the search side (I’ve seen billion+ document instances and we’ve benchmarked it at that level too), but the work…

Read more...

Getting Started with Payloads

Mark Miller recently posted a brief intro to Span Queries, so I thought I would piggyback on top of his work and show how to get started with Payloads (see also [1]).

Introduction

Like Spans, payloads involve the position of terms, but go one step further.  Namely, a Payload in Apache Lucene is an arbitrary byte array stored at a specific position (i.e. a specific token/term) in the index.  A payload can be used to store weights for specific…

Read more...