One of the most common questions I field these days is what is the state of spatial in Apache Solr? As of today, I believe I have backported all the functionality that exists on trunk to the 3.x branch. For details on this, see the Solr wiki page on spatial search. I would encourage people who are interested in spatial search to try it out and report any issues to the mailing list. For …
Read more
A week and a day later, I’ve finally got a chance to put up my thoughts/notes on the first ever RTP Apache Lucene/Solr Meetup hosted by Lulu Press and co-sponsored by Lucid Imagination.
First off, hats off to Lulu for the excellent hosting, coordination and marketing of the event. You could definitely see the evidence of Lulu’s “Be Remarkable” philosophy in the event. I’d say we had roughly 30-40 people for the first time event, …
Read more
Lucene Revolution may very well be the birthplace of the next generation of open source enterprise search. Don’t get left out of this historic gathering! Early bird pricing has been extended to September 17, so register today to join the Lucene Revolution, and meet attendees and speakers from all walks of life – developer and strategist, visionary and pragmatist, grizzled Apache Lucene/Solr veteran and newcomer alike.
More than just theory and academic hypothesis, Lucene Revolution will leave …
Read more
In my preparation for my upcoming talk on Apache Hadoop and Search, I thought I would try out using Nutch (the genesis for Hadoop) to index some content to Solr. I started off by referencing Sami Siren’s excellent post on Nutch and Solr (which worked flawlessly for 1.1 for me on OS X) to get up and going, but quickly hoped there is a much easier way to do this than typing in all …
Read more
Introduction
Recently, I did some minor work on improving the usability of the Lucene spell checker (see LUCENE-2479, LUCENE-2608 and the associated Solr work) and it got me thinking that a post on spell checking in Solr would be useful.
For those who aren’t familiar, the notion of spell checking in search (often called Did You Mean?) is slightly different from the notion of simply correcting spelling errors. It’s not that we don’t want …
Read more
Do you remember this scenario from days of yore?
- Company A buys a software license from Company B, a startup.
- Company A crosses its fingers that Company B doesn’t go bankrupt and disappear, along with the source code for Company A’s mission-critical software.
- Company B goes kaput.
- Company A is left with some machine-readable binary code that it is powerless to develop or use.
Source code escrow has changed the outcome of this sticky situation …
Read more
If you missed the SF Bay Area Lucene meetup last night, I thought I would give a recap of some of the highlights. First off, thanks to salesforce.com for the use of their space on the 42nd floor of 1 Market St. in downtown S.F. The views of the bay and the city were especially stunning at night with what appeared to be a full moo
n rising over the Bay Bridge. Salesforce also …
Read more
As some of you may know, I blog regularly on Network World’s Open Source Subnet. Watch weekly for more of my musings on trends, news and any number of topics that catch my interest. In my most recent post, I ask readers for their take on the legal maze associated with open source. In my opinion, Apache is the most liberal open source package today, the one most true to form. Everybody can use it, …
Read more
Here are my slides from the talk I gave last night at the RTP Semantic Web Group:
Read more
Back from Berlin Buzzwords and finally over the jet lag, so I thought I would put up some feedback. First off, it was a well organized conference with a nice focus on searching, storage and scaling. Kudos to Isabel, Simon and Jan for all their hard work. It also had great wi-fi coverage, which is always a struggle at every conference I’ve ever been too.
As for the talks, I gave the Keynote on using …
Read more