Intro to Mahout at Triangle Java User Group on Feb. 15

Monday, 15 February 2010
18:00 to 21:00

I will be giving an introduction to Apache Mahout at the Triangle Java User Group on Feb. 15.  See http://trijug.org/ for more details.  Hope to see you there!

Read more...

Apache Lucene Connector Framework now in Incubation at the ASF

Short Version

The Apache Lucene Connector Framework project has officially entered incubation.  LCF, for short, is going to be a framework for connecting to content repositories like Sharepoint, Documentum, etc. and will make it easy to hook into Lucene, Solr, Nutch, Mahout, Tika, while, of course, remaining agnostic of the final destination of the data.  See the Connectors website and the original proposal for more info.  Help wanted!

Long Version

Background

A while back, MetaCarta, a spatial search company, approached us…

Read more...

The Apache Lucene Ecosystem: My view of 2009

It’s that time of year, so I thought I would take a look back at the year that was for the Lucene Ecosystem and maybe look ahead just a little bit too.

First and foremost, it should be obvious to even the most casual observer that the Apache Lucene communities are thriving.  Not only is it a great time to be involved in open source, it’s a great time to be involved in Lucene.  Both as a…

Read more...

Apache Mahout 0.2 Released

I just sent out the Apache Mahout 0.2 release announcement.  Here’s a copy:

Apache Mahout 0.2 has been released and is now available for public
download at http://www.apache.org/dyn/closer.cgi/lucene/mahout

Apache Mahout is a subproject of Apache Lucene with the goal
of delivering scalable machine learning algorithm implementations
under the Apache license. http://www.apache.org/licenses/LICENSE-2.0
Scale in terms of computation to the
size of data you manage today.  Scale in terms of community to support anyone
interested in using machine learning. Scale
in terms of business by providing…

Read more...

Webinar: Get Started Faster, Get Better Results

Thursday, 13 August 2009
11:00 to 12:00

In case you missed the webinar on August 13, you can download the slides or view the recorded presentation! Here’s what we covered:

Find what you’re looking for? LucidWorks for Solr lets you quickly build faster, smarter open source search applications.

  • Open source search with Solr/Lucene gives you the power to turn a wide range of information into fast, useful, relevant results!
  • LucidWorks for Solr gives you a tested, release-stable certified distribution of open source search with enhanced tools and installation…

Read more...

Thoughts on Efficiency of Enterprise Search on eWeek.com

eWeek.com recently posted a nice article by Dr. Yves Schabes, founder of Teragram, on how to make enterprise search better through some higher order processing techniques like metadata generation, applying taxonomies, etc. and doing relevance testing on a regular basis.  Naturally, this got me thinking about all the different ways this relates to the Apache Lucene ecosystem (Lucene, Solr, Mahout, Tika, etc.) and Lucid Imagination.

First, by choosing an open backbone like Lucene and Solr, you are free…

Read more...

SF Bay Area Meetup Slides Available

Slides from the first Lucene/Solr SF Bay Area meetup are now available here.

Thanks to everyone who participated.

Read more...

Apache Mahout 0.1 Released

I’m pleased to announce the first release of the Apache Mahout project.  Apache Mahout is a suite of machine learning algorithm implementations.  Here’s the release notice I just sent to various mailing lists:

The Apache Lucene project is pleased to announce the release of Apache Mahout 0.1.
Apache Mahout is a subproject of Apache Lucene with the goal of delivering scalable
machine learning algorithm implementations under the Apache license.  The first public
release includes implementations for clustering, classification,
collaborative filtering…

Read more...

ApacheCon Europe Follow Up

Another year, another successful ApacheCon Europe, at least as far as Lucene, Solr and I are concerned.  This year, like last, Erik Hatcher and I had trainings on Lucene and Solr.  Both were well attended, despite the economy, showing once again the power of open source and the fact that people are still invested in search.  (If you missed the training, see here for alternatives.)

During the conference, there were several talks on Lucene, Solr,  Mahout and…

Read more...