Apache Mahout 0.2 Released

I just sent out the Apache Mahout 0.2 release announcement.  Here’s a copy:

Apache Mahout 0.2 has been released and is now available for public
download at http://www.apache.org/dyn/closer.cgi/lucene/mahout

Apache Mahout is a subproject of Apache Lucene with the goal
of delivering scalable machine learning algorithm implementations
under the Apache license. http://www.apache.org/licenses/LICENSE-2.0
Scale in terms of computation to the
size of data you manage today.  Scale in terms of community to support anyone
interested in using machine learning. Scale
in terms of business by providing…

Read more...

Thoughts on Efficiency of Enterprise Search on eWeek.com

eWeek.com recently posted a nice article by Dr. Yves Schabes, founder of Teragram, on how to make enterprise search better through some higher order processing techniques like metadata generation, applying taxonomies, etc. and doing relevance testing on a regular basis.  Naturally, this got me thinking about all the different ways this relates to the Apache Lucene ecosystem (Lucene, Solr, Mahout, Tika, etc.) and Lucid Imagination.

First, by choosing an open backbone like Lucene and Solr, you are free…

Read more...