My most recent article on Mahout is up at IBM developerWorks. It is titled Apache Mahout: Scalable machine learning for everyone and is designed to walk you through using Mahout with a real email data set using Hadoop and EC2. It also gets you up to speed on some of the new things in Mahout since I last wrote on the subject for developerWorks.
Note, I will also be giving a talk …
Read more
| Tuesday, 29 November 2011 |
| 18:30 |
to |
21:30 |
For all of those interested in Apache Mahout and scalable machine learning, Lucid Imagination is hosting a Mahout Users Meeting at it’s new office in Redwood City on Nov. 29th. Doors open at 6:30 pm. The night will feature two speakers, Ted Dunning of MapR Technologies and Grant Ingersoll of Lucid Imagination, along with a social gathering with food and drinks.
For more details and to RSVP, …
Read more
With another Lucene Eurocon successfully behind us (thanks Barcelona, you’ve been awesome!), it’s time to say hello to Vancouver for ApacheCon. I’ll leave it to others to fill in the blanks on the Barcelona conference other than to say that I am continually amazed by the vibrancy of the Lucene/Solr community and especially grateful to all the committers and contributors who take the time to show up and give talks about how they leverage …
Read more
You know your (technical) baby is (almost) grown up when the book on the project finally comes out. Such is the case for Apache Mahout, thanks to Manning Publications shipping Mahout in Action this week.
So, before I start into my review, let me first say congratulations to Sean, Robin, Ted, Ellen and Manning for producing such an excellent product. The simplest praise I can give it is to put it on the same …
Read more
After a week off to enjoy time with my family, I thought I would kick off the last week of 2010 with a look back at the year as it relates to the Apache Lucene ecosystem. For anyone who follows the amalgamation of projects that I like to call the Lucene Ecosystem (the Apache projects: Lucene, Solr, Nutch, Mahout, Tika, PyLucene, Lucy, Lucene.NET, Droids, ManifoldCF — Lucene Connector Framework, OpenNLP and UIMA) you know it …
Read more
Here are my slides from the talk I gave last night at the RTP Semantic Web Group:
Read more
Back from Berlin Buzzwords and finally over the jet lag, so I thought I would put up some feedback. First off, it was a well organized conference with a nice focus on searching, storage and scaling. Kudos to Isabel, Simon and Jan for all their hard work. It also had great wi-fi coverage, which is always a struggle at every conference I’ve ever been too.
As for the talks, I gave the Keynote on using …
Read more
After reviewing a lot of great talk proposals, we’ve announced the agenda for Apache Lucene Eurocon: Apache Lucene EuroCon – Europe’s Premier Lucene and Solr Search User Conference.
One of the things I really like about this agenda is it is a great mix of basics, use cases from all over the search map (CMS, news, social media, advertising), business decisions (see last list and next list) and advanced topics (NLP, collab filtering, machine …
Read more
Apache Lucene (the Lucene top level project, not Lucene the Java search API. I know, it’s confusing sometimes) has once again proved to be a fertile area for innovation (having already given birth to Apache Hadoop a few years back), as it once again has given birth, this time to three new Apache Top Level Projects (just approved by the Board at Apache): Apache Mahout, Apache Nutch and Apache Tika (never mind the URLs, …
Read more
Here’s the announcement:
Apache Mahout <http://lucene.apache.org/mahout> 0.3 has been released and is
now available for public
download at http://www.apache.org/dyn/closer.cgi/lucene/mahout
Up-to-date maven artifacts can be found in the Apache repository at
https://repository.apache.org/content/repositories/releases/org/apache/mahout/
Apache Mahout is a subproject of Apache Lucene with the goal of
delivering scalable machine learning algorithm implementations under
the Apache license. http://www.apache.org/licenses/LICENSE-2.0
Mahout is a machine learning library meant to scale: Scale in terms of
community to support anyone interested in using machine
…
Read more