Bet You Didn’t Know Lucene Can…
Here are my ApacheCon 2011 slides for my talk “Bet You Didn’t Know Lucene Can…” :
By Grant IngersollNovember 14, 2011
Here are my ApacheCon 2011 slides for my talk “Bet You Didn’t Know Lucene Can…” :
By Grant IngersollNovember 9, 2011
| Tuesday, 15 November 2011 | ||
| 18:30 | to | 23:30 |
For those of you in the Raleigh, Durham, Chapel Hill NC area, Lucid Imagination is sponsoring the next (and ongoing) Triangle Hadoop Users Group Meeting, November 15, 2011 @ Bronto Software in Durham, NC. The next meeting will feature Alan Gates of Hortonworks. Alan will be speaking on Apache Pig and HCatalog. To RSVP and find out more, visit www.trihug.org.…
By Grant IngersollNovember 8, 2011
My most recent article on Mahout is up at IBM developerWorks. It is titled Apache Mahout: Scalable machine learning for everyone and is designed to walk you through using Mahout with a real email data set using Hadoop and EC2. It also gets you up to speed on some of the new things in Mahout since I last wrote on the subject for developerWorks.
Note, I will also be giving a talk …
By Grant IngersollNovember 5, 2011
| Tuesday, 29 November 2011 | ||
| 18:30 | to | 21:30 |
For all of those interested in Apache Mahout and scalable machine learning, Lucid Imagination is hosting a Mahout Users Meeting at it’s new office in Redwood City on Nov. 29th. Doors open at 6:30 pm. The night will feature two speakers, Ted Dunning of MapR Technologies and Grant Ingersoll of Lucid Imagination, along with a social gathering with food and drinks.
For more details and to RSVP, …
By Grant IngersollOctober 22, 2011
With another Lucene Eurocon successfully behind us (thanks Barcelona, you’ve been awesome!), it’s time to say hello to Vancouver for ApacheCon. I’ll leave it to others to fill in the blanks on the Barcelona conference other than to say that I am continually amazed by the vibrancy of the Lucene/Solr community and especially grateful to all the committers and contributors who take the time to show up and give talks about how they leverage …
By Grant IngersollOctober 15, 2011
You know your (technical) baby is (almost) grown up when the book on the project finally comes out. Such is the case for Apache Mahout, thanks to Manning Publications shipping Mahout in Action this week.
So, before I start into my review, let me first say congratulations to Sean, Robin, Ted, Ellen and Manning for producing such an excellent product. The simplest praise I can give it is to put it on the same …
By Grant IngersollSeptember 18, 2011
From a quiet start as a pet project to a giant in the industry, Apache Lucene is definitely the little (search) engine that could. On September 18th, 2001 (at 16:29:48 UTC) Jason Van Zyl made the first official import of Doug Cutting’s Lucene project (which started in 1997 and was hosted on SourceForge) into Apache’s Jakarta project (check out the Wayback machine).
And while I wasn’t around in the beginning, I thought I would …
By Grant IngersollSeptember 14, 2011
Many times, clients ask us to help them estimate memory usage or disk space usage or to share benchmarks as they build out there search system. Doing so is always an interesting process, as I’ve always been wary of claims about benchmarks (for instance, one of the old tricks of performance benchmark hacking is to “cat XXX > /dev/null” to load everything into memory first, which isn’t what most people do when running their system) …
By Grant IngersollMarch 31, 2011
It’s official, Apache Lucene 3.1.0 and Apache Solr 3.1.0 are officially released. Keep an eye here for more on the new features and functionality.
Here’s the release announcements as just sent to the mailing lists:
March 2011, Apache Lucene 3.1 available The Lucene PMC is pleased to announce the release of Apache Lucene 3.1. This release contains numerous bug fixes, optimizations, and improvements, some of which are highlighted below. The release is available for immediate…
By Grant IngersollMarch 24, 2011
Changing Bits: Lucene’s FuzzyQuery is 100 times faster in 4.0.
So cool… I’m in awe daily of what happens in Lucene and Solr open source. Mike’s post is just a small example of what goes on. Perhaps Mike or Muir or someone will writeup on how Lucene has improved it’s Unit Testing by several orders of magnitude by some incredibly cool randomization techniques and the use of Jenkins/Hudson.…