• Products
    • Overview
    • LucidWorks Search Platform
      • Features and Benefits
      • Technical Overview
      • Only with LucidWorks
      • LucidWorks and Solr
      • White Papers
      • LucidWorks Enterprise
      • LucidWorks Cloud
    • Certified Distributions
      • Certified Solr
      • Certified Lucene
    • Apache Releases
      • Apache Solr
      • Apache Lucene
  • Support & Services
    • Overview
    • Support
    • Training
    • Solr/Lucene Certification
    • ExpertLink Advisory
    • Consulting
    • Partners
    • Subscriptions
  • Why Lucid?
    • Why Lucid?
    • Technology
    • Who uses Lucene/Solr?
      • What customers are saying
    • Case Studies
    • Whitepapers
    • Demos
    • Webinars
  • Blog
  • DevZone
    • DevZone Overview
    • Forums (LWE)
    • Videos & Podcasts
      • How To's
      • Screencasts
      • Podcasts
      • Conference Videos
    • Technical Articles
      • Whitepapers
    • Reference Materials
      • Documentation
      • Solr Reference Guide
      • Solr & LucidWorks Matrix
      • Tutorials
    • Events
      • Conferences
      • Meet Ups
    • Code & Test
  • Downloads
  • About Us
    • Management
    • Board of Directors
    • Apache Lucene/Solr Committers
    • Careers
    • News
      • Media Coverage
      • Press Releases
    • Contact Us
Sign Up or Log In
Home . Blog

Blog

Bet You Didn’t Know Lucene Can…

By Grant IngersollNovember 14, 2011

Here are my ApacheCon 2011 slides for my talk “Bet You Didn’t Know Lucene Can…” :

 

Bet you didn’t know Lucene can…

View more presentations from gsingers.

…

Read more

Triangle Hadoop Users Group, Next Meeting: November 15, 2011 @ Bronto Software

By Grant IngersollNovember 9, 2011

Tuesday, 15 November 2011
18:30 to 23:30

For those of you in the Raleigh, Durham, Chapel Hill NC area, Lucid Imagination is sponsoring the next (and ongoing) Triangle Hadoop Users Group Meeting, November 15, 2011 @ Bronto Software in Durham, NC.   The next meeting will feature Alan Gates of Hortonworks.  Alan will be speaking on Apache Pig and HCatalog.  To RSVP and find out more, visit www.trihug.org.…

Read more

Apache Mahout: Scalable machine learning for everyone

By Grant IngersollNovember 8, 2011

 

 

My most recent article on Mahout is up at IBM developerWorks.  It is titled Apache Mahout: Scalable machine learning for everyone and is designed to walk you through using Mahout with a real email data set using Hadoop and EC2.  It also gets you up to speed on some of the new things in Mahout since I last wrote on the subject for developerWorks.

Note, I will also be giving a talk …

Read more

SF Bay Area Apache Mahout User Meeting on Nov. 29

By Grant IngersollNovember 5, 2011

Tuesday, 29 November 2011
18:30 to 21:30

For all of those interested in Apache Mahout and scalable machine learning, Lucid Imagination is hosting a Mahout Users Meeting at it’s new office in Redwood City on Nov. 29th. Doors open at 6:30 pm. The night will feature two speakers, Ted Dunning of MapR Technologies and Grant Ingersoll of Lucid Imagination, along with a social gathering with food and drinks.

For more details and to RSVP, …

Read more

From Barcelona to Vancouver with Lucene and Solr

By Grant IngersollOctober 22, 2011

With another Lucene Eurocon successfully behind us (thanks Barcelona, you’ve been awesome!), it’s time to say hello to Vancouver for ApacheCon.  I’ll leave it to others to fill in the blanks on the Barcelona conference other than to say that I am continually amazed by the vibrancy of the Lucene/Solr community and especially grateful to all the committers and contributors who take the time to show up and give talks about how they leverage …

Read more

Mahout in Action Review

By Grant IngersollOctober 15, 2011

You know your (technical) baby is (almost) grown up when the book on the project finally comes out.  Such is the case for Apache Mahout, thanks to Manning Publications shipping Mahout in Action this week.

So, before I start into my review, let me first say congratulations to Sean, Robin, Ted, Ellen and Manning for producing such an excellent product.   The simplest praise I can give it is to put it on the same …

Read more

Happy Anniversary, Lucene! 10 years at the ASF

By Grant IngersollSeptember 18, 2011

From a quiet start as a pet project to a giant in the industry, Apache Lucene is definitely the little (search) engine that could.  On September 18th, 2001 (at 16:29:48 UTC) Jason Van Zyl made the first official import of Doug Cutting’s Lucene project (which started in 1997 and was hosted on SourceForge) into Apache’s Jakarta project (check out the Wayback machine).

And while I wasn’t around in the beginning, I thought I would …

Read more

Estimating Memory and Storage for Lucene/Solr

By Grant IngersollSeptember 14, 2011

Many times, clients ask us to help them estimate memory usage or disk space usage or to share benchmarks as they build out there search system. Doing so is always an interesting process, as I’ve always been wary of claims about benchmarks (for instance, one of the old tricks of performance benchmark hacking is to “cat XXX > /dev/null” to load everything into memory first, which isn’t what most people do when running their system) …

Read more

Apache Lucene 3.1.0 and Apache Solr 3.1.0

By Grant IngersollMarch 31, 2011

It’s official, Apache Lucene 3.1.0 and Apache Solr 3.1.0 are officially released.  Keep an eye here for more on the new features and functionality.

Here’s the release announcements as just sent to the mailing lists:

March 2011, Apache Lucene 3.1 available
The Lucene PMC is pleased to announce the release of Apache Lucene 3.1.

This release contains numerous bug fixes, optimizations, and
improvements, some of which are highlighted below.  The release
is available for immediate 

…

Read more

Changing Bits: Lucene’s FuzzyQuery is 100 times faster in 4.0

By Grant IngersollMarch 24, 2011

Changing Bits: Lucene’s FuzzyQuery is 100 times faster in 4.0.

So cool…  I’m in awe daily of what happens in Lucene and Solr open source.  Mike’s post is just a small example of what goes on.  Perhaps Mike or Muir or someone will writeup on how Lucene has improved it’s Unit Testing by several orders of magnitude by some incredibly cool randomization techniques and the use of Jenkins/Hudson.…

Read more

« Older Posts
  • Recent Posts

    • Indexing with SolrJ
    • Advanced Filter Caching in Solr
    • Lucene Revolution 2012 – Call for Participation now open!
    • SolrCloud is Coming (and looking to mix in even more ‘NoSQL’)
    • Our Solr Reference Guide updated for v3.5
    • Enhancing Discovery with Solr and Mahout – session slides now available!
    • Solr and LucidWorks feature matrix available
    • LucidWorks Enterprise latest version 2.0.1 released!
    • Why Not AND, OR, And NOT?
    • Options to tune document’s relevance in Solr
  • Archives

    • February 2012
    • January 2012
    • December 2011
    • November 2011
    • October 2011
    • September 2011
  • Tags

    acts_as_solr apache Apache Mahout best practices chump code4lib dismax drupal enterprise search Erik Hatcher field collapsing frange function query Grant Ingersoll hoss image isfdb Lucene lucene revolution LucidGaze lucid imagination Mahout Marc Krellenstein Mark Miller nutch Open Source Open Source Search qparser query parser queryparser Rails release result grouping Richmond Ruby schema design sint Solr solr 3.1 solr 4.0 solr cloud sortable spatial search Tika VA
  • Contact Us
  • About Lucid Imagination
  • Help & Support
  • Training
  • Privacy Policy
  • Legal Terms of Use
  • Copyrights and Disclaimers
  • Log in

Apache Solr, Solr, Apache Lucene, Lucene and their logos are trademarks of the Apache Software Foundation.

© 2011 Lucid Imagination. All Right reserved.