<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Lucid Imagination &#187; ApacheCon</title>
	<atom:link href="http://www.lucidimagination.com/blog/category/apachecon/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.lucidimagination.com/blog</link>
	<description>Exclusively dedicated to Apache Lucene/Solr open source search technology</description>
	<lastBuildDate>Sat, 04 Feb 2012 01:12:03 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3</generator>
		<item>
		<title>Bet You Didn&#8217;t Know Lucene Can&#8230;</title>
		<link>http://www.lucidimagination.com/blog/2011/11/14/bet-you-didnt-know-lucene-can/</link>
		<comments>http://www.lucidimagination.com/blog/2011/11/14/bet-you-didnt-know-lucene-can/#comments</comments>
		<pubDate>Mon, 14 Nov 2011 15:43:36 +0000</pubDate>
		<dc:creator>Grant Ingersoll</dc:creator>
				<category><![CDATA[apache]]></category>
		<category><![CDATA[ApacheCon]]></category>
		<category><![CDATA[Lucene]]></category>
		<category><![CDATA[Solr]]></category>

		<guid isPermaLink="false">http://www.lucidimagination.com/blog/?p=4418</guid>
		<description><![CDATA[<p>Here are my ApacheCon 2011 slides for my talk &#8220;Bet You Didn&#8217;t Know Lucene Can&#8230;&#8221; :</p>
<p>&#160;</p>
<div id="__ss_10155480" style="width: 425px;"><strong style="display: block; margin: 12px 0 4px;"><a title="Bet you didn't know Lucene can..." href="http://www.slideshare.net/gsingers/bet-you-didnt-know-lucene-can">Bet you didn&#8217;t know Lucene can&#8230;</a></strong>
<div style="padding: 5px 0 12px;">View more <a href="http://www.slideshare.net/">presentations</a> from <a href="http://www.slideshare.net/gsingers">gsingers</a>.</div>
&#8230;</div>]]></description>
			<content:encoded><![CDATA[<p>Here are my ApacheCon 2011 slides for my talk &#8220;Bet You Didn&#8217;t Know Lucene Can&#8230;&#8221; :</p>
<p>&nbsp;</p>
<div id="__ss_10155480" style="width: 425px;"><strong style="display: block; margin: 12px 0 4px;"><a title="Bet you didn't know Lucene can..." href="http://www.slideshare.net/gsingers/bet-you-didnt-know-lucene-can">Bet you didn&#8217;t know Lucene can&#8230;</a></strong><object id="__sse10155480" width="425" height="355" classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="allowFullScreen" value="true" /><param name="allowScriptAccess" value="always" /><param name="src" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=lucenecan-111114094003-phpapp01&amp;stripped_title=bet-you-didnt-know-lucene-can&amp;userName=gsingers" /><param name="allowscriptaccess" value="always" /><param name="allowfullscreen" value="true" /><embed id="__sse10155480" width="425" height="355" type="application/x-shockwave-flash" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=lucenecan-111114094003-phpapp01&amp;stripped_title=bet-you-didnt-know-lucene-can&amp;userName=gsingers" allowFullScreen="true" allowScriptAccess="always" allowscriptaccess="always" allowfullscreen="true" /></object></p>
<div style="padding: 5px 0 12px;">View more <a href="http://www.slideshare.net/">presentations</a> from <a href="http://www.slideshare.net/gsingers">gsingers</a>.</div>
</div>
]]></content:encoded>
			<wfw:commentRss>http://www.lucidimagination.com/blog/2011/11/14/bet-you-didnt-know-lucene-can/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>From Barcelona to Vancouver with Lucene and Solr</title>
		<link>http://www.lucidimagination.com/blog/2011/10/22/barcelona-vancouver/</link>
		<comments>http://www.lucidimagination.com/blog/2011/10/22/barcelona-vancouver/#comments</comments>
		<pubDate>Sat, 22 Oct 2011 10:14:36 +0000</pubDate>
		<dc:creator>Grant Ingersoll</dc:creator>
				<category><![CDATA[apache]]></category>
		<category><![CDATA[ApacheCon]]></category>
		<category><![CDATA[Mahout]]></category>
		<category><![CDATA[Solr]]></category>

		<guid isPermaLink="false">http://www.lucidimagination.com/blog/?p=4364</guid>
		<description><![CDATA[<p>With another <a href="http://lucene-eurocon.com/">Lucene Eurocon</a> successfully behind us (thanks Barcelona, you&#8217;ve been awesome!), it&#8217;s time to say hello to Vancouver for <a href="http://na11.apachecon.com/">ApacheCon</a>.  I&#8217;ll leave it to others to fill in the blanks on the Barcelona conference other than to say that I am continually amazed by the vibrancy of the Lucene/Solr community and especially grateful to all the committers and contributors who take the time to show up and give talks about how they leverage &#8230;</p>]]></description>
			<content:encoded><![CDATA[<p>With another <a href="http://lucene-eurocon.com/">Lucene Eurocon</a> successfully behind us (thanks Barcelona, you&#8217;ve been awesome!), it&#8217;s time to say hello to Vancouver for <a href="http://na11.apachecon.com/">ApacheCon</a>.  I&#8217;ll leave it to others to fill in the blanks on the Barcelona conference other than to say that I am continually amazed by the vibrancy of the Lucene/Solr community and especially grateful to all the committers and contributors who take the time to show up and give talks about how they leverage the world&#8217;s premier open source search engine.</p>
<p>For me personally, I&#8217;m on to Vancouver and ApacheCon for two primary things, besides of course the community bits that go with every ApacheCon:</p>
<ol>
<li>Providing the ApacheCon&#8217;s first ever <a href="http://na11.apachecon.com/talks/18395">Apache Mahout training on Monday, November 7th</a>.  It&#8217;s still not too late to sign up!</li>
<li>Giving a talk on alternative uses of Lucene/Solr other than traditional free text search (things like recommendation engines, classification, etc.)</li>
</ol>
<p>For the 2nd item, I&#8217;m also interested in hearing from you, the user, about interesting things you&#8217;ve done with Lucene/Solr that fall outside the norm of free text search.  If you care to share, please leave a comment on this post.</p>
<p>I&#8217;d be remiss if I didn&#8217;t also plug several other Lucid Imagination employees who are speaking at ApacheCon as well:</p>
<ol>
<li><a href="http://na11.apachecon.com/talks/19453">Solr Flair</a> by Erik Hatcher.  Erik will also be doing a <a href="http://na11.apachecon.com/talks/19454">2 day Solr training class</a>.  Registration is still open for this class as well.</li>
<li><a href="http://na11.apachecon.com/talks/19346">Apache Solr: Out of the Box</a> by Chris Hostetter</li>
</ol>
<p>Lucid Imagination is also sponsoring the Lucene/Solr <a href="https://wiki.apache.org/lucene-java/ApacheCon2011NaMeetup">meetup</a> on Wed. November 9th, so if you are in town, please feel free to drop by for a drink and a chat.</p>
<p>With that, I&#8217;ll simply say, I hope to see you in Vancouver in a few weeks!</p>
<p>-Grant</p>
]]></content:encoded>
			<wfw:commentRss>http://www.lucidimagination.com/blog/2011/10/22/barcelona-vancouver/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Data.gov on Solr</title>
		<link>http://www.lucidimagination.com/blog/2010/11/05/data-gov-on-solr/</link>
		<comments>http://www.lucidimagination.com/blog/2010/11/05/data-gov-on-solr/#comments</comments>
		<pubDate>Fri, 05 Nov 2010 21:43:44 +0000</pubDate>
		<dc:creator>Erik Hatcher</dc:creator>
				<category><![CDATA[ApacheCon]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[LucidWorks]]></category>
		<category><![CDATA[Solr]]></category>
		<category><![CDATA[apache]]></category>
		<category><![CDATA[Erik Hatcher]]></category>
		<category><![CDATA[Open Source]]></category>

		<guid isPermaLink="false">http://www.lucidimagination.com/blog/?p=2604</guid>
		<description><![CDATA[<p>At <a href="http://apachecon.com">ApacheCon</a> this week I presented <a href="http://na.apachecon.com/c/acna2010/sessions/571">&#8220;Rapid Prototyping with Solr&#8221;</a>.  This is the third time I&#8217;ve given a presentation with the same title.  In the spirit of the rapid prototyping theme, each time I&#8217;ve created a new prototype just a day or so prior to presenting it.  At <a href="http://lucene-eurocon.org/sessions-track2-day2.html#4">Lucene EuroCon</a> the prototype used attendee data, a treemap visualization, and a cute little Solr-powered &#8220;app&#8221; for picking attendees at random for the conference giveaways.  For &#8230;</p>]]></description>
			<content:encoded><![CDATA[<p>At <a href="http://apachecon.com">ApacheCon</a> this week I presented <a href="http://na.apachecon.com/c/acna2010/sessions/571">&#8220;Rapid Prototyping with Solr&#8221;</a>.  This is the third time I&#8217;ve given a presentation with the same title.  In the spirit of the rapid prototyping theme, each time I&#8217;ve created a new prototype just a day or so prior to presenting it.  At <a href="http://lucene-eurocon.org/sessions-track2-day2.html#4">Lucene EuroCon</a> the prototype used attendee data, a treemap visualization, and a cute little Solr-powered &#8220;app&#8221; for picking attendees at random for the conference giveaways.  For a recent <a href="http://www.lucidimagination.com/blog/2010/06/10/rapid-prototyping-search-applications-with-solr/">Lucid webinar</a> the prototype was more general purpose, bringing in and making searchable rich documents and faceting on file types with a pie chart visualization.</p>
<p>This time around, the data set I chose was <a href="http://www.data.gov/raw/92">Data.gov&#8217;s catalog of datasets</a>, which fit with the ApacheCon open source aura, and Lucid Imagination&#8217;s support of <a href="http://opensourceforamerica.org/awards/2010-recipients">Open Source for America</a>, which helps to advocate for open source in the US Federal Government.  The prototype built includes faceting browsing, query term suggest, hit highlighting, result clustering, spell checking, document detail, and a bonus Venn diagram visualization.</p>
<p><span id="more-2604"></span></p>
<p>The prototype was built with these steps:</p>
<ol>
<li>Install LucidWorks for Solr</li>
<li>Grab the Data.gov catalog CSV file</li>
<li>Iterate a bit with Solr&#8217;s CSV update handler (the funnest way to get data into Solr) and a little Solr schema tinkering</li>
<li>Adjusted the Solr configuration and UI templates to get a nice look and feel, adding in a document detail page and a Venn diagram visualization comparing query overlaps</li>
</ol>
<p>Voilà (click the images for large view):</p>
<table class="plain" style="width: 100%;" cellspacing="0" cellpadding="0">
<tbody>
<tr>
<td width="60%"><a href="http://www.lucidimagination.com/blog/wp-content/uploads/2010/11/datagov_search.png"><img class="alignnone size-thumbnail wp-image-2617" title="Data.gov on Solr" src="http://www.lucidimagination.com/blog/wp-content/uploads/2010/11/datagov_search-150x150.png" alt="" width="150" height="150" /></a></td>
<td><a href="http://www.lucidimagination.com/blog/wp-content/uploads/2010/11/datagov_compare.png"><img class="size-thumbnail wp-image-2627" title="query comparison Venn diagram" src="http://www.lucidimagination.com/blog/wp-content/uploads/2010/11/datagov_compare-150x150.png" alt="" width="150" height="150" /></a></td>
</tr>
</tbody>
</table>
<p>This isn&#8217;t the first time we&#8217;ve toyed with Data.gov data&#8230; earlier this year, <a href="../../../../../../blog/2010/05/07/data-mining-data-dot-gov/">Hoss demonstrated Solr&#8217;s stats component</a> on another of Data.gov&#8217;s data sets.</p>
<p>My ApacheCon slides are published at Slideshare and embedded here:</p>
<div id="__ss_5675936" style="width: 425px;"><strong><a title="Rapid prototyping with solr" href="http://www.slideshare.net/erikhatcher/rapid-prototyping-with-solr-5675936">Rapid prototyping with solr</a></strong><object id="__sse5675936" classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="425" height="355" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="allowFullScreen" value="true" /><param name="allowScriptAccess" value="always" /><param name="src" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=rapidprototypingwithsolr-101105050018-phpapp01&amp;stripped_title=rapid-prototyping-with-solr-5675936&amp;userName=erikhatcher" /><param name="name" value="__sse5675936" /><param name="allowfullscreen" value="true" /><embed id="__sse5675936" type="application/x-shockwave-flash" width="425" height="355" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=rapidprototypingwithsolr-101105050018-phpapp01&amp;stripped_title=rapid-prototyping-with-solr-5675936&amp;userName=erikhatcher" name="__sse5675936" allowscriptaccess="always" allowfullscreen="true"></embed></object></div>
<p>All the code and instructions for running the entire prototype yourself can be found here: <a href="https://github.com/erikhatcher/solr-rapid-prototyping/tree/master/ApacheCon2010">https://github.com/erikhatcher/solr-rapid-prototyping/tree/master/ApacheCon2010</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.lucidimagination.com/blog/2010/11/05/data-gov-on-solr/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>The Apache Lucene Ecosystem: My view of 2009</title>
		<link>http://www.lucidimagination.com/blog/2009/12/24/the-apache-lucene-ecosystem-my-view-of-2009/</link>
		<comments>http://www.lucidimagination.com/blog/2009/12/24/the-apache-lucene-ecosystem-my-view-of-2009/#comments</comments>
		<pubDate>Thu, 24 Dec 2009 15:53:02 +0000</pubDate>
		<dc:creator>Grant Ingersoll</dc:creator>
				<category><![CDATA[apache]]></category>
		<category><![CDATA[ApacheCon]]></category>
		<category><![CDATA[Droids]]></category>
		<category><![CDATA[Lucene]]></category>
		<category><![CDATA[Lucy]]></category>
		<category><![CDATA[Mahout]]></category>
		<category><![CDATA[nutch]]></category>
		<category><![CDATA[Open Relevance Project]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[PyLucene]]></category>
		<category><![CDATA[Solr]]></category>
		<category><![CDATA[Tika]]></category>
		<category><![CDATA[ZooKeeper]]></category>

		<guid isPermaLink="false">http://www.lucidimagination.com/blog/?p=1429</guid>
		<description><![CDATA[<p>It&#8217;s that time of year, so I thought I would take a look back at the year that was for the <a href="http://lucene.apache.org">Lucene Ecosystem</a> and maybe look ahead just a little bit too.</p>
<p>First and foremost, it should be obvious to even the most casual observer that the Apache Lucene communities are thriving.  Not only is it a great time to be involved in open source, it&#8217;s a great time to be involved in Lucene.  Both &#8230;</p>]]></description>
			<content:encoded><![CDATA[<p>It&#8217;s that time of year, so I thought I would take a look back at the year that was for the <a href="http://lucene.apache.org">Lucene Ecosystem</a> and maybe look ahead just a little bit too.</p>
<p>First and foremost, it should be obvious to even the most casual observer that the Apache Lucene communities are thriving.  Not only is it a great time to be involved in open source, it&#8217;s a great time to be involved in Lucene.  Both as a committer and as an employee of Lucid Imagination, I&#8217;m continuously amazed at the vibe produced by the people using the Lucene suite of libraries, tools and applications.  People are routinely solving both large scale and really hard problems using the Lucene ecosystem and they are doing it on time and on budget.  For instance, this year alone, I&#8217;ve seen companies and individuals using Lucene and Solr to provide search in production environments with document counts ranging from the few tens of thousands all the way up to 5-10 billion plus and query rates that barely register a blip to 1000+ QPS.  I&#8217;ve also seen many people using Lucene to power recommendation engines, content management systems, machine learning/NLP applications and log analysis tools.</p>
<p>Much to my initial surprise, the number one reason I hear for why they chose Lucene: flexibility. (I thought it would be the fact that they are free to use, but that is just icing on the proverbial cake, I guess)  Namely, Lucene gives them the flexibility to build what they want or simply to use it out of the box.  It gives them the flexibility to bring in other tools from other open source projects or other commercial vendors, all without compromising speed or scale.</p>
<p>With that in mind, I thought I would give some highlights of both the top level project (TLP &#8212; http://lucene.apache.org &#8212; the ASF project that &#8220;houses&#8221; all of the Lucene related subprojects) that is Lucene as well as the individual projects.  (I&#8217;m not involved in all them, so please correct me if I&#8217;m wrong!)</p>
<h1>Lucene TLP</h1>
<p>Whew!  It&#8217;s been a busy year for the Lucene TLP.  We started the <a href="http://lucene.apache.org/openrelevance">Open Relevance Project</a> (ORP), added <a href="http://lucene.apache.org/pylucene">PyLucene</a> (a Python port of Lucene) and successfully graduated a .NET version of Lucene from ASF incubation, not to mention the fact that the Lucene PMC is responsible for overseeing the release of all the various bits and bytes for each and every subproject (which is a lot of releases!)  We also, for the first time ever, organized two days of Lucene related talks at <a href="http://www.us.apachecon.org">ApacheCon US</a> plus two days of training and meetups. (In the past, organization was always handled by the ASF Conference Committee).</p>
<p>In looking ahead for the TLP, I see a continued focus on providing quality software across all the projects.  Additionally, keep your ears open, as there is a new sub project brewing that I think will really make it even easier for people to deploy Lucene based solutions.  Finally, just as Lucene gave birth to Apache Hadoop and is happy to see it doing so well, there is <a href="http://www.lucidimagination.com/search/document/5a41be454d503779/possible_contribution_at_somewhat_of_a_tangent_to_mahout">growing talk</a> that Lucene will look to see Apache Mahout off as it&#8217;s own TLP.  Of course, none of that is in stone yet!</p>
<p>For those looking for more on the big picture that is Lucene, see my <a href="http://www.us.apachecon.com/c/acus2009/sessions/428">talk</a> at ApacheCon US for more details on the ecosystem.  Not sure why the slides aren&#8217;t there, so I put them <a href="http://people.apache.org/~gsingers/apacheconUS09/luceneEcosystem.pptx">here</a>.</p>
<h1>Lucene Java</h1>
<p><a href="http://lucene.apache.org/java">Lucene Java</a> (i.e. what everyone knows as &#8220;Lucene&#8221;) continues to not only provide a rock solid indexing and search API, it continues to push forward with new capabilities.  In 2009, Lucene did 4 releases (2.4.1, 2.9.0, 2.9.1 and 3.0.0).  2.9.0 was probably the most interesting, as it significantly improved performance in a number of areas, while 3.0.0 removed all of the deprecated APIs and finally, officially, dropped support for Java JDK 1.4.  I&#8217;ll leave it to the reader to go look up all the features and changes as they are numerous.</p>
<p>Looking ahead, the phrase of the year appears to be &#8220;flexible indexing&#8221;.  Flex Indexing looks to make it even easier for people to custom craft what is in their index, whether that is rich token attributes (aka &#8220;typed&#8221; payloads), alternative scoring models (like Okapi BM25) or a bare bones index designed for blazing fast speed.</p>
<h1>Solr</h1>
<p>With Lucene as the engine, <a href="http://lucene.apache.org/solr">Solr</a> has evolved into quite the car.   Building on all of the goodness that is Lucene, Solr, in 2009, released version 1.4 with a whole slew of new features, faster implementations and bug fixes.  Highlights for 1.4 include: improved filtering and faceting performance, support for clustering, rich document indexing via Apache Tika, multi-select faceting (see Lucid&#8217;s very own <a href="http://search.lucidimagination.com">search.lucidimagination.com</a> for a demo), many new Query capabilities and a whole bevy of new Components (Terms, Term Vectors, Auto-suggest, deduplication and Statistics on result sets) that truly make Solr an incredible search platform.</p>
<p>Looking ahead, Solr 1.5 (2.0?) is already in the works and looks to have even more <a href="http://www.lucidimagination.com/blog/2009/12/12/apache-solr-1-5-on-the-move-with-more-functionality/">functionality</a>.  For instance, a lot of work is underway to integrate Apache ZooKeeper and other distributed capabilities, which will help make deploying Solr at scale even easier.  Meanwhile, many are hard at work adding &#8220;field collapsing&#8221; (search result grouping/deduplication) and spatial (local/geo) search.</p>
<h1>Mahout</h1>
<p>It&#8217;s been a very exciting year (in my completely biased opinion!) for <a href="http://lucene.apache.org/mahout">Mahout</a>, the scalable machine learning project under Lucene.  In 2009, Mahout shepherded through it&#8217;s very first release (0.1) built on the strength of a few dedicated volunteers working to add capabilities for clustering, categorization and collaborative filtering.  Next came 0.2 with many new features (frequent patternset mining, Latent Dirichlet Allocation, Random Decision Forests, new recommendation capabilities) API and performance improvements and a growing list of people who stopped lurking and stepped up to help out.  Towards the end of the year, Mahout is already reaching a list volume that I find difficult to keep up with if I miss a day or two.  For starters, we have taken on the task of integrating/transforming the <a href="http://acs.lbl.gov/~hoschek/colt/">Colt</a> matrix library for our needs.  We are also working on adding truly large scale recommendation capabilities plus adding in a Support Vector Machine implementation and Logistic Regression.  Not only that, but the mahout-user@lucene.apache.org mailing list continues to be a valuable resources for people seeking practical advice on deploying machine learning in production environments regardless of the choice of Mahout or not.</p>
<p>In 2010, I suspect Mahout will become it&#8217;s own TLP, with several sub projects roughly divided as: core/utilities, recommendations (Taste) and NLP.  Of course, until it happens, this is just speculation.  I also think Mahout will look to finalize its APIs for a 1.0 release.</p>
<h1>Nutch</h1>
<p>In 2009, <a href="http://lucene.apache.org/nutch">Apache Nutch</a> released the long awaited version 1.0.  This release contained many new indexing and scoring capabilities, as well as integration with Solr.  The community continues to be focused on providing large scale crawling and search capabilities by leveraging Apache Hadoop and Lucene/Solr.  Currently, the community is actively looking at modularizing Nutch to allow it to more easily plug in other ecosystem components like Tika and Solr while focusing on the primary task of obtaining and managing content via crawling.</p>
<h1>Tika</h1>
<p><a href="http://lucene.apache.org/tika/">Apache Tika</a> is a content extraction framework for &#8220;rich&#8221; documents like Adobe PDF and Microsoft Office.  In 2009, Tika released versions 0.3, 0.4 and 0.5, all with incremental improvements designed to make it more stable and easier to use.  Each release also seemed to carry with it a new list of supported file formats as more and more people join the project to lend a hand.</p>
<p>Coming up, I suspect Tika will look to finalize a 1.0 release at some point in 2009 as well as focus in on standardizing, if such a thing is possible, on the metadata artifacts produced by Tika.</p>
<h1>Open Relevance Project</h1>
<p>The <a href="http://lucene.apache.org/openrelevance">ORP</a> is a project that has been in my brain for several years now and finally got off the ground in 2009.  The goal of ORP is to provide corpora, queries, judgments and other tools to help search and machine learning projects discuss relevance in a completely open way.  While the project is really young, it is slowly but surely building up steam by adding some basic tools and collections thanks to the hard work of several individuals.  In 2010, look for ORP to build out a more complete toolset while attracting more users and contributors.  It will also be vital for the ORP to create its own versioned corpora for download (free!) so that all experiments can be reliably reproduced.</p>
<h1>Droids</h1>
<p><a href="http://incubator.apache.org/droids">Droids</a> is a standalone crawler framework currently in incubation at the ASF.  Development was active in 2009, but has not yet had a release.  For now, it is a Spring based framework that allows one to quickly build out agents that can go and crawl and process content.</p>
<h1>Lucene.NET</h1>
<p>In 2009, <a href="http://incubator.apache.org/lucene.net/">Lucene.NET</a> graduated (some infrastructure changes still need to happen) from ASF incubation and became a full-fledged member of the Lucene ecosystem.  While I&#8217;m not closely involved with Lucene.NET, the community continues to provide value to those looking for a solid search library in .NET.  Since the project is mostly autogenerated from the Java sources, the .NET version has tracked the Lucene Java releases fairly closely.</p>
<p>Looking forward, I expect the .NET version will strive to maintain a lockstep march with Lucene releases.</p>
<h1>PyLucene</h1>
<p>Similar to .NET, <a href="http://lucene.apache.org/pylucene">PyLucene</a> produces a Python port of Lucene Java.  In 2009, PyLucene was formerly welcomed into the Lucene fold via a software donation by Andi Vajda.  It continues to produce releases of PyLucene in lockstep with Lucene Java.</p>
<h1>Lucy</h1>
<p><a href="http://lucene.apache.org/lucy">Lucy</a> is a &#8220;loose&#8221; &#8216;C&#8217; port of Lucene.  Lucy finally got off the ground in 2009 and is steadily working on building out a core search library that provides fast search capabilities for languages like Perl, C and Ruby.</p>
<p>For 2010, look for Lucy to continue to grow its community while adding capabilities.</p>
<h1>Moving Forward</h1>
<p>While the past is, of course, no prediction of the future, I think it&#8217;s safe to say Lucene is looking to continue to provide significant capabilities and value to both well established and new communities alike.  With open source, you never know where the next good idea is coming from, so make sure to stay tuned both here and on the mailing lists for more insight and more cool new capabilities.</p>
<p>Happy Holidays and here&#8217;s to an Open Source 2010!</p>
]]></content:encoded>
			<wfw:commentRss>http://www.lucidimagination.com/blog/2009/12/24/the-apache-lucene-ecosystem-my-view-of-2009/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Solr Flair at ApacheCon US 09 (Lucene Meetup)</title>
		<link>http://www.lucidimagination.com/blog/2009/11/04/solr-flair-at-apachecon-us-09-lucene-meetup/</link>
		<comments>http://www.lucidimagination.com/blog/2009/11/04/solr-flair-at-apachecon-us-09-lucene-meetup/#comments</comments>
		<pubDate>Wed, 04 Nov 2009 10:12:04 +0000</pubDate>
		<dc:creator>Erik Hatcher</dc:creator>
				<category><![CDATA[ApacheCon]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[Lucene]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Solr]]></category>

		<guid isPermaLink="false">http://www.lucidimagination.com/blog/?p=1275</guid>
		<description><![CDATA[<div style="width:425px;text-align:left" id="__ss_2418882"><a style="font:14px Helvetica,Arial,Sans-serif;display:block;margin:12px 0 3px 0;text-decoration:underline;" href="http://www.slideshare.net/erikhatcher/solr-flair" title="Solr Flair: Search User Interfaces Powered by Apache Solr (ApacheCon US 2009, Lucene Meetup)">Solr Flair: Search User Interfaces Powered by Apache Solr (ApacheCon US 2009, Lucene Meetup)</a>
<div style="font-size:11px;font-family:tahoma,arial;height:26px;padding-top:2px;">View more <a style="text-decoration:underline;" href="http://www.slideshare.net/">documents</a> from <a style="text-decoration:underline;" href="http://www.slideshare.net/erikhatcher">Erik Hatcher</a>.</div>
&#8230;</div>]]></description>
			<content:encoded><![CDATA[<div style="width:425px;text-align:left" id="__ss_2418882"><a style="font:14px Helvetica,Arial,Sans-serif;display:block;margin:12px 0 3px 0;text-decoration:underline;" href="http://www.slideshare.net/erikhatcher/solr-flair" title="Solr Flair: Search User Interfaces Powered by Apache Solr (ApacheCon US 2009, Lucene Meetup)">Solr Flair: Search User Interfaces Powered by Apache Solr (ApacheCon US 2009, Lucene Meetup)</a><object style="margin:0px" width="425" height="355"><param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=solrflair-091104035813-phpapp02&#038;stripped_title=solr-flair" /><param name="allowFullScreen" value="true"/><param name="allowScriptAccess" value="always"/><embed src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=solrflair-091104035813-phpapp02&#038;stripped_title=solr-flair" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"></embed></object>
<div style="font-size:11px;font-family:tahoma,arial;height:26px;padding-top:2px;">View more <a style="text-decoration:underline;" href="http://www.slideshare.net/">documents</a> from <a style="text-decoration:underline;" href="http://www.slideshare.net/erikhatcher">Erik Hatcher</a>.</div>
</div>
]]></content:encoded>
			<wfw:commentRss>http://www.lucidimagination.com/blog/2009/11/04/solr-flair-at-apachecon-us-09-lucene-meetup/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Come to the Lucene Meetup at ApacheCon in Oakland!</title>
		<link>http://www.lucidimagination.com/blog/2009/11/02/come-to-the-lucene-meetup-at-apachecon-in-oakland/</link>
		<comments>http://www.lucidimagination.com/blog/2009/11/02/come-to-the-lucene-meetup-at-apachecon-in-oakland/#comments</comments>
		<pubDate>Mon, 02 Nov 2009 16:06:23 +0000</pubDate>
		<dc:creator>Ken Hoyle</dc:creator>
				<category><![CDATA[apache]]></category>
		<category><![CDATA[ApacheCon]]></category>
		<category><![CDATA[Enterprise Search]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[Lucene]]></category>
		<category><![CDATA[Lucid Imagination]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Solr]]></category>

		<guid isPermaLink="false">http://www.lucidimagination.com/blog/?p=1271</guid>
		<description><![CDATA[[ Tuesday, 3 November 2009; 20:00 to 22:00. ] <p>Come visit us at the Lucene Meetup at ApacheCon on Tuesday 11/3 from 8-10pm. All are welcome to come &#8211; there is no cost for this event. Come meet many of the key contributors to Lucene and Solr. Sponsored by Lucid Imagination.</p>
<p>Location: Marriott Oakland City Center, Rooms 1&#38;2</p>
<p>For more information about the meetup, visit<br />
<a href="http://wiki.apache.org/lucene-java/LuceneAtApacheConUs2009">http://wiki.apache.org/lucene-java/LuceneAtApacheConUs2009</a>&#8230;</p>]]></description>
			<content:encoded><![CDATA[[ Tuesday, 3 November 2009; 20:00 to 22:00. ] <p>Come visit us at the Lucene Meetup at ApacheCon on Tuesday 11/3 from 8-10pm. All are welcome to come &#8211; there is no cost for this event. Come meet many of the key contributors to Lucene and Solr. Sponsored by Lucid Imagination.</p>
<p>Location: Marriott Oakland City Center, Rooms 1&amp;2</p>
<p>For more information about the meetup, visit<br />
<a href="http://wiki.apache.org/lucene-java/LuceneAtApacheConUs2009">http://wiki.apache.org/lucene-java/LuceneAtApacheConUs2009</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.lucidimagination.com/blog/2009/11/02/come-to-the-lucene-meetup-at-apachecon-in-oakland/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>ApacheCon Europe Follow Up</title>
		<link>http://www.lucidimagination.com/blog/2009/04/01/apachecon-europe-follow-up/</link>
		<comments>http://www.lucidimagination.com/blog/2009/04/01/apachecon-europe-follow-up/#comments</comments>
		<pubDate>Wed, 01 Apr 2009 11:16:08 +0000</pubDate>
		<dc:creator>Grant Ingersoll</dc:creator>
				<category><![CDATA[ApacheCon]]></category>
		<category><![CDATA[Droids]]></category>
		<category><![CDATA[Lucene]]></category>
		<category><![CDATA[Mahout]]></category>
		<category><![CDATA[Solr]]></category>
		<category><![CDATA[Tika]]></category>

		<guid isPermaLink="false">http://www.lucidimagination.com/blog/?p=601</guid>
		<description><![CDATA[<p>Another year, another successful <a href="http://www.eu.apachecon.com/">ApacheCon Europe</a>, at least as far as Lucene, Solr and I are concerned.  This year, like last, Erik Hatcher and I had trainings on Lucene and Solr.  Both were well attended, despite the economy, showing once again the power of open source and the fact that people are still invested in search.  (If you missed the training, see <a href="http://www.lucidimagination.com/How-We-Can-Help/Training">here</a> for alternatives.)</p>
<p>During the conference, there were several talks on Lucene, &#8230;</p>]]></description>
			<content:encoded><![CDATA[<p>Another year, another successful <a href="http://www.eu.apachecon.com/">ApacheCon Europe</a>, at least as far as Lucene, Solr and I are concerned.  This year, like last, Erik Hatcher and I had trainings on Lucene and Solr.  Both were well attended, despite the economy, showing once again the power of open source and the fact that people are still invested in search.  (If you missed the training, see <a href="http://www.lucidimagination.com/How-We-Can-Help/Training">here</a> for alternatives.)</p>
<p>During the conference, there were several talks on Lucene, Solr,  Mahout and Droids.  Slides are available at:</p>
<ul>
<li><a href="http://www.eu.apachecon.com/c/aceu2009/sessions/136">Introducing Mahout</a></li>
<li><a href="http://www.eu.apachecon.com/c/aceu2009/sessions/137">Lucene/Solr Case Studies</a></li>
<li><a href="http://www.eu.apachecon.com/c/aceu2009/sessions/138">Advanced Indexing</a> (slides are missing, but should be up sometime soon)</li>
<li><a href="http://www.eu.apachecon.com/c/aceu2009/sessions/165">Apache Droids</a></li>
<li><a href="http://www.eu.apachecon.com/c/aceu2009/sessions/250">Best of Breed: HTTP Server, Forrest, Solr and Droids</a></li>
<li><a href="http://www.eu.apachecon.com/c/aceu2009/sessions/251">Apache Solr: A Case Study</a></li>
</ul>
<p>Additionally, for the first time,  we had a <a href="http://wiki.apache.org/lucene-java/LuceneMeetupMarch2009">Lucene Meetup</a> (sponsored by Lucid).  I&#8217;d estimate there were around 60 people there and we had some good discussions on Tika, Lucene, Solr and Mahout.    Also, Uwe Schindler presented his new TrieRange Query capabilities.  Slides are available <a href="http://www.thetaphi.de/share/Schindler-TrieRange.ppt">here</a>.</p>
<p>Finally, my favorite part of the conference is always the individual conversations with people using the Lucene ecosystem to solve their problems.  Each year, it seems, people have more and more new ideas about how to use Lucene and Solr, many of which go beyond &#8220;traditional&#8221; search.  Over the coming months, I think you will see more and more of Lucid highlighting all Lucene ecosystem users through our <a href="http://www.lucidimagination.com/Community/Hear-from-the-Experts/Podcasts-and-Videos">Podcasts</a>, <a href="http://www.lucidimagination.com/Community/Marketplace/Application-Showcase-Wiki">Wiki Showcase</a> and other features coming soon.  So, if you think you&#8217;ve got something cool using the Lucene ecosystem, add a comment below or drop us a line at feedback@lucidimagination.com</p>
<p>UPDATE: 4/9/09:  Uri has sent me his slides and they can be downloaded at: <a href="http://www.lucidimagination.com/blog/wp-content/uploads/2009/04/apache-conference-2009.pdf">Apache Solr Case Study</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.lucidimagination.com/blog/2009/04/01/apachecon-europe-follow-up/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Lucene and Solr training at ApacheCon Europe</title>
		<link>http://www.lucidimagination.com/blog/2009/03/16/lucene-and-solr-training-at-apachecon-europe/</link>
		<comments>http://www.lucidimagination.com/blog/2009/03/16/lucene-and-solr-training-at-apachecon-europe/#comments</comments>
		<pubDate>Mon, 16 Mar 2009 21:16:06 +0000</pubDate>
		<dc:creator>Grant Ingersoll</dc:creator>
				<category><![CDATA[ApacheCon]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[Lucene]]></category>
		<category><![CDATA[Solr]]></category>

		<guid isPermaLink="false">http://www.lucidimagination.com/blog/?p=397</guid>
		<description><![CDATA[<p>Just a reminder that Erik and Grant are offering Lucene and Solr training at <a href="http://www.eu.apachecon.com/c/aceu2009/">ApacheCon Europe</a> next week.  Grant&#8217;s class is a <a href="http://www.eu.apachecon.com/c/aceu2009/sessions/197">2-day hands-on training</a> on Lucene designed to get you up and working with Lucene and provide  information about where to go next.  Erik&#8217;s class is a <a href="http://www.eu.apachecon.com/c/aceu2009/sessions/201">1-day session</a> on getting up and running with Solr.</p>
<p>Also,  note both Erik and I will be at the <a href="http://www.eu.apachecon.com/c/aceu2009/schedule/events">Lucene meetup</a> on Tuesday night!&#8230;</p>]]></description>
			<content:encoded><![CDATA[<p>Just a reminder that Erik and Grant are offering Lucene and Solr training at <a href="http://www.eu.apachecon.com/c/aceu2009/">ApacheCon Europe</a> next week.  Grant&#8217;s class is a <a href="http://www.eu.apachecon.com/c/aceu2009/sessions/197">2-day hands-on training</a> on Lucene designed to get you up and working with Lucene and provide  information about where to go next.  Erik&#8217;s class is a <a href="http://www.eu.apachecon.com/c/aceu2009/sessions/201">1-day session</a> on getting up and running with Solr.</p>
<p>Also,  note both Erik and I will be at the <a href="http://www.eu.apachecon.com/c/aceu2009/schedule/events">Lucene meetup</a> on Tuesday night!</p>
]]></content:encoded>
			<wfw:commentRss>http://www.lucidimagination.com/blog/2009/03/16/lucene-and-solr-training-at-apachecon-europe/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Lucene Meetup March 2009</title>
		<link>http://www.lucidimagination.com/blog/2009/02/27/lucene-meetup-march-2009/</link>
		<comments>http://www.lucidimagination.com/blog/2009/02/27/lucene-meetup-march-2009/#comments</comments>
		<pubDate>Fri, 27 Feb 2009 12:59:07 +0000</pubDate>
		<dc:creator>Grant Ingersoll</dc:creator>
				<category><![CDATA[ApacheCon]]></category>
		<category><![CDATA[Lucene]]></category>
		<category><![CDATA[Lucid Imagination]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Solr]]></category>

		<guid isPermaLink="false">http://www.lucidimagination.com/blog/?p=219</guid>
		<description><![CDATA[[ Tuesday, 24 March 2009; 08:00; ] <p><a href="http://wiki.apache.org/lucene-java/LuceneMeetupMarch2009#preview">LuceneMeetupMarch2009 &#8211; Lucene-java Wiki</a>.</p>
<p>Lucene users (Solr, Nutch, Mahout, Tika, etc.) are all invited to attend a Lucene Meetup on March 24th in Amsterdam, Netherlands.  Looks like Lucene creator Doug Cutting will be there, as well as some of the other Lucene committers.  I always like these informal gatherings, as they are a great way to share ideas and meet fellow coders.&#8230;</p>]]></description>
			<content:encoded><![CDATA[[ Tuesday, 24 March 2009; 08:00; ] <p><a href="http://wiki.apache.org/lucene-java/LuceneMeetupMarch2009#preview">LuceneMeetupMarch2009 &#8211; Lucene-java Wiki</a>.</p>
<p>Lucene users (Solr, Nutch, Mahout, Tika, etc.) are all invited to attend a Lucene Meetup on March 24th in Amsterdam, Netherlands.  Looks like Lucene creator Doug Cutting will be there, as well as some of the other Lucene committers.  I always like these informal gatherings, as they are a great way to share ideas and meet fellow coders.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.lucidimagination.com/blog/2009/02/27/lucene-meetup-march-2009/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

