Lucid Imagination

Secondary links

  • Contact Us
  • Log out
  • Downloads
  • Solutions
    • Partners |
    • Blog |
    • Software |
    • Services |
    • Training |
    • Case Studies |
    • Webinars |
  • Developers
    • Blog |
    • Tech Articles |
    • Community |
    • Docs |
    • Downloads |
    • Whitepapers |
    • Podcasts |
  • About
    • Market Overview |
    • Management |
    • Company News |
    • In the Media |
    • Contact |

beta

Start new search

Options

  • results per page

Clear all facets

  • Project clear projects

  • Source clear sources

  • Author clear authors

Search Results for

Results loading...

Found 36,204 results in 0.133 seconds. Displaying page 3 of 3,621, sorted by

  1. [nutch-user] Write plugin in my own package with Nutch as a jar

    Sent 2010-09-01 by jitendra rajput <jeet.loves@...>

    Hi, I have gone through the tutorial about writing plugin in Nutch source code itself. But I want to write a nutch plugin in my own package with Nutch jar in its build path. Is it possible to do so. Can any one lead me to right direction for same. Any help would be appreciated. -- Thanks and...

  2. [nutch-dev] Re: crawling webpage results

    Sent 2010-09-01 by Alex McLintock <alex.mclintock@...>

    This should really be a user type question, not a dev question. But what the heck. The first thing which comes to mind is to do the search yourself and provide the results of that search as seed pages. But since you asked on the dev mailing list, you could possibly write something which actuall...

  3. [nutch-dev] crawling webpage results

    Sent 2010-09-01 by Shanthoosh PV <shanthoosh@...>

    Hi , I want to crawl a result obtained based upon a user defined keyword search in a search engine . Is it possible to do it in nutch . Please provide useful insights , i tried searching in this forum and google but found nothing helpful . The user may p...

  4. [nutch-user] Re: performance for small cluster

    Sent 2010-08-31 by AJ Chen <ajchen@...>

    Thanks for suggesting multiple segments approach - it's the way to go for further increasing crawling throughput. I tried the -maxNumSegments 3 option in local mode, but it did not generate 3 segments. Does the option work? It may be only work in distributed mode. I also observe that, when fet...

  5. [nutch-user] Re: Help: Extracted Links with characters like ?,= are getting filtered out.

    Sent 2010-08-31 by Jitendra <jeet.loves@...>

    Thanks a ton volli. I wasted 2 days trying to figure this out, never noticed crawl-urifilter.txt also contains regex expressions for filtering urls. Volli wrote: > > Did you try already to switch off the regexp in > crawl-urlfilter.txt? > > if you use > bin/nutch crawl... > for crawling cra...

  6. [nutch-user] Re: Help: Extracted Links with characters like ?,= are getting filtered out.

    Sent 2010-08-31 by Volli <illov@...>

    Did you try already to switch off the regexp in crawl-urlfilter.txt? if you use bin/nutch crawl... for crawling crawl-urlfilter.txt must be changed. compare other lines, too. see "# skip everything else" and "# accept anything else" Am 31.08.2010 10:32, schrieb jitendra rajput: > Hi, > > I a...

  7. [nutch-user] Help: Extracted Links with characters like ?,= are getting filtered out.

    Sent 2010-08-31 by jitendra rajput <jeet.loves@...>

    Hi, I am trying to write XpathBasedLinkExtractor which extracts links out of html page using xpaths. But all the extracted links which contains characters like [? , = ] are being filtered out. I am not able to nail it down where it is happening. They are not going into segments. I have also comm...

  8. [nutch-dev] Re: Alternative search box for Nutch site

    Sent 2010-08-30 by Andrzej Bialecki <ab@...>

    On 2010-08-30 12:21, Otis Gospodnetic wrote: > Hello peeps, > > We've created a patch for Tika and got some good and constructive feedback (see > https://issues.apache.org/jira/browse/TIKA-488 ). > > Should we follow the same functionality pattern for nutch.apache.org as seen in > TIKA-488? Sure...

  9. [nutch-dev] Re: Alternative search box for Nutch site

    Sent 2010-08-30 by Otis Gospodnetic <ogjunk-nutch@...>

    Hello peeps, We've created a patch for Tika and got some good and constructive feedback (see https://issues.apache.org/jira/browse/TIKA-488 ). Should we follow the same functionality pattern for nutch.apache.org as seen in TIKA-488? Thanks, Otis ---- Sematext :: http://sematext.com/ :: Solr ...

  10. [nutch-user] Re: bug in custom-fields.xml?

    Sent 2010-08-28 by Savannah Beckett <savannah_beckett30@...>

    one more thing, in code CustomFieldQueryFilter.java, it doesn't loop through same key more than once.  It looks like it never expect more than one custom field in the xml.  ________________________________ From: Savannah Beckett To: user@nutch.apache.org S...

  1. <<
  2. 1
  3. 2
  4. 3
  5. 4
  6. 5
  7. 6
  8. 7
  9. 8
  10. 9
  11. 10
  12. >>

Solr Powered

Give us your feedback

  • Lucene
  • Solr
  • Nutch
  • Tika
  • Mahout
  • Droids
  • PyLucene
  • Lucene.Net
  • Lucy
  • Lucene4c
  • Open Relevance Project
  • How We Can Help:
    • Getting Started |
    • Support Subscriptions |
    • White Papers |
    • Training |
    • Consulting |
    • Contact Us |
  • Developers:
    • Blog |
    • Documentation |
    • Tech Articles |
    • Podcasts and Videos |
    • Community |
  • Downloads:
    • LucidWorks for Solr |
    • LucidWorks for Lucene |
    • LucidGaze for Solr |
    • LucidGaze for Lucene |
  • Products:
  • Services:

Contact | Privacy Policy | Legal Terms of Use | Copyrights and Disclaimers | Logout

Apache Solr, Apache Lucene, ApacheCon and their logos are trademarks of the Apache Software Foundation.

© 2010 Lucid Imagination. All Right reserved.