Found 121,742 results in 0.163 seconds. Displaying page 8 of 12,175, sorted by
Sent 2010-08-26 by Li Li <fancyerii@...>
" It is strange that it should take 20 second to gather fields,"
20s including search and gather fields, it's the total time
2010/8/27 Karl Wettin :
> My mail client died while sending this mail.. Sorry for any duplicate.
>
> It is strange that it should take 20 second to ...
Sent 2010-08-26 by Shelly_Singh <Shelly_Singh@...>
Hi,
I have a lucene index of 100 million documents. But the document size is very small - 5 fields with 1 or 2 terms each. Only 1 field is analyzed and others are just simply indexed. The index is optimized to 2 segments and the total index size is 7GB.
I open a searcher with a termsInfoDivisor...
Sent 2010-08-26 by Li Li <fancyerii@...>
if I index only 7k documents, the time comparison:
time1: 7602331019 time2: 4246878035 total1: 10736 total2: 7393
it seems II is faster than RAMDirectory.
My indexed texts are all hotel names (chinese and english, litter french).
it has about 100k terms. terms such as hotel is very frequent and
...
Sent 2010-08-26 by Arun Rangarajan <arunrangarajan@...>
Thanks, Lance. After exploring for a while, I used lucene's ShingleFilter
followed by the SynonymFilter in Lucene in Action book. Then using the type
attribute, I removed all the shingles which did not belong to any category.
On Wed, Aug 18, 2010 at 10:28 PM, Lance Norskog wr...
Sent 2010-08-26 by Grant Ingersoll <gsingers@...>
I think that fits with its design goals. It wasn't intended for a large number of docs like RAMDir, more for the case where one might use MemoryIndex, but has more than one doc, but not so many that you need a more compact representation.
-Grant
On Aug 26, 2010, at 4:11 PM, Earwin Burrfoot wro...
Sent 2010-08-26 by Robert Muir <rcmuir@...>
On Thu, Aug 26, 2010 at 6:06 PM, Chris Hostetter
wrote:
>
> we could concievable support having LimitTokenCountFilter added implicitly
> even if that option isn't used, via some syntax like you suggest -- but
> honestly i think it's better to just let the user declare i...
Sent 2010-08-26 by Chris Hostetter <hossman_lucene@...>
This can be dealt with in a lot of differnet ways in Solr -- even if
Lucene removes all suppport for the IndexWriter.maxFieldLength, Solr can
still support it by wrapping every analyzer with a LimitTokenCountFilter
if that config option is used.
we could concievable support having LimitToke...
Sent 2010-08-26 by Stefan Nikolic <stefan.nikolic@...>
Hi everyone,
I'm trying to figure out the effects on search performance of using the
non-CFS format and spreading the various underlying files to different
disks/media types. For example, I'm considering moving a segment's various
.t* term-related files onto a solid-state drive, the .fdx/.fdt
st...
Sent 2010-08-26 by Earwin Burrfoot <earwin@...>
I meant exactly what I said. My opinion is that Instantiated puts more
strain on GC compared to RAMDir. If that aspect is negligible for you
(less docs, smaller heap) it quite probably runs faster, if not -
after some point GC costs will dwarf any gains.
2010/8/26 Li Li :
> ...
Sent 2010-08-26 by Amin Mohammed-Coleman <aminmc@...>
Hi Erick
Thanks for your response. I used the Lucene in Action 1st edition as a reference for batch indexing. I've just got my copy of the 2nd edition which mentions that there is no point in using RAM directory. Not saying I don't trust you :).
I'll update my code to use the normal fs direct...