You are viewing a plain text version of this content. The canonical link for it is here.

Posted to java-user@lucene.apache.org by Chantal Ackermann <ch...@biomax.de> on 2001/12/05 09:22:04 UTC

FileNotFoundException

hello all,

I am still trying to find the best way to index a really big amount of data. 
at the moment I am trying to index each of the 29 textfiles in a single 
thread using for each an own IndexWriter and an own directory where to place 
the index. there are always six threads working the same time.

the problem that occures now is that every second thread stops due to a 
FileNotFoundException or an ArrayIndexOutOfBoundsException (the latter only 
once) while the other half finishes fine. the file's name is different for 
each thread but has always the extension ".fnm".

for example:
java.io.FileNotFoundException: 
/lucenetest/medlineIndex/1976-1977/_2zfj.fnm (Datei oder 
Verzeichnis nicht gefunden)
        at java.io.RandomAccessFile.open(Native Method)
        at java.io.RandomAccessFile.<init>(RandomAccessFile.java(Compiled 
Code))        at 
java.io.RandomAccessFile.<init>(RandomAccessFile.java(Compiled Code))        
at org.apache.lucene.store.FSInputStream.<init>(Unknown Source)
        at org.apache.lucene.store.FSInputStream.<init>(Unknown Source)
        at org.apache.lucene.store.FSDirectory.openFile(Unknown Source)
        at org.apache.lucene.index.FieldInfos.<init>(Unknown Source)
        at org.apache.lucene.index.SegmentReader.<init>(Unknown Source)
        at org.apache.lucene.index.IndexWriter.mergeSegments(Unknown Source)
        at org.apache.lucene.index.IndexWriter.maybeMergeSegments(Unknown 
Source)
        at org.apache.lucene.index.IndexWriter.maybeMergeSegments(Unknown 
Source)
        at org.apache.lucene.index.IndexWriter.addDocument(Unknown Source)
        at 
de.biomax.lucenetest.MedlineRecordIndexer.indexDocs(MedlineRecordIndexer.java(Compiled 
Code))

since half of the files are indexed without throwing that kind of exception 
I'm at a loss where to start debugging. any ideas?

thanks a lot
chantal

--
To unsubscribe, e-mail:   <ma...@jakarta.apache.org>
For additional commands, e-mail: <ma...@jakarta.apache.org>

Re: FileNotFoundException

Posted by Ian Lea <ia...@blackwell.co.uk>.

I don't have an explanation for this, but if it was me indexing
this large amount of data I'd be running each of the 6 in a
completely separate process.  More control, less damage when
one bit fails, perhaps better performance on a multi-processor
machine.  And perhaps you wouldn't get this problem!


--
Ian.
ian.lea@blackwell.co.uk


Chantal Ackermann wrote:
> 
> hello all,
> 
> I am still trying to find the best way to index a really big amount of data.
> at the moment I am trying to index each of the 29 textfiles in a single
> thread using for each an own IndexWriter and an own directory where to place
> the index. there are always six threads working the same time.
> 
> the problem that occures now is that every second thread stops due to a
> FileNotFoundException or an ArrayIndexOutOfBoundsException (the latter only
> once) while the other half finishes fine. the file's name is different for
> each thread but has always the extension ".fnm".
> 
> for example:
> java.io.FileNotFoundException:
> /lucenetest/medlineIndex/1976-1977/_2zfj.fnm (Datei oder
> Verzeichnis nicht gefunden)
>         at java.io.RandomAccessFile.open(Native Method)
>         at java.io.RandomAccessFile.<init>(RandomAccessFile.java(Compiled
> Code))        at
> java.io.RandomAccessFile.<init>(RandomAccessFile.java(Compiled Code))
> at org.apache.lucene.store.FSInputStream.<init>(Unknown Source)
>         at org.apache.lucene.store.FSInputStream.<init>(Unknown Source)
>         at org.apache.lucene.store.FSDirectory.openFile(Unknown Source)
>         at org.apache.lucene.index.FieldInfos.<init>(Unknown Source)
>         at org.apache.lucene.index.SegmentReader.<init>(Unknown Source)
>         at org.apache.lucene.index.IndexWriter.mergeSegments(Unknown Source)
>         at org.apache.lucene.index.IndexWriter.maybeMergeSegments(Unknown
> Source)
>         at org.apache.lucene.index.IndexWriter.maybeMergeSegments(Unknown
> Source)
>         at org.apache.lucene.index.IndexWriter.addDocument(Unknown Source)
>         at
> de.biomax.lucenetest.MedlineRecordIndexer.indexDocs(MedlineRecordIndexer.java(Compiled
> Code))
> 
> since half of the files are indexed without throwing that kind of exception
> I'm at a loss where to start debugging. any ideas?
> 
> thanks a lot
> chantal

--
To unsubscribe, e-mail:   <ma...@jakarta.apache.org>
For additional commands, e-mail: <ma...@jakarta.apache.org>