You are viewing a plain text version of this content. The canonical link for it is here.
Posted to java-user@lucene.apache.org by suneethad <su...@india.adventnet.com> on 2002/02/12 04:50:14 UTC

Remove Duplicates

Hello team,
    I have indexed  a set of files based on some categories but I find
the urls to crawl that I've given has a lot of duplication . How
can I remove them .I want to refine the hit results too.
    Secondly can I index also some database values along with the file
contents.
Regards,
Suneetha


IOException

Posted by Pradeep Kumar K <pr...@robosoftin.com>.
Hello Lucene friends

     I am encountering  a serious problem while I am reading the indexes 
I stored   in the database using

      IndexSearcher  tempSearcher = new IndexSearcher(new IndexStore());
      where "IndexStore.java" is the java file I used to
      track my indexing and searching.
      Indexed files I am storing in Openbase database.

When I went deeply it is found that the function

     InputStream openFile(String name) is giving the exception

The error is
Could not perform search because:
class java.io.IOException - read past EOF
java.io.IOException: read past EOF

I am using Lucene2 cadidate3(latest downloaded). Any solutions for this 
will be greatfully appreciated.

regards

Pradeep


--------------------------------------------------------------
Robosoft Technologies, Mangalore, India



--
To unsubscribe, e-mail:   <ma...@jakarta.apache.org>
For additional commands, e-mail: <ma...@jakarta.apache.org>