You are viewing a plain text version of this content. The canonical link for it is here.
Posted to java-user@lucene.apache.org by revati joshi <re...@yahoo.com> on 2006/02/18 12:11:00 UTC

Indexing problem

Hi,
       i'm facing problem while indexing the files .There r some files which r not in normal ASCII format having ASCII characters but r in some arabic,french format which i don't want to index those files.
  Because of these files my indexing process gets halted in between.so is there any class in Lucene to ignore such files.
  plz suggest some solution to this.
  Thnkx for ur cooperation in advance.
                                                         regards,
                                                          Revati.

		
---------------------------------
 
 What are the most popular cars? Find out at Yahoo! Autos 

Re: Indexing problem

Posted by Erik Hatcher <er...@ehatchersolutions.com>.
On Feb 18, 2006, at 6:11 AM, revati joshi wrote:
>        i'm facing problem while indexing the files .There r some  
> files which r not in normal ASCII format having ASCII characters  
> but r in some arabic,french format which i don't want to index  
> those files.
>   Because of these files my indexing process gets halted in  
> between.so is there any class in Lucene to ignore such files.
>   plz suggest some solution to this.

No, there is nothing in Lucene to help you here.  Lucene does not  
deal with files, only text.  It is your application that needs to  
deal with files appropriately.  Perhaps you could simply catch and  
ignore exceptions to deal with this situation?

	Erik


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org