You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucenenet.apache.org by "Satish P Wani (JIRA)" <ji...@apache.org> on 2010/12/18 11:24:01 UTC
[jira] Created: (LUCENENET-386) Stop words in Unicode/UTF8 format
is not working with Lucene .Net 2.9.2
Stop words in Unicode/UTF8 format is not working with Lucene .Net 2.9.2
-----------------------------------------------------------------------
Key: LUCENENET-386
URL: https://issues.apache.org/jira/browse/LUCENENET-386
Project: Lucene.Net
Issue Type: Wish
Environment: Windows XP, Lucene .Net
Reporter: Satish P Wani
We are using Lucene .Net version 2.9.2. We tried giving Unicode/UTF8 stop words (words in Devnagari) using following 3 cases. All below cases are not working. Please let us know whether any one has tried this.
// Case 1
string StopWordsFilePath="D:\\lucene\stopword.txt"
FileInfo StopWordsList = new FileInfo(StopWordsFilePath);
StandardAnalyzer _oStandardAnalyzer = new StandardAnalyzer(Lucene.Net.Util.Version.LUCENE_CURRENT, StopWordsList);
IndexWriter indexWriter = new IndexWriter(IndexFolderPath, _oStandardAnalyzer, UpdateIndexFlag);
// Case 2
string StopWordsFilePath="D:\\lucene\stopword.txt" // File type of this file is UTF-8
string[] stopWords = File.ReadAllLines(StopWordsFilePath, Encoding.UTF8);
StandardAnalyzer _oStandardAnalyzer = new StandardAnalyzer(stopWords);
IndexWriter indexWriter = new IndexWriter(IndexFolderPath, _oStandardAnalyzer, UpdateIndexFlag);
// Case 3
string StopWordsFilePath="D:\\lucene\stopword.txt" // File type of this file is Unicode
string[] stopWords = File.ReadAllLines(StopWordsFilePath, Encoding.Unicode);
StandardAnalyzer _oStandardAnalyzer = new StandardAnalyzer(stopWords);
IndexWriter indexWriter = new IndexWriter(IndexFolderPath, _oStandardAnalyzer, UpdateIndexFlag);
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Closed: (LUCENENET-386) Stop words in Unicode/UTF8 format is
not working with Lucene .Net 2.9.2
Posted by "Digy (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/LUCENENET-386?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Digy closed LUCENENET-386.
--------------------------
Resolution: Invalid
JIRA isn't the right place to ask questions. Please use mailing lists.
DIGY
> Stop words in Unicode/UTF8 format is not working with Lucene .Net 2.9.2
> -----------------------------------------------------------------------
>
> Key: LUCENENET-386
> URL: https://issues.apache.org/jira/browse/LUCENENET-386
> Project: Lucene.Net
> Issue Type: Wish
> Environment: Windows XP, Lucene .Net
> Reporter: Satish P Wani
> Original Estimate: 336h
> Remaining Estimate: 336h
>
> We are using Lucene .Net version 2.9.2. We tried giving Unicode/UTF8 stop words (words in Devnagari) using following 3 cases. All below cases are not working. Please let us know whether any one has tried this.
> // Case 1
> string StopWordsFilePath="D:\\lucene\stopword.txt"
> FileInfo StopWordsList = new FileInfo(StopWordsFilePath);
> StandardAnalyzer _oStandardAnalyzer = new StandardAnalyzer(Lucene.Net.Util.Version.LUCENE_CURRENT, StopWordsList);
> IndexWriter indexWriter = new IndexWriter(IndexFolderPath, _oStandardAnalyzer, UpdateIndexFlag);
> // Case 2
> string StopWordsFilePath="D:\\lucene\stopword.txt" // File type of this file is UTF-8
> string[] stopWords = File.ReadAllLines(StopWordsFilePath, Encoding.UTF8);
> StandardAnalyzer _oStandardAnalyzer = new StandardAnalyzer(stopWords);
> IndexWriter indexWriter = new IndexWriter(IndexFolderPath, _oStandardAnalyzer, UpdateIndexFlag);
> // Case 3
> string StopWordsFilePath="D:\\lucene\stopword.txt" // File type of this file is Unicode
> string[] stopWords = File.ReadAllLines(StopWordsFilePath, Encoding.Unicode);
> StandardAnalyzer _oStandardAnalyzer = new StandardAnalyzer(stopWords);
> IndexWriter indexWriter = new IndexWriter(IndexFolderPath, _oStandardAnalyzer, UpdateIndexFlag);
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.