You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@lucene.apache.org by Andrzej Bialecki <ab...@getopt.org> on 2004/06/22 14:10:50 UTC

ANN: Luke v. 0.5 released

Hello fellow Luceners,

I'm pleased to announce that new release of Luke is now available. You 
can download it from:

	http://www.getopt.org/luke/

This release uses Lucene 1.4-rc4.

This release also represents a major step forward - many new exciting 
features have been added. The feature I consider the most important in 
this release is extensibility - there is a plugin framework, and a 
sample plugin is provided in the distribution - I encourage you to write 
more.

Here's a short summary of changes in this release:

     * NEW: Added support for Term Vectors.
     * NEW: Added a plugin framework - plugins found on classpath are
	detected automatically and added to the new "Plugins" tab.
	Note however that for now plugins autoloading doesn't quite
	work when using Java WebStart - an alternative mechanism is also
	provided. Plugins have full access to the application context.
	Please read JavaDoc for LukePlugin.java for more information.
     * NEW: A sample plugin is provided, based on Mark Harwood's "tool
	for analyzing analyzers".
     * NEW: all tables support resizable columns now. Some dialogs are
	also resizable.
     * NEW: Added Reconstruct functionality. Using this function users
	can reconstruct the content of all (also unstored) fields of a
	document. This function uses a brute-force approach, so it may
	be slow for larger indexes (> 500,000 docs).
     * NEW: Added "pseudo-edit" functionality. New document editor dialog
	allows to modify reconstructed documents, and add or replace the
	original ones.
     * FIX: problems with MRU list solved, and a framework for handling
	preferences introduced.
     * FIX: the list of available Analyzers is now dynamically populated
	from the classpath, using the same method as in the AnalyzerTool
	plugin. This also doesn't work in WebStart, so a fallback to a
	static list is provided.
     * FIX: restructured source repository and added Ant build script.

Please note that as a result of the package name changes, the main class 
is now org.getopt.luke.Luke, and NOT as before luke.Luke.

I felt that all these changes merited a slight change in name, from 
"Lucene Index Browser" to "Lucene Index Toolbox", as this seems to 
better reflect the current functionality of the tool.

Any feedback, patches for enhancements or bufixes are welcome! If you 
want to provide a patch, please use "diff -bdruN" - this will help me to 
integrate it. Thank you!

-- 
Best regards,
Andrzej Bialecki

-------------------------------------------------
Software Architect, System Integration Specialist
CEN/ISSS EC Workshop, ECIMF project chair
EU FP6 E-Commerce Expert/Evaluator
-------------------------------------------------
FreeBSD developer (http://www.freebsd.org)



---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-dev-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-dev-help@jakarta.apache.org

Re: Lucene Error

Posted by Don Vaillancourt <do...@webimpact.com>.

Hello,

Someone replied to me off-list and suggested to me the same thing and it 
did resolve the problem.

Thanks

At 12:37 PM 25/06/2004, you wrote:
>Hello Don,
>
>I've never seen this, and I'm pretty sure it's not really Lucene's
>fault.  Lucene is aware only of segments/files listed in segments file.
>  Could it have anything to do with that ColdFusion code that wraps
>Lucene by any chance?
>
>Regardless, it's probably not the best practise to store non-Lucene
>files in a Lucene index directory.  I suggest you store that log
>elsewhere, and avoid any issues that way. :)
>
>Otis
>
>--- Don Vaillancourt <do...@webimpact.com> wrote:
> > Hello All,
> >
> > I'm using Lucene to build collections from ColdFusion which I've got
> > working pretty good so far.  But I am getting the following exception
> > for
> > some reason that I can't understand and never used to get before.
> >
> > Below is the stack trace for that error.  Lucene is telling me that
> > it
> > cannot delete a file named 'query_1088174609733.log' which I create
> > in the
> > same folder as the Lucene collections.  I don't understand why Lucene
> > is
> > trying to delete this file.  I have verified that I am creating a new
> >
> > collection and not updating an existing one.
> >
> > Anyone have any ideas.
> >
> > Thanks
> >
> > java.io.IOException: couldn&apos;t delete query_1088174609733.log
> >          at
> > org.apache.lucene.store.FSDirectory.create(FSDirectory.java:166)
> >          at
> > org.apache.lucene.store.FSDirectory.<init>(FSDirectory.java:151)
> >          at
> >
>org.apache.lucene.store.FSDirectory.getDirectory(FSDirectory.java:132)
> >          at
> > org.apache.lucene.index.IndexWriter.<init>(IndexWriter.java:160)
> >          at Index.getIndexWriter(Index.java:154)
> >          at Index.update(Index.java:83)
> >          at sun.reflect.NativeMethodAccessorImpl.invoke0(Native
> > Method)
> >          at
> >
>sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
> >          at
> >
>sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
> >          at java.lang.reflect.Method.invoke(Method.java:324)
> >          at coldfusion.runtime.StructBean.invoke(StructBean.java:326)
> >          at coldfusion.runtime.CfJspPage._invoke(CfJspPage.java:1650)
> >          at
> >
>cfindex2ecfm778374439.runPage(C:\Inetpub\wwwroot\prism\prism376\Lucene\index.cfm:58)
> >          at coldfusion.runtime.CfJspPage.invoke(CfJspPage.java:147)
> >          at
> > coldfusion.tagext.lang.IncludeTag.doStartTag(IncludeTag.java:357)
> >          at
> > coldfusion.filter.CfincludeFilter.invoke(CfincludeFilter.java:62)
> >          at
> >
>coldfusion.filter.ApplicationFilter.invoke(ApplicationFilter.java:107)
> >          at coldfusion.filter.PathFilter.invoke(PathFilter.java:80)
> >          at
> > coldfusion.filter.LicenseFilter.invoke(LicenseFilter.java:24)
> >          at
> > coldfusion.filter.ExceptionFilter.invoke(ExceptionFilter.java:47)
> >          at
> >
>coldfusion.filter.ClientScopePersistenceFilter.invoke(ClientScopePersistenceFilter.java:28)
> >          at
> > coldfusion.filter.BrowserFilter.invoke(BrowserFilter.java:35)
> >          at
> > coldfusion.filter.GlobalsFilter.invoke(GlobalsFilter.java:43)
> >          at
> > coldfusion.filter.DatasourceFilter.invoke(DatasourceFilter.java:22)
> >          at coldfusion.CfmServlet.service(CfmServlet.java:105)
> >          at
> > jrun.servlet.ServletInvoker.invoke(ServletInvoker.java:91)
> >          at
> > jrun.servlet.JRunInvokerChain.invokeNext(JRunInvokerChain.java:42)
> >          at
> >
>jrun.servlet.JRunRequestDispatcher.invoke(JRunRequestDispatcher.java:252)
> >          at
> >
>jrun.servlet.ServletEngineService.dispatch(ServletEngineService.java:527)
> >          at
> >
>jrun.servlet.jrpp.JRunProxyService.invokeRunnable(JRunProxyService.java:192)
> >          at
> >
>jrunx.scheduler.ThreadPool$DownstreamMetrics.invokeRunnable(ThreadPool.java:348)
> >          at
> >
>jrunx.scheduler.ThreadPool$ThreadThrottle.invokeRunnable(ThreadPool.java:451)
> >          at
> >
>jrunx.scheduler.ThreadPool$UpstreamMetrics.invokeRunnable(ThreadPool.java:294)
> >          at jrunx.scheduler.WorkerThread.run(WorkerThread.java:66)
> >
> >
> >
> >
> >
> >
> > Don Vaillancourt
> > Director of Software Development
> >
> > WEB IMPACT INC.
> > 416-815-2000 ext. 245
> > email: donv@web-impact.com
> > web: http://www.web-impact.com
> >
> >
> >
> >
> > This email message is intended only for the addressee(s)
> > and contains information that may be confidential and/or
> > copyright.  If you are not the intended recipient please
> > notify the sender by reply email and immediately delete
> > this email. Use, disclosure or reproduction of this email
> > by anyone other than the intended recipient(s) is strictly
> > prohibited. No representation is made that this email or
> > any attachments are free of viruses. Virus scanning is
> > recommended and is the responsibility of the recipient.
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >
>
>
>---------------------------------------------------------------------
>To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
>For additional commands, e-mail: lucene-user-help@jakarta.apache.org

Don Vaillancourt
Director of Software Development

WEB IMPACT INC.
416-815-2000 ext. 245
email: donv@web-impact.com
web: http://www.web-impact.com




This email message is intended only for the addressee(s)
and contains information that may be confidential and/or
copyright.  If you are not the intended recipient please
notify the sender by reply email and immediately delete
this email. Use, disclosure or reproduction of this email
by anyone other than the intended recipient(s) is strictly
prohibited. No representation is made that this email or
any attachments are free of viruses. Virus scanning is
recommended and is the responsibility of the recipient.

Re: Lucene Error

Posted by Otis Gospodnetic <ot...@yahoo.com>.

Hello Don,

I've never seen this, and I'm pretty sure it's not really Lucene's
fault.  Lucene is aware only of segments/files listed in segments file.
 Could it have anything to do with that ColdFusion code that wraps
Lucene by any chance?

Regardless, it's probably not the best practise to store non-Lucene
files in a Lucene index directory.  I suggest you store that log
elsewhere, and avoid any issues that way. :)

Otis

--- Don Vaillancourt <do...@webimpact.com> wrote:
> Hello All,
> 
> I'm using Lucene to build collections from ColdFusion which I've got 
> working pretty good so far.  But I am getting the following exception
> for 
> some reason that I can't understand and never used to get before.
> 
> Below is the stack trace for that error.  Lucene is telling me that
> it 
> cannot delete a file named 'query_1088174609733.log' which I create
> in the 
> same folder as the Lucene collections.  I don't understand why Lucene
> is 
> trying to delete this file.  I have verified that I am creating a new
> 
> collection and not updating an existing one.
> 
> Anyone have any ideas.
> 
> Thanks
> 
> java.io.IOException: couldn&apos;t delete query_1088174609733.log
>          at
> org.apache.lucene.store.FSDirectory.create(FSDirectory.java:166)
>          at
> org.apache.lucene.store.FSDirectory.<init>(FSDirectory.java:151)
>          at 
>
org.apache.lucene.store.FSDirectory.getDirectory(FSDirectory.java:132)
>          at
> org.apache.lucene.index.IndexWriter.<init>(IndexWriter.java:160)
>          at Index.getIndexWriter(Index.java:154)
>          at Index.update(Index.java:83)
>          at sun.reflect.NativeMethodAccessorImpl.invoke0(Native
> Method)
>          at 
>
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
>          at 
>
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>          at java.lang.reflect.Method.invoke(Method.java:324)
>          at coldfusion.runtime.StructBean.invoke(StructBean.java:326)
>          at coldfusion.runtime.CfJspPage._invoke(CfJspPage.java:1650)
>          at 
>
cfindex2ecfm778374439.runPage(C:\Inetpub\wwwroot\prism\prism376\Lucene\index.cfm:58)
>          at coldfusion.runtime.CfJspPage.invoke(CfJspPage.java:147)
>          at
> coldfusion.tagext.lang.IncludeTag.doStartTag(IncludeTag.java:357)
>          at
> coldfusion.filter.CfincludeFilter.invoke(CfincludeFilter.java:62)
>          at 
>
coldfusion.filter.ApplicationFilter.invoke(ApplicationFilter.java:107)
>          at coldfusion.filter.PathFilter.invoke(PathFilter.java:80)
>          at
> coldfusion.filter.LicenseFilter.invoke(LicenseFilter.java:24)
>          at
> coldfusion.filter.ExceptionFilter.invoke(ExceptionFilter.java:47)
>          at 
>
coldfusion.filter.ClientScopePersistenceFilter.invoke(ClientScopePersistenceFilter.java:28)
>          at
> coldfusion.filter.BrowserFilter.invoke(BrowserFilter.java:35)
>          at
> coldfusion.filter.GlobalsFilter.invoke(GlobalsFilter.java:43)
>          at
> coldfusion.filter.DatasourceFilter.invoke(DatasourceFilter.java:22)
>          at coldfusion.CfmServlet.service(CfmServlet.java:105)
>          at
> jrun.servlet.ServletInvoker.invoke(ServletInvoker.java:91)
>          at
> jrun.servlet.JRunInvokerChain.invokeNext(JRunInvokerChain.java:42)
>          at 
>
jrun.servlet.JRunRequestDispatcher.invoke(JRunRequestDispatcher.java:252)
>          at 
>
jrun.servlet.ServletEngineService.dispatch(ServletEngineService.java:527)
>          at 
>
jrun.servlet.jrpp.JRunProxyService.invokeRunnable(JRunProxyService.java:192)
>          at 
>
jrunx.scheduler.ThreadPool$DownstreamMetrics.invokeRunnable(ThreadPool.java:348)
>          at 
>
jrunx.scheduler.ThreadPool$ThreadThrottle.invokeRunnable(ThreadPool.java:451)
>          at 
>
jrunx.scheduler.ThreadPool$UpstreamMetrics.invokeRunnable(ThreadPool.java:294)
>          at jrunx.scheduler.WorkerThread.run(WorkerThread.java:66)
> 
> 
> 
> 
> 
> 
> Don Vaillancourt
> Director of Software Development
> 
> WEB IMPACT INC.
> 416-815-2000 ext. 245
> email: donv@web-impact.com
> web: http://www.web-impact.com
> 
> 
> 
> 
> This email message is intended only for the addressee(s)
> and contains information that may be confidential and/or
> copyright.  If you are not the intended recipient please
> notify the sender by reply email and immediately delete
> this email. Use, disclosure or reproduction of this email
> by anyone other than the intended recipient(s) is strictly
> prohibited. No representation is made that this email or
> any attachments are free of viruses. Virus scanning is
> recommended and is the responsibility of the recipient.
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 


---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org

Lucene Error

Posted by Don Vaillancourt <do...@webimpact.com>.

Hello All,

I'm using Lucene to build collections from ColdFusion which I've got 
working pretty good so far.  But I am getting the following exception for 
some reason that I can't understand and never used to get before.

Below is the stack trace for that error.  Lucene is telling me that it 
cannot delete a file named 'query_1088174609733.log' which I create in the 
same folder as the Lucene collections.  I don't understand why Lucene is 
trying to delete this file.  I have verified that I am creating a new 
collection and not updating an existing one.

Anyone have any ideas.

Thanks

java.io.IOException: couldn&apos;t delete query_1088174609733.log
         at org.apache.lucene.store.FSDirectory.create(FSDirectory.java:166)
         at org.apache.lucene.store.FSDirectory.<init>(FSDirectory.java:151)
         at 
org.apache.lucene.store.FSDirectory.getDirectory(FSDirectory.java:132)
         at org.apache.lucene.index.IndexWriter.<init>(IndexWriter.java:160)
         at Index.getIndexWriter(Index.java:154)
         at Index.update(Index.java:83)
         at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
         at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
         at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
         at java.lang.reflect.Method.invoke(Method.java:324)
         at coldfusion.runtime.StructBean.invoke(StructBean.java:326)
         at coldfusion.runtime.CfJspPage._invoke(CfJspPage.java:1650)
         at 
cfindex2ecfm778374439.runPage(C:\Inetpub\wwwroot\prism\prism376\Lucene\index.cfm:58)
         at coldfusion.runtime.CfJspPage.invoke(CfJspPage.java:147)
         at coldfusion.tagext.lang.IncludeTag.doStartTag(IncludeTag.java:357)
         at coldfusion.filter.CfincludeFilter.invoke(CfincludeFilter.java:62)
         at 
coldfusion.filter.ApplicationFilter.invoke(ApplicationFilter.java:107)
         at coldfusion.filter.PathFilter.invoke(PathFilter.java:80)
         at coldfusion.filter.LicenseFilter.invoke(LicenseFilter.java:24)
         at coldfusion.filter.ExceptionFilter.invoke(ExceptionFilter.java:47)
         at 
coldfusion.filter.ClientScopePersistenceFilter.invoke(ClientScopePersistenceFilter.java:28)
         at coldfusion.filter.BrowserFilter.invoke(BrowserFilter.java:35)
         at coldfusion.filter.GlobalsFilter.invoke(GlobalsFilter.java:43)
         at coldfusion.filter.DatasourceFilter.invoke(DatasourceFilter.java:22)
         at coldfusion.CfmServlet.service(CfmServlet.java:105)
         at jrun.servlet.ServletInvoker.invoke(ServletInvoker.java:91)
         at jrun.servlet.JRunInvokerChain.invokeNext(JRunInvokerChain.java:42)
         at 
jrun.servlet.JRunRequestDispatcher.invoke(JRunRequestDispatcher.java:252)
         at 
jrun.servlet.ServletEngineService.dispatch(ServletEngineService.java:527)
         at 
jrun.servlet.jrpp.JRunProxyService.invokeRunnable(JRunProxyService.java:192)
         at 
jrunx.scheduler.ThreadPool$DownstreamMetrics.invokeRunnable(ThreadPool.java:348)
         at 
jrunx.scheduler.ThreadPool$ThreadThrottle.invokeRunnable(ThreadPool.java:451)
         at 
jrunx.scheduler.ThreadPool$UpstreamMetrics.invokeRunnable(ThreadPool.java:294)
         at jrunx.scheduler.WorkerThread.run(WorkerThread.java:66)






Don Vaillancourt
Director of Software Development

WEB IMPACT INC.
416-815-2000 ext. 245
email: donv@web-impact.com
web: http://www.web-impact.com




This email message is intended only for the addressee(s)
and contains information that may be confidential and/or
copyright.  If you are not the intended recipient please
notify the sender by reply email and immediately delete
this email. Use, disclosure or reproduction of this email
by anyone other than the intended recipient(s) is strictly
prohibited. No representation is made that this email or
any attachments are free of viruses. Virus scanning is
recommended and is the responsibility of the recipient.

Re: ANN: Luke v. 0.5 released

Posted by Vladimir Yuryev <vy...@rambler.ru>.

On Thu, 24 Jun 2004 12:34:35 +0200
  Andrzej Bialecki <ab...@getopt.org> wrote:
>Vladimir Yuryev wrote:
>
>> Hi Andrzej!
>> 
>> I am sorry for my English :-(
>> I with pleasure shall tell about the test and I shall try to state 
>> conditions of the test in detail.
>> 
>>>    I don't quite understand what you are saying... Do you suspect 
>>> there is a bug in Luke somewhere on the Search tab? If >that's the 
>>> case, please provide an example.
>> 
>> 
>> 
>> 1. Search was made on an index with coding Cp1251.
>> 2. Conditions of search:
>>      Analyzer to use for query parsing: 
>>org.apache.lucene.analysis.ru. 
>> RussianAnalyzer
>>      Default field is:contents
>> 
>>      2.1. Enter search expression here:высказался (the coding 
>>windows-1251)
>>             Result: No Results      2.2. Enter search expression 
>> here:высказал* (the coding windows-1251)
>>             Result: 1 doc (s), url: 
>> http://www.agnuz.info/result.php?year=2004&mounth1=March&day=26&files=v02.txt&print=news 
>
>Time to refresh my russian... :-) Ok, the problem seems to be in the 
>RussianAnalyzer - it uses RussianLetterTokenizer, which filters out 
>anything which is a non-letter - I'm afraid it filters out also the 
>wildcard at the end. Not only that, it then passes the tokens through 
>a RussianStemmer, which further mutilates the tokens.
>
>Please try the "Parsed query view" on the "Search" tab to see what is 
>the result of your query, or paste your query into the text area on 
>the AnalyzerTool plugin ("Plugins"), and see what tokens you get 
>using RussianAnalyzer.
>
>I just did it, and the result for "высказал*" was "высказа" - clearly 
>not what you wanted.
>
>-- 
>Best regards,
>Andrzej Bialecki
>
>-------------------------------------------------
>Software Architect, System Integration Specialist
>CEN/ISSS EC Workshop, ECIMF project chair
>EU FP6 E-Commerce Expert/Evaluator
>-------------------------------------------------
>FreeBSD developer (http://www.freebsd.org)
>
>
>---------------------------------------------------------------------
>To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
>For additional commands, e-mail: lucene-user-help@jakarta.apache.org
>

Hi Andrzej!

Well.
To the address: 
"http://www.agnuz.info/result.php?year=2004&mounth1=March&day=26&files=v02.txt&print=news" 
there is a full text in which I searched for a phrase "...Pontiff has 
expressed importance...", in russian "Понтифик высказался о важности". 

>Please try the "Parsed query view" on the "Search" tab to see what is the result of your query

In a bookmark "Search" the phrase has not been found. The problem was 
(for some reason?!) in the second and third words? Search by separate 
words (simple terms) has found out a problem in these last two words. 
And so, for "Analyzer to use for query parsing: ": 
org.apache.lucene.analysis.ru.RussianAnalyzer,
"Entry search expression here": [texts in coding Cp1251] -

1. "Entry search expression here ":"Понтифик высказался о важности".
     "Parsed query view": contents:"понтифик высказа важност".
- No Results

2. "Entry search expression here":Понтифик
     "Parsed query view": contents:понтифик 
- 2 doc (s)
URLs:
"http: // www.agnuz.info/result.php? 
year=2004&mounth1=March&day=26&files=v01.txt&print=news" 
"http: // www.agnuz.info/result.php? 
year=2004&mounth1=March&day=26&files=v02.txt&print=news" 

3. "Entry search expression here":высказался
     "Parsed query view": contents:высказа 
- No Results

4. "Entry search expression here":важности
     "Parsed query view": contents:важност
- No Results

5. "Entry search expression here":Понтифик высказался о важности.
     "Parsed query view": contents:понтифик contents:высказа 
contents:важност.
  - 2 doc (s)-> the same documents as point 2.

>.., or paste your query into the text area on the AnalyzerTool plugin ("Plugins"), and see what tokens you get using RussianAnalyzer.

In a tab "Plugins" in a field "Text to be analyzed" I have tested the 
same three words as a phrase - "Понтифик высказался о важности". As a 
result of the analysis in a field "Tokens found" three have been shown 
stemms - "понтифик", "высказа" and "важност". Actions - " hilite-> " 
has given positive results by all three words. (Similar a problem not 
in filters?):-)

Best regards,
Vladimir.

---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org

Re: ANN: Luke v. 0.5 released

Posted by Andrzej Bialecki <ab...@getopt.org>.

Vladimir Yuryev wrote:

> Hi Andrzej!
> 
> I am sorry for my English :-(
> I with pleasure shall tell about the test and I shall try to state 
> conditions of the test in detail.
> 
>>    I don't quite understand what you are saying... Do you suspect 
>> there is a bug in Luke somewhere on the Search tab? If >that's the 
>> case, please provide an example.
> 
> 
> 
> 1. Search was made on an index with coding Cp1251.
> 2. Conditions of search:
>      Analyzer to use for query parsing: org.apache.lucene.analysis.ru. 
> RussianAnalyzer
>      Default field is:contents
> 
>      2.1. Enter search expression here:высказался (the coding windows-1251)
>             Result: No Results      2.2. Enter search expression 
> here:высказал* (the coding windows-1251)
>             Result: 1 doc (s), url: 
> http://www.agnuz.info/result.php?year=2004&mounth1=March&day=26&files=v02.txt&print=news 

Time to refresh my russian... :-) Ok, the problem seems to be in the 
RussianAnalyzer - it uses RussianLetterTokenizer, which filters out 
anything which is a non-letter - I'm afraid it filters out also the 
wildcard at the end. Not only that, it then passes the tokens through a 
RussianStemmer, which further mutilates the tokens.

Please try the "Parsed query view" on the "Search" tab to see what is 
the result of your query, or paste your query into the text area on the 
AnalyzerTool plugin ("Plugins"), and see what tokens you get using 
RussianAnalyzer.

I just did it, and the result for "высказал*" was "высказа" - clearly 
not what you wanted.

-- 
Best regards,
Andrzej Bialecki

-------------------------------------------------
Software Architect, System Integration Specialist
CEN/ISSS EC Workshop, ECIMF project chair
EU FP6 E-Commerce Expert/Evaluator
-------------------------------------------------
FreeBSD developer (http://www.freebsd.org)

---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org

Re: ANN: Luke v. 0.5 released

Posted by Vladimir Yuryev <vy...@rambler.ru>.

Hi Andrzej!

I am sorry for my English :-(
I with pleasure shall tell about the test and I shall try to state 
conditions of the test in detail.

>    I don't quite understand what you are saying... Do you suspect there is a bug in Luke somewhere on the Search tab? If >that's the case, please provide an example.


1. Search was made on an index with coding Cp1251.
2. Conditions of search:
      Analyzer to use for query parsing: 
org.apache.lucene.analysis.ru. RussianAnalyzer
      Default field is:contents

      2.1. Enter search expression here:высказался (the coding 
windows-1251)
             Result: No Results 
      2.2. Enter search expression here:высказал* (the coding 
windows-1251)
             Result: 1 doc (s), 
url: 
http://www.agnuz.info/result.php?year=2004&mounth1=March&day=26&files=v02.txt&print=news

What address to sent an index a file?

Regards,
Vladimir.

---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org

Re: ANN: Luke v. 0.5 released

Posted by Andrzej Bialecki <ab...@getopt.org>.

Vladimir Yuryev wrote:

> Hi Andrzej!
> 
> I congratulate on the successful version. RussianAnalyzer works with my 
> indexes, but there are problems with some words. These problem words are 
> found only WildCard a method.

I don't quite understand what you are saying... Do you suspect there is 
a bug in Luke somewhere on the Search tab? If that's the case, please 
provide an example.

> Besides AnalizerTool works with these 
> words without problems.
> 
> There is one more small discrepancy on webpage http://www.getopt.org/luke/
> - Remember to put both JARs on your classpath, e.g.: java-classpath 
> luke.jar; lucene.jar org.getopt.luke. Luke
> + Remember to put both JARs on your classpath, e.g.: java-classpath 
> luke.jar:lucene.jar org.getopt.luke. Luke

Well, both versions are correct - just the platform is different :-). 
I'll make a clarification. Thank you!

-- 
Best regards,
Andrzej Bialecki

-------------------------------------------------
Software Architect, System Integration Specialist
CEN/ISSS EC Workshop, ECIMF project chair
EU FP6 E-Commerce Expert/Evaluator
-------------------------------------------------
FreeBSD developer (http://www.freebsd.org)


---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org

Re: ANN: Luke v. 0.5 released

Posted by Vladimir Yuryev <vy...@rambler.ru>.

Hi Andrzej!

I congratulate on the successful version. RussianAnalyzer works with 
my indexes, but there are problems with some words. These problem 
words are found only WildCard a method. Besides AnalizerTool works 
with these words without problems.

There is one more small discrepancy on webpage 
http://www.getopt.org/luke/
- Remember to put both JARs on your classpath, e.g.: java-classpath 
luke.jar; lucene.jar org.getopt.luke. Luke
+ Remember to put both JARs on your classpath, e.g.: java-classpath 
luke.jar:lucene.jar org.getopt.luke. Luke

Regards,
Vladimir.

On Tue, 22 Jun 2004 14:10:50 +0200
  Andrzej Bialecki <ab...@getopt.org> wrote:
>Hello fellow Luceners,
>
>I'm pleased to announce that new release of Luke is now available. 
>You can download it from:
>
>	http://www.getopt.org/luke/
>
>This release uses Lucene 1.4-rc4.
>
>This release also represents a major step forward - many new exciting 
>features have been added. The feature I consider the most important 
>in this release is extensibility - there is a plugin framework, and a 
>sample plugin is provided in the distribution - I encourage you to 
>write more.
>
>Here's a short summary of changes in this release:
>
>     * NEW: Added support for Term Vectors.
>     * NEW: Added a plugin framework - plugins found on classpath are
>	detected automatically and added to the new "Plugins" tab.
>	Note however that for now plugins autoloading doesn't quite
>	work when using Java WebStart - an alternative mechanism is also
>	provided. Plugins have full access to the application context.
>	Please read JavaDoc for LukePlugin.java for more information.
>     * NEW: A sample plugin is provided, based on Mark Harwood's 
>"tool
>	for analyzing analyzers".
>     * NEW: all tables support resizable columns now. Some dialogs 
>are
>	also resizable.
>     * NEW: Added Reconstruct functionality. Using this function 
>users
>	can reconstruct the content of all (also unstored) fields of a
>	document. This function uses a brute-force approach, so it may
>	be slow for larger indexes (> 500,000 docs).
>     * NEW: Added "pseudo-edit" functionality. New document editor 
>dialog
>	allows to modify reconstructed documents, and add or replace the
>	original ones.
>     * FIX: problems with MRU list solved, and a framework for 
>handling
>	preferences introduced.
>     * FIX: the list of available Analyzers is now dynamically 
>populated
>	from the classpath, using the same method as in the AnalyzerTool
>	plugin. This also doesn't work in WebStart, so a fallback to a
>	static list is provided.
>     * FIX: restructured source repository and added Ant build 
>script.
>
>Please note that as a result of the package name changes, the main 
>class is now org.getopt.luke.Luke, and NOT as before luke.Luke.
>
>I felt that all these changes merited a slight change in name, from 
>"Lucene Index Browser" to "Lucene Index Toolbox", as this seems to 
>better reflect the current functionality of the tool.
>
>Any feedback, patches for enhancements or bufixes are welcome! If you 
>want to provide a patch, please use "diff -bdruN" - this will help me 
>to integrate it. Thank you!
>
>-- 
>Best regards,
>Andrzej Bialecki
>
>-------------------------------------------------
>Software Architect, System Integration Specialist
>CEN/ISSS EC Workshop, ECIMF project chair
>EU FP6 E-Commerce Expert/Evaluator
>-------------------------------------------------
>FreeBSD developer (http://www.freebsd.org)
>
>
>
>---------------------------------------------------------------------
>To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
>For additional commands, e-mail: lucene-user-help@jakarta.apache.org
>


---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org