You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by Andrzej Bialecki <ab...@getopt.org> on 2004/06/22 14:10:50 UTC
ANN: Luke v. 0.5 released
Hello fellow Luceners,
I'm pleased to announce that new release of Luke is now available. You
can download it from:
http://www.getopt.org/luke/
This release uses Lucene 1.4-rc4.
This release also represents a major step forward - many new exciting
features have been added. The feature I consider the most important in
this release is extensibility - there is a plugin framework, and a
sample plugin is provided in the distribution - I encourage you to write
more.
Here's a short summary of changes in this release:
* NEW: Added support for Term Vectors.
* NEW: Added a plugin framework - plugins found on classpath are
detected automatically and added to the new "Plugins" tab.
Note however that for now plugins autoloading doesn't quite
work when using Java WebStart - an alternative mechanism is also
provided. Plugins have full access to the application context.
Please read JavaDoc for LukePlugin.java for more information.
* NEW: A sample plugin is provided, based on Mark Harwood's "tool
for analyzing analyzers".
* NEW: all tables support resizable columns now. Some dialogs are
also resizable.
* NEW: Added Reconstruct functionality. Using this function users
can reconstruct the content of all (also unstored) fields of a
document. This function uses a brute-force approach, so it may
be slow for larger indexes (> 500,000 docs).
* NEW: Added "pseudo-edit" functionality. New document editor dialog
allows to modify reconstructed documents, and add or replace the
original ones.
* FIX: problems with MRU list solved, and a framework for handling
preferences introduced.
* FIX: the list of available Analyzers is now dynamically populated
from the classpath, using the same method as in the AnalyzerTool
plugin. This also doesn't work in WebStart, so a fallback to a
static list is provided.
* FIX: restructured source repository and added Ant build script.
Please note that as a result of the package name changes, the main class
is now org.getopt.luke.Luke, and NOT as before luke.Luke.
I felt that all these changes merited a slight change in name, from
"Lucene Index Browser" to "Lucene Index Toolbox", as this seems to
better reflect the current functionality of the tool.
Any feedback, patches for enhancements or bufixes are welcome! If you
want to provide a patch, please use "diff -bdruN" - this will help me to
integrate it. Thank you!
--
Best regards,
Andrzej Bialecki
-------------------------------------------------
Software Architect, System Integration Specialist
CEN/ISSS EC Workshop, ECIMF project chair
EU FP6 E-Commerce Expert/Evaluator
-------------------------------------------------
FreeBSD developer (http://www.freebsd.org)
---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-dev-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-dev-help@jakarta.apache.org
Re: Lucene Error
Posted by Don Vaillancourt <do...@webimpact.com>.
Hello,
Someone replied to me off-list and suggested to me the same thing and it
did resolve the problem.
Thanks
At 12:37 PM 25/06/2004, you wrote:
>Hello Don,
>
>I've never seen this, and I'm pretty sure it's not really Lucene's
>fault. Lucene is aware only of segments/files listed in segments file.
> Could it have anything to do with that ColdFusion code that wraps
>Lucene by any chance?
>
>Regardless, it's probably not the best practise to store non-Lucene
>files in a Lucene index directory. I suggest you store that log
>elsewhere, and avoid any issues that way. :)
>
>Otis
>
>--- Don Vaillancourt <do...@webimpact.com> wrote:
> > Hello All,
> >
> > I'm using Lucene to build collections from ColdFusion which I've got
> > working pretty good so far. But I am getting the following exception
> > for
> > some reason that I can't understand and never used to get before.
> >
> > Below is the stack trace for that error. Lucene is telling me that
> > it
> > cannot delete a file named 'query_1088174609733.log' which I create
> > in the
> > same folder as the Lucene collections. I don't understand why Lucene
> > is
> > trying to delete this file. I have verified that I am creating a new
> >
> > collection and not updating an existing one.
> >
> > Anyone have any ideas.
> >
> > Thanks
> >
> > java.io.IOException: couldn't delete query_1088174609733.log
> > at
> > org.apache.lucene.store.FSDirectory.create(FSDirectory.java:166)
> > at
> > org.apache.lucene.store.FSDirectory.<init>(FSDirectory.java:151)
> > at
> >
>org.apache.lucene.store.FSDirectory.getDirectory(FSDirectory.java:132)
> > at
> > org.apache.lucene.index.IndexWriter.<init>(IndexWriter.java:160)
> > at Index.getIndexWriter(Index.java:154)
> > at Index.update(Index.java:83)
> > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native
> > Method)
> > at
> >
>sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
> > at
> >
>sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
> > at java.lang.reflect.Method.invoke(Method.java:324)
> > at coldfusion.runtime.StructBean.invoke(StructBean.java:326)
> > at coldfusion.runtime.CfJspPage._invoke(CfJspPage.java:1650)
> > at
> >
>cfindex2ecfm778374439.runPage(C:\Inetpub\wwwroot\prism\prism376\Lucene\index.cfm:58)
> > at coldfusion.runtime.CfJspPage.invoke(CfJspPage.java:147)
> > at
> > coldfusion.tagext.lang.IncludeTag.doStartTag(IncludeTag.java:357)
> > at
> > coldfusion.filter.CfincludeFilter.invoke(CfincludeFilter.java:62)
> > at
> >
>coldfusion.filter.ApplicationFilter.invoke(ApplicationFilter.java:107)
> > at coldfusion.filter.PathFilter.invoke(PathFilter.java:80)
> > at
> > coldfusion.filter.LicenseFilter.invoke(LicenseFilter.java:24)
> > at
> > coldfusion.filter.ExceptionFilter.invoke(ExceptionFilter.java:47)
> > at
> >
>coldfusion.filter.ClientScopePersistenceFilter.invoke(ClientScopePersistenceFilter.java:28)
> > at
> > coldfusion.filter.BrowserFilter.invoke(BrowserFilter.java:35)
> > at
> > coldfusion.filter.GlobalsFilter.invoke(GlobalsFilter.java:43)
> > at
> > coldfusion.filter.DatasourceFilter.invoke(DatasourceFilter.java:22)
> > at coldfusion.CfmServlet.service(CfmServlet.java:105)
> > at
> > jrun.servlet.ServletInvoker.invoke(ServletInvoker.java:91)
> > at
> > jrun.servlet.JRunInvokerChain.invokeNext(JRunInvokerChain.java:42)
> > at
> >
>jrun.servlet.JRunRequestDispatcher.invoke(JRunRequestDispatcher.java:252)
> > at
> >
>jrun.servlet.ServletEngineService.dispatch(ServletEngineService.java:527)
> > at
> >
>jrun.servlet.jrpp.JRunProxyService.invokeRunnable(JRunProxyService.java:192)
> > at
> >
>jrunx.scheduler.ThreadPool$DownstreamMetrics.invokeRunnable(ThreadPool.java:348)
> > at
> >
>jrunx.scheduler.ThreadPool$ThreadThrottle.invokeRunnable(ThreadPool.java:451)
> > at
> >
>jrunx.scheduler.ThreadPool$UpstreamMetrics.invokeRunnable(ThreadPool.java:294)
> > at jrunx.scheduler.WorkerThread.run(WorkerThread.java:66)
> >
> >
> >
> >
> >
> >
> > Don Vaillancourt
> > Director of Software Development
> >
> > WEB IMPACT INC.
> > 416-815-2000 ext. 245
> > email: donv@web-impact.com
> > web: http://www.web-impact.com
> >
> >
> >
> >
> > This email message is intended only for the addressee(s)
> > and contains information that may be confidential and/or
> > copyright. If you are not the intended recipient please
> > notify the sender by reply email and immediately delete
> > this email. Use, disclosure or reproduction of this email
> > by anyone other than the intended recipient(s) is strictly
> > prohibited. No representation is made that this email or
> > any attachments are free of viruses. Virus scanning is
> > recommended and is the responsibility of the recipient.
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >
>
>
>---------------------------------------------------------------------
>To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
>For additional commands, e-mail: lucene-user-help@jakarta.apache.org
Don Vaillancourt
Director of Software Development
WEB IMPACT INC.
416-815-2000 ext. 245
email: donv@web-impact.com
web: http://www.web-impact.com
This email message is intended only for the addressee(s)
and contains information that may be confidential and/or
copyright. If you are not the intended recipient please
notify the sender by reply email and immediately delete
this email. Use, disclosure or reproduction of this email
by anyone other than the intended recipient(s) is strictly
prohibited. No representation is made that this email or
any attachments are free of viruses. Virus scanning is
recommended and is the responsibility of the recipient.
Re: Lucene Error
Posted by Otis Gospodnetic <ot...@yahoo.com>.
Hello Don,
I've never seen this, and I'm pretty sure it's not really Lucene's
fault. Lucene is aware only of segments/files listed in segments file.
Could it have anything to do with that ColdFusion code that wraps
Lucene by any chance?
Regardless, it's probably not the best practise to store non-Lucene
files in a Lucene index directory. I suggest you store that log
elsewhere, and avoid any issues that way. :)
Otis
--- Don Vaillancourt <do...@webimpact.com> wrote:
> Hello All,
>
> I'm using Lucene to build collections from ColdFusion which I've got
> working pretty good so far. But I am getting the following exception
> for
> some reason that I can't understand and never used to get before.
>
> Below is the stack trace for that error. Lucene is telling me that
> it
> cannot delete a file named 'query_1088174609733.log' which I create
> in the
> same folder as the Lucene collections. I don't understand why Lucene
> is
> trying to delete this file. I have verified that I am creating a new
>
> collection and not updating an existing one.
>
> Anyone have any ideas.
>
> Thanks
>
> java.io.IOException: couldn't delete query_1088174609733.log
> at
> org.apache.lucene.store.FSDirectory.create(FSDirectory.java:166)
> at
> org.apache.lucene.store.FSDirectory.<init>(FSDirectory.java:151)
> at
>
org.apache.lucene.store.FSDirectory.getDirectory(FSDirectory.java:132)
> at
> org.apache.lucene.index.IndexWriter.<init>(IndexWriter.java:160)
> at Index.getIndexWriter(Index.java:154)
> at Index.update(Index.java:83)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native
> Method)
> at
>
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
> at
>
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
> at java.lang.reflect.Method.invoke(Method.java:324)
> at coldfusion.runtime.StructBean.invoke(StructBean.java:326)
> at coldfusion.runtime.CfJspPage._invoke(CfJspPage.java:1650)
> at
>
cfindex2ecfm778374439.runPage(C:\Inetpub\wwwroot\prism\prism376\Lucene\index.cfm:58)
> at coldfusion.runtime.CfJspPage.invoke(CfJspPage.java:147)
> at
> coldfusion.tagext.lang.IncludeTag.doStartTag(IncludeTag.java:357)
> at
> coldfusion.filter.CfincludeFilter.invoke(CfincludeFilter.java:62)
> at
>
coldfusion.filter.ApplicationFilter.invoke(ApplicationFilter.java:107)
> at coldfusion.filter.PathFilter.invoke(PathFilter.java:80)
> at
> coldfusion.filter.LicenseFilter.invoke(LicenseFilter.java:24)
> at
> coldfusion.filter.ExceptionFilter.invoke(ExceptionFilter.java:47)
> at
>
coldfusion.filter.ClientScopePersistenceFilter.invoke(ClientScopePersistenceFilter.java:28)
> at
> coldfusion.filter.BrowserFilter.invoke(BrowserFilter.java:35)
> at
> coldfusion.filter.GlobalsFilter.invoke(GlobalsFilter.java:43)
> at
> coldfusion.filter.DatasourceFilter.invoke(DatasourceFilter.java:22)
> at coldfusion.CfmServlet.service(CfmServlet.java:105)
> at
> jrun.servlet.ServletInvoker.invoke(ServletInvoker.java:91)
> at
> jrun.servlet.JRunInvokerChain.invokeNext(JRunInvokerChain.java:42)
> at
>
jrun.servlet.JRunRequestDispatcher.invoke(JRunRequestDispatcher.java:252)
> at
>
jrun.servlet.ServletEngineService.dispatch(ServletEngineService.java:527)
> at
>
jrun.servlet.jrpp.JRunProxyService.invokeRunnable(JRunProxyService.java:192)
> at
>
jrunx.scheduler.ThreadPool$DownstreamMetrics.invokeRunnable(ThreadPool.java:348)
> at
>
jrunx.scheduler.ThreadPool$ThreadThrottle.invokeRunnable(ThreadPool.java:451)
> at
>
jrunx.scheduler.ThreadPool$UpstreamMetrics.invokeRunnable(ThreadPool.java:294)
> at jrunx.scheduler.WorkerThread.run(WorkerThread.java:66)
>
>
>
>
>
>
> Don Vaillancourt
> Director of Software Development
>
> WEB IMPACT INC.
> 416-815-2000 ext. 245
> email: donv@web-impact.com
> web: http://www.web-impact.com
>
>
>
>
> This email message is intended only for the addressee(s)
> and contains information that may be confidential and/or
> copyright. If you are not the intended recipient please
> notify the sender by reply email and immediately delete
> this email. Use, disclosure or reproduction of this email
> by anyone other than the intended recipient(s) is strictly
> prohibited. No representation is made that this email or
> any attachments are free of viruses. Virus scanning is
> recommended and is the responsibility of the recipient.
>
>
>
>
>
>
>
>
>
>
>
>
---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org
Lucene Error
Posted by Don Vaillancourt <do...@webimpact.com>.
Hello All,
I'm using Lucene to build collections from ColdFusion which I've got
working pretty good so far. But I am getting the following exception for
some reason that I can't understand and never used to get before.
Below is the stack trace for that error. Lucene is telling me that it
cannot delete a file named 'query_1088174609733.log' which I create in the
same folder as the Lucene collections. I don't understand why Lucene is
trying to delete this file. I have verified that I am creating a new
collection and not updating an existing one.
Anyone have any ideas.
Thanks
java.io.IOException: couldn't delete query_1088174609733.log
at org.apache.lucene.store.FSDirectory.create(FSDirectory.java:166)
at org.apache.lucene.store.FSDirectory.<init>(FSDirectory.java:151)
at
org.apache.lucene.store.FSDirectory.getDirectory(FSDirectory.java:132)
at org.apache.lucene.index.IndexWriter.<init>(IndexWriter.java:160)
at Index.getIndexWriter(Index.java:154)
at Index.update(Index.java:83)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:324)
at coldfusion.runtime.StructBean.invoke(StructBean.java:326)
at coldfusion.runtime.CfJspPage._invoke(CfJspPage.java:1650)
at
cfindex2ecfm778374439.runPage(C:\Inetpub\wwwroot\prism\prism376\Lucene\index.cfm:58)
at coldfusion.runtime.CfJspPage.invoke(CfJspPage.java:147)
at coldfusion.tagext.lang.IncludeTag.doStartTag(IncludeTag.java:357)
at coldfusion.filter.CfincludeFilter.invoke(CfincludeFilter.java:62)
at
coldfusion.filter.ApplicationFilter.invoke(ApplicationFilter.java:107)
at coldfusion.filter.PathFilter.invoke(PathFilter.java:80)
at coldfusion.filter.LicenseFilter.invoke(LicenseFilter.java:24)
at coldfusion.filter.ExceptionFilter.invoke(ExceptionFilter.java:47)
at
coldfusion.filter.ClientScopePersistenceFilter.invoke(ClientScopePersistenceFilter.java:28)
at coldfusion.filter.BrowserFilter.invoke(BrowserFilter.java:35)
at coldfusion.filter.GlobalsFilter.invoke(GlobalsFilter.java:43)
at coldfusion.filter.DatasourceFilter.invoke(DatasourceFilter.java:22)
at coldfusion.CfmServlet.service(CfmServlet.java:105)
at jrun.servlet.ServletInvoker.invoke(ServletInvoker.java:91)
at jrun.servlet.JRunInvokerChain.invokeNext(JRunInvokerChain.java:42)
at
jrun.servlet.JRunRequestDispatcher.invoke(JRunRequestDispatcher.java:252)
at
jrun.servlet.ServletEngineService.dispatch(ServletEngineService.java:527)
at
jrun.servlet.jrpp.JRunProxyService.invokeRunnable(JRunProxyService.java:192)
at
jrunx.scheduler.ThreadPool$DownstreamMetrics.invokeRunnable(ThreadPool.java:348)
at
jrunx.scheduler.ThreadPool$ThreadThrottle.invokeRunnable(ThreadPool.java:451)
at
jrunx.scheduler.ThreadPool$UpstreamMetrics.invokeRunnable(ThreadPool.java:294)
at jrunx.scheduler.WorkerThread.run(WorkerThread.java:66)
Don Vaillancourt
Director of Software Development
WEB IMPACT INC.
416-815-2000 ext. 245
email: donv@web-impact.com
web: http://www.web-impact.com
This email message is intended only for the addressee(s)
and contains information that may be confidential and/or
copyright. If you are not the intended recipient please
notify the sender by reply email and immediately delete
this email. Use, disclosure or reproduction of this email
by anyone other than the intended recipient(s) is strictly
prohibited. No representation is made that this email or
any attachments are free of viruses. Virus scanning is
recommended and is the responsibility of the recipient.
Re: ANN: Luke v. 0.5 released
Posted by Vladimir Yuryev <vy...@rambler.ru>.
On Thu, 24 Jun 2004 12:34:35 +0200
Andrzej Bialecki <ab...@getopt.org> wrote:
>Vladimir Yuryev wrote:
>
>> Hi Andrzej!
>>
>> I am sorry for my English :-(
>> I with pleasure shall tell about the test and I shall try to state
>> conditions of the test in detail.
>>
>>> I don't quite understand what you are saying... Do you suspect
>>> there is a bug in Luke somewhere on the Search tab? If >that's the
>>> case, please provide an example.
>>
>>
>>
>> 1. Search was made on an index with coding Cp1251.
>> 2. Conditions of search:
>> Analyzer to use for query parsing:
>>org.apache.lucene.analysis.ru.
>> RussianAnalyzer
>> Default field is:contents
>>
>> 2.1. Enter search expression here:высказался (the coding
>>windows-1251)
>> Result: No Results 2.2. Enter search expression
>> here:высказал* (the coding windows-1251)
>> Result: 1 doc (s), url:
>> http://www.agnuz.info/result.php?year=2004&mounth1=March&day=26&files=v02.txt&print=news
>
>Time to refresh my russian... :-) Ok, the problem seems to be in the
>RussianAnalyzer - it uses RussianLetterTokenizer, which filters out
>anything which is a non-letter - I'm afraid it filters out also the
>wildcard at the end. Not only that, it then passes the tokens through
>a RussianStemmer, which further mutilates the tokens.
>
>Please try the "Parsed query view" on the "Search" tab to see what is
>the result of your query, or paste your query into the text area on
>the AnalyzerTool plugin ("Plugins"), and see what tokens you get
>using RussianAnalyzer.
>
>I just did it, and the result for "высказал*" was "высказа" - clearly
>not what you wanted.
>
>--
>Best regards,
>Andrzej Bialecki
>
>-------------------------------------------------
>Software Architect, System Integration Specialist
>CEN/ISSS EC Workshop, ECIMF project chair
>EU FP6 E-Commerce Expert/Evaluator
>-------------------------------------------------
>FreeBSD developer (http://www.freebsd.org)
>
>
>---------------------------------------------------------------------
>To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
>For additional commands, e-mail: lucene-user-help@jakarta.apache.org
>
Hi Andrzej!
Well.
To the address:
"http://www.agnuz.info/result.php?year=2004&mounth1=March&day=26&files=v02.txt&print=news"
there is a full text in which I searched for a phrase "...Pontiff has
expressed importance...", in russian "Понтифик высказался о важности".
>Please try the "Parsed query view" on the "Search" tab to see what is the result of your query
In a bookmark "Search" the phrase has not been found. The problem was
(for some reason?!) in the second and third words? Search by separate
words (simple terms) has found out a problem in these last two words.
And so, for "Analyzer to use for query parsing: ":
org.apache.lucene.analysis.ru.RussianAnalyzer,
"Entry search expression here": [texts in coding Cp1251] -
1. "Entry search expression here ":"Понтифик высказался о важности".
"Parsed query view": contents:"понтифик высказа важност".
- No Results
2. "Entry search expression here":Понтифик
"Parsed query view": contents:понтифик
- 2 doc (s)
URLs:
"http: // www.agnuz.info/result.php?
year=2004&mounth1=March&day=26&files=v01.txt&print=news"
"http: // www.agnuz.info/result.php?
year=2004&mounth1=March&day=26&files=v02.txt&print=news"
3. "Entry search expression here":высказался
"Parsed query view": contents:высказа
- No Results
4. "Entry search expression here":важности
"Parsed query view": contents:важност
- No Results
5. "Entry search expression here":Понтифик высказался о важности.
"Parsed query view": contents:понтифик contents:высказа
contents:важност.
- 2 doc (s)-> the same documents as point 2.
>.., or paste your query into the text area on the AnalyzerTool plugin ("Plugins"), and see what tokens you get using RussianAnalyzer.
In a tab "Plugins" in a field "Text to be analyzed" I have tested the
same three words as a phrase - "Понтифик высказался о важности". As a
result of the analysis in a field "Tokens found" three have been shown
stemms - "понтифик", "высказа" and "важност". Actions - " hilite-> "
has given positive results by all three words. (Similar a problem not
in filters?):-)
Best regards,
Vladimir.
---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org
Re: ANN: Luke v. 0.5 released
Posted by Andrzej Bialecki <ab...@getopt.org>.
Vladimir Yuryev wrote:
> Hi Andrzej!
>
> I am sorry for my English :-(
> I with pleasure shall tell about the test and I shall try to state
> conditions of the test in detail.
>
>> I don't quite understand what you are saying... Do you suspect
>> there is a bug in Luke somewhere on the Search tab? If >that's the
>> case, please provide an example.
>
>
>
> 1. Search was made on an index with coding Cp1251.
> 2. Conditions of search:
> Analyzer to use for query parsing: org.apache.lucene.analysis.ru.
> RussianAnalyzer
> Default field is:contents
>
> 2.1. Enter search expression here:высказался (the coding windows-1251)
> Result: No Results 2.2. Enter search expression
> here:высказал* (the coding windows-1251)
> Result: 1 doc (s), url:
> http://www.agnuz.info/result.php?year=2004&mounth1=March&day=26&files=v02.txt&print=news
Time to refresh my russian... :-) Ok, the problem seems to be in the
RussianAnalyzer - it uses RussianLetterTokenizer, which filters out
anything which is a non-letter - I'm afraid it filters out also the
wildcard at the end. Not only that, it then passes the tokens through a
RussianStemmer, which further mutilates the tokens.
Please try the "Parsed query view" on the "Search" tab to see what is
the result of your query, or paste your query into the text area on the
AnalyzerTool plugin ("Plugins"), and see what tokens you get using
RussianAnalyzer.
I just did it, and the result for "высказал*" was "высказа" - clearly
not what you wanted.
--
Best regards,
Andrzej Bialecki
-------------------------------------------------
Software Architect, System Integration Specialist
CEN/ISSS EC Workshop, ECIMF project chair
EU FP6 E-Commerce Expert/Evaluator
-------------------------------------------------
FreeBSD developer (http://www.freebsd.org)
---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org
Re: ANN: Luke v. 0.5 released
Posted by Vladimir Yuryev <vy...@rambler.ru>.
Hi Andrzej!
I am sorry for my English :-(
I with pleasure shall tell about the test and I shall try to state
conditions of the test in detail.
> I don't quite understand what you are saying... Do you suspect there is a bug in Luke somewhere on the Search tab? If >that's the case, please provide an example.
1. Search was made on an index with coding Cp1251.
2. Conditions of search:
Analyzer to use for query parsing:
org.apache.lucene.analysis.ru. RussianAnalyzer
Default field is:contents
2.1. Enter search expression here:высказался (the coding
windows-1251)
Result: No Results
2.2. Enter search expression here:высказал* (the coding
windows-1251)
Result: 1 doc (s),
url:
http://www.agnuz.info/result.php?year=2004&mounth1=March&day=26&files=v02.txt&print=news
What address to sent an index a file?
Regards,
Vladimir.
---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org
Re: ANN: Luke v. 0.5 released
Posted by Andrzej Bialecki <ab...@getopt.org>.
Vladimir Yuryev wrote:
> Hi Andrzej!
>
> I congratulate on the successful version. RussianAnalyzer works with my
> indexes, but there are problems with some words. These problem words are
> found only WildCard a method.
I don't quite understand what you are saying... Do you suspect there is
a bug in Luke somewhere on the Search tab? If that's the case, please
provide an example.
> Besides AnalizerTool works with these
> words without problems.
>
> There is one more small discrepancy on webpage http://www.getopt.org/luke/
> - Remember to put both JARs on your classpath, e.g.: java-classpath
> luke.jar; lucene.jar org.getopt.luke. Luke
> + Remember to put both JARs on your classpath, e.g.: java-classpath
> luke.jar:lucene.jar org.getopt.luke. Luke
Well, both versions are correct - just the platform is different :-).
I'll make a clarification. Thank you!
--
Best regards,
Andrzej Bialecki
-------------------------------------------------
Software Architect, System Integration Specialist
CEN/ISSS EC Workshop, ECIMF project chair
EU FP6 E-Commerce Expert/Evaluator
-------------------------------------------------
FreeBSD developer (http://www.freebsd.org)
---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org
Re: ANN: Luke v. 0.5 released
Posted by Vladimir Yuryev <vy...@rambler.ru>.
Hi Andrzej!
I congratulate on the successful version. RussianAnalyzer works with
my indexes, but there are problems with some words. These problem
words are found only WildCard a method. Besides AnalizerTool works
with these words without problems.
There is one more small discrepancy on webpage
http://www.getopt.org/luke/
- Remember to put both JARs on your classpath, e.g.: java-classpath
luke.jar; lucene.jar org.getopt.luke. Luke
+ Remember to put both JARs on your classpath, e.g.: java-classpath
luke.jar:lucene.jar org.getopt.luke. Luke
Regards,
Vladimir.
On Tue, 22 Jun 2004 14:10:50 +0200
Andrzej Bialecki <ab...@getopt.org> wrote:
>Hello fellow Luceners,
>
>I'm pleased to announce that new release of Luke is now available.
>You can download it from:
>
> http://www.getopt.org/luke/
>
>This release uses Lucene 1.4-rc4.
>
>This release also represents a major step forward - many new exciting
>features have been added. The feature I consider the most important
>in this release is extensibility - there is a plugin framework, and a
>sample plugin is provided in the distribution - I encourage you to
>write more.
>
>Here's a short summary of changes in this release:
>
> * NEW: Added support for Term Vectors.
> * NEW: Added a plugin framework - plugins found on classpath are
> detected automatically and added to the new "Plugins" tab.
> Note however that for now plugins autoloading doesn't quite
> work when using Java WebStart - an alternative mechanism is also
> provided. Plugins have full access to the application context.
> Please read JavaDoc for LukePlugin.java for more information.
> * NEW: A sample plugin is provided, based on Mark Harwood's
>"tool
> for analyzing analyzers".
> * NEW: all tables support resizable columns now. Some dialogs
>are
> also resizable.
> * NEW: Added Reconstruct functionality. Using this function
>users
> can reconstruct the content of all (also unstored) fields of a
> document. This function uses a brute-force approach, so it may
> be slow for larger indexes (> 500,000 docs).
> * NEW: Added "pseudo-edit" functionality. New document editor
>dialog
> allows to modify reconstructed documents, and add or replace the
> original ones.
> * FIX: problems with MRU list solved, and a framework for
>handling
> preferences introduced.
> * FIX: the list of available Analyzers is now dynamically
>populated
> from the classpath, using the same method as in the AnalyzerTool
> plugin. This also doesn't work in WebStart, so a fallback to a
> static list is provided.
> * FIX: restructured source repository and added Ant build
>script.
>
>Please note that as a result of the package name changes, the main
>class is now org.getopt.luke.Luke, and NOT as before luke.Luke.
>
>I felt that all these changes merited a slight change in name, from
>"Lucene Index Browser" to "Lucene Index Toolbox", as this seems to
>better reflect the current functionality of the tool.
>
>Any feedback, patches for enhancements or bufixes are welcome! If you
>want to provide a patch, please use "diff -bdruN" - this will help me
>to integrate it. Thank you!
>
>--
>Best regards,
>Andrzej Bialecki
>
>-------------------------------------------------
>Software Architect, System Integration Specialist
>CEN/ISSS EC Workshop, ECIMF project chair
>EU FP6 E-Commerce Expert/Evaluator
>-------------------------------------------------
>FreeBSD developer (http://www.freebsd.org)
>
>
>
>---------------------------------------------------------------------
>To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
>For additional commands, e-mail: lucene-user-help@jakarta.apache.org
>
---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org