You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Ivan Provalov (JIRA)" <ji...@apache.org> on 2010/08/22 05:31:16 UTC
[jira] Updated: (LUCENE-474) High Frequency Terms/Phrases at the
Index level
[ https://issues.apache.org/jira/browse/LUCENE-474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Ivan Provalov updated LUCENE-474:
---------------------------------
Attachment: collocations.zip
I saw some activity on the term collocations in the lucene user forum recently and decided to make a few changes to the colloc.zip package which Mark worked on. I used it before and it worked well for my project.
I ended up doing some fixes and refactoring and adding couple of unit tests, as well as a new class which will search the collocated terms if provided with a term. This version works with Lucene 3.0.2. Also, I changed package names, added the license verbage, as well as added maven and ant for contrib packaging.
If Mark is OK with these changes, it could be published as a contrib.
> High Frequency Terms/Phrases at the Index level
> -----------------------------------------------
>
> Key: LUCENE-474
> URL: https://issues.apache.org/jira/browse/LUCENE-474
> Project: Lucene - Java
> Issue Type: New Feature
> Components: contrib/*
> Affects Versions: 1.4
> Reporter: Suri Babu B
> Assignee: Otis Gospodnetic
> Priority: Minor
> Attachments: colloc.zip, collocations.zip
>
>
> We should be able to find the all the high frequency terms/phrases ( where frequency is the search criteria / benchmark)
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org