You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Ivan Provalov (JIRA)" <ji...@apache.org> on 2010/08/22 05:31:16 UTC

[jira] Updated: (LUCENE-474) High Frequency Terms/Phrases at the Index level

     [ https://issues.apache.org/jira/browse/LUCENE-474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ivan Provalov updated LUCENE-474:
---------------------------------

    Attachment: collocations.zip

I saw some activity on the term collocations in the lucene user forum recently and decided to make a few changes to the colloc.zip package which Mark worked on.  I used it before and it worked well for my project.  

I ended up doing some fixes and refactoring and adding couple of unit tests, as well as a new class which will search the collocated terms if provided with a term.  This version works with Lucene 3.0.2.  Also, I changed package names, added the license verbage, as well as added maven and ant for contrib packaging.  

If Mark is OK with these changes, it could be published as a contrib.

> High Frequency Terms/Phrases at the Index level
> -----------------------------------------------
>
>                 Key: LUCENE-474
>                 URL: https://issues.apache.org/jira/browse/LUCENE-474
>             Project: Lucene - Java
>          Issue Type: New Feature
>          Components: contrib/*
>    Affects Versions: 1.4
>            Reporter: Suri Babu B
>            Assignee: Otis Gospodnetic
>            Priority: Minor
>         Attachments: colloc.zip, collocations.zip
>
>
> We should be able to find the all the high frequency terms/phrases ( where frequency  is the search criteria / benchmark)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org