You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Michael McCandless (JIRA)" <ji...@apache.org> on 2011/01/12 12:43:45 UTC

[jira] Created: (LUCENE-2862) Track total term freq per term

Track total term freq per term
------------------------------

                 Key: LUCENE-2862
                 URL: https://issues.apache.org/jira/browse/LUCENE-2862
             Project: Lucene - Java
          Issue Type: Improvement
          Components: Index
            Reporter: Michael McCandless
            Assignee: Michael McCandless
             Fix For: 4.0


Right now we track docFreq for each term (how many docs have the
term), but the totalTermFreq (total number of occurrences of this
term, ie sum of freq() for each doc that has the term) is also a
useful stat (for flex scoring, PulsingCodec, etc.).


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] Updated: (LUCENE-2862) Track total term freq per term

Posted by "Michael McCandless (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/LUCENE-2862?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Michael McCandless updated LUCENE-2862:
---------------------------------------

    Attachment: LUCENE-2862.patch

Patch, adds TermsEnum.totalTermFreq (returns -1 if codec doesn't impl it, or if omitTFAP is on) and Terms.getSumTotalTermFreq (= sum across all terms in this field).

> Track total term freq per term
> ------------------------------
>
>                 Key: LUCENE-2862
>                 URL: https://issues.apache.org/jira/browse/LUCENE-2862
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: Index
>            Reporter: Michael McCandless
>            Assignee: Michael McCandless
>             Fix For: 4.0
>
>         Attachments: LUCENE-2862.patch
>
>
> Right now we track docFreq for each term (how many docs have the
> term), but the totalTermFreq (total number of occurrences of this
> term, ie sum of freq() for each doc that has the term) is also a
> useful stat (for flex scoring, PulsingCodec, etc.).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] Resolved: (LUCENE-2862) Track total term freq per term

Posted by "Michael McCandless (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/LUCENE-2862?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Michael McCandless resolved LUCENE-2862.
----------------------------------------

    Resolution: Fixed

> Track total term freq per term
> ------------------------------
>
>                 Key: LUCENE-2862
>                 URL: https://issues.apache.org/jira/browse/LUCENE-2862
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: Index
>            Reporter: Michael McCandless
>            Assignee: Michael McCandless
>             Fix For: 4.0
>
>         Attachments: LUCENE-2862.patch
>
>
> Right now we track docFreq for each term (how many docs have the
> term), but the totalTermFreq (total number of occurrences of this
> term, ie sum of freq() for each doc that has the term) is also a
> useful stat (for flex scoring, PulsingCodec, etc.).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org