You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Michael McCandless (JIRA)" <ji...@apache.org> on 2011/01/12 12:43:45 UTC
[jira] Created: (LUCENE-2862) Track total term freq per term
Track total term freq per term
------------------------------
Key: LUCENE-2862
URL: https://issues.apache.org/jira/browse/LUCENE-2862
Project: Lucene - Java
Issue Type: Improvement
Components: Index
Reporter: Michael McCandless
Assignee: Michael McCandless
Fix For: 4.0
Right now we track docFreq for each term (how many docs have the
term), but the totalTermFreq (total number of occurrences of this
term, ie sum of freq() for each doc that has the term) is also a
useful stat (for flex scoring, PulsingCodec, etc.).
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org
[jira] Updated: (LUCENE-2862) Track total term freq per term
Posted by "Michael McCandless (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/LUCENE-2862?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Michael McCandless updated LUCENE-2862:
---------------------------------------
Attachment: LUCENE-2862.patch
Patch, adds TermsEnum.totalTermFreq (returns -1 if codec doesn't impl it, or if omitTFAP is on) and Terms.getSumTotalTermFreq (= sum across all terms in this field).
> Track total term freq per term
> ------------------------------
>
> Key: LUCENE-2862
> URL: https://issues.apache.org/jira/browse/LUCENE-2862
> Project: Lucene - Java
> Issue Type: Improvement
> Components: Index
> Reporter: Michael McCandless
> Assignee: Michael McCandless
> Fix For: 4.0
>
> Attachments: LUCENE-2862.patch
>
>
> Right now we track docFreq for each term (how many docs have the
> term), but the totalTermFreq (total number of occurrences of this
> term, ie sum of freq() for each doc that has the term) is also a
> useful stat (for flex scoring, PulsingCodec, etc.).
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org
[jira] Resolved: (LUCENE-2862) Track total term freq per term
Posted by "Michael McCandless (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/LUCENE-2862?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Michael McCandless resolved LUCENE-2862.
----------------------------------------
Resolution: Fixed
> Track total term freq per term
> ------------------------------
>
> Key: LUCENE-2862
> URL: https://issues.apache.org/jira/browse/LUCENE-2862
> Project: Lucene - Java
> Issue Type: Improvement
> Components: Index
> Reporter: Michael McCandless
> Assignee: Michael McCandless
> Fix For: 4.0
>
> Attachments: LUCENE-2862.patch
>
>
> Right now we track docFreq for each term (how many docs have the
> term), but the totalTermFreq (total number of occurrences of this
> term, ie sum of freq() for each doc that has the term) is also a
> useful stat (for flex scoring, PulsingCodec, etc.).
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org