You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@datafu.apache.org by "Eyal Allweil (JIRA)" <ji...@apache.org> on 2017/09/11 11:43:00 UTC

[jira] [Commented] (DATAFU-61) Add TF-IDF Macro to DataFu

    [ https://issues.apache.org/jira/browse/DATAFU-61?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16161118#comment-16161118 ] 

Eyal Allweil commented on DATAFU-61:
------------------------------------

Came back to this today and tried a little experiment - I verified (calculating manually) that the Russell's code produces the same results as the "augmented TF" IDF flavor for the sample I took from the wikipedia page. Is that good enough for us?

> Add TF-IDF Macro to DataFu
> --------------------------
>
>                 Key: DATAFU-61
>                 URL: https://issues.apache.org/jira/browse/DATAFU-61
>             Project: DataFu
>          Issue Type: New Feature
>    Affects Versions: 1.3.0
>            Reporter: Russell Jurney
>         Attachments: DATAFU-61-2.patch, DATAFU-61.patch, DATAFU-61.patch
>
>
> The first macro I would like to add is a Term Frequency, Inverse Document Frequency implementation.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)