You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@madlib.apache.org by "Frank McQuillan (JIRA)" <ji...@apache.org> on 2016/01/08 20:57:39 UTC

[jira] [Closed] (MADLIB-934) MADlib LDA should allow users to supply names of input columns

     [ https://issues.apache.org/jira/browse/MADLIB-934?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Frank McQuillan closed MADLIB-934.
----------------------------------
    Resolution: Won't Fix

Closing as "won't fix" since INT4 by design for memory management reasons.

> MADlib LDA should allow users to supply names of input columns
> --------------------------------------------------------------
>
>                 Key: MADLIB-934
>                 URL: https://issues.apache.org/jira/browse/MADLIB-934
>             Project: Apache MADlib
>          Issue Type: Bug
>          Components: Module: Parallel Latent Dirichlet Allocation
>            Reporter: Srivatsan
>            Assignee: Rahul Iyer
>             Fix For: v1.9
>
>
> When i attempt to create my own input table for LDA (one that has docid, wordid, count) which had 4 columns "docid", "wordid", "count" as well as a fourth column "word" (corresponding to the raw token). Of these, the type of the "count" column was bigint and not int. I am not sure what prompted the lda_train function to throw an error it said the input table did not contain docid, wordid and count columns, i did not check to see if it was because of the data type mismatch of the count column or if it was due to the additional column i had. Can you confirm which one is it?
> If it is just the case that it is a bigint vs int issue, can we allow user to supply the names of the docid, wordid and count columns? (instead of hard-coding it?).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)