You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@metamodel.apache.org by "Kasper Sørensen (JIRA)" <ji...@apache.org> on 2014/09/29 21:06:34 UTC
[jira] [Commented] (METAMODEL-82) Metadata type is always VARCHAR
for CSV & Excel modules
[ https://issues.apache.org/jira/browse/METAMODEL-82?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14152082#comment-14152082 ]
Kasper Sørensen commented on METAMODEL-82:
------------------------------------------
Well yes and no. It's a tough one...
The trouble is that the data here can never be guaranteed as any other type. If you put a number in a CSV file or an Excel sheet, it can only be treated as a number if all the other rows have a number in this column. And how do we guarantee that (except for obviously eager-reading all records, which isn't acceptable IMO).
If we change this I would suggest to make a "type detection" optional (and default disabled for compatibility). Then we could eager-read up to 1000 records or so and use the schema inference utils to determine a pretty precise type.
> Metadata type is always VARCHAR for CSV & Excel modules
> -------------------------------------------------------
>
> Key: METAMODEL-82
> URL: https://issues.apache.org/jira/browse/METAMODEL-82
> Project: Metamodel
> Issue Type: Bug
> Reporter: Alberto Rodriguez
>
> The metadata types fetched from CSV and Excel DataContexts are always VARCHAR although the data of some columns are not a string.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)