You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@tajo.apache.org by "Jinho Kim (JIRA)" <ji...@apache.org> on 2014/11/03 11:04:34 UTC
[jira] [Updated] (TAJO-1149) Implement direct read of
DelimitedTextFile
[ https://issues.apache.org/jira/browse/TAJO-1149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Jinho Kim updated TAJO-1149:
----------------------------
Description:
If a text file is uncompressed, scanner is able to read from direct buffer
Other cases, it read from input stream.
This patch includes some refactoring as follows:
* rename properties
** {{csvfile.delimiter}} to {{text.delimiter}}
** {{csvfile.null}} to {{text.null}}
* Add DelimitedLineReader
** it will be easier to implement text-format variations.
* Unsafe split line and field
** It will be better performance
* Remove seekable interface
** We use the CSVFile to just debugging
was:
if a text file is uncompressed, scanner is able to read from direct buffer
Summary: Implement direct read of DelimitedTextFile (was: Implement direct read of TextFile scanner)
> Implement direct read of DelimitedTextFile
> ------------------------------------------
>
> Key: TAJO-1149
> URL: https://issues.apache.org/jira/browse/TAJO-1149
> Project: Tajo
> Issue Type: Sub-task
> Components: storage
> Reporter: Jinho Kim
> Assignee: Jinho Kim
> Labels: performance
>
> If a text file is uncompressed, scanner is able to read from direct buffer
> Other cases, it read from input stream.
> This patch includes some refactoring as follows:
> * rename properties
> ** {{csvfile.delimiter}} to {{text.delimiter}}
> ** {{csvfile.null}} to {{text.null}}
> * Add DelimitedLineReader
> ** it will be easier to implement text-format variations.
> * Unsafe split line and field
> ** It will be better performance
> * Remove seekable interface
> ** We use the CSVFile to just debugging
>
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)