You are viewing a plain text version of this content. The canonical link for it is here.
Posted to notifications@accumulo.apache.org by "Christopher Tubbs (JIRA)" <ji...@apache.org> on 2016/01/28 18:37:40 UTC
[jira] [Comment Edited] (ACCUMULO-4119) Consistently use
DataFileValue to parse as UTF-8
[ https://issues.apache.org/jira/browse/ACCUMULO-4119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15121941#comment-15121941 ]
Christopher Tubbs edited comment on ACCUMULO-4119 at 1/28/16 5:37 PM:
----------------------------------------------------------------------
bq. That does seem weird. I can't think of a reason why timestamps on the bulk files would influence our size estimation.
The original code basically did {code}if ((a = string.split(",")).length == 2) { tabletSize += a[0]; }{code}. It looks like it was just a mistake in avoiding ArrayIndexOutOfBoundsException, which only stood out when it was translated to its equivalent DataFileValue class usage.
was (Author: ctubbsii):
bq. That does seem weird. I can't think of a reason why timestamps on the bulk files would influence our size estimation.
The original code basically did {{ if ((a = string.split(",").length) == 2) \{ tabletSize += a[0]; \} }}. It looks like it was just a mistake in avoiding ArrayIndexOutOfBoundsException, which only stood out when it was translated to its equivalent DataFileValue class usage.
> Consistently use DataFileValue to parse as UTF-8
> ------------------------------------------------
>
> Key: ACCUMULO-4119
> URL: https://issues.apache.org/jira/browse/ACCUMULO-4119
> Project: Accumulo
> Issue Type: Bug
> Reporter: Christopher Tubbs
> Assignee: Christopher Tubbs
> Fix For: 1.6.5, 1.7.1, 1.8.0
>
>
> Noticed that we weren't always using DataFileValue to parse file values from the metadata tables, and also that DataFileValue doesn't specify UTF-8 encoding explicitly.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)