You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-dev@hadoop.apache.org by "Edward Yoon (JIRA)" <ji...@apache.org> on 2007/12/02 09:31:43 UTC

[jira] Created: (HADOOP-2329) [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data

[Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data
--------------------------------------------------------------------------------------------

                 Key: HADOOP-2329
                 URL: https://issues.apache.org/jira/browse/HADOOP-2329
             Project: Hadoop
          Issue Type: New Feature
          Components: contrib/hbase
    Affects Versions: 0.16.0
            Reporter: Edward Yoon
            Assignee: Edward Yoon
             Fix For: 0.16.0


A built-in data type is a fundamental data type that the hbase shell defines.
(character strings, scalars, ranges, arrays, ... , etc)

If you need a specialized data type that is not currently provided as a built-in type, 
you are encouraged to write your own user-defined data type using UDF(not yet implemented).
(or contribute it for distribution in a future release of hbase shell)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HADOOP-2329) [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data

Posted by "Edward Yoon (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HADOOP-2329?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Edward Yoon updated HADOOP-2329:
--------------------------------

    Description: 
A built-in data type is a fundamental data type that the hbase shell defines.
(character strings, scalars, ranges, arrays, ... , etc)

If you need a specialized data type that is not currently provided as a built-in type, 
you are encouraged to write your own user-defined data type using UDC(not yet implemented).
(or contribute it for distribution in a future release of hbase shell)

  was:
A built-in data type is a fundamental data type that the hbase shell defines.
(character strings, scalars, ranges, arrays, ... , etc)

If you need a specialized data type that is not currently provided as a built-in type, 
you are encouraged to write your own user-defined data type using UDF(not yet implemented).
(or contribute it for distribution in a future release of hbase shell)


> [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data
> --------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-2329
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2329
>             Project: Hadoop
>          Issue Type: New Feature
>          Components: contrib/hbase
>    Affects Versions: 0.16.0
>            Reporter: Edward Yoon
>            Assignee: Edward Yoon
>             Fix For: 0.16.0
>
>
> A built-in data type is a fundamental data type that the hbase shell defines.
> (character strings, scalars, ranges, arrays, ... , etc)
> If you need a specialized data type that is not currently provided as a built-in type, 
> you are encouraged to write your own user-defined data type using UDC(not yet implemented).
> (or contribute it for distribution in a future release of hbase shell)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Issue Comment Edited: (HADOOP-2329) [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data

Posted by "Edward Yoon (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-2329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12548558 ] 

udanax edited comment on HADOOP-2329 at 12/4/07 11:16 PM:
---------------------------------------------------------------

Above comment means... "I didn't said that Hbase should provides built-in data type."

Let's see the example. 

Some DB solutions can be provide only byte[] data type.
And then, ... listen to their slogan.. "you can stores own arbitrary data type. look!! this is awesome flexible DB solution. Let's make your own datum and data types. "..... 
We can think about why they didn't...

Also, i think it is not makes any problem. 
It's just a function for the convenience of users.

# only byte[] VS { integer or character or byte[] or .... , etc}

It means that each user needn't develop for the provided built-in data type in everytime.

      was (Author: udanax):
    Above comment means... "I didn't said that Hbase should provides built-in data type."

Let's see the example. 

Some DB solutions can be provide only byte[] data type.
And then, ... listen to their slogan.. "you can stores own arbitrary data type. look!! this is awesome flexible DB solution. Let's make your own datum and data types. "..... 
We can think about why they didn't...

Also, i think it is not makes any problem. 
It just makes user can choice, so we can provides convenience.

only byte[] VS { integer or character or byte[] or .... , etc}

It means that each user needn't develop the exist built-in data type in everytime.
  
> [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data
> --------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-2329
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2329
>             Project: Hadoop
>          Issue Type: New Feature
>          Components: contrib/hbase
>    Affects Versions: 0.16.0
>            Reporter: Edward Yoon
>            Assignee: Edward Yoon
>             Fix For: 0.16.0
>
>
> A built-in data type is a fundamental data type that the hbase shell defines.
> (character strings, scalars, ranges, arrays, ... , etc)
> If you need a specialized data type that is not currently provided as a built-in type, 
> you are encouraged to write your own user-defined data type using UDC(not yet implemented).
> (or contribute it for distribution in a future release of hbase shell)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HADOOP-2329) [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data

Posted by "Edward Yoon (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-2329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12547710 ] 

Edward Yoon commented on HADOOP-2329:
-------------------------------------

I'd like to add the type information to column descriptor.
Let folks add either arbitrary values or built-in data types to Cell.

Any Comments are welcome.

> [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data
> --------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-2329
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2329
>             Project: Hadoop
>          Issue Type: New Feature
>          Components: contrib/hbase
>    Affects Versions: 0.16.0
>            Reporter: Edward Yoon
>            Assignee: Edward Yoon
>             Fix For: 0.16.0
>
>
> A built-in data type is a fundamental data type that the hbase shell defines.
> (character strings, scalars, ranges, arrays, ... , etc)
> If you need a specialized data type that is not currently provided as a built-in type, 
> you are encouraged to write your own user-defined data type using UDC(not yet implemented).
> (or contribute it for distribution in a future release of hbase shell)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HADOOP-2329) [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data

Posted by "Edward Yoon (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-2329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12547728 ] 

Edward Yoon commented on HADOOP-2329:
-------------------------------------

I think this option effects all columns in columnfamily.
It make generalizations about columns in columnfamily.
Is it problem?
I don't think so, We can just use the current byte[] type.

{code}
Insert into table (String, String) values ('stirng value', '3');
Insert into table (String, Integer) values ('stirng value', 3);
{code}

So, It doen't matter.

> [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data
> --------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-2329
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2329
>             Project: Hadoop
>          Issue Type: New Feature
>          Components: contrib/hbase
>    Affects Versions: 0.16.0
>            Reporter: Edward Yoon
>            Assignee: Edward Yoon
>             Fix For: 0.16.0
>
>
> A built-in data type is a fundamental data type that the hbase shell defines.
> (character strings, scalars, ranges, arrays, ... , etc)
> If you need a specialized data type that is not currently provided as a built-in type, 
> you are encouraged to write your own user-defined data type using UDC(not yet implemented).
> (or contribute it for distribution in a future release of hbase shell)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Issue Comment Edited: (HADOOP-2329) [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data

Posted by "Edward Yoon (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-2329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12547597 ] 

udanax edited comment on HADOOP-2329 at 12/2/07 4:49 PM:
--------------------------------------------------------------

- { Create, Insert, Select } Syntax and Codes will be changed.

Built-In Data Types
||Data Type||Explanation|| 
|BOOLEAN|Stores the Boolean values for true and false|
|BYTE|Stores binary data that are not random access|
|CHAR(*n*)|Stores single-byte or multibyte sequences of characters, including letters, numbers, and symbols of fixed length Collation is code-set dependent|
|VARCHAR(*m*,*r*)|Stores single-byte or multibyte strings of letters, numbers, and symbols of varying length to a maximum of 255 bytes Collation is code-set dependent|
|DECIMAL|Stores numbers with definable scale and precision|
|INTEGER or INT|Stores whole numbers from -(2^31^-1) to 2^31^-1|
|SMALLINT|Stores whole numbers from -(2^15^-1) to 2^15^-1|
|INT8|Stores an 8-byte integer value These whole numbers can be in the range -(2^63^-1) to 2^63^-1|
|SERIAL|Stores sequential integers; has the same range of values as INTEGER|
|SERIAL8|Stores large sequential integers; has the same range of values as INT8|
|TEXT|Stores text data in chunks that are not random access|

      was (Author: udanax):
    - { Create, Insert, Update } Syntax will be changed.

Built-In Data Types
||Data Type||Explanation|| 
|BOOLEAN|Stores the Boolean values for true and false|
|BYTE|Stores binary data that are not random access|
|CHAR(*n*)|Stores single-byte or multibyte sequences of characters, including letters, numbers, and symbols of fixed length Collation is code-set dependent|
|VARCHAR(*m*,*r*)|Stores single-byte or multibyte strings of letters, numbers, and symbols of varying length to a maximum of 255 bytes Collation is code-set dependent|
|DECIMAL|Stores numbers with definable scale and precision|
|INTEGER or INT|Stores whole numbers from -(231-1) to 231-1|
|SMALLINT|Stores whole numbers from -(215-1) to 215-1|
|INT8|Stores an 8-byte integer value These whole numbers can be in the range -(263-1) to 263-1|
|SERIAL|Stores sequential integers; has the same range of values as INTEGER|
|SERIAL8|Stores large sequential integers; has the same range of values as INT8|
|TEXT|Stores text data in chunks that are not random access|
  
> [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data
> --------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-2329
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2329
>             Project: Hadoop
>          Issue Type: New Feature
>          Components: contrib/hbase
>    Affects Versions: 0.16.0
>            Reporter: Edward Yoon
>            Assignee: Edward Yoon
>             Fix For: 0.16.0
>
>
> A built-in data type is a fundamental data type that the hbase shell defines.
> (character strings, scalars, ranges, arrays, ... , etc)
> If you need a specialized data type that is not currently provided as a built-in type, 
> you are encouraged to write your own user-defined data type using UDC(not yet implemented).
> (or contribute it for distribution in a future release of hbase shell)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HADOOP-2329) [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data

Posted by "Edward Yoon (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-2329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12548017 ] 

Edward Yoon commented on HADOOP-2329:
-------------------------------------

>> HADOOP-2197 would permit applications to tag columns with arbitrary key/value pairs. Thus an application could store family member/type information using this mechanism.

ugh... ok!

> [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data
> --------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-2329
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2329
>             Project: Hadoop
>          Issue Type: New Feature
>          Components: contrib/hbase
>    Affects Versions: 0.16.0
>            Reporter: Edward Yoon
>            Assignee: Edward Yoon
>             Fix For: 0.16.0
>
>
> A built-in data type is a fundamental data type that the hbase shell defines.
> (character strings, scalars, ranges, arrays, ... , etc)
> If you need a specialized data type that is not currently provided as a built-in type, 
> you are encouraged to write your own user-defined data type using UDC(not yet implemented).
> (or contribute it for distribution in a future release of hbase shell)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HADOOP-2329) [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data

Posted by "Edward Yoon (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-2329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12548546 ] 

Edward Yoon commented on HADOOP-2329:
-------------------------------------

I don't think so.
Shell can't handle type even if there's a good alternative already in the works.

This issue is exist for shell.


> [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data
> --------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-2329
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2329
>             Project: Hadoop
>          Issue Type: New Feature
>          Components: contrib/hbase
>    Affects Versions: 0.16.0
>            Reporter: Edward Yoon
>            Assignee: Edward Yoon
>             Fix For: 0.16.0
>
>
> A built-in data type is a fundamental data type that the hbase shell defines.
> (character strings, scalars, ranges, arrays, ... , etc)
> If you need a specialized data type that is not currently provided as a built-in type, 
> you are encouraged to write your own user-defined data type using UDC(not yet implemented).
> (or contribute it for distribution in a future release of hbase shell)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HADOOP-2329) [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data

Posted by "Edward Yoon (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-2329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12548881 ] 

Edward Yoon commented on HADOOP-2329:
-------------------------------------

OK, i see jim.
But, i don't know the movements opposed to shell operations. :)

I think there can be no cause for complaint.
The shell tool isn't threatening a pure Hbase.

> [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data
> --------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-2329
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2329
>             Project: Hadoop
>          Issue Type: New Feature
>          Components: contrib/hbase
>    Affects Versions: 0.16.0
>            Reporter: Edward Yoon
>            Assignee: Edward Yoon
>            Priority: Trivial
>             Fix For: 0.16.0
>
>
> A built-in data type is a fundamental data type that the hbase shell defines.
> (character strings, scalars, ranges, arrays, ... , etc)
> If you need a specialized data type that is not currently provided as a built-in type, 
> you are encouraged to write your own user-defined data type using UDC(not yet implemented).
> (or contribute it for distribution in a future release of hbase shell)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HADOOP-2329) [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data

Posted by "Edward Yoon (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-2329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12548547 ] 

Edward Yoon commented on HADOOP-2329:
-------------------------------------

It will be clarify the scope of various operations.

> [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data
> --------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-2329
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2329
>             Project: Hadoop
>          Issue Type: New Feature
>          Components: contrib/hbase
>    Affects Versions: 0.16.0
>            Reporter: Edward Yoon
>            Assignee: Edward Yoon
>             Fix For: 0.16.0
>
>
> A built-in data type is a fundamental data type that the hbase shell defines.
> (character strings, scalars, ranges, arrays, ... , etc)
> If you need a specialized data type that is not currently provided as a built-in type, 
> you are encouraged to write your own user-defined data type using UDC(not yet implemented).
> (or contribute it for distribution in a future release of hbase shell)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HADOOP-2329) [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data

Posted by "Jim Kellerman (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-2329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12549225 ] 

Jim Kellerman commented on HADOOP-2329:
---------------------------------------

One more thought with respect to shell operations. If relational and matrix operations, etc., were packaged in a separate jar, it would be easier to use in a map/reduce job wouldn't it? And if they supported different input sources, they could be potentially used in a generic (that is, non-hbase) map/reduce operation.

> [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data
> --------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-2329
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2329
>             Project: Hadoop
>          Issue Type: New Feature
>          Components: contrib/hbase
>    Affects Versions: 0.16.0
>            Reporter: Edward Yoon
>            Assignee: Edward Yoon
>            Priority: Trivial
>             Fix For: 0.16.0
>
>
> A built-in data type is a fundamental data type that the hbase shell defines.
> (character strings, scalars, ranges, arrays, ... , etc)
> If you need a specialized data type that is not currently provided as a built-in type, 
> you are encouraged to write your own user-defined data type using UDC(not yet implemented).
> (or contribute it for distribution in a future release of hbase shell)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HADOOP-2329) [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data

Posted by "Edward Yoon (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-2329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12550255 ] 

Edward Yoon commented on HADOOP-2329:
-------------------------------------

>> Since you are proposing the former rather than the latter, I would say go for it. 

Is it okay?
I'm not sure.

> [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data
> --------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-2329
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2329
>             Project: Hadoop
>          Issue Type: New Feature
>          Components: contrib/hbase
>    Affects Versions: 0.16.0
>            Reporter: Edward Yoon
>            Assignee: Edward Yoon
>            Priority: Trivial
>             Fix For: 0.16.0
>
>
> A built-in data type is a fundamental data type that the hbase shell defines.
> (character strings, scalars, ranges, arrays, ... , etc)
> If you need a specialized data type that is not currently provided as a built-in type, 
> you are encouraged to write your own user-defined data type using UDC(not yet implemented).
> (or contribute it for distribution in a future release of hbase shell)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Work started: (HADOOP-2329) [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data

Posted by "Edward Yoon (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HADOOP-2329?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Work on HADOOP-2329 started by Edward Yoon.

> [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data
> --------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-2329
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2329
>             Project: Hadoop
>          Issue Type: New Feature
>          Components: contrib/hbase
>    Affects Versions: 0.16.0
>            Reporter: Edward Yoon
>            Assignee: Edward Yoon
>             Fix For: 0.16.0
>
>
> A built-in data type is a fundamental data type that the hbase shell defines.
> (character strings, scalars, ranges, arrays, ... , etc)
> If you need a specialized data type that is not currently provided as a built-in type, 
> you are encouraged to write your own user-defined data type using UDF(not yet implemented).
> (or contribute it for distribution in a future release of hbase shell)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Issue Comment Edited: (HADOOP-2329) [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data

Posted by "Edward Yoon (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-2329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12547597 ] 

udanax edited comment on HADOOP-2329 at 12/2/07 2:28 AM:
--------------------------------------------------------------

- { Create, Insert, Update } Syntax will be changed.

Built-In Data Types
||Data Type||Explanation|| 
|BOOLEAN|Stores the Boolean values for true and false|
|BYTE|Stores binary data that are not random access|
|CHAR(*n*)|Stores single-byte or multibyte sequences of characters, including letters, numbers, and symbols of fixed length Collation is code-set dependent|
|VARCHAR(*m*,*r*)|Stores single-byte or multibyte strings of letters, numbers, and symbols of varying length to a maximum of 255 bytes Collation is code-set dependent|
|DECIMAL|Stores numbers with definable scale and precision|
|INTEGER or INT|Stores a span of time|
|SMALLINT|Stores whole numbers from -(215-1) to 215-1|
|INT8|Stores an 8-byte integer value These whole numbers can be in the range -(263-1) to 263-1|
|SERIAL|Stores sequential integers; has the same range of values as INTEGER|
|SERIAL8|Stores large sequential integers; has the same range of values as INT8|
|TEXT|Stores text data in chunks that are not random access|

      was (Author: udanax):
    - { Create, Insert, Update } Syntax will be changed.

Built-In Data Types
||Data Type||Explanation|| 
|BOOLEAN|Stores the Boolean values for true and false|
|BYTE|Stores binary data that are not random access|
|DECIMAL|Stores numbers with definable scale and precision|
|INT|Is a synonym for INTEGER|
|INT8|Stores an 8-byte integer value These whole numbers can be in the range -(263-1) to 263-1|
|INTEGER|Stores a span of time|
|REAL|Is a synonym for SMALLFLOAT|
|SERIAL|Stores sequential integers; has the same range of values as INTEGER|
|SERIAL8|Stores large sequential integers; has the same range of values as INT8|
|TEXT|Stores text data in chunks that are not random access|
  
> [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data
> --------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-2329
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2329
>             Project: Hadoop
>          Issue Type: New Feature
>          Components: contrib/hbase
>    Affects Versions: 0.16.0
>            Reporter: Edward Yoon
>            Assignee: Edward Yoon
>             Fix For: 0.16.0
>
>
> A built-in data type is a fundamental data type that the hbase shell defines.
> (character strings, scalars, ranges, arrays, ... , etc)
> If you need a specialized data type that is not currently provided as a built-in type, 
> you are encouraged to write your own user-defined data type using UDC(not yet implemented).
> (or contribute it for distribution in a future release of hbase shell)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Issue Comment Edited: (HADOOP-2329) [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data

Posted by "Edward Yoon (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-2329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12548558 ] 

udanax edited comment on HADOOP-2329 at 12/4/07 11:06 PM:
---------------------------------------------------------------

Above comment means... "I didn't said that Hbase should provides built-in data type."

Let's see the example. 

Some DB solutions can be provide only byte[] data type.
And then, ... listen to their slogan.. "you can stores own arbitrary data type. look!! this is awesome flexible DB solution. Let's make your own datum and data types. "..... 
We can think about why they didn't...

Also, i think it is not makes any problem. 
It just makes user can choice, so we can provides convenience.

only byte[] VS { integer or character or byte[] or .... , etc}

It means that each user needn't develop the exist built-in data type in everytime.

      was (Author: udanax):
    Above comment means... "I didn't said that Hbase should provides built-in data type."

Let's see the example. 

Some DB solutions can be provide only byte[] data type.
And then, ... listen to their slogan.. "you can stores own arbitrary data type. look!! this is awesome flexible DB solution. Let's make your own datum and data types. "..... 
We can think about why they didn't...

Also, i think it is not makes any problem. 
It just makes user can choice, so we can provides convenience.

only byte[] VS { integer or character or byte[] or .... , etc}

It means that each user shouldn't make general built-in data type at everytime.
  
> [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data
> --------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-2329
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2329
>             Project: Hadoop
>          Issue Type: New Feature
>          Components: contrib/hbase
>    Affects Versions: 0.16.0
>            Reporter: Edward Yoon
>            Assignee: Edward Yoon
>             Fix For: 0.16.0
>
>
> A built-in data type is a fundamental data type that the hbase shell defines.
> (character strings, scalars, ranges, arrays, ... , etc)
> If you need a specialized data type that is not currently provided as a built-in type, 
> you are encouraged to write your own user-defined data type using UDC(not yet implemented).
> (or contribute it for distribution in a future release of hbase shell)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HADOOP-2329) [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data

Posted by "Jim Kellerman (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-2329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12548877 ] 

Jim Kellerman commented on HADOOP-2329:
---------------------------------------

One of the stated goals of the HBase project is to produce a system as similar to Bigtable as possible (see http://wiki.apache.org/lucene-hadoop/Hbase#goals). In this spirit, HBase will remain typeless and it is likely that we will go ahead with HADOOP-2334 (making row keys WritableComparable instead of Text) once we get a chance to breathe after getting out from under the major bugs.

> [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data
> --------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-2329
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2329
>             Project: Hadoop
>          Issue Type: New Feature
>          Components: contrib/hbase
>    Affects Versions: 0.16.0
>            Reporter: Edward Yoon
>            Assignee: Edward Yoon
>            Priority: Trivial
>             Fix For: 0.16.0
>
>
> A built-in data type is a fundamental data type that the hbase shell defines.
> (character strings, scalars, ranges, arrays, ... , etc)
> If you need a specialized data type that is not currently provided as a built-in type, 
> you are encouraged to write your own user-defined data type using UDC(not yet implemented).
> (or contribute it for distribution in a future release of hbase shell)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HADOOP-2329) [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data

Posted by "Edward Yoon (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-2329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12548558 ] 

Edward Yoon commented on HADOOP-2329:
-------------------------------------

Above comment means... "I didn't said that Hbase should provides built-in data type."

Let's see the example. 

Some DB solutions can be provide only byte[] data type.
And then, ... listen to their slogan.. "you can stores own arbitrary data type. look!! this is awesome flexible DB solution. Let's make your own datum and data types. "..... 
We can think about why they didn't...

Also, i think it is not makes any problem. 
It just makes user can choice, so we can provides convenience.

only byte[] VS { integer or character or byte[] or .... , etc}

It means that each user shouldn't make general built-in data type at everytime.

> [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data
> --------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-2329
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2329
>             Project: Hadoop
>          Issue Type: New Feature
>          Components: contrib/hbase
>    Affects Versions: 0.16.0
>            Reporter: Edward Yoon
>            Assignee: Edward Yoon
>             Fix For: 0.16.0
>
>
> A built-in data type is a fundamental data type that the hbase shell defines.
> (character strings, scalars, ranges, arrays, ... , etc)
> If you need a specialized data type that is not currently provided as a built-in type, 
> you are encouraged to write your own user-defined data type using UDC(not yet implemented).
> (or contribute it for distribution in a future release of hbase shell)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Work stopped: (HADOOP-2329) [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data

Posted by "Edward Yoon (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HADOOP-2329?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Work on HADOOP-2329 stopped by Edward Yoon.

> [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data
> --------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-2329
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2329
>             Project: Hadoop
>          Issue Type: New Feature
>          Components: contrib/hbase
>    Affects Versions: 0.16.0
>            Reporter: Edward Yoon
>            Assignee: Edward Yoon
>            Priority: Trivial
>             Fix For: 0.16.0
>
>
> A built-in data type is a fundamental data type that the hbase shell defines.
> (character strings, scalars, ranges, arrays, ... , etc)
> If you need a specialized data type that is not currently provided as a built-in type, 
> you are encouraged to write your own user-defined data type using UDC(not yet implemented).
> (or contribute it for distribution in a future release of hbase shell)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HADOOP-2329) [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data

Posted by "Edward Yoon (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-2329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12548573 ] 

Edward Yoon commented on HADOOP-2329:
-------------------------------------

{code}
row      bike                                        car
row1     bike:name           <Harley davidson>       ...   
         bike:cc             <800>                   ...
         bike:price          <23,000> 
         bike:price_currency <U.S dollar>
         ...
{code}

For this case, i'm think some different method for each Type-definition.

> [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data
> --------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-2329
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2329
>             Project: Hadoop
>          Issue Type: New Feature
>          Components: contrib/hbase
>    Affects Versions: 0.16.0
>            Reporter: Edward Yoon
>            Assignee: Edward Yoon
>             Fix For: 0.16.0
>
>
> A built-in data type is a fundamental data type that the hbase shell defines.
> (character strings, scalars, ranges, arrays, ... , etc)
> If you need a specialized data type that is not currently provided as a built-in type, 
> you are encouraged to write your own user-defined data type using UDC(not yet implemented).
> (or contribute it for distribution in a future release of hbase shell)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Issue Comment Edited: (HADOOP-2329) [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data

Posted by "Edward Yoon (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-2329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12547597 ] 

udanax edited comment on HADOOP-2329 at 12/2/07 4:50 PM:
--------------------------------------------------------------

- { Create, Insert, Select } Syntax and Codes will be changed.

Built-In Data Types
||Data Type||Explanation|| 
|BOOLEAN|Stores the Boolean values for true and false|
|BYTE|Stores binary data that are not random access|
|CHAR(*n*)|Stores single-byte or multibyte sequences of characters, including letters, numbers, and symbols of fixed length Collation is code-set dependent|
|VARCHAR(*m*,*r*)|Stores single-byte or multibyte strings of letters, numbers, and symbols of varying length to a maximum of 255 bytes Collation is code-set dependent|
|DECIMAL|Stores numbers with definable scale and precision|
|INTEGER or INT|Stores whole numbers from -(2 ^31^ -1) to 2 ^31^ -1|
|SMALLINT|Stores whole numbers from -(2 ^15^ -1) to 2 ^15^ -1|
|INT8|Stores an 8-byte integer value These whole numbers can be in the range -(2 ^63^ -1) to 2 ^63^ -1|
|SERIAL|Stores sequential integers; has the same range of values as INTEGER|
|SERIAL8|Stores large sequential integers; has the same range of values as INT8|
|TEXT|Stores text data in chunks that are not random access|

      was (Author: udanax):
    - { Create, Insert, Select } Syntax and Codes will be changed.

Built-In Data Types
||Data Type||Explanation|| 
|BOOLEAN|Stores the Boolean values for true and false|
|BYTE|Stores binary data that are not random access|
|CHAR(*n*)|Stores single-byte or multibyte sequences of characters, including letters, numbers, and symbols of fixed length Collation is code-set dependent|
|VARCHAR(*m*,*r*)|Stores single-byte or multibyte strings of letters, numbers, and symbols of varying length to a maximum of 255 bytes Collation is code-set dependent|
|DECIMAL|Stores numbers with definable scale and precision|
|INTEGER or INT|Stores whole numbers from -(2^31^-1) to 2^31^-1|
|SMALLINT|Stores whole numbers from -(2^15^-1) to 2^15^-1|
|INT8|Stores an 8-byte integer value These whole numbers can be in the range -(2^63^-1) to 2^63^-1|
|SERIAL|Stores sequential integers; has the same range of values as INTEGER|
|SERIAL8|Stores large sequential integers; has the same range of values as INT8|
|TEXT|Stores text data in chunks that are not random access|
  
> [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data
> --------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-2329
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2329
>             Project: Hadoop
>          Issue Type: New Feature
>          Components: contrib/hbase
>    Affects Versions: 0.16.0
>            Reporter: Edward Yoon
>            Assignee: Edward Yoon
>             Fix For: 0.16.0
>
>
> A built-in data type is a fundamental data type that the hbase shell defines.
> (character strings, scalars, ranges, arrays, ... , etc)
> If you need a specialized data type that is not currently provided as a built-in type, 
> you are encouraged to write your own user-defined data type using UDC(not yet implemented).
> (or contribute it for distribution in a future release of hbase shell)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HADOOP-2329) [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data

Posted by "Bryan Duxbury (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-2329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12548855 ] 

Bryan Duxbury commented on HADOOP-2329:
---------------------------------------

I don't think there should be a type field. That's up to the application to deal with. It would add a ton of overhead to everything in HBase and require a huge overhaul of how stuff works. It would also take away a good deal of flexibility. 

The fact that the shell cannot understand user-supplied key/value based data types is not a good motivation for adding it. The shell should really only be a administrative utility anyway, just enough to be able create and drop tables and to peek at a row here or there. I doubt that people who write their applications to use HBase are going to be limited by the lack of built-in data types.

> [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data
> --------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-2329
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2329
>             Project: Hadoop
>          Issue Type: New Feature
>          Components: contrib/hbase
>    Affects Versions: 0.16.0
>            Reporter: Edward Yoon
>            Assignee: Edward Yoon
>             Fix For: 0.16.0
>
>
> A built-in data type is a fundamental data type that the hbase shell defines.
> (character strings, scalars, ranges, arrays, ... , etc)
> If you need a specialized data type that is not currently provided as a built-in type, 
> you are encouraged to write your own user-defined data type using UDC(not yet implemented).
> (or contribute it for distribution in a future release of hbase shell)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HADOOP-2329) [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data

Posted by "Edward Yoon (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-2329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12548869 ] 

Edward Yoon commented on HADOOP-2329:
-------------------------------------

>> The shell should really only be a administrative utility anyway, just enough to be able create and drop tables and to peek at a row here or there.

I don't think so.
What do you think about this ment.

- The shell should really only be a administrative utility anyway, just enough to be *reboot* and *dir* and to peek at a *file name* here or there.

>> It would add a ton of overhead to everything in HBase and require a huge overhaul of how stuff works.
>> It would also take away a good deal of flexibility.

I don't think so. you can just use the byte[]. ok?
Also, applications developers need to modeling capacity on Hbase. (It's very difficult in my experience, so shell's guide will be very useful.)

>> I doubt that people who write their applications to use HBase are going to be limited by the lack of built-in data types.

I don't think so.
If you are studied Database and Math, you can really powerful use the Some DB solutions.
But, many peoples(application developers) can't.....

Why...?

More think please.








> [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data
> --------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-2329
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2329
>             Project: Hadoop
>          Issue Type: New Feature
>          Components: contrib/hbase
>    Affects Versions: 0.16.0
>            Reporter: Edward Yoon
>            Assignee: Edward Yoon
>            Priority: Trivial
>             Fix For: 0.16.0
>
>
> A built-in data type is a fundamental data type that the hbase shell defines.
> (character strings, scalars, ranges, arrays, ... , etc)
> If you need a specialized data type that is not currently provided as a built-in type, 
> you are encouraged to write your own user-defined data type using UDC(not yet implemented).
> (or contribute it for distribution in a future release of hbase shell)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HADOOP-2329) [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data

Posted by "Bryan Duxbury (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-2329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12548543 ] 

Bryan Duxbury commented on HADOOP-2329:
---------------------------------------

If there's a good alternative already in the works, can we close this issue?

> [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data
> --------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-2329
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2329
>             Project: Hadoop
>          Issue Type: New Feature
>          Components: contrib/hbase
>    Affects Versions: 0.16.0
>            Reporter: Edward Yoon
>            Assignee: Edward Yoon
>             Fix For: 0.16.0
>
>
> A built-in data type is a fundamental data type that the hbase shell defines.
> (character strings, scalars, ranges, arrays, ... , etc)
> If you need a specialized data type that is not currently provided as a built-in type, 
> you are encouraged to write your own user-defined data type using UDC(not yet implemented).
> (or contribute it for distribution in a future release of hbase shell)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HADOOP-2329) [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data

Posted by "Edward Yoon (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-2329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12549236 ] 

Edward Yoon commented on HADOOP-2329:
-------------------------------------

Thanks for your advice.

Then,...  

- We can just use the MapFiles on HDFS.
- Of course, batch scheduler can be managing data structure as a "bigTable".
- And you can work using the mapreduce parallel model.

How do you think about scheduler deamonizing?

- No, users can be misunderstand that this structure system is all of HDFS.
- If we don't it, users could be potentially ....

I think your logic is like above.
Automation has revolutionized the industrial world.

In my opinion, we can discuss this after something done.


I has just began........  ........ ....... ... .. .. .. . .. . .






> [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data
> --------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-2329
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2329
>             Project: Hadoop
>          Issue Type: New Feature
>          Components: contrib/hbase
>    Affects Versions: 0.16.0
>            Reporter: Edward Yoon
>            Assignee: Edward Yoon
>            Priority: Trivial
>             Fix For: 0.16.0
>
>
> A built-in data type is a fundamental data type that the hbase shell defines.
> (character strings, scalars, ranges, arrays, ... , etc)
> If you need a specialized data type that is not currently provided as a built-in type, 
> you are encouraged to write your own user-defined data type using UDC(not yet implemented).
> (or contribute it for distribution in a future release of hbase shell)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HADOOP-2329) [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data

Posted by "Jim Kellerman (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-2329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12548888 ] 

Jim Kellerman commented on HADOOP-2329:
---------------------------------------

Edward,

> But, i don't know the movements opposed to shell operations.

I don't think there is opposition to what you are doing, other than some people feel that the advanced shell operations are not necessary in a basic shell that can do simple queries and administrative functions. If the advanced features could be packaged in a separate jar and loaded via some command line option, I think it would gain higher acceptance.

> I think there can be no cause for complaint.
> The shell tool isn't threatening a pure Hbase.

I think I am misunderstanding something here. Are you proposing to do the data types entirely outside of HBase or leveraging HADOOP-2197 ? Or do you want internal support for data types?

If you are thinking of the former, that's fine. But I don't think support for data types should be in the core of HBase.


> [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data
> --------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-2329
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2329
>             Project: Hadoop
>          Issue Type: New Feature
>          Components: contrib/hbase
>    Affects Versions: 0.16.0
>            Reporter: Edward Yoon
>            Assignee: Edward Yoon
>            Priority: Trivial
>             Fix For: 0.16.0
>
>
> A built-in data type is a fundamental data type that the hbase shell defines.
> (character strings, scalars, ranges, arrays, ... , etc)
> If you need a specialized data type that is not currently provided as a built-in type, 
> you are encouraged to write your own user-defined data type using UDC(not yet implemented).
> (or contribute it for distribution in a future release of hbase shell)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HADOOP-2329) [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data

Posted by "Edward Yoon (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-2329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12549238 ] 

Edward Yoon commented on HADOOP-2329:
-------------------------------------

What do you think about { RDF storage on Hbase on Hadoop,  Matrix storage on Hbase on Hadoop }?


> [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data
> --------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-2329
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2329
>             Project: Hadoop
>          Issue Type: New Feature
>          Components: contrib/hbase
>    Affects Versions: 0.16.0
>            Reporter: Edward Yoon
>            Assignee: Edward Yoon
>            Priority: Trivial
>             Fix For: 0.16.0
>
>
> A built-in data type is a fundamental data type that the hbase shell defines.
> (character strings, scalars, ranges, arrays, ... , etc)
> If you need a specialized data type that is not currently provided as a built-in type, 
> you are encouraged to write your own user-defined data type using UDC(not yet implemented).
> (or contribute it for distribution in a future release of hbase shell)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HADOOP-2329) [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data

Posted by "Edward Yoon (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-2329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12548893 ] 

Edward Yoon commented on HADOOP-2329:
-------------------------------------

>> Are you proposing to do the data types entirely outside of HBase or leveraging HADOOP-2197 ? Or do you want internal support for data types?

Yes, I'm thinking the former.

> [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data
> --------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-2329
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2329
>             Project: Hadoop
>          Issue Type: New Feature
>          Components: contrib/hbase
>    Affects Versions: 0.16.0
>            Reporter: Edward Yoon
>            Assignee: Edward Yoon
>            Priority: Trivial
>             Fix For: 0.16.0
>
>
> A built-in data type is a fundamental data type that the hbase shell defines.
> (character strings, scalars, ranges, arrays, ... , etc)
> If you need a specialized data type that is not currently provided as a built-in type, 
> you are encouraged to write your own user-defined data type using UDC(not yet implemented).
> (or contribute it for distribution in a future release of hbase shell)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HADOOP-2329) [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data

Posted by "Edward Yoon (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-2329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12548908 ] 

Edward Yoon commented on HADOOP-2329:
-------------------------------------

Thanks for your advice.

> [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data
> --------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-2329
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2329
>             Project: Hadoop
>          Issue Type: New Feature
>          Components: contrib/hbase
>    Affects Versions: 0.16.0
>            Reporter: Edward Yoon
>            Assignee: Edward Yoon
>            Priority: Trivial
>             Fix For: 0.16.0
>
>
> A built-in data type is a fundamental data type that the hbase shell defines.
> (character strings, scalars, ranges, arrays, ... , etc)
> If you need a specialized data type that is not currently provided as a built-in type, 
> you are encouraged to write your own user-defined data type using UDC(not yet implemented).
> (or contribute it for distribution in a future release of hbase shell)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HADOOP-2329) [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data

Posted by "Edward Yoon (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-2329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12547703 ] 

Edward Yoon commented on HADOOP-2329:
-------------------------------------

Let's see the integer value example.
{code}
int integer = 123456789;

ByteBuffer buff = ByteBuffer.allocate(Integer.SIZE / 8);
buff.putInt(integer);
buff.order(ByteOrder.BIG_ENDIAN);

1: table.put(lockId, column, buff.array());
2: table.put(lockId, column, "123456789".getBytes());
{code}

4 bytes vs 9 bytes.
Nine is the almost double of four.

> [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data
> --------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-2329
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2329
>             Project: Hadoop
>          Issue Type: New Feature
>          Components: contrib/hbase
>    Affects Versions: 0.16.0
>            Reporter: Edward Yoon
>            Assignee: Edward Yoon
>             Fix For: 0.16.0
>
>
> A built-in data type is a fundamental data type that the hbase shell defines.
> (character strings, scalars, ranges, arrays, ... , etc)
> If you need a specialized data type that is not currently provided as a built-in type, 
> you are encouraged to write your own user-defined data type using UDC(not yet implemented).
> (or contribute it for distribution in a future release of hbase shell)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HADOOP-2329) [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data

Posted by "Edward Yoon (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-2329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12547597 ] 

Edward Yoon commented on HADOOP-2329:
-------------------------------------

- { Create, Insert, Update } Syntax will be changed.

Built-In Data Types
||Data Type||Explanation|| 
|BOOLEAN|Stores the Boolean values for true and false|
|BYTE|Stores binary data that are not random access|
|DECIMAL|Stores numbers with definable scale and precision|
|INT|Is a synonym for INTEGER|
|INT8|Stores an 8-byte integer value These whole numbers can be in the range -(263-1) to 263-1|
|INTEGER|Stores a span of time|
|REAL|Is a synonym for SMALLFLOAT|
|SERIAL|Stores sequential integers; has the same range of values as INTEGER|
|SERIAL8|Stores large sequential integers; has the same range of values as INT8|
|TEXT|Stores text data in chunks that are not random access|

> [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data
> --------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-2329
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2329
>             Project: Hadoop
>          Issue Type: New Feature
>          Components: contrib/hbase
>    Affects Versions: 0.16.0
>            Reporter: Edward Yoon
>            Assignee: Edward Yoon
>             Fix For: 0.16.0
>
>
> A built-in data type is a fundamental data type that the hbase shell defines.
> (character strings, scalars, ranges, arrays, ... , etc)
> If you need a specialized data type that is not currently provided as a built-in type, 
> you are encouraged to write your own user-defined data type using UDF(not yet implemented).
> (or contribute it for distribution in a future release of hbase shell)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Issue Comment Edited: (HADOOP-2329) [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data

Posted by "Edward Yoon (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-2329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12547597 ] 

udanax edited comment on HADOOP-2329 at 12/2/07 4:47 PM:
--------------------------------------------------------------

- { Create, Insert, Update } Syntax will be changed.

Built-In Data Types
||Data Type||Explanation|| 
|BOOLEAN|Stores the Boolean values for true and false|
|BYTE|Stores binary data that are not random access|
|CHAR(*n*)|Stores single-byte or multibyte sequences of characters, including letters, numbers, and symbols of fixed length Collation is code-set dependent|
|VARCHAR(*m*,*r*)|Stores single-byte or multibyte strings of letters, numbers, and symbols of varying length to a maximum of 255 bytes Collation is code-set dependent|
|DECIMAL|Stores numbers with definable scale and precision|
|INTEGER or INT|Stores whole numbers from -(231-1) to 231-1|
|SMALLINT|Stores whole numbers from -(215-1) to 215-1|
|INT8|Stores an 8-byte integer value These whole numbers can be in the range -(263-1) to 263-1|
|SERIAL|Stores sequential integers; has the same range of values as INTEGER|
|SERIAL8|Stores large sequential integers; has the same range of values as INT8|
|TEXT|Stores text data in chunks that are not random access|

      was (Author: udanax):
    - { Create, Insert, Update } Syntax will be changed.

Built-In Data Types
||Data Type||Explanation|| 
|BOOLEAN|Stores the Boolean values for true and false|
|BYTE|Stores binary data that are not random access|
|CHAR(*n*)|Stores single-byte or multibyte sequences of characters, including letters, numbers, and symbols of fixed length Collation is code-set dependent|
|VARCHAR(*m*,*r*)|Stores single-byte or multibyte strings of letters, numbers, and symbols of varying length to a maximum of 255 bytes Collation is code-set dependent|
|DECIMAL|Stores numbers with definable scale and precision|
|INTEGER or INT|Stores a span of time|
|SMALLINT|Stores whole numbers from -(215-1) to 215-1|
|INT8|Stores an 8-byte integer value These whole numbers can be in the range -(263-1) to 263-1|
|SERIAL|Stores sequential integers; has the same range of values as INTEGER|
|SERIAL8|Stores large sequential integers; has the same range of values as INT8|
|TEXT|Stores text data in chunks that are not random access|
  
> [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data
> --------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-2329
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2329
>             Project: Hadoop
>          Issue Type: New Feature
>          Components: contrib/hbase
>    Affects Versions: 0.16.0
>            Reporter: Edward Yoon
>            Assignee: Edward Yoon
>             Fix For: 0.16.0
>
>
> A built-in data type is a fundamental data type that the hbase shell defines.
> (character strings, scalars, ranges, arrays, ... , etc)
> If you need a specialized data type that is not currently provided as a built-in type, 
> you are encouraged to write your own user-defined data type using UDC(not yet implemented).
> (or contribute it for distribution in a future release of hbase shell)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HADOOP-2329) [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data

Posted by "Bryan Duxbury (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HADOOP-2329?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Bryan Duxbury updated HADOOP-2329:
----------------------------------

    Priority: Trivial  (was: Major)

Not a major issue.

> [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data
> --------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-2329
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2329
>             Project: Hadoop
>          Issue Type: New Feature
>          Components: contrib/hbase
>    Affects Versions: 0.16.0
>            Reporter: Edward Yoon
>            Assignee: Edward Yoon
>            Priority: Trivial
>             Fix For: 0.16.0
>
>
> A built-in data type is a fundamental data type that the hbase shell defines.
> (character strings, scalars, ranges, arrays, ... , etc)
> If you need a specialized data type that is not currently provided as a built-in type, 
> you are encouraged to write your own user-defined data type using UDC(not yet implemented).
> (or contribute it for distribution in a future release of hbase shell)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HADOOP-2329) [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data

Posted by "Jim Kellerman (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-2329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12548012 ] 

Jim Kellerman commented on HADOOP-2329:
---------------------------------------

-1

Adding value types to the HBase server side is not a good idea. Since members of a column family can be created on an ad-hoc basis, there would be a lot of bookkeeping to do to determine if the family member should be of a particular type. And if there were no data about a particular family member, what type should it be? 

It would be unacceptable to force all members of a column family to be the same type.

Additionally there have been requests to loosen the restrictions on the row key being a Text and instead, accepting any WritableComparable as the row key.

HADOOP-2197 would permit applications to tag columns with arbitrary key/value pairs. Thus an application could store family member/type information using this mechanism.

Bigtable is typeless and I think HBase should be as well.

> [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data
> --------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-2329
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2329
>             Project: Hadoop
>          Issue Type: New Feature
>          Components: contrib/hbase
>    Affects Versions: 0.16.0
>            Reporter: Edward Yoon
>            Assignee: Edward Yoon
>             Fix For: 0.16.0
>
>
> A built-in data type is a fundamental data type that the hbase shell defines.
> (character strings, scalars, ranges, arrays, ... , etc)
> If you need a specialized data type that is not currently provided as a built-in type, 
> you are encouraged to write your own user-defined data type using UDC(not yet implemented).
> (or contribute it for distribution in a future release of hbase shell)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Issue Comment Edited: (HADOOP-2329) [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data

Posted by "Edward Yoon (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-2329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12548558 ] 

udanax edited comment on HADOOP-2329 at 12/4/07 11:47 PM:
---------------------------------------------------------------

Some DB solutions can be provide only byte[] data type.
And then, ... listen to their slogan.. "you can stores own arbitrary data type. look!! this is awesome flexible DB solution. Let's make your own datum and data types. "..... 
We can think about why they didn't...

Let's view the matter.
Built-in Data Types does not interrupt the frexible because it's just a function for the convenience of users.

{code}
only byte[] VS { integer or character or byte[] or .... , etc}
{code}

It means that each time users needn't tiresome of develop the provided built-in data types.

      was (Author: udanax):
    Above comment means... "I didn't said that Hbase should provides built-in data type."

Let's see the example. 

Some DB solutions can be provide only byte[] data type.
And then, ... listen to their slogan.. "you can stores own arbitrary data type. look!! this is awesome flexible DB solution. Let's make your own datum and data types. "..... 
We can think about why they didn't...

Also, i think it is not makes any problem. 
It's just a function for the convenience of users.

# only byte[] VS { integer or character or byte[] or .... , etc}

It means that each user needn't develop for the provided built-in data type in everytime.
  
> [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data
> --------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-2329
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2329
>             Project: Hadoop
>          Issue Type: New Feature
>          Components: contrib/hbase
>    Affects Versions: 0.16.0
>            Reporter: Edward Yoon
>            Assignee: Edward Yoon
>             Fix For: 0.16.0
>
>
> A built-in data type is a fundamental data type that the hbase shell defines.
> (character strings, scalars, ranges, arrays, ... , etc)
> If you need a specialized data type that is not currently provided as a built-in type, 
> you are encouraged to write your own user-defined data type using UDC(not yet implemented).
> (or contribute it for distribution in a future release of hbase shell)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HADOOP-2329) [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data

Posted by "Jim Kellerman (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-2329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12548906 ] 

Jim Kellerman commented on HADOOP-2329:
---------------------------------------

Since you are proposing the former rather than the latter, I would say go for it.

> [Hbase Shell] Addition of Built-In Value Data Types for efficient accessing and stroing data
> --------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-2329
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2329
>             Project: Hadoop
>          Issue Type: New Feature
>          Components: contrib/hbase
>    Affects Versions: 0.16.0
>            Reporter: Edward Yoon
>            Assignee: Edward Yoon
>            Priority: Trivial
>             Fix For: 0.16.0
>
>
> A built-in data type is a fundamental data type that the hbase shell defines.
> (character strings, scalars, ranges, arrays, ... , etc)
> If you need a specialized data type that is not currently provided as a built-in type, 
> you are encouraged to write your own user-defined data type using UDC(not yet implemented).
> (or contribute it for distribution in a future release of hbase shell)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.