You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@hbase.apache.org by Nicolas Paris <ni...@gmail.com> on 2018/05/19 07:23:33 UTC

MOB integration

Hi


I am using hbase 1.1 and hive 1.2

I created an hbase table with a mob column with the default
threshold (100K)
I mapped the table into hive with a binary format, and loaded
20M of pdf of size between 50k and 20mb

Apparently the mob is not populated because when I look into
hdfs hbase/data/mlob, it is a nearly empty folder.

Does it mean hive cannot populate hbase mob columns  ?

Thanks

Re: MOB integration

Posted by Ted Yu <yu...@gmail.com>.
regionserver logs on your server would tell you what happened during the
ingestion.

BTW Mob feature is not in 1.1.x releases. You're likely using a vendor's
backport.

There have been continuous improvements to Mob feature since its initial
checkin.
So there may be some difference in the details between the release you use
and hbase 2.0 (which I used to generate the logs I quoted).

On Sat, May 19, 2018 at 11:29 AM, Nicolas Paris <ni...@gmail.com> wrote:

> 2018-05-19 20:08 GMT+02:00 Ted Yu <yu...@gmail.com>:
>
> > Mob store file is renamed from /apps/hbase/data/mobdir to the final
> > location under region directory for the table.
> >
> > This explains why you don't see much data under mobdir since data
> ingestion
> > has finished.
> >
>
>
> ​Well, I monitored the mobdir folder during ingestion. Nothing happened in
> it.​
> Data were directly going under the table in the region.
>
> BTW, thats why I was thinking the pdf were treated as regular binary files.
> Certainly reading the regionserver logs will help.
>
> Finallly, if hive is able to load mob columns, that's a good news for me.
>
>
>
>
> >
> > Cheers
> >
> > On Sat, May 19, 2018 at 9:58 AM, Nicolas Paris <ni...@gmail.com>
> > wrote:
> >
> > > Not having access cluster for few days, but I will be looking
> > > to logs.
> > >
> > > However, when looking at your logs, it seems that I mispell
> > > my mlob dir in the first post. It was "mobdir".
> > > The /apps/hbase/data/mobdir/ is nearly empty, sizing 4 or 10 kb
> > >
> > > Would this confirm the mob flushing process wouldn't be activated ?
> > >
> > >
> > >
> > > 2018-05-19 18:38 GMT+02:00 Ted Yu <yu...@gmail.com>:
> > >
> > > > If you have a chance to look at region server log, you would see some
> > > line
> > > > such as the following:
> > > >
> > > > 2018-05-19 16:31:23,548 INFO  [MemStoreFlusher.0]
> > regionserver.HMobStore:
> > > > Renaming flushed file from
> > > > hdfs://mycluster/apps/hbase/data/mobdir/.tmp/
> > > > 28e252d7f013973174750d483d358fa020180519dd8e7c3d67814eb0b5fb
> > 06fb9e800377
> > > > to
> > > > hdfs://mycluster/apps/hbase/data/mobdir/data/default/
> > > > IntegrationTestIngestWithMOB/e9b5d936e7f55a4f1c3246a8d5ce53
> c2/test_cf/
> > > > 28e252d7f013973174750d483d358fa020180519dd8e7c3d67814eb0b5fb
> > 06fb9e800377
> > > >
> > > > Meaning Mob store file is first saved under
> > > /apps/hbase/data/mobdir/.tmp/ ,
> > > > then renamed to under the usual location under region directory for
> the
> > > > table.
> > > >
> > > > From high level, as long as you can query what you ingested, you can
> be
> > > > assured that Mob data is persisted.
> > > >
> > > > Cheers
> > > >
> > > > On Sat, May 19, 2018 at 8:43 AM, Nicolas Paris <ni...@gmail.com>
> > > > wrote:
> > > >
> > > > > Hi
> > > > >
> > > > > ​Yes the data comes back as expected.
> > > > > My table is not called "mlob" however since I found such folder
> > > > > I thought it was storing mob objects.
> > > > >
> > > > > I do have 500 folder hashed as you mentionned. They contains the
> > > > > whole dataset (2TO)
> > > > > However, how beeing sure the data is actually stored as MOB (and
> not
> > > > > as traditional binary)
> > > > >
> > > > > Thanks
> > > > >
> > > > >
> > > > > 2018-05-19 15:59 GMT+02:00 Ted Yu <yu...@gmail.com>:
> > > > >
> > > > > > bq. look into hdfs hbase/data/mlob
> > > > > >
> > > > > > Is 'mlob' name of your table ?
> > > > > >
> > > > > > bq. nearly empty folder
> > > > > >
> > > > > > Here is listing under a one region table:
> > > > > >
> > > > > > drwxr-xr-x   - hbase hdfs          0 2018-05-16 23:51
> > > > > > /apps/hbase/data/data/default/atlas_janus/.tabledesc
> > > > > > drwxr-xr-x   - hbase hdfs          0 2018-05-16 23:51
> > > > > > /apps/hbase/data/data/default/atlas_janus/.tmp
> > > > > > drwxr-xr-x   - hbase hdfs          0 2018-05-17 00:55
> > > > > > /apps/hbase/data/data/default/atlas_janus/
> > > > 8033ea259cb7272d43bc137ca0ab29
> > > > > 06
> > > > > >
> > > > > > Not sure if the above matches your description of being nearly
> > empty.
> > > > > > Here data is stored under 8033ea259cb7272d43bc137ca0ab2906
> > > > > >
> > > > > > If you query the table, does the data come back as expected ?
> > > > > >
> > > > > > Thanks
> > > > > >
> > > > > > On Sat, May 19, 2018 at 12:23 AM, Nicolas Paris <
> > niparisco@gmail.com
> > > >
> > > > > > wrote:
> > > > > >
> > > > > > > Hi
> > > > > > >
> > > > > > >
> > > > > > > I am using hbase 1.1 and hive 1.2
> > > > > > >
> > > > > > > I created an hbase table with a mob column with the default
> > > > > > > threshold (100K)
> > > > > > > I mapped the table into hive with a binary format, and loaded
> > > > > > > 20M of pdf of size between 50k and 20mb
> > > > > > >
> > > > > > > Apparently the mob is not populated because when I look into
> > > > > > > hdfs hbase/data/mlob, it is a nearly empty folder.
> > > > > > >
> > > > > > > Does it mean hive cannot populate hbase mob columns  ?
> > > > > > >
> > > > > > > Thanks
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
>

Re: MOB integration

Posted by Nicolas Paris <ni...@gmail.com>.
2018-05-19 20:08 GMT+02:00 Ted Yu <yu...@gmail.com>:

> Mob store file is renamed from /apps/hbase/data/mobdir to the final
> location under region directory for the table.
>
> This explains why you don't see much data under mobdir since data ingestion
> has finished.
>


​Well, I monitored the mobdir folder during ingestion. Nothing happened in
it.​
Data were directly going under the table in the region.

BTW, thats why I was thinking the pdf were treated as regular binary files.
Certainly reading the regionserver logs will help.

Finallly, if hive is able to load mob columns, that's a good news for me.




>
> Cheers
>
> On Sat, May 19, 2018 at 9:58 AM, Nicolas Paris <ni...@gmail.com>
> wrote:
>
> > Not having access cluster for few days, but I will be looking
> > to logs.
> >
> > However, when looking at your logs, it seems that I mispell
> > my mlob dir in the first post. It was "mobdir".
> > The /apps/hbase/data/mobdir/ is nearly empty, sizing 4 or 10 kb
> >
> > Would this confirm the mob flushing process wouldn't be activated ?
> >
> >
> >
> > 2018-05-19 18:38 GMT+02:00 Ted Yu <yu...@gmail.com>:
> >
> > > If you have a chance to look at region server log, you would see some
> > line
> > > such as the following:
> > >
> > > 2018-05-19 16:31:23,548 INFO  [MemStoreFlusher.0]
> regionserver.HMobStore:
> > > Renaming flushed file from
> > > hdfs://mycluster/apps/hbase/data/mobdir/.tmp/
> > > 28e252d7f013973174750d483d358fa020180519dd8e7c3d67814eb0b5fb
> 06fb9e800377
> > > to
> > > hdfs://mycluster/apps/hbase/data/mobdir/data/default/
> > > IntegrationTestIngestWithMOB/e9b5d936e7f55a4f1c3246a8d5ce53c2/test_cf/
> > > 28e252d7f013973174750d483d358fa020180519dd8e7c3d67814eb0b5fb
> 06fb9e800377
> > >
> > > Meaning Mob store file is first saved under
> > /apps/hbase/data/mobdir/.tmp/ ,
> > > then renamed to under the usual location under region directory for the
> > > table.
> > >
> > > From high level, as long as you can query what you ingested, you can be
> > > assured that Mob data is persisted.
> > >
> > > Cheers
> > >
> > > On Sat, May 19, 2018 at 8:43 AM, Nicolas Paris <ni...@gmail.com>
> > > wrote:
> > >
> > > > Hi
> > > >
> > > > ​Yes the data comes back as expected.
> > > > My table is not called "mlob" however since I found such folder
> > > > I thought it was storing mob objects.
> > > >
> > > > I do have 500 folder hashed as you mentionned. They contains the
> > > > whole dataset (2TO)
> > > > However, how beeing sure the data is actually stored as MOB (and not
> > > > as traditional binary)
> > > >
> > > > Thanks
> > > >
> > > >
> > > > 2018-05-19 15:59 GMT+02:00 Ted Yu <yu...@gmail.com>:
> > > >
> > > > > bq. look into hdfs hbase/data/mlob
> > > > >
> > > > > Is 'mlob' name of your table ?
> > > > >
> > > > > bq. nearly empty folder
> > > > >
> > > > > Here is listing under a one region table:
> > > > >
> > > > > drwxr-xr-x   - hbase hdfs          0 2018-05-16 23:51
> > > > > /apps/hbase/data/data/default/atlas_janus/.tabledesc
> > > > > drwxr-xr-x   - hbase hdfs          0 2018-05-16 23:51
> > > > > /apps/hbase/data/data/default/atlas_janus/.tmp
> > > > > drwxr-xr-x   - hbase hdfs          0 2018-05-17 00:55
> > > > > /apps/hbase/data/data/default/atlas_janus/
> > > 8033ea259cb7272d43bc137ca0ab29
> > > > 06
> > > > >
> > > > > Not sure if the above matches your description of being nearly
> empty.
> > > > > Here data is stored under 8033ea259cb7272d43bc137ca0ab2906
> > > > >
> > > > > If you query the table, does the data come back as expected ?
> > > > >
> > > > > Thanks
> > > > >
> > > > > On Sat, May 19, 2018 at 12:23 AM, Nicolas Paris <
> niparisco@gmail.com
> > >
> > > > > wrote:
> > > > >
> > > > > > Hi
> > > > > >
> > > > > >
> > > > > > I am using hbase 1.1 and hive 1.2
> > > > > >
> > > > > > I created an hbase table with a mob column with the default
> > > > > > threshold (100K)
> > > > > > I mapped the table into hive with a binary format, and loaded
> > > > > > 20M of pdf of size between 50k and 20mb
> > > > > >
> > > > > > Apparently the mob is not populated because when I look into
> > > > > > hdfs hbase/data/mlob, it is a nearly empty folder.
> > > > > >
> > > > > > Does it mean hive cannot populate hbase mob columns  ?
> > > > > >
> > > > > > Thanks
> > > > > >
> > > > >
> > > >
> > >
> >
>

Re: MOB integration

Posted by Ted Yu <yu...@gmail.com>.
Mob store file is renamed from /apps/hbase/data/mobdir to the final
location under region directory for the table.

This explains why you don't see much data under mobdir since data ingestion
has finished.

Cheers

On Sat, May 19, 2018 at 9:58 AM, Nicolas Paris <ni...@gmail.com> wrote:

> Not having access cluster for few days, but I will be looking
> to logs.
>
> However, when looking at your logs, it seems that I mispell
> my mlob dir in the first post. It was "mobdir".
> The /apps/hbase/data/mobdir/ is nearly empty, sizing 4 or 10 kb
>
> Would this confirm the mob flushing process wouldn't be activated ?
>
>
>
> 2018-05-19 18:38 GMT+02:00 Ted Yu <yu...@gmail.com>:
>
> > If you have a chance to look at region server log, you would see some
> line
> > such as the following:
> >
> > 2018-05-19 16:31:23,548 INFO  [MemStoreFlusher.0] regionserver.HMobStore:
> > Renaming flushed file from
> > hdfs://mycluster/apps/hbase/data/mobdir/.tmp/
> > 28e252d7f013973174750d483d358fa020180519dd8e7c3d67814eb0b5fb06fb9e800377
> > to
> > hdfs://mycluster/apps/hbase/data/mobdir/data/default/
> > IntegrationTestIngestWithMOB/e9b5d936e7f55a4f1c3246a8d5ce53c2/test_cf/
> > 28e252d7f013973174750d483d358fa020180519dd8e7c3d67814eb0b5fb06fb9e800377
> >
> > Meaning Mob store file is first saved under
> /apps/hbase/data/mobdir/.tmp/ ,
> > then renamed to under the usual location under region directory for the
> > table.
> >
> > From high level, as long as you can query what you ingested, you can be
> > assured that Mob data is persisted.
> >
> > Cheers
> >
> > On Sat, May 19, 2018 at 8:43 AM, Nicolas Paris <ni...@gmail.com>
> > wrote:
> >
> > > Hi
> > >
> > > ​Yes the data comes back as expected.
> > > My table is not called "mlob" however since I found such folder
> > > I thought it was storing mob objects.
> > >
> > > I do have 500 folder hashed as you mentionned. They contains the
> > > whole dataset (2TO)
> > > However, how beeing sure the data is actually stored as MOB (and not
> > > as traditional binary)
> > >
> > > Thanks
> > >
> > >
> > > 2018-05-19 15:59 GMT+02:00 Ted Yu <yu...@gmail.com>:
> > >
> > > > bq. look into hdfs hbase/data/mlob
> > > >
> > > > Is 'mlob' name of your table ?
> > > >
> > > > bq. nearly empty folder
> > > >
> > > > Here is listing under a one region table:
> > > >
> > > > drwxr-xr-x   - hbase hdfs          0 2018-05-16 23:51
> > > > /apps/hbase/data/data/default/atlas_janus/.tabledesc
> > > > drwxr-xr-x   - hbase hdfs          0 2018-05-16 23:51
> > > > /apps/hbase/data/data/default/atlas_janus/.tmp
> > > > drwxr-xr-x   - hbase hdfs          0 2018-05-17 00:55
> > > > /apps/hbase/data/data/default/atlas_janus/
> > 8033ea259cb7272d43bc137ca0ab29
> > > 06
> > > >
> > > > Not sure if the above matches your description of being nearly empty.
> > > > Here data is stored under 8033ea259cb7272d43bc137ca0ab2906
> > > >
> > > > If you query the table, does the data come back as expected ?
> > > >
> > > > Thanks
> > > >
> > > > On Sat, May 19, 2018 at 12:23 AM, Nicolas Paris <niparisco@gmail.com
> >
> > > > wrote:
> > > >
> > > > > Hi
> > > > >
> > > > >
> > > > > I am using hbase 1.1 and hive 1.2
> > > > >
> > > > > I created an hbase table with a mob column with the default
> > > > > threshold (100K)
> > > > > I mapped the table into hive with a binary format, and loaded
> > > > > 20M of pdf of size between 50k and 20mb
> > > > >
> > > > > Apparently the mob is not populated because when I look into
> > > > > hdfs hbase/data/mlob, it is a nearly empty folder.
> > > > >
> > > > > Does it mean hive cannot populate hbase mob columns  ?
> > > > >
> > > > > Thanks
> > > > >
> > > >
> > >
> >
>

Re: MOB integration

Posted by Nicolas Paris <ni...@gmail.com>.
Not having access cluster for few days, but I will be looking
to logs.

However, when looking at your logs, it seems that I mispell
my mlob dir in the first post. It was "mobdir".
The /apps/hbase/data/mobdir/ is nearly empty, sizing 4 or 10 kb

Would this confirm the mob flushing process wouldn't be activated ?



2018-05-19 18:38 GMT+02:00 Ted Yu <yu...@gmail.com>:

> If you have a chance to look at region server log, you would see some line
> such as the following:
>
> 2018-05-19 16:31:23,548 INFO  [MemStoreFlusher.0] regionserver.HMobStore:
> Renaming flushed file from
> hdfs://mycluster/apps/hbase/data/mobdir/.tmp/
> 28e252d7f013973174750d483d358fa020180519dd8e7c3d67814eb0b5fb06fb9e800377
> to
> hdfs://mycluster/apps/hbase/data/mobdir/data/default/
> IntegrationTestIngestWithMOB/e9b5d936e7f55a4f1c3246a8d5ce53c2/test_cf/
> 28e252d7f013973174750d483d358fa020180519dd8e7c3d67814eb0b5fb06fb9e800377
>
> Meaning Mob store file is first saved under /apps/hbase/data/mobdir/.tmp/ ,
> then renamed to under the usual location under region directory for the
> table.
>
> From high level, as long as you can query what you ingested, you can be
> assured that Mob data is persisted.
>
> Cheers
>
> On Sat, May 19, 2018 at 8:43 AM, Nicolas Paris <ni...@gmail.com>
> wrote:
>
> > Hi
> >
> > ​Yes the data comes back as expected.
> > My table is not called "mlob" however since I found such folder
> > I thought it was storing mob objects.
> >
> > I do have 500 folder hashed as you mentionned. They contains the
> > whole dataset (2TO)
> > However, how beeing sure the data is actually stored as MOB (and not
> > as traditional binary)
> >
> > Thanks
> >
> >
> > 2018-05-19 15:59 GMT+02:00 Ted Yu <yu...@gmail.com>:
> >
> > > bq. look into hdfs hbase/data/mlob
> > >
> > > Is 'mlob' name of your table ?
> > >
> > > bq. nearly empty folder
> > >
> > > Here is listing under a one region table:
> > >
> > > drwxr-xr-x   - hbase hdfs          0 2018-05-16 23:51
> > > /apps/hbase/data/data/default/atlas_janus/.tabledesc
> > > drwxr-xr-x   - hbase hdfs          0 2018-05-16 23:51
> > > /apps/hbase/data/data/default/atlas_janus/.tmp
> > > drwxr-xr-x   - hbase hdfs          0 2018-05-17 00:55
> > > /apps/hbase/data/data/default/atlas_janus/
> 8033ea259cb7272d43bc137ca0ab29
> > 06
> > >
> > > Not sure if the above matches your description of being nearly empty.
> > > Here data is stored under 8033ea259cb7272d43bc137ca0ab2906
> > >
> > > If you query the table, does the data come back as expected ?
> > >
> > > Thanks
> > >
> > > On Sat, May 19, 2018 at 12:23 AM, Nicolas Paris <ni...@gmail.com>
> > > wrote:
> > >
> > > > Hi
> > > >
> > > >
> > > > I am using hbase 1.1 and hive 1.2
> > > >
> > > > I created an hbase table with a mob column with the default
> > > > threshold (100K)
> > > > I mapped the table into hive with a binary format, and loaded
> > > > 20M of pdf of size between 50k and 20mb
> > > >
> > > > Apparently the mob is not populated because when I look into
> > > > hdfs hbase/data/mlob, it is a nearly empty folder.
> > > >
> > > > Does it mean hive cannot populate hbase mob columns  ?
> > > >
> > > > Thanks
> > > >
> > >
> >
>

Re: MOB integration

Posted by Ted Yu <yu...@gmail.com>.
If you have a chance to look at region server log, you would see some line
such as the following:

2018-05-19 16:31:23,548 INFO  [MemStoreFlusher.0] regionserver.HMobStore:
Renaming flushed file from
hdfs://mycluster/apps/hbase/data/mobdir/.tmp/28e252d7f013973174750d483d358fa020180519dd8e7c3d67814eb0b5fb06fb9e800377
to
hdfs://mycluster/apps/hbase/data/mobdir/data/default/IntegrationTestIngestWithMOB/e9b5d936e7f55a4f1c3246a8d5ce53c2/test_cf/28e252d7f013973174750d483d358fa020180519dd8e7c3d67814eb0b5fb06fb9e800377

Meaning Mob store file is first saved under /apps/hbase/data/mobdir/.tmp/ ,
then renamed to under the usual location under region directory for the
table.

From high level, as long as you can query what you ingested, you can be
assured that Mob data is persisted.

Cheers

On Sat, May 19, 2018 at 8:43 AM, Nicolas Paris <ni...@gmail.com> wrote:

> Hi
>
> ​Yes the data comes back as expected.
> My table is not called "mlob" however since I found such folder
> I thought it was storing mob objects.
>
> I do have 500 folder hashed as you mentionned. They contains the
> whole dataset (2TO)
> However, how beeing sure the data is actually stored as MOB (and not
> as traditional binary)
>
> Thanks
>
>
> 2018-05-19 15:59 GMT+02:00 Ted Yu <yu...@gmail.com>:
>
> > bq. look into hdfs hbase/data/mlob
> >
> > Is 'mlob' name of your table ?
> >
> > bq. nearly empty folder
> >
> > Here is listing under a one region table:
> >
> > drwxr-xr-x   - hbase hdfs          0 2018-05-16 23:51
> > /apps/hbase/data/data/default/atlas_janus/.tabledesc
> > drwxr-xr-x   - hbase hdfs          0 2018-05-16 23:51
> > /apps/hbase/data/data/default/atlas_janus/.tmp
> > drwxr-xr-x   - hbase hdfs          0 2018-05-17 00:55
> > /apps/hbase/data/data/default/atlas_janus/8033ea259cb7272d43bc137ca0ab29
> 06
> >
> > Not sure if the above matches your description of being nearly empty.
> > Here data is stored under 8033ea259cb7272d43bc137ca0ab2906
> >
> > If you query the table, does the data come back as expected ?
> >
> > Thanks
> >
> > On Sat, May 19, 2018 at 12:23 AM, Nicolas Paris <ni...@gmail.com>
> > wrote:
> >
> > > Hi
> > >
> > >
> > > I am using hbase 1.1 and hive 1.2
> > >
> > > I created an hbase table with a mob column with the default
> > > threshold (100K)
> > > I mapped the table into hive with a binary format, and loaded
> > > 20M of pdf of size between 50k and 20mb
> > >
> > > Apparently the mob is not populated because when I look into
> > > hdfs hbase/data/mlob, it is a nearly empty folder.
> > >
> > > Does it mean hive cannot populate hbase mob columns  ?
> > >
> > > Thanks
> > >
> >
>

Re: MOB integration

Posted by Nicolas Paris <ni...@gmail.com>.
Hi

​Yes the data comes back as expected.
My table is not called "mlob" however since I found such folder
I thought it was storing mob objects.

I do have 500 folder hashed as you mentionned. They contains the
whole dataset (2TO)
However, how beeing sure the data is actually stored as MOB (and not
as traditional binary)

Thanks


2018-05-19 15:59 GMT+02:00 Ted Yu <yu...@gmail.com>:

> bq. look into hdfs hbase/data/mlob
>
> Is 'mlob' name of your table ?
>
> bq. nearly empty folder
>
> Here is listing under a one region table:
>
> drwxr-xr-x   - hbase hdfs          0 2018-05-16 23:51
> /apps/hbase/data/data/default/atlas_janus/.tabledesc
> drwxr-xr-x   - hbase hdfs          0 2018-05-16 23:51
> /apps/hbase/data/data/default/atlas_janus/.tmp
> drwxr-xr-x   - hbase hdfs          0 2018-05-17 00:55
> /apps/hbase/data/data/default/atlas_janus/8033ea259cb7272d43bc137ca0ab2906
>
> Not sure if the above matches your description of being nearly empty.
> Here data is stored under 8033ea259cb7272d43bc137ca0ab2906
>
> If you query the table, does the data come back as expected ?
>
> Thanks
>
> On Sat, May 19, 2018 at 12:23 AM, Nicolas Paris <ni...@gmail.com>
> wrote:
>
> > Hi
> >
> >
> > I am using hbase 1.1 and hive 1.2
> >
> > I created an hbase table with a mob column with the default
> > threshold (100K)
> > I mapped the table into hive with a binary format, and loaded
> > 20M of pdf of size between 50k and 20mb
> >
> > Apparently the mob is not populated because when I look into
> > hdfs hbase/data/mlob, it is a nearly empty folder.
> >
> > Does it mean hive cannot populate hbase mob columns  ?
> >
> > Thanks
> >
>

Re: MOB integration

Posted by Ted Yu <yu...@gmail.com>.
bq. look into hdfs hbase/data/mlob

Is 'mlob' name of your table ?

bq. nearly empty folder

Here is listing under a one region table:

drwxr-xr-x   - hbase hdfs          0 2018-05-16 23:51
/apps/hbase/data/data/default/atlas_janus/.tabledesc
drwxr-xr-x   - hbase hdfs          0 2018-05-16 23:51
/apps/hbase/data/data/default/atlas_janus/.tmp
drwxr-xr-x   - hbase hdfs          0 2018-05-17 00:55
/apps/hbase/data/data/default/atlas_janus/8033ea259cb7272d43bc137ca0ab2906

Not sure if the above matches your description of being nearly empty.
Here data is stored under 8033ea259cb7272d43bc137ca0ab2906

If you query the table, does the data come back as expected ?

Thanks

On Sat, May 19, 2018 at 12:23 AM, Nicolas Paris <ni...@gmail.com> wrote:

> Hi
>
>
> I am using hbase 1.1 and hive 1.2
>
> I created an hbase table with a mob column with the default
> threshold (100K)
> I mapped the table into hive with a binary format, and loaded
> 20M of pdf of size between 50k and 20mb
>
> Apparently the mob is not populated because when I look into
> hdfs hbase/data/mlob, it is a nearly empty folder.
>
> Does it mean hive cannot populate hbase mob columns  ?
>
> Thanks
>