You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@sqoop.apache.org by Ken Krugler <kk...@transpac.com> on 2011/09/01 00:47:44 UTC

Support for partitioning during export into HDFS

Hi there,

For maximum performance when pulling data, it seems like we'd want to run multiple Sqoops in parallel against the available partitions in a table.

That would require adding 'PARTITION <partition_name> to the select statement, something like:

select * from <table_name> PARTITION <partition_name> where <condition>;

1. Does this make sense, both for general Sqoop and specifically OraOop?

2. Is there a way to do this now, or would Sqoop (and OraOop) need to be extended?

Thanks,

-- Ken

--------------------------
Ken Krugler
+1 530-210-6378
http://bixolabs.com
custom data mining solutions