You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@sqoop.apache.org by Ken Krugler <kk...@transpac.com> on 2011/09/20 22:35:40 UTC
Controlling both transaction size and load during
During an import from a large table, we want to avoid using too many mappers, as that would put too much load on the database.
However that winds up generating very large transactions, e.g. 30M+ rows per request.
Which in turn can cause a transaction timeout, if it takes longer than about 3000 seconds.
Is there any way to control both the load (number of parallel requests) and the size of each request?
Thanks,
-- Ken
--------------------------
Ken Krugler
+1 530-210-6378
http://bixolabs.com
custom big data solutions & training
Hadoop, Cascading, Mahout & Solr