You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@nutch.apache.org by sn...@apache.org on 2014/12/08 20:44:28 UTC

svn commit: r1643899 - in /nutch/branches/2.x: CHANGES.txt src/java/org/apache/nutch/crawl/GeneratorJob.java

Author: snagel
Date: Mon Dec  8 19:44:27 2014
New Revision: 1643899

URL: http://svn.apache.org/r1643899
Log:
NUTCH-1829 Generator : unable to distinguish real errors

Modified:
    nutch/branches/2.x/CHANGES.txt
    nutch/branches/2.x/src/java/org/apache/nutch/crawl/GeneratorJob.java

Modified: nutch/branches/2.x/CHANGES.txt
URL: http://svn.apache.org/viewvc/nutch/branches/2.x/CHANGES.txt?rev=1643899&r1=1643898&r2=1643899&view=diff
==============================================================================
--- nutch/branches/2.x/CHANGES.txt (original)
+++ nutch/branches/2.x/CHANGES.txt Mon Dec  8 19:44:27 2014
@@ -2,6 +2,8 @@ Nutch Change Log
 
 Current Development 2.3-SNAPSHOT
 
+* NUTCH-1829 Generator : unable to distinguish real errors (Mathieu Bouchard, jnioche, snagel)
+
 * NUTCH-1778 Generator not logging number of URLs in batch correctly (jnioche via snagel)
 
 * NUTCH-1877 Suffix URL filter to ignore query string by default (markus via snagel)

Modified: nutch/branches/2.x/src/java/org/apache/nutch/crawl/GeneratorJob.java
URL: http://svn.apache.org/viewvc/nutch/branches/2.x/src/java/org/apache/nutch/crawl/GeneratorJob.java?rev=1643899&r1=1643898&r2=1643899&view=diff
==============================================================================
--- nutch/branches/2.x/src/java/org/apache/nutch/crawl/GeneratorJob.java (original)
+++ nutch/branches/2.x/src/java/org/apache/nutch/crawl/GeneratorJob.java Mon Dec  8 19:44:27 2014
@@ -239,6 +239,9 @@ public class GeneratorJob extends NutchT
     long generateCount = (Long) results.get(GENERATE_COUNT);
     LOG.info("GeneratorJob: finished at " + sdf.format(finish) + ", time elapsed: " + TimingUtil.elapsedTime(start, finish));
     LOG.info("GeneratorJob: generated batch id: " + batchId + " containing " + generateCount + " URLs");
+    if (generateCount == 0) {
+      return null;
+    }
     return batchId;
   }