You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@flume.apache.org by Muhammad Ehsan ul Haque <m....@gmail.com> on 2014/02/08 12:42:30 UTC

Re: Review Request 17867: FLUME-2309 Adds consume order feature in the spooling directory source to allow files to be consumed in the order of oldest, youngest or random.

-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/17867/
-----------------------------------------------------------

(Updated Feb. 8, 2014, 11:42 a.m.)


Review request for Flume and Hari Shreedharan.


Changes
-------

Updated summary to include JIRA Issue ID.
Updated diff to become visible in the show diff.


Summary (updated)
-----------------

FLUME-2309 Adds consume order feature in the spooling directory source to allow files to be consumed in the order of oldest, youngest or random.


Repository: flume-git


Description
-------

This patch provides.
* A consume order feature in the Spooling directory source, which will allow the users to explicitly state in which order; oldest, youngest or randomly files should be consumed from the spooling directory.
* Fixes the old implementation of selecting the file from spooling directory. Previously, each file to be consumed was selected by sorting, which might become extremly time consuming if there are many files (of the order of 10K or more). The new implementation instead do a linear scan in case when the consume order is oldest or youngest.
* Updates the Flume user guide accordingly.


Diffs (updated)
-----

  flume-ng-core/src/main/java/org/apache/flume/client/avro/ReliableSpoolingFileEventReader.java a88ed6e 
  flume-ng-core/src/main/java/org/apache/flume/source/SpoolDirectorySource.java f42ed2d 
  flume-ng-core/src/main/java/org/apache/flume/source/SpoolDirectorySourceConfigurationConstants.java 83522c0 
  flume-ng-core/src/test/java/org/apache/flume/client/avro/TestReliableSpoolingFileEventReader.java 9d708c1 
  flume-ng-core/src/test/java/org/apache/flume/source/TestSpoolDirectorySource.java 503ab4d 
  flume-ng-doc/sphinx/FlumeUserGuide.rst d120a74 

Diff: https://reviews.apache.org/r/17867/diff/


Testing
-------

Tested without specifying a consume order, by default oldest was used.
Tested with specifying oldest, youngest and random consume order.
Tested with large number of files, random order is fastest.


Thanks,

Muhammad Ehsan ul Haque


Re: Review Request 17867: FLUME-2309 Adds consume order feature in the spooling directory source to allow files to be consumed in the order of oldest, youngest or random.

Posted by Muhammad Ehsan ul Haque <m....@gmail.com>.
-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/17867/
-----------------------------------------------------------

(Updated Feb. 8, 2014, 12:58 p.m.)


Review request for Flume and Hari Shreedharan.


Changes
-------

Linked with JIRA ID


Bugs: FLUME-2309
    https://issues.apache.org/jira/browse/FLUME-2309


Repository: flume-git


Description
-------

This patch provides.
* A consume order feature in the Spooling directory source, which will allow the users to explicitly state in which order; oldest, youngest or randomly files should be consumed from the spooling directory.
* Fixes the old implementation of selecting the file from spooling directory. Previously, each file to be consumed was selected by sorting, which might become extremly time consuming if there are many files (of the order of 10K or more). The new implementation instead do a linear scan in case when the consume order is oldest or youngest.
* Updates the Flume user guide accordingly.


Diffs
-----

  flume-ng-core/src/main/java/org/apache/flume/client/avro/ReliableSpoolingFileEventReader.java a88ed6e 
  flume-ng-core/src/main/java/org/apache/flume/source/SpoolDirectorySource.java f42ed2d 
  flume-ng-core/src/main/java/org/apache/flume/source/SpoolDirectorySourceConfigurationConstants.java 83522c0 
  flume-ng-core/src/test/java/org/apache/flume/client/avro/TestReliableSpoolingFileEventReader.java 9d708c1 
  flume-ng-core/src/test/java/org/apache/flume/source/TestSpoolDirectorySource.java 503ab4d 
  flume-ng-doc/sphinx/FlumeUserGuide.rst d120a74 

Diff: https://reviews.apache.org/r/17867/diff/


Testing
-------

Tested without specifying a consume order, by default oldest was used.
Tested with specifying oldest, youngest and random consume order.
Tested with large number of files, random order is fastest.


Thanks,

Muhammad Ehsan ul Haque