You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@beam.apache.org by "Chamikara Jayalath (JIRA)" <ji...@apache.org> on 2017/11/17 09:59:00 UTC

[jira] [Created] (BEAM-3217) Add a performance test for HadoopInputFormatIO

Chamikara Jayalath created BEAM-3217:
----------------------------------------

             Summary: Add a performance test for HadoopInputFormatIO
                 Key: BEAM-3217
                 URL: https://issues.apache.org/jira/browse/BEAM-3217
             Project: Beam
          Issue Type: Test
          Components: sdk-java-extensions
            Reporter: Chamikara Jayalath
            Assignee: Chamikara Jayalath


We should add a large scale performance test for HadoopInputFormatIO. We should use PerfKitBenchmarker based performance testing framework [1] to manage Kubernetes based muti-node data store and to publish benchmark results.

Example input format implementation to use: DBInputFormat to connect to a Postgres instance.
https://github.com/hanborq/hadoop/blob/master/src/mapred/org/apache/hadoop/mapreduce/lib/db/DBInputFormat.java
 
Example docker image to use: https://hub.docker.com/_/postgres/

[1] https://beam.apache.org/documentation/io/testing/



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)