You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@beam.apache.org by "Chamikara Jayalath (JIRA)" <ji...@apache.org> on 2017/08/02 18:52:00 UTC

[jira] [Reopened] (BEAM-2708) Decompressing bzip2 files with multiple "streams" only reads the first stream

     [ https://issues.apache.org/jira/browse/BEAM-2708?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Chamikara Jayalath reopened BEAM-2708:
--------------------------------------
      Assignee: Chamikara Jayalath  (was: Ben Chambers)

Have to check if this applies to Python SDK as well.

> Decompressing bzip2 files with multiple "streams" only reads the first stream
> -----------------------------------------------------------------------------
>
>                 Key: BEAM-2708
>                 URL: https://issues.apache.org/jira/browse/BEAM-2708
>             Project: Beam
>          Issue Type: Bug
>          Components: sdk-java-extensions, sdk-py
>            Reporter: Pablo Estrada
>            Assignee: Chamikara Jayalath
>             Fix For: 2.1.0, 2.2.0
>
>
> I'm not sure which components to file this against. A user has observed that pbzip2 files are not being properly decompressed:
> https://stackoverflow.com/questions/45439117/google-dataflow-only-partly-uncompressing-files-compressed-with-pbzip2



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)