You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by "Hatem Helal (JIRA)" <ji...@apache.org> on 2018/12/21 13:58:00 UTC

[jira] [Updated] (PARQUET-1481) [C++] SEGV when reading corrupt parquet file

     [ https://issues.apache.org/jira/browse/PARQUET-1481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Hatem Helal updated PARQUET-1481:
---------------------------------
    Attachment: corrupt.parquet

> [C++] SEGV when reading corrupt parquet file
> --------------------------------------------
>
>                 Key: PARQUET-1481
>                 URL: https://issues.apache.org/jira/browse/PARQUET-1481
>             Project: Parquet
>          Issue Type: Bug
>            Reporter: Hatem Helal
>            Assignee: Hatem Helal
>            Priority: Major
>         Attachments: corrupt.parquet
>
>
> >>> import pyarrow.parquet as pq
> >>> pq.read_table('corrupt.parquet')
> fish: 'python' terminated by signal SIGSEGV (Address boundary error)
>  
> Stack report from macOS:
>  
> 0 libsystem_kernel.dylib 0x00007fff51164cee __psynch_cvwait + 10
> 1 libsystem_pthread.dylib 0x00007fff512a1662 _pthread_cond_wait + 732
> 2 libc++.1.dylib 0x00007fff4f04acb0 std::__1::condition_variable::wait(std::__1::unique_lock<std::__1::mutex>&) + 18
> 3 libc++.1.dylib 0x00007fff4f04b728 std::__1::__assoc_sub_state::__sub_wait(std::__1::unique_lock<std::__1::mutex>&) + 46
> 4 libparquet.11.dylib 0x0000000115512d00 std::__1::__assoc_state<arrow::Status>::move() + 48
> 5 libparquet.11.dylib 0x00000001154faa15 parquet::arrow::FileReader::Impl::ReadTable(std::__1::vector<int, std::__1::allocator<int> > const&, std::__1::shared_ptr<arrow::Table>*) + 1093
> 6 libparquet.11.dylib 0x00000001154fb6fe parquet::arrow::FileReader::Impl::ReadTable(std::__1::shared_ptr<arrow::Table>*) + 350
> 7 libparquet.11.dylib 0x00000001154fce47 parquet::arrow::FileReader::ReadTable(std::__1::shared_ptr<arrow::Table>*) + 23
> 8 _parquet.so 0x000000011598d97b __pyx_pw_7pyarrow_8_parquet_13ParquetReader_9read_all(_object*, _object*, _object*) + 1035



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)