You are viewing a plain text version of this content. The canonical link for it is here.
- [jira] [Resolved] (PARQUET-2158) Upgrade Hadoop dependency to version 3.2.0 - posted by "Steve Loughran (Jira)" <ji...@apache.org> on 2022/08/01 09:45:00 UTC, 0 replies.
- [GitHub] [parquet-mr] theosib-amazon commented on a diff in pull request #960: Performance optimization: Move all LittleEndianDataInputStream functionality into ByteBufferInputStream - posted by GitBox <gi...@apache.org> on 2022/08/01 14:50:52 UTC, 12 replies.
- [jira] [Commented] (PARQUET-2126) Thread safety bug in CodecFactory - posted by "Timothy Miller (Jira)" <ji...@apache.org> on 2022/08/01 15:49:00 UTC, 0 replies.
- Fail to read back written large parquet file - posted by Jozef Vilcek <jo...@gmail.com> on 2022/08/04 10:08:47 UTC, 4 replies.
- [jira] [Commented] (PARQUET-2160) Close decompression stream to free off-heap memory in time - posted by "Adam Binford (Jira)" <ji...@apache.org> on 2022/08/04 20:18:00 UTC, 17 replies.
- [jira] [Comment Edited] (PARQUET-2160) Close decompression stream to free off-heap memory in time - posted by "Yujiang Zhong (Jira)" <ji...@apache.org> on 2022/08/05 02:51:00 UTC, 7 replies.
- [jira] [Created] (PARQUET-2170) Empty projection returns the wrong number of rows when column index is enabled - posted by "Ivan Sadikov (Jira)" <ji...@apache.org> on 2022/08/05 04:32:00 UTC, 0 replies.
- [jira] [Commented] (PARQUET-2170) Empty projection returns the wrong number of rows when column index is enabled - posted by "Ivan Sadikov (Jira)" <ji...@apache.org> on 2022/08/05 04:33:00 UTC, 0 replies.
- [GitHub] [parquet-mr] parthchandra commented on a diff in pull request #968: PARQUET-2149: Async IO implementation for ParquetFileReader - posted by GitBox <gi...@apache.org> on 2022/08/05 21:34:35 UTC, 0 replies.
- [jira] [Commented] (PARQUET-2149) Implement async IO for Parquet file reader - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/08/05 21:35:00 UTC, 1 replies.
- [GitHub] [parquet-mr] zhongyujiang opened a new pull request, #982: PARQUET-2160: Close ZstdInputStream to free off-heap memory in time. - posted by GitBox <gi...@apache.org> on 2022/08/08 03:01:22 UTC, 0 replies.
- [GitHub] [parquet-mr] zhongyujiang commented on a diff in pull request #982: PARQUET-2160: Close ZstdInputStream to free off-heap memory in time. - posted by GitBox <gi...@apache.org> on 2022/08/08 03:05:03 UTC, 3 replies.
- [GitHub] [parquet-mr] steveloughran opened a new pull request, #983: WiP: parquet to use openfile api and some other performance enhancements - posted by GitBox <gi...@apache.org> on 2022/08/08 10:17:08 UTC, 0 replies.
- [GitHub] [parquet-mr] sunchao commented on a diff in pull request #983: WiP: parquet to use openfile api and some other performance enhancements - posted by GitBox <gi...@apache.org> on 2022/08/08 16:46:26 UTC, 1 replies.
- [jira] [Commented] (PARQUET-2168) Potential bug in ParquetWriteProtocol - posted by "Joy Bestourous (Jira)" <ji...@apache.org> on 2022/08/09 14:01:00 UTC, 0 replies.
- [GitHub] [parquet-mr] steveloughran commented on a diff in pull request #983: WiP: parquet to use openfile api and some other performance enhancements - posted by GitBox <gi...@apache.org> on 2022/08/09 17:54:41 UTC, 1 replies.
- [GitHub] [parquet-mr] iemejia commented on pull request #981: PARQUET-2169: Upgrade Avro to version 1.11.1 - posted by GitBox <gi...@apache.org> on 2022/08/10 17:01:27 UTC, 2 replies.
- [jira] [Commented] (PARQUET-2169) Upgrade Avro to version 1.11.1 - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/08/10 17:02:00 UTC, 4 replies.
- [jira] [Commented] (PARQUET-2171) Implement vectored IO in parquet file format - posted by "Mukund Thakur (Jira)" <ji...@apache.org> on 2022/08/10 22:44:00 UTC, 3 replies.
- [jira] [Created] (PARQUET-2171) Implement vectored IO in parquet file format - posted by "Mukund Thakur (Jira)" <ji...@apache.org> on 2022/08/10 22:44:00 UTC, 0 replies.
- [GitHub] [parquet-mr] dependabot[bot] opened a new pull request, #984: Bump hadoop-common from 3.2.3 to 3.2.4 - posted by GitBox <gi...@apache.org> on 2022/08/11 21:21:36 UTC, 0 replies.
- [GitHub] [parquet-mr] sunchao commented on pull request #981: PARQUET-2169: Upgrade Avro to version 1.11.1 - posted by GitBox <gi...@apache.org> on 2022/08/11 21:46:19 UTC, 0 replies.
- [jira] [Created] (PARQUET-2172) [C++] Make field return const NodePtr& instead of forcing copy of shared_ptr - posted by "Micah Kornfield (Jira)" <ji...@apache.org> on 2022/08/12 06:09:00 UTC, 0 replies.
- [jira] [Assigned] (PARQUET-2172) [C++] Make field return const NodePtr& instead of forcing copy of shared_ptr - posted by "Micah Kornfield (Jira)" <ji...@apache.org> on 2022/08/12 06:09:00 UTC, 0 replies.
- [jira] [Updated] (PARQUET-2172) [C++] Make field return const NodePtr& instead of forcing copy of shared_ptr - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/08/12 06:16:00 UTC, 1 replies.
- [GitHub] [parquet-mr] sunchao commented on a diff in pull request #982: PARQUET-2160: Close ZstdInputStream to free off-heap memory in time. - posted by GitBox <gi...@apache.org> on 2022/08/12 23:03:34 UTC, 0 replies.
- [jira] [Commented] (PARQUET-2172) [C++] Make field return const NodePtr& instead of forcing copy of shared_ptr - posted by "Yibo Cai (Jira)" <ji...@apache.org> on 2022/08/13 02:20:00 UTC, 0 replies.
- [jira] [Resolved] (PARQUET-2172) [C++] Make field return const NodePtr& instead of forcing copy of shared_ptr - posted by "Micah Kornfield (Jira)" <ji...@apache.org> on 2022/08/13 03:26:00 UTC, 0 replies.
- [GitHub] [parquet-mr] ggershinsky commented on a diff in pull request #968: PARQUET-2149: Async IO implementation for ParquetFileReader - posted by GitBox <gi...@apache.org> on 2022/08/16 11:32:04 UTC, 0 replies.
- [jira] [Created] (PARQUET-2173) Fix parquet build against hadoop 3.3.3+ - posted by "Steve Loughran (Jira)" <ji...@apache.org> on 2022/08/16 19:35:00 UTC, 0 replies.
- [GitHub] [parquet-mr] steveloughran opened a new pull request, #985: PARQUET-2173. Fix parquet build against hadoop 3.3.3+ - posted by GitBox <gi...@apache.org> on 2022/08/16 19:57:09 UTC, 0 replies.
- [jira] [Commented] (PARQUET-2173) Fix parquet build against hadoop 3.3.3+ - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/08/16 19:58:00 UTC, 3 replies.
- [GitHub] [parquet-mr] steveloughran commented on pull request #985: PARQUET-2173. Fix parquet build against hadoop 3.3.3+ - posted by GitBox <gi...@apache.org> on 2022/08/16 20:04:36 UTC, 0 replies.
- [GitHub] [parquet-mr] theosib-amazon opened a new pull request, #986: Prevent IntelliJ from making unsolicited whitespace changes - posted by GitBox <gi...@apache.org> on 2022/08/17 20:59:22 UTC, 0 replies.
- [GitHub] [parquet-mr] theosib-amazon commented on pull request #986: Prevent IntelliJ from making unsolicited whitespace changes - posted by GitBox <gi...@apache.org> on 2022/08/17 21:03:27 UTC, 0 replies.
- [GitHub] [parquet-mr] parthchandra commented on pull request #986: Prevent IntelliJ from making unsolicited whitespace changes - posted by GitBox <gi...@apache.org> on 2022/08/17 21:21:04 UTC, 0 replies.
- [GitHub] [parquet-mr] gszadovszky merged pull request #981: PARQUET-2169: Upgrade Avro to version 1.11.1 - posted by GitBox <gi...@apache.org> on 2022/08/18 08:58:38 UTC, 0 replies.
- [jira] [Created] (PARQUET-2174) Encrypting the entire table is impossible - posted by "ren shangtao (Jira)" <ji...@apache.org> on 2022/08/18 11:31:00 UTC, 0 replies.
- [GitHub] [parquet-mr] renshangtao opened a new pull request, #987: Modify to true to encrypt - posted by GitBox <gi...@apache.org> on 2022/08/18 12:33:59 UTC, 0 replies.
- [GitHub] [parquet-mr] renshangtao commented on pull request #987: Modify to true to encrypt - posted by GitBox <gi...@apache.org> on 2022/08/18 12:36:01 UTC, 0 replies.
- [GitHub] [parquet-mr] matthieun opened a new pull request, #988: PARQUET-1711: Break circular dependencies in proto definitions - posted by GitBox <gi...@apache.org> on 2022/08/18 20:02:02 UTC, 0 replies.
- [jira] [Commented] (PARQUET-1711) [parquet-protobuf] stack overflow when work with well known json type - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/08/18 20:03:00 UTC, 11 replies.
- [GitHub] [parquet-mr] ggershinsky commented on pull request #987: Parquet-MR Encryption - Modify to true to encrypt - posted by GitBox <gi...@apache.org> on 2022/08/19 05:14:30 UTC, 0 replies.
- [GitHub] [parquet-mr] shangxinli commented on a diff in pull request #982: PARQUET-2160: Close ZstdInputStream to free off-heap memory in time. - posted by GitBox <gi...@apache.org> on 2022/08/21 18:27:41 UTC, 1 replies.
- [GitHub] [parquet-mr] shangxinli commented on a diff in pull request #985: PARQUET-2173. Fix parquet build against hadoop 3.3.3+ - posted by GitBox <gi...@apache.org> on 2022/08/21 18:57:00 UTC, 0 replies.
- [GitHub] [parquet-mr] shangxinli commented on pull request #986: Prevent IntelliJ from making unsolicited whitespace changes - posted by GitBox <gi...@apache.org> on 2022/08/21 18:59:24 UTC, 0 replies.
- [GitHub] [parquet-mr] shangxinli commented on a diff in pull request #988: PARQUET-1711: Break circular dependencies in proto definitions - posted by GitBox <gi...@apache.org> on 2022/08/21 19:09:26 UTC, 4 replies.
- [GitHub] [parquet-mr] shangxinli commented on a diff in pull request #960: Performance optimization: Move all LittleEndianDataInputStream functionality into ByteBufferInputStream - posted by GitBox <gi...@apache.org> on 2022/08/21 21:38:04 UTC, 7 replies.
- [jira] [Updated] (PARQUET-1416) [C++] Deprecate parquet/api/* in favor of simpler public API "parquet/api.h" - posted by "Antoine Pitrou (Jira)" <ji...@apache.org> on 2022/08/22 11:34:00 UTC, 0 replies.
- [jira] [Updated] (PARQUET-1814) [C++] TestInt96ParquetIO failure on Windows - posted by "Antoine Pitrou (Jira)" <ji...@apache.org> on 2022/08/22 11:35:00 UTC, 0 replies.
- [jira] [Updated] (PARQUET-1657) [C++] Change Bloom filter implementation to use xxhash - posted by "Antoine Pitrou (Jira)" <ji...@apache.org> on 2022/08/22 11:35:00 UTC, 0 replies.
- [jira] [Updated] (PARQUET-1859) [C++] Require error message when using ParquetException::EofException - posted by "Antoine Pitrou (Jira)" <ji...@apache.org> on 2022/08/22 11:35:00 UTC, 0 replies.
- [jira] [Updated] (PARQUET-2099) [C++] Statistics::num_values() is misleading - posted by "Antoine Pitrou (Jira)" <ji...@apache.org> on 2022/08/22 11:35:00 UTC, 0 replies.
- [jira] [Updated] (PARQUET-1614) [C++] Reuse arrow::Buffer used as scratch space for decryption in Thrift deserialization hot path - posted by "Antoine Pitrou (Jira)" <ji...@apache.org> on 2022/08/22 11:36:00 UTC, 0 replies.
- [jira] [Updated] (PARQUET-1646) [C++] Use arrow::Buffer for buffered dictionary indices in DictEncoder instead of std::vector - posted by "Antoine Pitrou (Jira)" <ji...@apache.org> on 2022/08/22 11:36:00 UTC, 0 replies.
- [jira] [Updated] (PARQUET-1634) [C++] Factor out data/dictionary page writes to allow for page buffering - posted by "Antoine Pitrou (Jira)" <ji...@apache.org> on 2022/08/22 11:36:00 UTC, 0 replies.
- [jira] [Updated] (PARQUET-1653) [C++] Deprecated BIT_PACKED level decoding is probably incorrect - posted by "Antoine Pitrou (Jira)" <ji...@apache.org> on 2022/08/22 11:36:00 UTC, 0 replies.
- [jira] [Updated] (PARQUET-1430) [C++] Add tests for C++ tools - posted by "Antoine Pitrou (Jira)" <ji...@apache.org> on 2022/08/22 11:37:00 UTC, 0 replies.
- [jira] [Updated] (PARQUET-1199) [C++] Support writing (and test reading) boolean values with RLE encoding - posted by "Antoine Pitrou (Jira)" <ji...@apache.org> on 2022/08/22 11:37:00 UTC, 0 replies.
- [jira] [Updated] (PARQUET-1515) [C++] Disable LZ4 codec - posted by "Antoine Pitrou (Jira)" <ji...@apache.org> on 2022/08/22 11:37:00 UTC, 0 replies.
- [jira] [Updated] (PARQUET-1158) [C++] Basic RowGroup filtering - posted by "Antoine Pitrou (Jira)" <ji...@apache.org> on 2022/08/22 11:38:00 UTC, 0 replies.
- [GitHub] [parquet-mr] steveloughran commented on a diff in pull request #985: PARQUET-2173. Fix parquet build against hadoop 3.3.3+ - posted by GitBox <gi...@apache.org> on 2022/08/22 15:36:21 UTC, 0 replies.
- [GitHub] [parquet-mr] matthieun commented on a diff in pull request #988: PARQUET-1711: Break circular dependencies in proto definitions - posted by GitBox <gi...@apache.org> on 2022/08/22 16:29:31 UTC, 2 replies.
- [GitHub] [parquet-mr] shangxinli merged pull request #986: Prevent IntelliJ from making unsolicited whitespace changes - posted by GitBox <gi...@apache.org> on 2022/08/23 15:32:15 UTC, 0 replies.
- [jira] [Created] (PARQUET-2175) Skip method skips levels and not rows for repeated fields - posted by "fatemah (Jira)" <ji...@apache.org> on 2022/08/23 21:30:00 UTC, 0 replies.
- Interest in adding the float16 logical type to the Parquet spec - posted by Anja <an...@gmail.com> on 2022/08/23 22:57:12 UTC, 1 replies.
- [jira] [Created] (PARQUET-2176) Parquet writers should allow for configurable index/statistics truncation - posted by "patchwork01 (Jira)" <ji...@apache.org> on 2022/08/24 11:18:00 UTC, 0 replies.
- [GitHub] [parquet-mr] patchwork01 opened a new pull request, #989: PARQUET-2176: Column index/statistics truncation in ParquetWriter - posted by GitBox <gi...@apache.org> on 2022/08/24 11:27:35 UTC, 0 replies.
- [jira] [Commented] (PARQUET-2176) Parquet writers should allow for configurable index/statistics truncation - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/08/24 11:28:00 UTC, 0 replies.
- [jira] [Updated] (PARQUET-2176) Parquet writers should allow for configurable index/statistics truncation - posted by "patchwork01 (Jira)" <ji...@apache.org> on 2022/08/24 11:30:00 UTC, 0 replies.
- Skip method skips levels and not rows for repeated fields - posted by Fatemah Panahi <pa...@google.com.INVALID> on 2022/08/24 17:06:20 UTC, 0 replies.
- [jira] [Commented] (PARQUET-2175) Skip method skips levels and not rows for repeated fields - posted by "Micah Kornfield (Jira)" <ji...@apache.org> on 2022/08/24 17:28:00 UTC, 0 replies.
- [GitHub] [parquet-mr] NickCrews commented on pull request #433: PARQUET-1115: Warn users when misusing parquet-tools merge - posted by GitBox <gi...@apache.org> on 2022/08/25 01:29:14 UTC, 0 replies.
- [jira] [Commented] (PARQUET-1115) Warn users when misusing parquet-tools merge - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/08/25 01:30:00 UTC, 0 replies.
- [jira] [Commented] (PARQUET-2142) parquet-cli without hadoop throws java.lang.NoSuchMethodError on any parquet file access command - posted by "Kengo Seki (Jira)" <ji...@apache.org> on 2022/08/25 23:56:00 UTC, 1 replies.
- [jira] [Assigned] (PARQUET-2142) parquet-cli without hadoop throws java.lang.NoSuchMethodError on any parquet file access command - posted by "Kengo Seki (Jira)" <ji...@apache.org> on 2022/08/25 23:57:00 UTC, 0 replies.
- [GitHub] [parquet-mr] sekikn opened a new pull request, #990: PARQUET-2142: Update the parquet-cli document to avoid NoSuchMethodError - posted by GitBox <gi...@apache.org> on 2022/08/26 00:19:20 UTC, 0 replies.
- [jira] [Created] (PARQUET-2177) Fix parquet-cli not to fail showing descriptions - posted by "Kengo Seki (Jira)" <ji...@apache.org> on 2022/08/26 05:05:00 UTC, 0 replies.
- [GitHub] [parquet-mr] sekikn opened a new pull request, #991: PARQUET-2177: Fix parquet-cli not to fail showing descriptions - posted by GitBox <gi...@apache.org> on 2022/08/26 05:11:12 UTC, 0 replies.
- [jira] [Commented] (PARQUET-2177) Fix parquet-cli not to fail showing descriptions - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/08/26 05:12:00 UTC, 0 replies.
- [GitHub] [parquet-format] anjakefala opened a new pull request, #184: PARQUET-758: Add Float16/Half-float logical type - posted by GitBox <gi...@apache.org> on 2022/08/26 21:21:16 UTC, 0 replies.
- [jira] [Commented] (PARQUET-758) HALF precision FLOAT Logical type - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/08/26 21:22:00 UTC, 7 replies.
- [GitHub] [parquet-mr] matthieun commented on pull request #988: PARQUET-1711: Break circular dependencies in proto definitions - posted by GitBox <gi...@apache.org> on 2022/08/26 23:40:12 UTC, 0 replies.
- [jira] [Created] (PARQUET-2178) ParquetReader constructed using builder fails to read encrypted files - posted by "Atul Mohan (Jira)" <ji...@apache.org> on 2022/08/26 23:53:00 UTC, 0 replies.
- [GitHub] [parquet-format] pitrou commented on a diff in pull request #184: PARQUET-758: Add Float16/Half-float logical type - posted by GitBox <gi...@apache.org> on 2022/08/29 08:57:31 UTC, 5 replies.
- [GitHub] [parquet-format] pitrou commented on pull request #184: PARQUET-758: Add Float16/Half-float logical type - posted by GitBox <gi...@apache.org> on 2022/08/29 09:02:33 UTC, 1 replies.
- [jira] [Updated] (PARQUET-758) [Format] HALF precision FLOAT Logical type - posted by "Antoine Pitrou (Jira)" <ji...@apache.org> on 2022/08/29 14:38:00 UTC, 0 replies.
- [GitHub] [parquet-format] anjakefala commented on a diff in pull request #184: PARQUET-758: Add Float16/Half-float logical type - posted by GitBox <gi...@apache.org> on 2022/08/29 21:48:01 UTC, 0 replies.
- [jira] [Commented] (PARQUET-758) [Format] HALF precision FLOAT Logical type - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/08/29 21:49:00 UTC, 8 replies.
- [GitHub] [parquet-format] emkornfield commented on pull request #184: PARQUET-758: Add Float16/Half-float logical type - posted by GitBox <gi...@apache.org> on 2022/08/30 06:05:36 UTC, 2 replies.
- [GitHub] [parquet-format] emkornfield commented on a diff in pull request #184: PARQUET-758: Add Float16/Half-float logical type - posted by GitBox <gi...@apache.org> on 2022/08/30 06:10:14 UTC, 0 replies.
- [GitHub] [parquet-format] gszadovszky commented on pull request #184: PARQUET-758: Add Float16/Half-float logical type - posted by GitBox <gi...@apache.org> on 2022/08/30 07:53:12 UTC, 2 replies.
- [jira] [Created] (PARQUET-2179) Add a test for skipping repeated fields - posted by "fatemah (Jira)" <ji...@apache.org> on 2022/08/30 18:34:00 UTC, 0 replies.
- [jira] [Updated] (PARQUET-2180) make the default behavior for proto writing not-backwards compatible - posted by "J Y (Jira)" <ji...@apache.org> on 2022/08/30 20:15:00 UTC, 1 replies.
- [jira] [Created] (PARQUET-2180) make the default behavior for proto writing not-backwards compatible - posted by "J Y (Jira)" <ji...@apache.org> on 2022/08/30 20:15:00 UTC, 0 replies.
- [GitHub] [parquet-mr] jinyius commented on pull request #988: PARQUET-1711: Break circular dependencies in proto definitions - posted by GitBox <gi...@apache.org> on 2022/08/31 05:11:33 UTC, 0 replies.
- [GitHub] [parquet-mr] jinyius commented on a diff in pull request #988: PARQUET-1711: Break circular dependencies in proto definitions - posted by GitBox <gi...@apache.org> on 2022/08/31 05:18:45 UTC, 0 replies.
- [jira] [Created] (PARQUET-2181) parquet-cli fails at supporting parquet-protobuf generated schemas that have repeated primitives in them - posted by "J Y (Jira)" <ji...@apache.org> on 2022/08/31 05:38:00 UTC, 0 replies.
- [jira] [Updated] (PARQUET-2181) parquet-cli fails at supporting parquet-protobuf generated schemas that have repeated primitives in them - posted by "J Y (Jira)" <ji...@apache.org> on 2022/08/31 05:40:00 UTC, 9 replies.
- [jira] [Created] (PARQUET-2182) Handle unknown logical types - posted by "Gabor Szadovszky (Jira)" <ji...@apache.org> on 2022/08/31 05:48:00 UTC, 0 replies.
- [jira] [Commented] (PARQUET-2181) parquet-cli fails at supporting parquet-protobuf generated schemas that have repeated primitives in them - posted by "J Y (Jira)" <ji...@apache.org> on 2022/08/31 06:25:00 UTC, 0 replies.
- [jira] [Updated] (PARQUET-2181) parquet-cli fails at supporting parquet-protobuf generated files - posted by "J Y (Jira)" <ji...@apache.org> on 2022/08/31 06:27:00 UTC, 0 replies.
- [jira] [Commented] (PARQUET-2181) parquet-cli fails at supporting parquet-protobuf generated files - posted by "J Y (Jira)" <ji...@apache.org> on 2022/08/31 17:03:00 UTC, 0 replies.