You are viewing a plain text version of this content. The canonical link for it is here.
- Re: parquet checksum coverage - posted by Micah Kornfield <em...@gmail.com> on 2022/12/01 05:53:02 UTC, 2 replies.
- [jira] [Assigned] (PARQUET-1404) [C++] Add index pages to the format to support efficient page skipping to parquet-cpp - posted by "Gang Wu (Jira)" <ji...@apache.org> on 2022/12/01 14:29:00 UTC, 0 replies.
- [jira] [Commented] (PARQUET-1404) [C++] Add index pages to the format to support efficient page skipping to parquet-cpp - posted by "Gang Wu (Jira)" <ji...@apache.org> on 2022/12/01 14:31:00 UTC, 3 replies.
- [jira] [Commented] (PARQUET-2216) Parquet writer classes don't close underlying output stream in case of errors. - posted by "Andrei Lopukhov (Jira)" <ji...@apache.org> on 2022/12/02 09:47:00 UTC, 1 replies.
- [GitHub] [parquet-mr] mr1716 opened a new pull request, #1013: Upgrade Jackson - posted by GitBox <gi...@apache.org> on 2022/12/02 15:23:49 UTC, 0 replies.
- [GitHub] [parquet-mr] mr1716 commented on pull request #1013: Upgrade Jackson - posted by GitBox <gi...@apache.org> on 2022/12/02 15:24:16 UTC, 0 replies.
- [GitHub] [parquet-mr] mr1716 closed pull request #1013: Upgrade Jackson - posted by GitBox <gi...@apache.org> on 2022/12/02 15:24:16 UTC, 0 replies.
- [GitHub] [parquet-mr] mr1716 commented on pull request #1005: PARQUET-2198 : Updating jackson data bind version to fix CVEs - posted by GitBox <gi...@apache.org> on 2022/12/02 15:24:55 UTC, 0 replies.
- [jira] [Commented] (PARQUET-2198) Vulnerabilities in jackson-databind - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/12/02 15:25:00 UTC, 1 replies.
- [GitHub] [parquet-mr] shangxinli commented on a diff in pull request #968: PARQUET-2149: Async IO implementation for ParquetFileReader - posted by GitBox <gi...@apache.org> on 2022/12/03 16:54:50 UTC, 2 replies.
- [jira] [Commented] (PARQUET-2149) Implement async IO for Parquet file reader - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/12/03 16:55:00 UTC, 4 replies.
- [GitHub] [parquet-mr] shangxinli commented on pull request #968: PARQUET-2149: Async IO implementation for ParquetFileReader - posted by GitBox <gi...@apache.org> on 2022/12/03 17:31:16 UTC, 0 replies.
- [GitHub] [parquet-mr] shangxinli commented on pull request #1011: PARQUET-2159: java17 vector parquet bit-packing decode optimization - posted by GitBox <gi...@apache.org> on 2022/12/03 18:10:32 UTC, 0 replies.
- [jira] [Commented] (PARQUET-2159) Parquet bit-packing de/encode optimization - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/12/03 18:11:00 UTC, 9 replies.
- [GitHub] [parquet-mr] shangxinli commented on pull request #985: PARQUET-2173. Fix parquet build against hadoop 3.3.3+ - posted by GitBox <gi...@apache.org> on 2022/12/03 18:31:38 UTC, 0 replies.
- [jira] [Commented] (PARQUET-2173) Fix parquet build against hadoop 3.3.3+ - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/12/03 18:32:00 UTC, 3 replies.
- [GitHub] [parquet-mr] shangxinli commented on pull request #988: PARQUET-1711: Break circular dependencies in proto definitions - posted by GitBox <gi...@apache.org> on 2022/12/03 18:34:19 UTC, 0 replies.
- [jira] [Commented] (PARQUET-1711) [parquet-protobuf] stack overflow when work with well known json type - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/12/03 18:35:00 UTC, 1 replies.
- [GitHub] [parquet-mr] shangxinli closed pull request #988: PARQUET-1711: Break circular dependencies in proto definitions - posted by GitBox <gi...@apache.org> on 2022/12/03 18:35:41 UTC, 0 replies.
- [GitHub] [parquet-mr] shangxinli merged pull request #991: PARQUET-2177: Fix parquet-cli not to fail showing descriptions - posted by GitBox <gi...@apache.org> on 2022/12/03 18:36:21 UTC, 0 replies.
- [jira] [Commented] (PARQUET-2177) Fix parquet-cli not to fail showing descriptions - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/12/03 18:37:00 UTC, 0 replies.
- [GitHub] [parquet-mr] shangxinli commented on pull request #993: PARQUET-2184: Improve the allocation behavior of SnappyCompressor - posted by GitBox <gi...@apache.org> on 2022/12/03 18:37:47 UTC, 0 replies.
- [jira] [Commented] (PARQUET-2184) Improve SnappyCompressor buffer expansion performance - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/12/03 18:38:00 UTC, 1 replies.
- [GitHub] [parquet-mr] shangxinli merged pull request #1005: PARQUET-2198 : Updating jackson data bind version to fix CVEs - posted by GitBox <gi...@apache.org> on 2022/12/03 18:39:00 UTC, 0 replies.
- [GitHub] [parquet-mr] shangxinli commented on a diff in pull request #1008: PARQUET-2212: Add ByteBuffer api for decryptors to allow direct memory to be decrypted - posted by GitBox <gi...@apache.org> on 2022/12/03 19:00:38 UTC, 1 replies.
- [jira] [Commented] (PARQUET-2212) Add ByteBuffer api for decryptors to allow direct memory to be decrypted - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/12/03 19:01:00 UTC, 2 replies.
- [GitHub] [parquet-mr] shangxinli merged pull request #1009: PARQUET-2208: Add details to nested column encryption config doc and exception text - posted by GitBox <gi...@apache.org> on 2022/12/03 19:10:36 UTC, 0 replies.
- [jira] [Commented] (PARQUET-2208) Add details to nested column encryption config doc and exception text - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/12/03 19:11:00 UTC, 0 replies.
- [GitHub] [parquet-mr] shangxinli commented on pull request #960: Performance optimization: Move all LittleEndianDataInputStream functionality into ByteBufferInputStream - posted by GitBox <gi...@apache.org> on 2022/12/03 19:30:06 UTC, 0 replies.
- [jira] [Updated] (PARQUET-2217) Support gorilla encoding for float numbers - posted by "Frank Dai (Jira)" <ji...@apache.org> on 2022/12/04 22:45:00 UTC, 0 replies.
- [jira] [Created] (PARQUET-2217) Support gorilla encoding for float numbers - posted by "Frank Dai (Jira)" <ji...@apache.org> on 2022/12/04 22:45:00 UTC, 0 replies.
- [GitHub] [parquet-mr] panbingkun closed pull request #974: PARQUET-2156: Column bloom filter: Show bloom filters in tools - posted by GitBox <gi...@apache.org> on 2022/12/05 01:27:00 UTC, 0 replies.
- [jira] [Commented] (PARQUET-2156) Column bloom filter: Show bloom filters in tools - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/12/05 01:28:00 UTC, 0 replies.
- [GitHub] [parquet-mr] abaranec commented on pull request #993: PARQUET-2184: Improve the allocation behavior of SnappyCompressor - posted by GitBox <gi...@apache.org> on 2022/12/05 14:43:01 UTC, 0 replies.
- [GitHub] [parquet-mr] parthchandra commented on a diff in pull request #968: PARQUET-2149: Async IO implementation for ParquetFileReader - posted by GitBox <gi...@apache.org> on 2022/12/05 18:33:06 UTC, 0 replies.
- [GitHub] [parquet-mr] parthchandra commented on a diff in pull request #1008: PARQUET-2212: Add ByteBuffer api for decryptors to allow direct memory to be decrypted - posted by GitBox <gi...@apache.org> on 2022/12/05 18:49:02 UTC, 0 replies.
- [GitHub] [parquet-format] emkornfield commented on pull request #185: PARQUET-1222: [Format] Add deails about sort order to README.md - posted by GitBox <gi...@apache.org> on 2022/12/06 07:05:31 UTC, 0 replies.
- [jira] [Commented] (PARQUET-1222) Specify a well-defined sorting order for float and double types - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/12/06 07:06:00 UTC, 8 replies.
- [GitHub] [parquet-mr] wgtmac commented on a diff in pull request #1011: PARQUET-2159: java17 vector parquet bit-packing decode optimization - posted by GitBox <gi...@apache.org> on 2022/12/06 08:54:28 UTC, 0 replies.
- [GitHub] [parquet-mr] wgtmac commented on pull request #985: PARQUET-2173. Fix parquet build against hadoop 3.3.3+ - posted by GitBox <gi...@apache.org> on 2022/12/06 09:12:21 UTC, 0 replies.
- [GitHub] [parquet-mr] steveloughran commented on pull request #985: PARQUET-2173. Fix parquet build against hadoop 3.3.3+ - posted by GitBox <gi...@apache.org> on 2022/12/06 09:50:38 UTC, 0 replies.
- [GitHub] [parquet-mr] ggershinsky commented on pull request #985: PARQUET-2173. Fix parquet build against hadoop 3.3.3+ - posted by GitBox <gi...@apache.org> on 2022/12/06 12:05:41 UTC, 0 replies.
- [GitHub] [parquet-format] pitrou commented on a diff in pull request #185: PARQUET-1222: [Format] Add details about sort order to README.md - posted by GitBox <gi...@apache.org> on 2022/12/06 14:50:51 UTC, 0 replies.
- [GitHub] [parquet-format] emkornfield commented on a diff in pull request #185: PARQUET-1222: [Format] Add details about sort order to README.md - posted by GitBox <gi...@apache.org> on 2022/12/07 05:31:24 UTC, 0 replies.
- [GitHub] [parquet-format] emkornfield commented on pull request #185: PARQUET-1222: [Format] Add details about sort order to README.md - posted by GitBox <gi...@apache.org> on 2022/12/07 05:31:49 UTC, 1 replies.
- [GitHub] [parquet-format] pitrou merged pull request #185: PARQUET-1222: [Format] Add details about sort order to README.md - posted by GitBox <gi...@apache.org> on 2022/12/07 07:54:57 UTC, 0 replies.
- [GitHub] [parquet-format] pitrou commented on pull request #185: PARQUET-1222: [Format] Add details about sort order to README.md - posted by GitBox <gi...@apache.org> on 2022/12/07 07:55:04 UTC, 1 replies.
- [GitHub] [parquet-format] pitrou commented on pull request #186: MINOR: Update parquet.thrift to fix a typo - posted by GitBox <gi...@apache.org> on 2022/12/07 08:05:56 UTC, 0 replies.
- [GitHub] [parquet-format] gszadovszky commented on pull request #186: MINOR: Update parquet.thrift to fix a typo - posted by GitBox <gi...@apache.org> on 2022/12/07 08:09:18 UTC, 0 replies.
- [GitHub] [parquet-format] pitrou merged pull request #176: MINOR: Typo in parquet.thrift - posted by GitBox <gi...@apache.org> on 2022/12/07 08:13:53 UTC, 0 replies.
- [GitHub] [parquet-format] pitrou merged pull request #186: MINOR: Update parquet.thrift to fix a typo - posted by GitBox <gi...@apache.org> on 2022/12/07 08:14:53 UTC, 0 replies.
- [jira] [Assigned] (PARQUET-1222) Specify a well-defined sorting order for float and double types - posted by "Antoine Pitrou (Jira)" <ji...@apache.org> on 2022/12/07 14:10:00 UTC, 0 replies.
- [jira] [Resolved] (PARQUET-1222) Specify a well-defined sorting order for float and double types - posted by "Antoine Pitrou (Jira)" <ji...@apache.org> on 2022/12/07 14:11:00 UTC, 0 replies.
- [jira] [Updated] (PARQUET-1222) Specify a well-defined sorting order for float and double types - posted by "Antoine Pitrou (Jira)" <ji...@apache.org> on 2022/12/07 14:11:00 UTC, 0 replies.
- [GitHub] [parquet-format] gszadovszky commented on pull request #185: PARQUET-1222: [Format] Add details about sort order to README.md - posted by GitBox <gi...@apache.org> on 2022/12/07 17:39:20 UTC, 0 replies.
- Re: [Format] Clarifying Sort Order Requirements for Floating Points and Logical Types - posted by Micah Kornfield <em...@gmail.com> on 2022/12/07 17:43:19 UTC, 0 replies.
- [GitHub] [parquet-format] emkornfield commented on a diff in pull request #184: PARQUET-758: Add Float16/Half-float logical type - posted by GitBox <gi...@apache.org> on 2022/12/07 17:55:13 UTC, 0 replies.
- [GitHub] [parquet-format] emkornfield commented on pull request #184: PARQUET-758: Add Float16/Half-float logical type - posted by GitBox <gi...@apache.org> on 2022/12/07 17:55:28 UTC, 1 replies.
- [jira] [Commented] (PARQUET-758) [Format] HALF precision FLOAT Logical type - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/12/07 17:56:00 UTC, 4 replies.
- [jira] [Updated] (PARQUET-2201) Add Stress test for RecordReader SkipRecords - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/12/07 21:33:00 UTC, 0 replies.
- [GitHub] [parquet-format] anjakefala commented on pull request #184: PARQUET-758: Add Float16/Half-float logical type - posted by GitBox <gi...@apache.org> on 2022/12/07 22:07:49 UTC, 1 replies.
- [GitHub] [parquet-format] emkornfield commented on pull request #182: Fix typo under "Unsigned Integers" - posted by GitBox <gi...@apache.org> on 2022/12/08 05:38:13 UTC, 0 replies.
- [GitHub] [parquet-format] emkornfield merged pull request #182: Fix typo under "Unsigned Integers" - posted by GitBox <gi...@apache.org> on 2022/12/08 05:38:35 UTC, 0 replies.
- [jira] [Assigned] (PARQUET-2204) TypedColumnReaderImpl::Skip should reuse scratch space - posted by "Antoine Pitrou (Jira)" <ji...@apache.org> on 2022/12/08 15:42:00 UTC, 0 replies.
- [jira] [Updated] (PARQUET-2204) TypedColumnReaderImpl::Skip should reuse scratch space - posted by "Antoine Pitrou (Jira)" <ji...@apache.org> on 2022/12/08 15:42:00 UTC, 0 replies.
- [jira] [Resolved] (PARQUET-2204) TypedColumnReaderImpl::Skip should reuse scratch space - posted by "Antoine Pitrou (Jira)" <ji...@apache.org> on 2022/12/08 15:43:00 UTC, 0 replies.
- [GitHub] [parquet-mr] jiangjiguang commented on a diff in pull request #1011: PARQUET-2159: java17 vector parquet bit-packing decode optimization - posted by GitBox <gi...@apache.org> on 2022/12/09 08:16:29 UTC, 3 replies.
- [jira] [Assigned] (PARQUET-2075) Unified Rewriter Tool - posted by "Gang Wu (Jira)" <ji...@apache.org> on 2022/12/09 15:39:00 UTC, 0 replies.
- [jira] [Commented] (PARQUET-2075) Unified Rewriter Tool - posted by "Gang Wu (Jira)" <ji...@apache.org> on 2022/12/09 15:40:00 UTC, 18 replies.
- [GitHub] [parquet-mr] jackzhangsir commented on pull request #936: PARQUET-2101: Fix wrong descriptions about the default block size - posted by GitBox <gi...@apache.org> on 2022/12/12 09:20:31 UTC, 0 replies.
- [jira] [Commented] (PARQUET-2101) Fix wrong descriptions about the default block size - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/12/12 09:21:00 UTC, 0 replies.
- [GitHub] [parquet-format] pitrou commented on pull request #126: PARQUET-1539: Clarify CRC checksum in page header - posted by GitBox <gi...@apache.org> on 2022/12/13 09:50:57 UTC, 5 replies.
- [jira] [Commented] (PARQUET-1539) Clarify CRC checksum in page header - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/12/13 09:51:00 UTC, 9 replies.
- [GitHub] [parquet-format] mapleFU commented on pull request #126: PARQUET-1539: Clarify CRC checksum in page header - posted by GitBox <gi...@apache.org> on 2022/12/13 11:40:13 UTC, 2 replies.
- [jira] [Commented] (PARQUET-1629) Page-level CRC checksum verification for DataPageV2 - posted by "Antoine Pitrou (Jira)" <ji...@apache.org> on 2022/12/13 12:27:00 UTC, 0 replies.
- [GitHub] [parquet-format] wgtmac commented on pull request #126: PARQUET-1539: Clarify CRC checksum in page header - posted by GitBox <gi...@apache.org> on 2022/12/13 14:21:22 UTC, 0 replies.
- [jira] [Created] (PARQUET-2218) [Format] Clarify CRC computation - posted by "Antoine Pitrou (Jira)" <ji...@apache.org> on 2022/12/13 14:39:00 UTC, 0 replies.
- [jira] [Updated] (PARQUET-2218) [Format] Clarify CRC computation - posted by "Antoine Pitrou (Jira)" <ji...@apache.org> on 2022/12/13 14:40:00 UTC, 0 replies.
- [GitHub] [parquet-format] pitrou opened a new pull request, #188: PARQUET-2218: [Format] Clarify CRC computation - posted by GitBox <gi...@apache.org> on 2022/12/13 14:43:04 UTC, 0 replies.
- [GitHub] [parquet-format] pitrou commented on pull request #188: PARQUET-2218: [Format] Clarify CRC computation - posted by GitBox <gi...@apache.org> on 2022/12/13 14:43:31 UTC, 0 replies.
- [jira] [Commented] (PARQUET-2218) [Format] Clarify CRC computation - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/12/13 14:44:00 UTC, 2 replies.
- [GitHub] [parquet-format] mapleFU commented on pull request #188: PARQUET-2218: [Format] Clarify CRC computation - posted by GitBox <gi...@apache.org> on 2022/12/13 14:55:35 UTC, 0 replies.
- [GitHub] [parquet-mr] jiangjiguang commented on pull request #1011: PARQUET-2159: java17 vector parquet bit-packing decode optimization - posted by GitBox <gi...@apache.org> on 2022/12/14 02:40:48 UTC, 1 replies.
- [GitHub] [parquet-mr] jatin-bhateja commented on a diff in pull request #1011: PARQUET-2159: java17 vector parquet bit-packing decode optimization - posted by GitBox <gi...@apache.org> on 2022/12/14 05:49:21 UTC, 1 replies.
- [GitHub] [parquet-mr] wgtmac opened a new pull request, #1014: PARQUET-2075: Implement ParquetRewriter - posted by GitBox <gi...@apache.org> on 2022/12/14 07:47:30 UTC, 0 replies.
- [GitHub] [parquet-mr] wgtmac commented on pull request #1014: PARQUET-2075: Implement unified file rewriter - posted by GitBox <gi...@apache.org> on 2022/12/14 15:52:36 UTC, 0 replies.
- Canceled event: Parquet Sync @ Tue Dec 27, 2022 8:30am - 9:30am (PST) (dev@parquet.apache.org) - posted by sh...@uber.com.INVALID on 2022/12/14 16:24:19 UTC, 0 replies.
- How the parquet-mr community support java17 and is compatible with java8 using any versions of jdks by default - posted by jiangjiguang719 <ji...@163.com> on 2022/12/15 02:25:54 UTC, 0 replies.
- [jira] [Created] (PARQUET-2219) ParquetFileReader throws a runtime exception when a file contains only headers and now row data - posted by "chris stockton (Jira)" <ji...@apache.org> on 2022/12/16 04:49:00 UTC, 0 replies.
- [jira] [Assigned] (PARQUET-2196) Support LZ4_RAW codec - posted by "Gang Wu (Jira)" <ji...@apache.org> on 2022/12/16 05:06:00 UTC, 0 replies.
- [jira] [Resolved] (PARQUET-2196) Support LZ4_RAW codec - posted by "Gang Wu (Jira)" <ji...@apache.org> on 2022/12/16 05:06:00 UTC, 0 replies.
- [jira] [Commented] (PARQUET-2219) ParquetFileReader throws a runtime exception when a file contains only headers and now row data - posted by "Gang Wu (Jira)" <ji...@apache.org> on 2022/12/16 05:19:00 UTC, 1 replies.
- [GitHub] [parquet-mr] vectorijk opened a new pull request, #1015: add support re-encryption in ColumnEncryptor - posted by GitBox <gi...@apache.org> on 2022/12/18 00:20:55 UTC, 0 replies.
- [GitHub] [parquet-mr] shangxinli commented on a diff in pull request #1014: PARQUET-2075: Implement unified file rewriter - posted by GitBox <gi...@apache.org> on 2022/12/24 22:12:10 UTC, 6 replies.
- [GitHub] [parquet-mr] shangxinli commented on pull request #1014: PARQUET-2075: Implement unified file rewriter - posted by GitBox <gi...@apache.org> on 2022/12/24 22:58:15 UTC, 1 replies.
- [GitHub] [parquet-mr] ggershinsky commented on pull request #1014: PARQUET-2075: Implement unified file rewriter - posted by GitBox <gi...@apache.org> on 2022/12/26 05:44:04 UTC, 0 replies.
- [GitHub] [parquet-mr] ggershinsky commented on a diff in pull request #1014: PARQUET-2075: Implement unified file rewriter - posted by GitBox <gi...@apache.org> on 2022/12/27 07:45:09 UTC, 2 replies.
- [jira] [Created] (PARQUET-2220) Parquet Filter predicate storing nested string causing OOM's - posted by "Abhishek Jain (Jira)" <ji...@apache.org> on 2022/12/29 12:52:00 UTC, 0 replies.
- [jira] [Updated] (PARQUET-2220) Parquet Filter predicate storing nested string causing OOM's - posted by "Abhishek Jain (Jira)" <ji...@apache.org> on 2022/12/29 12:53:00 UTC, 2 replies.
- [GitHub] [parquet-mr] dongjoon-hyun commented on pull request #975: PARQUET-2157: add bloom filter fpp config - posted by GitBox <gi...@apache.org> on 2022/12/30 05:31:12 UTC, 0 replies.
- [jira] [Commented] (PARQUET-2157) Add BloomFilter fpp config - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/12/30 05:32:00 UTC, 0 replies.
- [jira] [Assigned] (PARQUET-2157) Add BloomFilter fpp config - posted by "Huaxin Gao (Jira)" <ji...@apache.org> on 2022/12/30 05:44:00 UTC, 0 replies.
- [jira] [Resolved] (PARQUET-2157) Add BloomFilter fpp config - posted by "Huaxin Gao (Jira)" <ji...@apache.org> on 2022/12/30 05:55:00 UTC, 0 replies.
- [jira] [Commented] (PARQUET-2220) Parquet Filter predicate storing nested string causing OOM's - posted by "Abhishek Jain (Jira)" <ji...@apache.org> on 2022/12/30 07:12:00 UTC, 2 replies.
- [GitHub] [parquet-mr] wgtmac commented on a diff in pull request #1014: PARQUET-2075: Implement unified file rewriter - posted by GitBox <gi...@apache.org> on 2022/12/30 13:30:45 UTC, 2 replies.