You are viewing a plain text version of this content. The canonical link for it is here.
- Re: Add FilteredPageReader to filter rows based on page statistics - posted by Micah Kornfield <em...@gmail.com> on 2022/11/01 05:58:06 UTC, 2 replies.
- [jira] [Created] (PARQUET-2211) [C++] Print ColumnMetaData.encoding_stats field - posted by "Gang Wu (Jira)" <ji...@apache.org> on 2022/11/01 06:01:00 UTC, 0 replies.
- [jira] [Updated] (PARQUET-2211) [C++] Print ColumnMetaData.encoding_stats field - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/11/01 06:05:00 UTC, 0 replies.
- [GitHub] [parquet-mr] ggershinsky commented on pull request #995: PARQUET-1711: support recursive proto schemas by limiting recursion depth - posted by GitBox <gi...@apache.org> on 2022/11/01 07:04:22 UTC, 0 replies.
- [jira] [Commented] (PARQUET-1711) [parquet-protobuf] stack overflow when work with well known json type - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/11/01 07:05:00 UTC, 5 replies.
- [GitHub] [parquet-mr] jinyius commented on pull request #995: PARQUET-1711: support recursive proto schemas by limiting recursion depth - posted by GitBox <gi...@apache.org> on 2022/11/01 15:50:38 UTC, 0 replies.
- [GitHub] [parquet-mr] emkornfield commented on pull request #995: PARQUET-1711: support recursive proto schemas by limiting recursion depth - posted by GitBox <gi...@apache.org> on 2022/11/01 18:06:38 UTC, 0 replies.
- [jira] [Assigned] (PARQUET-2209) [C++] Optimize skip for the case that number of values to skip equals page size - posted by "Antoine Pitrou (Jira)" <ji...@apache.org> on 2022/11/02 10:54:00 UTC, 0 replies.
- [jira] [Resolved] (PARQUET-2209) [C++] Optimize skip for the case that number of values to skip equals page size - posted by "Antoine Pitrou (Jira)" <ji...@apache.org> on 2022/11/02 10:54:00 UTC, 0 replies.
- Re: Modular encryption to support arrays and nested arrays - posted by nicolas paris <ni...@riseup.net> on 2022/11/02 14:17:43 UTC, 0 replies.
- [GitHub] [parquet-mr] shangxinli commented on pull request #995: PARQUET-1711: support recursive proto schemas by limiting recursion depth - posted by GitBox <gi...@apache.org> on 2022/11/02 14:44:16 UTC, 0 replies.
- [GitHub] [parquet-mr] shangxinli merged pull request #995: PARQUET-1711: support recursive proto schemas by limiting recursion depth - posted by GitBox <gi...@apache.org> on 2022/11/02 14:45:36 UTC, 0 replies.
- [GitHub] [parquet-mr] shangxinli commented on pull request #1000: PARQUET-2196: Support LZ4_RAW codec - posted by GitBox <gi...@apache.org> on 2022/11/02 14:50:50 UTC, 0 replies.
- [jira] [Commented] (PARQUET-2196) Support LZ4_RAW codec - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/11/02 14:51:00 UTC, 1 replies.
- [GitHub] [parquet-mr] shangxinli merged pull request #1000: PARQUET-2196: Support LZ4_RAW codec - posted by GitBox <gi...@apache.org> on 2022/11/02 16:32:13 UTC, 0 replies.
- [jira] [Commented] (PARQUET-2069) Parquet file containing arrays, written by Parquet-MR, cannot be read again by Parquet-MR - posted by "Sabarishan (Jira)" <ji...@apache.org> on 2022/11/03 03:07:00 UTC, 1 replies.
- [GitHub] [parquet-format] emkornfield opened a new pull request, #185: PARQUET-1222: [Format] Add deails about sort order to README.md - posted by GitBox <gi...@apache.org> on 2022/11/05 04:34:50 UTC, 0 replies.
- [jira] [Commented] (PARQUET-1222) Specify a well-defined sorting order for float and double types - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/11/05 04:35:00 UTC, 1 replies.
- [Format] Clarifying Sort Order Requirements for Floating Points and Logical Types - posted by Micah Kornfield <em...@gmail.com> on 2022/11/05 04:54:09 UTC, 0 replies.
- [jira] [Resolved] (PARQUET-2211) [C++] Print ColumnMetaData.encoding_stats field - posted by "Antoine Pitrou (Jira)" <ji...@apache.org> on 2022/11/06 14:57:00 UTC, 0 replies.
- [jira] [Assigned] (PARQUET-2211) [C++] Print ColumnMetaData.encoding_stats field - posted by "Antoine Pitrou (Jira)" <ji...@apache.org> on 2022/11/06 14:58:00 UTC, 0 replies.
- [GitHub] [parquet-mr] shangxinli commented on pull request #998: PARQUET-2195: Add scan command to parquet-cli - posted by GitBox <gi...@apache.org> on 2022/11/07 17:20:44 UTC, 0 replies.
- [jira] [Commented] (PARQUET-2195) Add scan command to parquet-cli - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/11/07 17:21:00 UTC, 1 replies.
- [GitHub] [parquet-mr] shangxinli merged pull request #998: PARQUET-2195: Add scan command to parquet-cli - posted by GitBox <gi...@apache.org> on 2022/11/07 17:21:23 UTC, 0 replies.
- [GitHub] [parquet-mr] shangxinli commented on pull request #1005: PARQUET-2198 : Updating jackson data bind version to fix CVEs - posted by GitBox <gi...@apache.org> on 2022/11/07 17:22:46 UTC, 0 replies.
- [jira] [Commented] (PARQUET-2198) Vulnerabilities in jackson-databind - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/11/07 17:23:00 UTC, 0 replies.
- [jira] [Updated] (PARQUET-2210) Skip pages based on header metadata using a callback - posted by "fatemah (Jira)" <ji...@apache.org> on 2022/11/07 21:35:00 UTC, 3 replies.
- [GitHub] [parquet-mr] wgtmac commented on a diff in pull request #957: PARQUET-2069: Allow list and array record types to be compatible. - posted by GitBox <gi...@apache.org> on 2022/11/08 06:25:21 UTC, 0 replies.
- [GitHub] [parquet-mr] wzx140 commented on a diff in pull request #520: PARQUET-1410: Refactor modules to use the new logical type API - posted by GitBox <gi...@apache.org> on 2022/11/08 15:12:48 UTC, 1 replies.
- [jira] [Commented] (PARQUET-1410) Refactor modules to use the new logical type API - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/11/08 15:13:00 UTC, 1 replies.
- [GitHub] [parquet-format] emkornfield commented on pull request #185: PARQUET-1222: [Format] Add deails about sort order to README.md - posted by GitBox <gi...@apache.org> on 2022/11/08 20:01:56 UTC, 0 replies.
- [jira] [Updated] (PARQUET-2210) [C++] Skip pages based on header metadata using a callback - posted by "Antoine Pitrou (Jira)" <ji...@apache.org> on 2022/11/09 11:29:00 UTC, 0 replies.
- [jira] [Created] (PARQUET-2212) Add ByteBuffer api for decryptors to allow direct memory to be decrypted - posted by "Parth Chandra (Jira)" <ji...@apache.org> on 2022/11/09 19:22:00 UTC, 0 replies.
- [GitHub] [parquet-mr] parthchandra opened a new pull request, #1008: PARQUET-2212: Add ByteBuffer api for decryptors to allow direct memory to be decrypted - posted by GitBox <gi...@apache.org> on 2022/11/09 19:55:23 UTC, 0 replies.
- [jira] [Commented] (PARQUET-2212) Add ByteBuffer api for decryptors to allow direct memory to be decrypted - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/11/09 19:56:00 UTC, 11 replies.
- [GitHub] [parquet-mr] parthchandra commented on pull request #1008: PARQUET-2212: Add ByteBuffer api for decryptors to allow direct memory to be decrypted - posted by GitBox <gi...@apache.org> on 2022/11/09 19:59:32 UTC, 1 replies.
- [GitHub] [parquet-mr] wgtmac commented on a diff in pull request #959: PARQUET-2126: Make cached (de)compressors thread-safe - posted by GitBox <gi...@apache.org> on 2022/11/10 07:42:54 UTC, 0 replies.
- [jira] [Commented] (PARQUET-2126) Thread safety bug in CodecFactory - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/11/10 07:43:00 UTC, 0 replies.
- [GitHub] [parquet-mr] ggershinsky commented on pull request #1008: PARQUET-2212: Add ByteBuffer api for decryptors to allow direct memory to be decrypted - posted by GitBox <gi...@apache.org> on 2022/11/10 13:16:53 UTC, 1 replies.
- [jira] [Created] (PARQUET-2213) Add an alternative InputFile.newStream that allow an input range - posted by "Chao Sun (Jira)" <ji...@apache.org> on 2022/11/10 18:07:00 UTC, 0 replies.
- [GitHub] [parquet-mr] sunchao opened a new pull request, #1010: PARQUET-2213: add InputFile.newStream with a read range - posted by GitBox <gi...@apache.org> on 2022/11/10 18:09:25 UTC, 0 replies.
- [jira] [Commented] (PARQUET-2213) Add an alternative InputFile.newStream that allow an input range - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/11/10 18:10:00 UTC, 9 replies.
- [GitHub] [parquet-mr] wgtmac commented on a diff in pull request #960: Performance optimization: Move all LittleEndianDataInputStream functionality into ByteBufferInputStream - posted by GitBox <gi...@apache.org> on 2022/11/11 15:35:37 UTC, 0 replies.
- [GitHub] [parquet-mr] wgtmac commented on a diff in pull request #1010: PARQUET-2213: add InputFile.newStream with a read range - posted by GitBox <gi...@apache.org> on 2022/11/11 15:43:43 UTC, 0 replies.
- [GitHub] [parquet-mr] renshangtao closed pull request #987: Parquet-MR Encryption - Modify to true to encrypt - posted by GitBox <gi...@apache.org> on 2022/11/13 11:02:32 UTC, 0 replies.
- [GitHub] [parquet-format] Jimexist opened a new pull request, #186: Update parquet.thrift to fix a typo - posted by GitBox <gi...@apache.org> on 2022/11/14 03:45:07 UTC, 0 replies.
- parquet checksum coverage - posted by Steve Loughran <st...@cloudera.com.INVALID> on 2022/11/14 11:38:40 UTC, 0 replies.
- [GitHub] [parquet-mr] steveloughran commented on pull request #1010: PARQUET-2213: add InputFile.newStream with a read range - posted by GitBox <gi...@apache.org> on 2022/11/14 11:48:45 UTC, 2 replies.
- [GitHub] [parquet-mr] steveloughran commented on a diff in pull request #1010: PARQUET-2213: add InputFile.newStream with a read range - posted by GitBox <gi...@apache.org> on 2022/11/14 12:35:41 UTC, 2 replies.
- [GitHub] [parquet-mr] ggershinsky commented on a diff in pull request #1008: PARQUET-2212: Add ByteBuffer api for decryptors to allow direct memory to be decrypted - posted by GitBox <gi...@apache.org> on 2022/11/14 14:02:24 UTC, 4 replies.
- [GitHub] [parquet-mr] sunchao commented on a diff in pull request #1010: PARQUET-2213: add InputFile.newStream with a read range - posted by GitBox <gi...@apache.org> on 2022/11/14 16:47:00 UTC, 0 replies.
- [GitHub] [parquet-mr] theosib-amazon commented on a diff in pull request #960: Performance optimization: Move all LittleEndianDataInputStream functionality into ByteBufferInputStream - posted by GitBox <gi...@apache.org> on 2022/11/14 17:41:51 UTC, 3 replies.
- [GitHub] [parquet-mr] parthchandra commented on a diff in pull request #1008: PARQUET-2212: Add ByteBuffer api for decryptors to allow direct memory to be decrypted - posted by GitBox <gi...@apache.org> on 2022/11/14 21:05:57 UTC, 1 replies.
- [jira] [Commented] (PARQUET-1647) [Java] support for Arrow's float16 - posted by "JAVIER ANDRES RECASENS SANCHEZ (Jira)" <ji...@apache.org> on 2022/11/15 00:27:00 UTC, 0 replies.
- [GitHub] [parquet-mr] jiangjiguang opened a new pull request, #1011: PARQUET-2159: java17 vector parquet bit-packing decode optimization - posted by GitBox <gi...@apache.org> on 2022/11/15 07:32:17 UTC, 0 replies.
- [jira] [Commented] (PARQUET-2159) Parquet bit-packing de/encode optimization - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/11/15 07:33:00 UTC, 10 replies.
- [jira] [Commented] (PARQUET-632) Parquet file in invalid state while writing to S3 from EMR - posted by "Emil Kleszcz (Jira)" <ji...@apache.org> on 2022/11/15 11:35:00 UTC, 0 replies.
- [jira] [Resolved] (PARQUET-2206) Microbenchmark for ColumnReadaer ReadBatch and Skip - posted by "Antoine Pitrou (Jira)" <ji...@apache.org> on 2022/11/15 16:20:00 UTC, 0 replies.
- [jira] [Assigned] (PARQUET-2206) Microbenchmark for ColumnReadaer ReadBatch and Skip - posted by "Antoine Pitrou (Jira)" <ji...@apache.org> on 2022/11/15 16:20:00 UTC, 0 replies.
- [jira] [Updated] (PARQUET-2206) Microbenchmark for ColumnReadaer ReadBatch and Skip - posted by "Antoine Pitrou (Jira)" <ji...@apache.org> on 2022/11/15 16:20:00 UTC, 0 replies.
- [GitHub] [parquet-mr] sunchao commented on pull request #1010: PARQUET-2213: add InputFile.newStream with a read range - posted by GitBox <gi...@apache.org> on 2022/11/15 17:06:53 UTC, 0 replies.
- [jira] [Created] (PARQUET-2214) Support re-encryption in ColumnEncryptor - posted by "Kai Jiang (Jira)" <ji...@apache.org> on 2022/11/17 04:45:00 UTC, 0 replies.
- [GitHub] [parquet-mr] Jimexist opened a new pull request, #1012: Update README.md to reflect thrift 0.17 - posted by GitBox <gi...@apache.org> on 2022/11/17 13:12:43 UTC, 0 replies.
- [GitHub] [parquet-mr] wgtmac commented on pull request #968: PARQUET-2149: Async IO implementation for ParquetFileReader - posted by GitBox <gi...@apache.org> on 2022/11/17 16:37:50 UTC, 2 replies.
- [jira] [Commented] (PARQUET-2149) Implement async IO for Parquet file reader - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/11/17 16:38:00 UTC, 9 replies.
- [GitHub] [parquet-mr] parthchandra commented on pull request #968: PARQUET-2149: Async IO implementation for ParquetFileReader - posted by GitBox <gi...@apache.org> on 2022/11/17 18:29:31 UTC, 4 replies.
- [GitHub] [parquet-mr] jiangjiguang commented on pull request #1011: PARQUET-2159: java17 vector parquet bit-packing decode optimization - posted by GitBox <gi...@apache.org> on 2022/11/20 11:02:19 UTC, 5 replies.
- [jira] [Created] (PARQUET-2215) Document how DELTA_BINARY_PACKED handles overflow for deltas - posted by "Rok Mihevc (Jira)" <ji...@apache.org> on 2022/11/22 19:19:00 UTC, 0 replies.
- [jira] [Updated] (PARQUET-2215) Document how DELTA_BINARY_PACKED handles overflow for deltas - posted by "Rok Mihevc (Jira)" <ji...@apache.org> on 2022/11/22 19:20:00 UTC, 0 replies.
- [jira] [Assigned] (PARQUET-2215) Document how DELTA_BINARY_PACKED handles overflow for deltas - posted by "Antoine Pitrou (Jira)" <ji...@apache.org> on 2022/11/23 08:35:00 UTC, 0 replies.
- [GitHub] [parquet-format] pitrou commented on pull request #187: PARQUET-2215: [Format] Document overflow handling in DELTA_BINARY_PACKED - posted by GitBox <gi...@apache.org> on 2022/11/23 08:37:57 UTC, 1 replies.
- [jira] [Commented] (PARQUET-2215) Document how DELTA_BINARY_PACKED handles overflow for deltas - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/11/23 08:38:00 UTC, 7 replies.
- [GitHub] [parquet-format] rok commented on a diff in pull request #187: PARQUET-2215: [Format] Document overflow handling in DELTA_BINARY_PACKED - posted by GitBox <gi...@apache.org> on 2022/11/23 09:05:53 UTC, 2 replies.
- [GitHub] [parquet-format] pitrou commented on a diff in pull request #187: PARQUET-2215: [Format] Document overflow handling in DELTA_BINARY_PACKED - posted by GitBox <gi...@apache.org> on 2022/11/23 09:31:46 UTC, 1 replies.
- [GitHub] [parquet-mr] wgtmac commented on a diff in pull request #968: PARQUET-2149: Async IO implementation for ParquetFileReader - posted by GitBox <gi...@apache.org> on 2022/11/23 15:57:10 UTC, 0 replies.
- [GitHub] [parquet-mr] parthchandra commented on a diff in pull request #968: PARQUET-2149: Async IO implementation for ParquetFileReader - posted by GitBox <gi...@apache.org> on 2022/11/23 19:34:26 UTC, 0 replies.
- [GitHub] [parquet-mr] gszadovszky commented on pull request #1011: PARQUET-2159: java17 vector parquet bit-packing decode optimization - posted by GitBox <gi...@apache.org> on 2022/11/24 08:07:14 UTC, 3 replies.
- [GitHub] [parquet-format] pitrou merged pull request #187: PARQUET-2215: [Format] Document overflow handling in DELTA_BINARY_PACKED - posted by GitBox <gi...@apache.org> on 2022/11/25 08:47:34 UTC, 0 replies.
- [jira] [Assigned] (PARQUET-2159) Parquet bit-packing de/encode optimization - posted by "Gabor Szadovszky (Jira)" <ji...@apache.org> on 2022/11/25 13:15:00 UTC, 0 replies.
- [jira] [Created] (PARQUET-2216) Parquet writer classes don't close underlying output stream in case of errors. - posted by "Andrei Lopukhov (Jira)" <ji...@apache.org> on 2022/11/25 13:46:00 UTC, 0 replies.