You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by GitBox <gi...@apache.org> on 2021/01/01 11:39:22 UTC
[GitHub] [arrow] Dandandan opened a new pull request #9068: ARROW-11100: [Rust] Speed up numeric to string cast using lexical_core
Dandandan opened a new pull request #9068:
URL: https://github.com/apache/arrow/pull/9068
Uses lexical_core to speed up num to string casts.
```
cast i64 to string 512 time: [22.209 us 22.462 us 22.885 us]
change: [-38.438% -37.979% -37.154%] (p = 0.00 < 0.05)
Performance has improved.
Found 14 outliers among 100 measurements (14.00%)
11 (11.00%) low mild
3 (3.00%) high severe
Benchmarking cast f32 to string 512: Collecting 100 samples in estimated 5.0698 cast f32 to string 512 time: [25.587 us 25.692 us 25.786 us]
change: [-62.364% -62.215% -62.076%] (p = 0.00 < 0.05)
Performance has improved.
Found 7 outliers among 100 measurements (7.00%)
1 (1.00%) low severe
1 (1.00%) low mild
4 (4.00%) high mild
1 (1.00%) high severe
```
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] jorgecarleitao commented on a change in pull request #9068: ARROW-11100: [Rust] Speed up numeric to string cast using lexical_core
Posted by GitBox <gi...@apache.org>.
jorgecarleitao commented on a change in pull request #9068:
URL: https://github.com/apache/arrow/pull/9068#discussion_r550870918
##########
File path: rust/arrow/src/util/mod.rs
##########
@@ -21,5 +21,6 @@ pub mod display;
pub mod integration_util;
#[cfg(feature = "prettyprint")]
pub mod pretty;
+pub mod serialization;
Review comment:
what do you think about making this `pub(crate)` or `pub(super)`? In general I think that we should avoid exposing public unless needed, so that it is easier to refactor without breaking public apis.
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] codecov-io edited a comment on pull request #9068: ARROW-11100: [Rust] Speed up numeric to string cast using lexical_core
Posted by GitBox <gi...@apache.org>.
codecov-io edited a comment on pull request #9068:
URL: https://github.com/apache/arrow/pull/9068#issuecomment-753307368
# [Codecov](https://codecov.io/gh/apache/arrow/pull/9068?src=pr&el=h1) Report
> Merging [#9068](https://codecov.io/gh/apache/arrow/pull/9068?src=pr&el=desc) (48f5522) into [master](https://codecov.io/gh/apache/arrow/commit/cc0ee5efcf9f6a67bcc407a11e8553a0409275c1?el=desc) (cc0ee5e) will **increase** coverage by `0.05%`.
> The diff coverage is `100.00%`.
[![Impacted file tree graph](https://codecov.io/gh/apache/arrow/pull/9068/graphs/tree.svg?width=650&height=150&src=pr&token=LpTCFbqVT1)](https://codecov.io/gh/apache/arrow/pull/9068?src=pr&el=tree)
```diff
@@ Coverage Diff @@
## master #9068 +/- ##
==========================================
+ Coverage 82.55% 82.61% +0.05%
==========================================
Files 203 204 +1
Lines 50043 49942 -101
==========================================
- Hits 41313 41259 -54
+ Misses 8730 8683 -47
```
| [Impacted Files](https://codecov.io/gh/apache/arrow/pull/9068?src=pr&el=tree) | Coverage Δ | |
|---|---|---|
| [rust/arrow/src/compute/kernels/cast.rs](https://codecov.io/gh/apache/arrow/pull/9068/diff?src=pr&el=tree#diff-cnVzdC9hcnJvdy9zcmMvY29tcHV0ZS9rZXJuZWxzL2Nhc3QucnM=) | `96.99% <100.00%> (ø)` | |
| [rust/arrow/src/csv/writer.rs](https://codecov.io/gh/apache/arrow/pull/9068/diff?src=pr&el=tree#diff-cnVzdC9hcnJvdy9zcmMvY3N2L3dyaXRlci5ycw==) | `78.82% <100.00%> (-0.56%)` | :arrow_down: |
| [rust/arrow/src/util/serialization.rs](https://codecov.io/gh/apache/arrow/pull/9068/diff?src=pr&el=tree#diff-cnVzdC9hcnJvdy9zcmMvdXRpbC9zZXJpYWxpemF0aW9uLnJz) | `100.00% <100.00%> (ø)` | |
| [...datafusion/src/optimizer/hash\_build\_probe\_order.rs](https://codecov.io/gh/apache/arrow/pull/9068/diff?src=pr&el=tree#diff-cnVzdC9kYXRhZnVzaW9uL3NyYy9vcHRpbWl6ZXIvaGFzaF9idWlsZF9wcm9iZV9vcmRlci5ycw==) | `54.73% <0.00%> (-3.70%)` | :arrow_down: |
| [rust/arrow/src/array/array\_string.rs](https://codecov.io/gh/apache/arrow/pull/9068/diff?src=pr&el=tree#diff-cnVzdC9hcnJvdy9zcmMvYXJyYXkvYXJyYXlfc3RyaW5nLnJz) | `90.16% <0.00%> (-0.11%)` | :arrow_down: |
| [rust/datafusion/src/scalar.rs](https://codecov.io/gh/apache/arrow/pull/9068/diff?src=pr&el=tree#diff-cnVzdC9kYXRhZnVzaW9uL3NyYy9zY2FsYXIucnM=) | `58.99% <0.00%> (-0.08%)` | :arrow_down: |
| [rust/arrow/src/array/array\_binary.rs](https://codecov.io/gh/apache/arrow/pull/9068/diff?src=pr&el=tree#diff-cnVzdC9hcnJvdy9zcmMvYXJyYXkvYXJyYXlfYmluYXJ5LnJz) | `90.54% <0.00%> (-0.07%)` | :arrow_down: |
| [rust/parquet/src/arrow/schema.rs](https://codecov.io/gh/apache/arrow/pull/9068/diff?src=pr&el=tree#diff-cnVzdC9wYXJxdWV0L3NyYy9hcnJvdy9zY2hlbWEucnM=) | `90.93% <0.00%> (+0.31%)` | :arrow_up: |
| [rust/arrow/src/array/array\_primitive.rs](https://codecov.io/gh/apache/arrow/pull/9068/diff?src=pr&el=tree#diff-cnVzdC9hcnJvdy9zcmMvYXJyYXkvYXJyYXlfcHJpbWl0aXZlLnJz) | `92.32% <0.00%> (+0.68%)` | :arrow_up: |
| ... and [2 more](https://codecov.io/gh/apache/arrow/pull/9068/diff?src=pr&el=tree-more) | |
------
[Continue to review full report at Codecov](https://codecov.io/gh/apache/arrow/pull/9068?src=pr&el=continue).
> **Legend** - [Click here to learn more](https://docs.codecov.io/docs/codecov-delta)
> `Δ = absolute <relative> (impact)`, `ø = not affected`, `? = missing data`
> Powered by [Codecov](https://codecov.io/gh/apache/arrow/pull/9068?src=pr&el=footer). Last update [cc0ee5e...48f5522](https://codecov.io/gh/apache/arrow/pull/9068?src=pr&el=lastupdated). Read the [comment docs](https://docs.codecov.io/docs/pull-request-comments).
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] Dandandan commented on a change in pull request #9068: ARROW-11100: [Rust] Speed up numeric to string cast using lexical_core
Posted by GitBox <gi...@apache.org>.
Dandandan commented on a change in pull request #9068:
URL: https://github.com/apache/arrow/pull/9068#discussion_r550871768
##########
File path: rust/arrow/src/util/mod.rs
##########
@@ -21,5 +21,6 @@ pub mod display;
pub mod integration_util;
#[cfg(feature = "prettyprint")]
pub mod pretty;
+pub mod serialization;
Review comment:
Thanks, makes sense. I think it is indeed best to expose as little as internal code as possible :+1:
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] jorgecarleitao closed pull request #9068: ARROW-11100: [Rust] Speed up numeric to string cast using lexical_core
Posted by GitBox <gi...@apache.org>.
jorgecarleitao closed pull request #9068:
URL: https://github.com/apache/arrow/pull/9068
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] codecov-io edited a comment on pull request #9068: ARROW-11100: [Rust] Speed up numeric to string cast using lexical_core
Posted by GitBox <gi...@apache.org>.
codecov-io edited a comment on pull request #9068:
URL: https://github.com/apache/arrow/pull/9068#issuecomment-753307368
# [Codecov](https://codecov.io/gh/apache/arrow/pull/9068?src=pr&el=h1) Report
> Merging [#9068](https://codecov.io/gh/apache/arrow/pull/9068?src=pr&el=desc) (b840784) into [master](https://codecov.io/gh/apache/arrow/commit/cc0ee5efcf9f6a67bcc407a11e8553a0409275c1?el=desc) (cc0ee5e) will **not change** coverage.
> The diff coverage is `100.00%`.
[![Impacted file tree graph](https://codecov.io/gh/apache/arrow/pull/9068/graphs/tree.svg?width=650&height=150&src=pr&token=LpTCFbqVT1)](https://codecov.io/gh/apache/arrow/pull/9068?src=pr&el=tree)
```diff
@@ Coverage Diff @@
## master #9068 +/- ##
=======================================
Coverage 82.55% 82.55%
=======================================
Files 203 204 +1
Lines 50043 50043
=======================================
Hits 41313 41313
Misses 8730 8730
```
| [Impacted Files](https://codecov.io/gh/apache/arrow/pull/9068?src=pr&el=tree) | Coverage Δ | |
|---|---|---|
| [rust/arrow/src/compute/kernels/cast.rs](https://codecov.io/gh/apache/arrow/pull/9068/diff?src=pr&el=tree#diff-cnVzdC9hcnJvdy9zcmMvY29tcHV0ZS9rZXJuZWxzL2Nhc3QucnM=) | `96.99% <100.00%> (ø)` | |
| [rust/arrow/src/csv/writer.rs](https://codecov.io/gh/apache/arrow/pull/9068/diff?src=pr&el=tree#diff-cnVzdC9hcnJvdy9zcmMvY3N2L3dyaXRlci5ycw==) | `78.82% <100.00%> (-0.56%)` | :arrow_down: |
| [rust/arrow/src/util/serialization.rs](https://codecov.io/gh/apache/arrow/pull/9068/diff?src=pr&el=tree#diff-cnVzdC9hcnJvdy9zcmMvdXRpbC9zZXJpYWxpemF0aW9uLnJz) | `100.00% <100.00%> (ø)` | |
------
[Continue to review full report at Codecov](https://codecov.io/gh/apache/arrow/pull/9068?src=pr&el=continue).
> **Legend** - [Click here to learn more](https://docs.codecov.io/docs/codecov-delta)
> `Δ = absolute <relative> (impact)`, `ø = not affected`, `? = missing data`
> Powered by [Codecov](https://codecov.io/gh/apache/arrow/pull/9068?src=pr&el=footer). Last update [cc0ee5e...b840784](https://codecov.io/gh/apache/arrow/pull/9068?src=pr&el=lastupdated). Read the [comment docs](https://docs.codecov.io/docs/pull-request-comments).
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] codecov-io commented on pull request #9068: ARROW-11100: [Rust] Speed up numeric to string cast using lexical_core
Posted by GitBox <gi...@apache.org>.
codecov-io commented on pull request #9068:
URL: https://github.com/apache/arrow/pull/9068#issuecomment-753307368
# [Codecov](https://codecov.io/gh/apache/arrow/pull/9068?src=pr&el=h1) Report
> Merging [#9068](https://codecov.io/gh/apache/arrow/pull/9068?src=pr&el=desc) (05e1901) into [master](https://codecov.io/gh/apache/arrow/commit/cc0ee5efcf9f6a67bcc407a11e8553a0409275c1?el=desc) (cc0ee5e) will **not change** coverage.
> The diff coverage is `100.00%`.
[![Impacted file tree graph](https://codecov.io/gh/apache/arrow/pull/9068/graphs/tree.svg?width=650&height=150&src=pr&token=LpTCFbqVT1)](https://codecov.io/gh/apache/arrow/pull/9068?src=pr&el=tree)
```diff
@@ Coverage Diff @@
## master #9068 +/- ##
=======================================
Coverage 82.55% 82.55%
=======================================
Files 203 204 +1
Lines 50043 50043
=======================================
Hits 41313 41313
Misses 8730 8730
```
| [Impacted Files](https://codecov.io/gh/apache/arrow/pull/9068?src=pr&el=tree) | Coverage Δ | |
|---|---|---|
| [rust/arrow/src/compute/kernels/cast.rs](https://codecov.io/gh/apache/arrow/pull/9068/diff?src=pr&el=tree#diff-cnVzdC9hcnJvdy9zcmMvY29tcHV0ZS9rZXJuZWxzL2Nhc3QucnM=) | `96.99% <100.00%> (ø)` | |
| [rust/arrow/src/csv/writer.rs](https://codecov.io/gh/apache/arrow/pull/9068/diff?src=pr&el=tree#diff-cnVzdC9hcnJvdy9zcmMvY3N2L3dyaXRlci5ycw==) | `78.82% <100.00%> (-0.56%)` | :arrow_down: |
| [rust/arrow/src/util/serialization.rs](https://codecov.io/gh/apache/arrow/pull/9068/diff?src=pr&el=tree#diff-cnVzdC9hcnJvdy9zcmMvdXRpbC9zZXJpYWxpemF0aW9uLnJz) | `100.00% <100.00%> (ø)` | |
------
[Continue to review full report at Codecov](https://codecov.io/gh/apache/arrow/pull/9068?src=pr&el=continue).
> **Legend** - [Click here to learn more](https://docs.codecov.io/docs/codecov-delta)
> `Δ = absolute <relative> (impact)`, `ø = not affected`, `? = missing data`
> Powered by [Codecov](https://codecov.io/gh/apache/arrow/pull/9068?src=pr&el=footer). Last update [cc0ee5e...b840784](https://codecov.io/gh/apache/arrow/pull/9068?src=pr&el=lastupdated). Read the [comment docs](https://docs.codecov.io/docs/pull-request-comments).
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] github-actions[bot] commented on pull request #9068: ARROW-11100: [Rust] Speed up numeric to string cast using lexical_core
Posted by GitBox <gi...@apache.org>.
github-actions[bot] commented on pull request #9068:
URL: https://github.com/apache/arrow/pull/9068#issuecomment-753306954
https://issues.apache.org/jira/browse/ARROW-11100
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] codecov-io edited a comment on pull request #9068: ARROW-11100: [Rust] Speed up numeric to string cast using lexical_core
Posted by GitBox <gi...@apache.org>.
codecov-io edited a comment on pull request #9068:
URL: https://github.com/apache/arrow/pull/9068#issuecomment-753307368
# [Codecov](https://codecov.io/gh/apache/arrow/pull/9068?src=pr&el=h1) Report
> Merging [#9068](https://codecov.io/gh/apache/arrow/pull/9068?src=pr&el=desc) (8929078) into [master](https://codecov.io/gh/apache/arrow/commit/cc0ee5efcf9f6a67bcc407a11e8553a0409275c1?el=desc) (cc0ee5e) will **increase** coverage by `0.05%`.
> The diff coverage is `100.00%`.
[![Impacted file tree graph](https://codecov.io/gh/apache/arrow/pull/9068/graphs/tree.svg?width=650&height=150&src=pr&token=LpTCFbqVT1)](https://codecov.io/gh/apache/arrow/pull/9068?src=pr&el=tree)
```diff
@@ Coverage Diff @@
## master #9068 +/- ##
==========================================
+ Coverage 82.55% 82.61% +0.05%
==========================================
Files 203 204 +1
Lines 50043 49942 -101
==========================================
- Hits 41313 41258 -55
+ Misses 8730 8684 -46
```
| [Impacted Files](https://codecov.io/gh/apache/arrow/pull/9068?src=pr&el=tree) | Coverage Δ | |
|---|---|---|
| [rust/arrow/src/compute/kernels/cast.rs](https://codecov.io/gh/apache/arrow/pull/9068/diff?src=pr&el=tree#diff-cnVzdC9hcnJvdy9zcmMvY29tcHV0ZS9rZXJuZWxzL2Nhc3QucnM=) | `96.99% <100.00%> (ø)` | |
| [rust/arrow/src/csv/writer.rs](https://codecov.io/gh/apache/arrow/pull/9068/diff?src=pr&el=tree#diff-cnVzdC9hcnJvdy9zcmMvY3N2L3dyaXRlci5ycw==) | `78.82% <100.00%> (-0.56%)` | :arrow_down: |
| [rust/arrow/src/util/serialization.rs](https://codecov.io/gh/apache/arrow/pull/9068/diff?src=pr&el=tree#diff-cnVzdC9hcnJvdy9zcmMvdXRpbC9zZXJpYWxpemF0aW9uLnJz) | `100.00% <100.00%> (ø)` | |
| [...datafusion/src/optimizer/hash\_build\_probe\_order.rs](https://codecov.io/gh/apache/arrow/pull/9068/diff?src=pr&el=tree#diff-cnVzdC9kYXRhZnVzaW9uL3NyYy9vcHRpbWl6ZXIvaGFzaF9idWlsZF9wcm9iZV9vcmRlci5ycw==) | `54.73% <0.00%> (-3.70%)` | :arrow_down: |
| [rust/parquet/src/encodings/encoding.rs](https://codecov.io/gh/apache/arrow/pull/9068/diff?src=pr&el=tree#diff-cnVzdC9wYXJxdWV0L3NyYy9lbmNvZGluZ3MvZW5jb2RpbmcucnM=) | `95.24% <0.00%> (-0.20%)` | :arrow_down: |
| [rust/arrow/src/array/array\_string.rs](https://codecov.io/gh/apache/arrow/pull/9068/diff?src=pr&el=tree#diff-cnVzdC9hcnJvdy9zcmMvYXJyYXkvYXJyYXlfc3RyaW5nLnJz) | `90.16% <0.00%> (-0.11%)` | :arrow_down: |
| [rust/datafusion/src/scalar.rs](https://codecov.io/gh/apache/arrow/pull/9068/diff?src=pr&el=tree#diff-cnVzdC9kYXRhZnVzaW9uL3NyYy9zY2FsYXIucnM=) | `58.99% <0.00%> (-0.08%)` | :arrow_down: |
| [rust/arrow/src/array/array\_binary.rs](https://codecov.io/gh/apache/arrow/pull/9068/diff?src=pr&el=tree#diff-cnVzdC9hcnJvdy9zcmMvYXJyYXkvYXJyYXlfYmluYXJ5LnJz) | `90.54% <0.00%> (-0.07%)` | :arrow_down: |
| [rust/parquet/src/arrow/schema.rs](https://codecov.io/gh/apache/arrow/pull/9068/diff?src=pr&el=tree#diff-cnVzdC9wYXJxdWV0L3NyYy9hcnJvdy9zY2hlbWEucnM=) | `90.93% <0.00%> (+0.31%)` | :arrow_up: |
| ... and [3 more](https://codecov.io/gh/apache/arrow/pull/9068/diff?src=pr&el=tree-more) | |
------
[Continue to review full report at Codecov](https://codecov.io/gh/apache/arrow/pull/9068?src=pr&el=continue).
> **Legend** - [Click here to learn more](https://docs.codecov.io/docs/codecov-delta)
> `Δ = absolute <relative> (impact)`, `ø = not affected`, `? = missing data`
> Powered by [Codecov](https://codecov.io/gh/apache/arrow/pull/9068?src=pr&el=footer). Last update [cc0ee5e...8929078](https://codecov.io/gh/apache/arrow/pull/9068?src=pr&el=lastupdated). Read the [comment docs](https://docs.codecov.io/docs/pull-request-comments).
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org