You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "guocity (via GitHub)" <gi...@apache.org> on 2024/04/01 20:53:43 UTC

[I] pyarrow data type error cause lot of dependent project crash [arrow]

guocity opened a new issue, #40926:
URL: https://github.com/apache/arrow/issues/40926

   ### Describe the bug, including details regarding any error messages, version, and platform.
   
   several project depends on pyarrow crash on type conversation
   
   - Data Wrangler
   - Polars
   
   
   df = pl.DataFrame(pd)
   
   ArrowInvalid: Could not convert '230' with type str: tried to convert to double
   
   ### Component(s)
   
   Python


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Re: [I] [Python] pyarrow data type error cause lot of dependent project crash [arrow]

Posted by "AlenkaF (via GitHub)" <gi...@apache.org>.
AlenkaF commented on issue #40926:
URL: https://github.com/apache/arrow/issues/40926#issuecomment-2036535343

   In case you have any follow-up questions, do not hesitate to re-open the issue.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Re: [I] [Python] pyarrow data type error cause lot of dependent project crash [arrow]

Posted by "guocity (via GitHub)" <gi...@apache.org>.
guocity commented on issue #40926:
URL: https://github.com/apache/arrow/issues/40926#issuecomment-2034915856

   thanks, 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Re: [I] pyarrow data type error cause lot of dependent project crash [arrow]

Posted by "AlenkaF (via GitHub)" <gi...@apache.org>.
AlenkaF commented on issue #40926:
URL: https://github.com/apache/arrow/issues/40926#issuecomment-2031299788

   Could you share a concrete reproducible example so we can see where the issue might be?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Re: [I] [Python] pyarrow data type error cause lot of dependent project crash [arrow]

Posted by "AlenkaF (via GitHub)" <gi...@apache.org>.
AlenkaF closed issue #40926: [Python] pyarrow data type error cause lot of dependent project crash
URL: https://github.com/apache/arrow/issues/40926


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Re: [I] [Python] pyarrow data type error cause lot of dependent project crash [arrow]

Posted by "guocity (via GitHub)" <gi...@apache.org>.
guocity commented on issue #40926:
URL: https://github.com/apache/arrow/issues/40926#issuecomment-2032554089

   I have this dataset, when the string is mixed with float datatype, pandas will read as mixed datatype but not pyarrow,
   therefore it breaks Data wrangler, and ability convert pd to polars, or export as parquet 
   https://drive.google.com/file/d/1uxKSIqZ7IUEQ1Qp_LgB2qSAZDQEKteJl/view?usp=sharing


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Re: [I] [Python] pyarrow data type error cause lot of dependent project crash [arrow]

Posted by "AlenkaF (via GitHub)" <gi...@apache.org>.
AlenkaF commented on issue #40926:
URL: https://github.com/apache/arrow/issues/40926#issuecomment-2032622821

   Yes, Arrow array (or column) has one data type by [definition](https://arrow.apache.org/docs/dev/format/Columnar.html#terminology). Defining the column (with mixed float and string) as a string should solve the problem?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org