You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by GitBox <gi...@apache.org> on 2021/07/22 19:25:05 UTC

[GitHub] [arrow] jvanalstine opened a new issue #10784: Are there line parsers in the C++ lib?

jvanalstine opened a new issue #10784:
URL: https://github.com/apache/arrow/issues/10784


   Greetings,
   
   I have been parsing my text files via
   ```
     int64_t buf_size = 65536;
   
     arrow::MemoryPool* pool = arrow::default_memory_pool();
     ARROW_RETURN_NOT_OK(arrow::jemalloc_memory_pool(&pool));
   
     ARROW_ASSIGN_OR_RAISE(auto text_readable,
                           arrow::io::ReadableFile::Open(text_file, pool));
     ARROW_ASSIGN_OR_RAISE(auto buffered_text_stream,
                           arrow::io::BufferedInputStream::Create(buf_size, pool, text_readable));
     ARROW_ASSIGN_OR_RAISE(auto gzip_codec,
                           arrow::util::Codec::Create(arrow::Compression::GZIP));
     ARROW_ASSIGN_OR_RAISE(auto compressed_text_stream,
                           arrow::io::CompressedInputStream::Make(gzip_codec.get(), buffered_text_stream));
   ```
   and then using `reinterpret_cast<const char*>(buf->data());` to parse the buffer into lines. 
   
   Is there a utility that i'm overlooking in the library that will give me an iterator parsing a text file line by line?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [arrow] jvanalstine closed issue #10784: Are there line parsers in the C++ lib?

Posted by GitBox <gi...@apache.org>.
jvanalstine closed issue #10784:
URL: https://github.com/apache/arrow/issues/10784


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org