You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@arrow.apache.org by uw...@apache.org on 2019/02/21 09:06:11 UTC
[arrow] branch maint-0.12.x created (now cef7e71)
This is an automated email from the ASF dual-hosted git repository.
uwe pushed a change to branch maint-0.12.x
in repository https://gitbox.apache.org/repos/asf/arrow.git.
at cef7e71 ARROW-4501: Fix out-of-bounds read in DoubleCrcHash
This branch includes the following new commits:
new d5aa7c4 Initial Commit
new cbc56bf Update readme and add license in root.
new fa5f029 ARROW-1: Initial Arrow Code Commit
new 16e44e3 ARROW-3: This patch includes a WIP draft specification document for the physical Arrow memory layout produced over a series of discussions amongst the to-be Arrow committers during late 2015. There are also a few small PNG diagrams that illustrate some of the Arrow layout concepts.
new 23c4b08 ARROW-4: This provides an partial C++11 implementation of the Apache Arrow data structures along with a cmake-based build system. The codebase generally follows Google C++ style guide, but more cleaning to be more conforming is needed. It uses googletest for unit testing.
new 7e76e3a ARROW-5: Update drill-fmpp-maven-plugin to 1.5.0
new e9cc8ce ARROW-5: Correct Apache Maven repo for maven plugin use
new e6905ef ARROW-9: Replace straggler references to Drill
new a385622 ARROW-8: Add .travis.yml and test script for Arrow C++. OS X build fixes
new 8f2ca24 ARROW-13: Add PR merge tool from parquet-mr, suitably modified
new 1000d11 ARROW-36: Remove fixVersions from JIRA resolve code path
new e418020 ARROW-19: Add an externalized MemoryPool interface for use in builder classes
new b88b69e ARROW-20: Add null_count_ member to array containers, remove nullable_ member
new 89c6afd ARROW-21: Implement a simple in-memory Schema data structure
new 307977e ARROW-15: Fix a naming typo for memory.AllocationManager.AllocationOutcome
new 0c95d3c ARROW-10: Fix mismatch of javadoc names and method parameters
new 3b777c7 ARROW-26: Add instructions for enabling Arrow C++ Parquet adapter build
new 9c2b954 ARROW-23: Add a logical Column data structure
new 612fbc7 ARROW-24: C++: Implement a logical Table container type
new 572cdf2 ARROW-7: Add barebones Python library build toolchain
new 8caa287 ARROW-35: Add a short call-to-action in the top level README.md
new 571343b ARROW-9: Rename some unchanged "Drill" to "Arrow" (follow-up)
new 9afb667 ARROW-31: Python: prototype user object model, add PyList conversion path with type inference
new ae95dbd ARROW-44: Python: prototype object model for array slot values ("scalars")
new 45cd9fd ARROW-43: Python: format array values to in __repr__ for interactive computing
new 1650026 ARROW-17: set some vector fields to package level access for Drill compatibility
new 243ed4e ARROW-18: Fix decimal precision and scale in MapWriters
new 31def7d ARROW-51: Add simple ValueVector tests
new e822ea7 ARROW-46: ListVector should initialize bits in allocateNew
new 8367527 ARROW-42: Add Python tests to Travis CI build
new 6fdcd49 ARROW-54: [Python] Rename package to "pyarrow"
new 883c62b ARROW-55: [Python] Fix unit tests in 2.7
new 5881aac ARROW-64: Add zsh support to C++ build scripts
new c996610 ARROW-68: Better error handling for not fully setup systems
new 3a99f39 ARROW-73: Support older CMake versions
new 016b92b ARROW-72: Search for alternative parquet-cpp header
new 4ec034b ARROW-28: Adding google's benchmark library to the toolchain
new 093f9bd ARROW-75: Fix handling of empty strings
new 65db0da ARROW-67: C++ metadata flatbuffer serialization and data movement to memory maps
new a4002c6 ARROW-70: Add adapt 'lite' DCHECK macros from Kudu as also used in Parquet
new fbbee3d ARROW-77: [C++] Conform bitmap interpretation to ARROW-62; 1 for nulls, 0 for non-nulls
new c06b765 ARROW-62: Clarify null bitmap interpretation, indicate bit-endianness, add null count, remove non-nullable physical distinction
new 0a8979d ARROW-37: [C++ / Python] Implement BooleanArray and BooleanBuilder. Handle Python built-in bool
new d3cb6b4 ARROW-22: [C++] Convert flat Parquet schemas to Arrow schemas
new d6d53b2 ARROW-63: [C++] Enable ctest to work on systems with Python 3 as the default Python
new 0171877 ARROW-65: Be less restrictive on PYTHON_LIBRARY search paths
new 1fd0668 ARROW-30: [Python] Routines for converting between arrow::Array/Table and pandas.DataFrame
new ecadd0b ARROW-80: Handle len call for pre-init arrays
new 80ec2c1 ARROW-79: [Python] Add benchmarks
new df7726d ARROW-88: [C++] Refactor usages of parquet_cpp namespace
new 38897ee ARROW-83: [C++] Add basic test infrastructure for DecimalType
new 2d8627c ARROW-87: [C++] Add all four possible ways to encode Decimals in Parquet to schema conversion
new 5a68f8d ARROW-93: Fix builds when using XCode 7.3
new b3ebce1 ARROW-89: [Python] Add benchmarks for Arrow<->Pandas conversion
new 6d31d59 ARROW-49: [Python] Add Column and Table wrapper interface
new 79fddd1 ARROW-90: [C++] Check for SIMD instruction set support
new 5d12999 ARROW-71: [C++] Add clang-tidy and clang-format to the the tool chain.
new 9d88a50 ARROW-86: [Python] Implement zero-copy Arrow-to-Pandas conversion
new 7b2153b ARROW-85: memcmp can be avoided in Equal when comparing with the same …
new 37f7271 ARROW-94: [Format] Expand list example to clarify null vs empty list
new 5843e68 ARROW-103: Add files to gitignore
new 0b472d8 ARROW-82: Initial IPC support for ListArray
new a541644 ARROW-100: [C++] Computing RowBatch size
new 56514d9 ARROW-104: [FORMAT] Add alignment and padding requirements + union clarification
new 355f7c9 ARROW-92: Arrow to Parquet Schema conversion
new ad3d01d ARROW-188: Add numpy as install requirement
new 3302257 ARROW-190: Python: Provide installable sdist builds
new c9ffe54 ARROW-194: C++: Allow read-only memory mapped source
new 1f04f7f ARROW-193: typos "int his" fix to "in this"
new 4bd13b8 ARROW-91: Basic Parquet read support
new 68b80a8 ARROW-197: Working first draft of a conda recipe for pyarrow
new 6968ec0 ARROW-199: [C++] Refine third party dependency
new 9c59158 ARROW-185: Make padding and alignment for all buffers be 64 bytes
new 978de1a ARROW-204: Add Travis CI builds that post conda artifacts for Linux and OS X
new e0fb369 ARROW-201: [C++] Initial ParquetWriter implementation
new c0985a4 Make BaseValueVector#MAX_ALLOCATION_SIZE configurable
new e316b3f Fix BaseAllocator.java NPE when assertions are disabled
new 7035467 Add java support to Travis CI
new cd1d770 ARROW-206: Expose a C++ api to compare ranges of slots between two arrays
new c8b8078 [Doc] Update Layout.md
new 6574095 ARROW-209: [C++] Triage builds due to unavailable LLVM apt repo
new ce2fe7a ARROW-211: [Format] Fixed typos in layout examples
new 9ce13a0 ARROW-60: [C++] Struct type builder API
new bc6c4c8 ARROW-200: [C++/Python] Return error status on string initialization failure
new 8197f24 ARROW-212: Change contract of PrimitiveArray to reflect its abstractness
new ec66ddd ARROW-203: Python: Basic filename based Parquet read/write
new b4e0e93 ARROW-217: Fix Travis w.r.t conda 4.1.0 changes
new 790d541 ARROW-218: Add optional API token authentication option to PR merge tool
new 27edd25 ARROW-210: Cleanup of the string related types in C++ code base
new a3e3849 ARROW-219: Preserve CMAKE_CXX_FLAGS, fix compiler warnings
new f7ade7b ARROW-223: Do not link against libpython
new ef90830 ARROW-222: Prototyping an IO interface for Arrow, with initial HDFS target
new 2f52cf4 ARROW-215: Support other integer types and strings in Parquet I/O
new fab4c82 ARROW-234: Build libhdfs IO extension in conda artifacts
new 77598fa ARROW-233: Add visibility macros, add static build option
new ff6132f ARROW-237: Implement parquet-cpp's abstract IO interfaces for memory allocation and file reading
new 62390d8 ARROW-106: [C++] Add IPC to binary/string types
new 55bfa83 ARROW-238: Change InternalMemoryPool::Free() to return Status::Invalid when ther…
new 59e5f98 ARROW-236: Bridging IO interfaces under the hood in pyarrow
new a2fb756 ARROW-241: Add missing implementation for splitAndTransfer in UnionVector
new dc79ceb ARROW-244: Some global APIs of IPC module should be visible to the outside
new 356d015 ARROW-240: Provide more detailed installation instructions for pyarrow. Closes
new 3a2dfba ARROW-101: Fix java compiler warnings
new 56835c3 ARROW-246: [Java] UnionVector doesn't call allocateNew() when creating it's vectorType
new 5df7d4d ARROW-247: Missing explicit destructor in RowBatchReader causes an incomplete type error
new 34e7f48 ARROW-250: Fix for ARROW-246 may cause memory leaks
new 2742d37 ARROW-254: remove Bit type as it is redundant with Boolean
new dc01f09 ARROW-253: restrict ints to 8, 16, 32, or 64 bits in V1
new e8724f8 ARROW-260: Fix flaky oversized tests
new 689cd27 ARROW-245: add endianness to RecordBatch
new 268e108 ARROW-251: Expose APIs for getting code and message of the status
new 246a126 ARROW-107: [C++] Implement IPC for structs
new e7e399d ARROW-259: Use Flatbuffer Field type instead of MaterializedField
new fd2e524 Revert version to 0.1-SNAPSHOT
new 282fcac ARROW-265: Pad negative decimal values with1
new c2eb161 ARROW-265: Fix few decimal bugs
new 812201a ARROW-266: [C++] Fix broken build due to Flatbuffers namespace change
new 7861968 ARROW-252: Add implementation guidelines to the documentation
new 8960a2e ARROW-255: Finalize Dictionary representation
new ec51d56 ARROW-269: Include typeVector buffers UnionVector.getBuffers()
new 803afeb ARROW-264: File format
new 907cc5a ARROW-262: Start metadata specification document
new e081a4c ARROW-271: Update Field structure to be more explicit
new 0a411fd ARROW-242: Support Timestamp Data Type
new e197b2d ARROW-279: rename vector module to arrow-vector
new 2d8ec78 ARROW-274: Add NullableMapVector to support nullable maps
new 637584b ARROW-284: Disable arrow_parquet module in Travis CI to triage builds
new 214b861 ARROW-283: [C++] Account for upstream changes in parquet-cpp
new 270ab4e ARROW-278: [Format] Rename Tuple to Struct_ in flatbuffers IDL
new 52089d6 ARROW-285: Optional flatc download
new a5f2861 ARROW-286: Build thirdparty dependencies in parallel
new 077c72b ARROW-256: [Format] Add a version number to the IPC/RPC metadata
new 6b8abb4 ARROW-289: Install test-util.h
new 6f99156 ARROW-287: Make nullable vectors use a BitVecor instead of UInt1Vector for bits
new 3487c2f ARROW-292: [Java] Upgrade Netty to 4.0.41
new 17e90e1 ARROW-290: Specialize alloc() in ArrowBuf
new 559b865 ARROW-280: [C++] Refactor IPC / memory map IO to use common arrow_io interfaces. Create arrow_ipc leaf library
new 5f1556c ARROW-297: Fix Arrow pom for release
new 5358328 ARROW-298: create release scripts
new 430bd95 ARROW-299: Use absolute namespace in macros
new 7e39747 ARROW-267: [C++] Implement file format layout for IPC/RPC
new 32fd692 ARROW-296: [Python / C++] Remove arrow::parquet, make pyarrow link against parquet_arrow
new 45d8832 ARROW-293: [C++] Implement Arrow IO interfaces for operating system files
new 03134b1 ARROW-270: Define more generic Interval logical type
new bae33d6 ARROW-304: NullableMapReaderImpl.isSet() always returns true
new 768c7d0 ARROW-257: Add a typeids Vector to Union type
new bd195e3 ARROW-308: UnionListWriter.setPosition() should not call startList()
new bf30235 ARROW-306: Add option to pass cmake arguments via environment variable
new 30f6083 ARROW-305: Add compression and use_dictionary options to Parquet
new 391ab64 ARROW-309: Types.getMinorTypeForArrowType() does not work for Union type
new c7b0480 ARROW-314: JSONScalar is unnecessary and unused
new c3930a0 ARROW-301: Add user field metadata to IPC schemas
new c7e6a07 ARROW-302: [C++/Python] Implement C++ IO interfaces for interacting with Python file and bytes objects
new c3cfa3d ARROW-313: Build on any version of XCode
new 7fb4d24 ARROW-315: finalize timestamp
new dd1b95b ARROW-318: Revise python/README.md given recent changes in codebase
new 04cf874 ARROW-321: fix arrow licenses
new f1a4bd1 ARROW-320: ComplexCopier.copy(FieldReader, FieldWriter) should not st…
new 3f85cee ARROW-324: Update arrow metadata diagram
new 2d8e820 ARROW-319: Add canonical Arrow Schema json representation
new 1196691 ARROW-326: Initialize nested writers in MapWriter based on the underlying MapVector's field
new eb1491a ARROW-325: make TestArrowFile not dependent on timezone
new e7080ef [maven-release-plugin] prepare release apache-arrow-0.1.0
new 17cd7a6 [maven-release-plugin] prepare for next development iteration
new a9747ce ARROW-312: Read and write Arrow IPC file format from Python
new fb799bc ARROW-112: Changed constexprs to kValue naming.
new 8c8d341 ARROW-326: Include scale and precision when materializing decimal writer
new 994aa5a ARROW-189: Build 3rd party with ExternalProject.
new caa843b ARROW-333: Make writers update their internal schema even when no data is written
new 3919a27 ARROW-332: Add RecordBatch.to_pandas method
new bf749f5 ARROW-275: Add tests for UnionVector in Arrow File
new 4ecf327 ARROW-191: Python: Provide infrastructure for manylinux1 wheels
new 8520061 ARROW-336: Run Apache Rat in Travis builds
new 8e8b17f ARROW-97: API documentation via sphinx-apidoc
new 732a205 ARROW-261: Refactor String/Binary code paths to reflect unnested (non-list-based) structure
new 676c32c ARROW-317: Add Slice, Copy methods to Buffer
new e2c0a18 ARROW-327: [Python] Remove conda builds from Travis CI setup
new 446ec9b ARROW-334: [Python] Remove INSTALL_RPATH_USE_LINK_PATH
new 2f84493 ARROW-342: Set Python version on release
new 3d2e4df ARROW-337: UnionListWriter.list() is doing more than it should, this …
new 6178bf7 ARROW-350: Added Kerberos to HDFS client
new da24c1a ARROW-339: Python 3 compatibility in merge_arrow_pr.py
new d946e79 ARROW-354: Fix comparison of arrays of empty strings
new 772bc6e ARROW-349: Add six as a requirement
new ca088dd ARROW-339: [Dev] Lingering Python 3 fixes
new d414875 ARROW-348: [Python] Add build-type command line option to setup.py, build CMake extensions in a build type subdirectory
new c7db80e ARROW-355: Add tests for serialising arrays of empty strings to Parquet
new e70d97d ARROW-358: Add explicit environment variable to locate libhdfs in one's environment
new 2a059bd ARROW-359: Document ARROW_LIBHDFS_DIR
new 17c9ae7 ARROW-357: Use a single RowGroup for Parquet files as default.
new 25e0106 ARROW-323: [Python] Opt-in to pyarrow.parquet extension rather than attempting and failing silently
new e8bc1fe ARROW-368: Added note for LD_LIBRARY_PATH in Python README
new 121e826 ARROW-361: Python: Support reading a column-selection from Parquet files
new 79344b3 ARROW-362: Fix memory leak in zero-copy arrow to NumPy/pandas conversion
new 6996c17 ARROW-312: [Java] IPC file round trip tool for integration testing
new 4fa7ac4 ARROW-372: json vector serialization format
new 7f048a4 ARROW-356: Add documentation about reading Parquet
new 48f9780 ARROW-375: Fix unicode Python 3 issue in columns argument of parquet.read_table
new 78288b5 ARROW-371: Handle pandas-nullable types correctly
new 8417096 ARROW-367: converter json <=> Arrow file format for Integration tests
new ed6ec3b ARROW-373: [C++] JSON serialization format for testing
new 997f502 ARROW-382: Extend Python API documentation
new f082b17 ARROW-383: [C++] Integration testing CLI tool
new 197120c ARROW-390: Only specify dependencies for json-integration-test on ARROW_BUILD_TESTS=ON
new 86f56a6 ARROW-202: Integrate with appveyor ci for windows
new e3c167b ARROW-363: [Java/C++] integration testing harness, initial integration tests
new 65b74b3 ARROW-393: [JAVA] JSON file reader fails to set the buffer size on String data vector
new 859018b ARROW-392: [C++/Java] String IPC integration testing / fixes. Add array / record batch pretty-printing
new 072b7d6 ARROW-395: Arrow file format writes record batches in reverse order.
new 3b946b8 ARROW-396: [Python] Add pyarrow.schema.Schema.equals
new 33c731d ARROW-398: Java file format requires bitmaps of all 1's to be written…
new 06be7ae ARROW-389: Python: Write Parquet files to pyarrow.io.NativeFile objects
new ebe7dc8 ARROW-335: Improve Type apis and toString() by encapsulating flatbuffers better
new b5de9e5 ARROW-369: [Python] Convert multiple record batches at once to Pandas
new 0ac01a5 ARROW-379: Use setuptools_scm for Python versioning
new 599d516 ARROW-401: Floating point vectors should do an approximate comparison…
new 82575ca ARROW-403: [Java] Create transfer pairs for internal vectors in UnionVector transfer impl
new 4b72329 ARROW-406: [C++] Set explicit 64K HDFS buffer size, test large reads
new 72f80d4 ARROW-409: [Python] Change record batches conversion to Table
new c8eb49e ARROW-408: Remove defunct conda recipes
new e139b8b ARROW-404: [Python] Fix segfault caused by HdfsClient getting closed before an HdfsFile
new a5362c2 ARROW-346: Use conda environment to build API docs
new d06c491 ARROW-399: ListVector.loadFieldBuffers ignores the ArrowFieldNode len…
new 14ed1be ARROW-400: set struct length on load
new 8995c92 ARROW-402: Fix reference counting issue with empty buffers. Close #232
new 45ed7e7 ARROW-394: [Integration] Generate tests cases for numeric types, strings, lists, structs
new 73fe556 ARROW-328: Return shared_ptr<T> by value instead of const-ref
new 2c10d7c ARROW-418: [C++] Array / Builder class code reorganization, flattening
new 7e93075 ARROW-405: Use vendored hdfs.h if not found in include/ in $HADOOP_HOME
new 9352790 ARROW-422: IPC should depend on rapidjson_ep if RapidJSON is vendored
new 063c190 ARROW-423: Define BUILD_BYPRODUCTS for CMake 3.2+
new cfb544d ARROW-425: Add private API to get python Table from a C++ object
new a2ead2f ARROW-380: [Java] optimize null count when serializing vectors
new c369709 ARROW-426: Python: Conversion from pyarrow.Array to a Python list
new 68e39c6 ARROW-429: Revert ARROW-379 until git-archive issues are resolved
new cfde460 ARROW-243: [C++] Add option to switch between libhdfs and libhdfs3 when creating HdfsClient
new d7845fc ARROW-420: Align DATE type with Java implementation
new fe53fa4 ARROW-435: Fix spelling of RAPIDJSON_VENDORED
new 6ff5fcf ARROW-433: Correctly handle Arrow to Python date conversion for timezones west of London
new f6bf112 ARROW-434: [Python] Correctly handle Python file objects in Parquet read/write paths
new 73455b5 ARROW-430: Improved version handling
new 268ffbe ARROW-374: More precise handling of bytes vs unicode in Python API
new fd4eb98 ARROW-440: [C++] Support pkg-config
new 65af9ea ARROW-432: [Python] Construct precise pandas BlockManager structure for zero-copy DataFrame initialization
new 1079a32 ARROW-437: [C++} Fix clang compiler warning
new ab5f66a ARROW-428: [Python] Multithreaded conversion from Arrow table to pandas.DataFrame
new cfbdb68 ARROW-322: [C++] Remove ARROW_HDFS option, always build the module
new 8aab00e ARROW-330: CMake functions to simplify shared / static library configuration
new 3095f2c ARROW-444: [Python] Native file reads into pre-allocated memory. Some IO API cleanup / niceness
new 4733ee8 ARROW-445: arrow_ipc_objlib depends on Flatbuffer generated files
new 23fe6ae ARROW-338: Implement visitor pattern for IPC loading/unloading
new e15c6a0 ARROW-447: Always return unicode objects for UTF-8 strings
new e8b6231 ARROW-450: Fixes for PARQUET-818
new 806239f ARROW-449: Python: Conversion from pyarrow.{Table,RecordBatch} to a Python dict
new 9f7d4ae ARROW-108: [C++] Add Union implementation and IPC/JSON serialization tests
new d9df556 ARROW-294: [C++] Do not use platform-dependent fopen/fclose functions for MemoryMappedFile
new 26140dc ARROW-387: [C++] Verify zero-copy Buffer slices from BufferReader retain reference to parent Buffer
new fdbc579 ARROW-417: Add Equals implementation to compare ChunkedArrays
new 9513ca7 ARROW-411: [Java] Move compactor functions in Integration to a separate Validator module
new 320f587 ARROW-455: [C++] Add dtor to BufferOutputStream that calls Close()
new 5bf6ae4 ARROW-456: Add jemalloc based MemoryPool
new 74685f3 ARROW-427: [C++] Implement dictionary array type
new 7d1f1cf ARROW-360: C++: Add method to shrink PoolBuffer using realloc
new 1094d89 ARROW-463: C++: Support jemalloc 4.x
new 3195948 ARROW-438: [C++/Python] Implement zero-data-copy record batch and table concatenation.
new f44b6a3 ARROW-442: [Python] Inspect Parquet file metadata from Python
new 8d917c1 ARROW-466: Add ExternalProject for jemalloc
new 543e508 ARROW-469: C++: Add option so that resize doesn't decrease the capacity
new 7d3e2a3 ARROW-421: [Python] Retain parent reference in PyBytesReader
new c5663c6 ARROW-385: Refactors metric system
new 5ffbda1 ARROW-479: Python: Test for expected schema in Pandas conversion
new ad0e57d ARROW-472: [Python] Expose more C++ IO interfaces. Add equals methods to Parquet schemas. Pass Parquet metadata separately in reader
new cb83b8d ARROW-96: Add C++ API documentation
new 281cb96 ARROW-481: [Python] Fix 2.7 regression in Parquet path to open file code path
new 876ae85 ARROW-239: Test reading remainder of file in HDFS with read() with no args
new 47115aa ARROW-478: Consolidate BytesReader and BufferReader to accept PyBytes or Buffer
new a098fd0 ARROW-486: [C++] Use virtual inheritance for diamond inheritance
new 850774e ARROW-484: Revise README to include more detail about software components
new b147230 ARROW-490: Python: Update manylinux1 build scripts
new 353772f ARROW-468: Python: Conversion of nested data in pd.DataFrames
new 9b1b397 ARROW-461: [Python] Add Python interfaces to DictionaryArray data, pandas interop
new 6811d3f ARROW-474: [Java] Add initial version of streaming serialized format.
new 512bc16 ARROW-386: [Java] Respect case of struct / map field names
new 8ca7033 ARROW-499: Update file serialization to use the streaming serialization format.
new 5888e10 ARROW-495: [C++] Implement streaming binary format, refactoring
new 5a161eb ARROW-505: [C++] Fix compiler warning in gcc in release mode
new 53a478d ARROW-475: [Python] Add support for reading multiple Parquet files as a single pyarrow.Table
new 69cdbd8 ARROW-494: [C++] Extend lifetime of memory mapped data if any buffers reference it
new c327b5f ARROW-506: Java: Implement echo server for integration testing.
new 1f81adc ARROW-503: [Python] Implement Python interface to streaming file format
new 2821030 ARROW-508: [C++] Add basic threadsafety to normal files and memory maps
new 085c875 ARROW-81: [Format] Augment dictionary encoding metadata to accommodate additional use cases
new c90ca60 ARROW-378: Python: Respect timezone on conversion of Pandas datetime columns
new 61a54f8 ARROW-509: [Python] Add support for multithreaded Parquet reads
new a68af9d ARROW-498 [C++] Add command line utilities that convert between stream and file.
new a90b5f3 ARROW-512: C++: Add method to check for primitive types
new aac2e70 ARROW-513: [C++] Fixing Appveyor / MSVC build
new 30bb0d9 ARROW-514: [Python] Automatically wrap pyarrow.io.Buffer in BufferReader
new 4226adf ARROW-515: [Python] Add read_all methods to FileReader, StreamReader
new 7ac320b ARROW-519: [C++] Refactor array comparison code into a compare.h / compare.cc in part to resolve Xcode 6.1 linker issue
new be5d73f ARROW-410: [C++] Add virtual Writeable::Flush
new 0ae4d86 ARROW-497: Integration harness for streaming file format
new c05292f ARROW-523: Python: Account for changes in PARQUET-834
new 720d422 ARROW-467: [Python] Run Python parquet-cpp unit tests in Travis CI
new 08f38d9 ARROW-477: [Java] Add support for second/microsecond/nanosecond timestamps in-memory and in IPC/JSON layer
new e881f11 ARROW-525: Python: Add more documentation to the package
new 5b35d6b ARROW-457: Python: Better control over memory pool
new 84f1662 ARROW-381: [C++] Simplify primitive array type builders to use a default type singleton
new c45c3b3 ARROW-527: Remove drill-module.conf file
new 70c05be ARROW-524: provide apis to access nested vectors and buffers
new 5bee596 ARROW-529: Python: Add jemalloc and Python 3.6 to manylinux1 build
new 74bc4dd ARROW-511: Python: Implement List conversions for single arrays
new 5439b71 ARROW-33: [C++] Implement zero-copy array slicing, integrate with IPC code paths
new f268e92 ARROW-540: [C++] Build fixes after ARROW-33, PARQUET-866
new 4c3481e ARROW-535: [Python] Add type mapping for NPY_LONGLONG
new e97fbe6 ARROW-531: Python: Document jemalloc, extend Pandas section, add Getting Involved
new c322cbf ARROW-366 Java Dictionary Vector
new 1407abf ARROW-537: [C++] Do not compare String/Binary data in null slots when comparing arrays
new b99d049 ARROW-351: Time type has no unit
new 4440e40 ARROW-543: C++: Lazily computed null_counts counts number of non-null entries
new 0bdfd5e ARROW-538: [C++] Set up AddressSanitizer (ASAN) builds
new 31f145d ARROW-545: [Python] Ignore non .parq/.parquet files when reading directories as Parquet datasets
new dc6cefd ARROW-521: [C++] Track peak allocations in default memory pool
new 3add918 ARROW-476: Add binary integration test fixture, add Java support
new 0ab4252 ARROW-546: Python: Account for changes in PARQUET-867
new 42b55d9 ARROW-544: [C++] Test writing zero-length record batches, zero-length BinaryArray fixes
new e4845c4 ARROW-551: C++: Construction of Column with nullptr Array segfaults
new 1f26040 ARROW-548: [Python] Add nthreads to Filesystem.read_parquet and pass through
new ad01575 ARROW-553: C++: Faster valid bitmap building
new 66f650c ARROW-547: [Python] Add zero-copy slice methods to Array, RecordBatch
new 69cf692 ARROW-556: [Integration] Configure C++ integration test executable with a single environment variable. Update README
new d50f152 ARROW-558: Add KEYS files
new fa8d27f ARROW-561:[JAVA][PYTHON] Update java & python dependencies to improve downstream packaging experience
new f6924ad [maven-release-plugin] prepare release apache-arrow-0.2.0
new ab15e01 [maven-release-plugin] prepare for next development iteration
new ef6b465 ARROW-563: Support non-standard gcc version strings
new 4598c1a ARROW-570: Determine Java tools JAR location from project metadata
new 5e279f0 ARROW-569: [C++] Set version for *.pc
new d28f1c1 ARROW-459: [C++] Dictionary IPC support in file and stream formats
new 89dc557 ARROW-580: C++: Also provide jemalloc_X targets if only a static or shared version is found
new 8afe92c ARROW-578: [C++] Add -DARROW_CXXFLAGS=... option to make CMake more consistent
new ef3b6b3 ARROW-451: [C++] Implement DataType::Equals as TypeVisitor. Add default implementations for TypeVisitor, ArrayVisitor methods
new 16c9759 ARROW-577: [C++] Use private implementation pattern in ipc::StreamWriter and ipc::FileWriter
new dc103fe ARROW-557: [Python] Add option to explicitly opt in to HDFS tests, do not implicitly skip
new 01a67f3 ARROW-493: [C++] Permit large (length > INT32_MAX) arrays in memory
new 2c3bd93 ARROW-588: [C++] Fix some 32 bit compiler warnings
new 0637e05 ARROW-576: [C++] Complete file/stream implementation for union types
new 8378c48 ARROW-589: C++: Use system provided shared jemalloc if static is unavailable
new 9deb325 ARROW-109: [C++] Add nesting stress tests up to 500 recursion depth
new fb9fbe4 ARROW-604: Python: boxed Field instances are missing the reference to their DataType
new b109a24 ARROW-566: Bundle Arrow libraries in Python package
new 6b3ae2a ARROW-605: [C++] Refactor IPC adapter code into generic ArrayLoader class. Add Date32Type
new f7f915d ARROW-615: [Java] Moved ByteArrayReadableSeekableByteChannel to src main o.a.a.vector.util
new d99958d ARROW-452: [C++/Python] Incorporate C++ and Python codebases for Feather file format
new fdc25b4 ARROW-606: [C++] upgrade flatbuffers version to 1.6.0
new e5a11da ARROW-534: [C++] Add IPC tests for date/time after ARROW-452, fix bugs
new 344ad1f ARROW-619: Fix typos in setup.py args and LD_LIBRARY_PATH
new d4ecb5e ARROW-612: [Java] Added not null to Field.toString output
new 492b3d4 ARROW-613: WIP TypeScript Implementation
new 2cf36ef ARROW-574: Python: Add support for nested Python lists in Pandas conversion
new 331be49 ARROW-623: Fix segfault in __repr__ of empty field
new 00df40c ARROW-618: [Python/C++] Support timestamp+timezone conversion to pandas
new 6aed18f ARROW-619: [Python] Fixed remaining typo for LD_LIBRARY_PATH
new f442879 ARROW-624: [C++] Restore MakePrimitiveArray function, use in feather.cc
new cef4615 ARROW-598: [Python] Add support for converting pyarrow.Buffer to a memoryview with zero copy
new a32ae59 ARROW-625: [C++] Add TimeUnit to TimeType::ToString. Add timezone to TimestampType::ToString if present
new dd8204c ARROW-628: [Python] Install nomkl metapackage when building parquet-cpp in Travis CI
new c8d15d4 ARROW-491: [Format / C++] Add FixedWidthBinary type to format, C++ implementation
new 3b65001 ARROW-520: [C++] STL-compliant allocator
new 49f666e ARROW-542: Adding dictionary encoding to FileWriter
new 1c101ff ARROW-636: [C++] Update README about Boost system requirement
new 0cf2bbb ARROW-642: [Java] Remove temporary file in java/tools
new 867f924 ARROW-231 [C++]: Add typed Resize to PoolBuffer
new 3ee3822 ARROW-593 [C++]: Rename ReadableFileInterface to RandomAccessFile
new c13d671 ARROW-644: Python: Cython should be a setup-only requirement
new 39c7274 ARROW-631: [GLib] Import
new 57b537a ARROW-637: [Format] Add timezone to Timestamp metadata, comments describing the semantics
new 16dd871 ARROW-650: [GLib] Follow ReadableFileInterface -> RnadomAccessFile change
new f5157a0 ARROW-648: [C++] Support multiarch on Debian
new 5ef6840 ARROW-652: Remove trailing f in merge script output
new 019f90d ARROW-647: [C++] Use Boost shared libraries for tests and utilities
new 98c9490 ARROW-639: [C++] Invalid offset in slices
new a9f0c63 ARROW-651: [C++] Set version to shared library
new 4c5f79c ARROW-617: [Format] Add additional Time metadata and comments based on discussion in ARROW-617
new df2220f ARROW-661: [C++] Add LargeRecordBatch metadata type, IPC support, associated refactoring
new cd4544d ARROW-664: [C++] Make C++ Arrow serialization deterministic
new 02bdbf4 ARROW-502 [C++/Python]: Logging memory pool
new 6cd82c2 ARROW-671: [GLib] Install missing license file
new 98a52b4 ARROW-316: [Format] Changes to Date metadata format per discussion in ARROW-316
new a8bf0fb ARROW-673: [Java] Support additional Time metadata
new a9a5701 ARROW-208: Add checkstyle policy to java project
new 55d8f99 ARROW-677: [java] Fix checkstyle jcl-over-slf4j conflict issue
new 82b15a4 ARROW-678: [GLib] Fix dependencies
new d252867 ARROW-675: [GLib] Update package metadata
new 96734ef ARROW-654: [C++] Serialize timezone in IPC metadata
new b179ad2 ARROW-681: [C++] Disable boost's autolinking if shared boost is used …
new 5fda247 ARROW-680: [C++] Support CMake 2 or older again
new 1b957dc ARROW-688: [C++] Use CMAKE_INSTALL_INCLUDEDIR for consistency
new 3610314 ARROW-689: [GLib] Fix install directories
new 71424c2 ARROW-683: [C++/Python] Refactor to make Date32 and Date64 types for new metadata. Test IPC roundtrip
new ced9d76 ARROW-679: [Format] Change FieldNode, RecordBatch lengths to long, remove LargeRecordBatch. Refactoring
new 2406d4e ARROW-552: [Python] Implement getitem for DictionaryArray by returning a value from the dictionary
new bf2acf6 ARROW-454: pojo.Field doesn't implement hashCode()
new 990e2bd ARROW-691: [Java] Encode dictionary type in message format
new 2926183 ARROW-347: Add method to pass CallBack when creating a transfer pair
new f67974b ARROW-700: Add headroom interface for allocator
new e8f6a49 ARROW-674: [Java] Support additional Timestamp timezone metadata
new 7594492 ARROW-704: Fix bad import caused by conflicting changes
new 2a568f0 ARROW-662: [Format] Move Schema flatbuffers into their own file that can be included
new e968ca6 ARROW-621: [C++] Start IPC benchmark suite for record batches, implement "inline" visitor. Code reorg
new dcaa8e5 ARROW-702: fix BitVector.copyFromSafe to reAllocate instead of returning false
new 13c12c6 ARROW-682: [Integration] Check implementations against themselves
new bc185a4 ARROW-595: [Python] Set schema attribute on StreamReader
new 016a209 ARROW-706: [GLib] Add package install document
new dc3cb30 ARROW-550: [Format] Draft experimental Tensor flatbuffer message type
new 5ad4988 ARROW-708: [C++] Simplify metadata APIs to all use the Message class, perf analysis
new 60b5832 ARROW-686: [C++] Account for time metadata changes, add Time32 and Time64 types
new c7947dc ARROW-709: [C++] Restore type comparator for DecimalType
new 685ebf4 ARROW-626: [Python] Replace PyBytesBuffer with zero-copy, memoryview-based PyBuffer
new ab848f0 ARROW-713: [C++] Fix cmake linking issue in new IPC benchmark
new fd87669 ARROW-684: [Python] More helpful error message if libparquet_arrow not built
new 6d4e862 ARROW-712: [C++] Reimplement Array::Accept as inline visitor
new 3aac4ad ARROW-341: [Python] Move pyarrow's C++ code to the main C++ source tree, install libarrow_python and headers
new d2d2755 ARROW-658: [C++] Implement a prototype in-memory arrow::Tensor type
new e717d47 ARROW-716: [Python] Update README build instructions after moving libpyarrow to C++ tree
new 3b71d87 ARROW-620: [C++] Implement JSON integration test support for date, time, timestamp, fixed width binary
new dac648d ARROW-701: [Java] Support Additional Date Type Metadata
new b032363 ARROW-732: [C++] Schema comparison bugs in struct and union types
new 8f38637 ARROW-731: [C++] Add shared library related versions to .pc
new f7b287a ARROW-627: [C++] Add compatibility macros for exported extern templates
new 642b753 ARROW-698: Add flag to FileWriter::WriteRecordBatch for writing record batches with lengths over INT32_MAX
new 47fad3f ARROW-728: [C++/Python] Add Table::RemoveColumn method, remove name member, some other code cleaning
new 15b874e ARROW-699: [C++] Resolve Arrow and Arrow IPC build issues on Windows;
new 957a0e6 ARROW-717: [C++] Implement IPC zero-copy round trip for tensors
new 4938d8d ARROW-726: [C++] Fix segfault caused when passing non-buffer object to arrow::py::PyBuffer
new ae2da98 ARROW-743: [C++] Consolidate all but decimal array tests into array-test, collect some tests in type-test.cc
new ba4f478 ARROW-715: [Python] Make pandas not a hard requirement, flake8 fixes
new edd6cfc ARROW-727: [Python] Ensure that NativeFile.write accepts any bytes, unicode, or object providing buffer protocol. Rename build_arrow_buffer to pyarrow.frombuffer
new 4915ecf ARROW-632: [Python] Add support for FixedWidthBinary type
new f5967ed ARROW-603: [C++] Add RecordBatch::Validate method, call in RecordBatch ctor in debug builds
new 005826f ARROW-719: [GLib] Release source archive
new ad8a0cf ARROW-739: Don't install jemalloc in parallel
new e5b6827 ARROW-744: [GLib] Re-add an assertion for garrow_table_new() test
new 4e77d33 ARROW-746: [GLib] Add garrow_array_get_data_type()
new 067cd4e ARROW-630: [C++] Create boolean batches for IPC testing, properly account for nonzero offset
new d75d7a9 ARROW-736: [Python] Mixed-type object DataFrame columns should not silently co…
new 9f5e174 ARROW-733: [C++/Python] Rename FixedWidthBinary to FixedSizeBinary for consistency with FixedSizeList
new fd00096 ARROW-723: [Python] Ensure that passing chunk_size=0 when writing Parquet file does not enter infinite loop
new 31a1f53 ARROW-710: [Python] Read/write with file-like Python objects from read_feather/write_feather
new 651ea92 ARROW-745: [C++] Allow use of system cpplint
new baf38e4 ARROW-747: [C++] Calling add_dependencies with dl causes spurious CMake warning
new 7fec7d3 ARROW-754: [GLib] Add garrow_array_is_null()
new e333576 ARROW-718: [Python] Implement pyarrow.Tensor container, zero-copy NumPy roundtrips
new d54ab9a ARROW-737: [C++] Enable mutable buffer slices, SliceMutableBuffer function
new c4d535c ARROW-753: [Python] Fix linker error for python-test on OS X
new 9f720b1 ARROW-738: Fix manylinux1 build
new 8f113b4 ARROW-755: [GLib] Add garrow_array_get_value_type()
new 96f3d61 ARROW-749: [Python] Delete partially-written Feather file when column write fails
new 7232e5b ARROW-676: move from MinorType to FieldType in ValueVectors to carry all the relevant type bits
new 7d1d4e7 ARROW-657: [C++/Python] Expose Tensor IPC in Python. Add equals method. Add pyarrow.create_memory_map/memory_map functions
new f05b7c6 ARROW-443: [Python] Support ingest of strided NumPy arrays from pandas
new d0cd03d ARROW-763: C++: Use to find libpythonX.X.dylib
new d560e30 ARROW-656: [C++] Add random access writer for a mutable buffer. Rename WriteableFileInterface to WriteableFile for better consistency
new ec6188e ARROW-769: [GLib] Support building without installed Arrow C++
new 2aed784 ARROW-765: [Python] Add more natural Exception type hierarchy for thirdparty users
new 5d6c6ad ARROW-770: [C++] Move .clang* files back into cpp source tree
new 360942e ARROW-672: [Format] Add MetadataVersion::V3 for Arrow 0.3
new e29a7d4 ARROW-668: [Python] Box timestamp values as pandas.Timestamp if available, attach tzinfo
new f4fcb42 ARROW-510 ARROW-582 ARROW-663 ARROW-729: [Java] Added units for Time and Date types, and integration tests
new ddf880b ARROW-752: [Python] Support boxed Arrow arrays as input to DictionaryArray.from_arrays
new 621d527 ARROW-582: [Java] Added JSON reader/writer unit test for date, time, and timestamp
new 49b3e0e ARROW-774: [GLib] Remove needless LICENSE.txt copy
new ff744ef ARROW-775: add simple constructors to value vectors
new 56f1e91 ARROW-771: [Python] Add read_row_group / num_row_groups to ParquetFile
new 58fa4c2 ARROW-776: [GLib] Fix wrong type name
new e371ebd ARROW-756: [C++] MSVC build fixes and cleanup, remove -fPIC flag from EP builds on Windows, Dev docs
new e53357c ARROW-778: Port merge tool to work on Windows
new 1c66097 ARROW-758: [C++] Build with /WX in Appveyor, fix MSVC compiler warnings
new 027c6b8 ARROW-781 [C++/Python] Increase reference count of the numpy base array?
new 8ae3283 ARROW-787: [GLib] Fix compilation error caused by introducing BooleanBuilder::Append overload
new 3591103 ARROW-793: [GLib] Fix indent
new b0e3122 ARROW-724: Add How to Contribute section to README
new 739ed82 ARROW-762: [Python] Start docs page about files and filesystems, adapt C++ docs about HDFS
new b0863cb ARROW-788: [C++] Align WriteTensor message
new 449f991 ARROW-773: [CPP] Add Table::AddColumn API
new 754bcce ARROW-655: [C++/Python] Implement DecimalArray
new 137aade ARROW-722: [Python] Support additional date/time types and metadata, conversion to/from NumPy and pandas.DataFrame
new 72e1e08 ARROW-800: [C++] Boost headers being transitively included in pyarrow
new acbda18 ARROW-794: [C++/Python] Disallow strided tensors in ipc::WriteTensor
new ddda303 ARROW-526: [Format] Revise Format documents for evolution in IPC stream / file / tensor formats
new d1a9aff ARROW-795: [C++] Consolidate arrow/arrow_io/arrow_ipc into a single shared and static library
new 793f4e0 ARROW-782: [C++] API cleanup, change public member access in DataType classes to functions, use class instead of struct
new e327c2e ARROW-761: [C++/Python] Add GetTensorSize method, Python bindings
new c2f28cd ARROW-741: [Python] Switch Travis CI to use Python 3.6 instead of 3.5
new 06d92bb ARROW-779: [C++] Check for old metadata and raise exception if found
new 85b870e ARROW-802: [GLib] Add read examples
new b7423a6 ARROW-803: [GLib] Update package repository URL
new b3cec80 ARROW-804: [GLib] Update build document
new f5245cc ARROW-806: [GLib] Support add/remove a column from table
new 7b4723b ARROW-807: [GLib] Update "Since" tag
new ab520cb ARROW-808: [GLib] Remove needless ignore entries
new 5e5a587 ARROW-805: [C++] Don't throw IOError when listing empty HDFS dir
new 6443b82 ARROW-810: [GLib] Remove io/ipc prefix
new 9db96fe ARROW-811: [GLib] Add GArrowBuffer
new 9d532c4 ARROW-539: [Python] Add support for reading partitioned Parquet files with Hive-like directory schemes
new 3d9bfc2 ARROW-646: [Python] Conda s3 robustness, set CONDA_PKGS_DIR env variable and add Travis CI caching
new e934365 ARROW-797: [Python] Make more explicitly curated public API page, sphinx cleanup
new 8b64a4f ARROW-751: [Python] Make all Cython modules private. Some code tidying
new 19da86a ARROW-817: [Python] Fix comment in date32 conversion
new 874666a ARROW-816: [C++] Travis CI script cleanup, add C++ toolchain env with Flatbuffers, RapidJSON
new b4892fd ARROW-528: [Python] Utilize improved Parquet writer C++ API, add write_metadata function, test _metadata files
new 01114d8 ARROW-783: [Java/C++] Fixes for 0-length record batches
new b603337 ARROW-815 [Java] Exposing reAlloc for ValueVector
new 794d020 ARROW-777: restore getObject behavior on Date and Time
new 88c351a ARROW-720: arrow should not have a dependency on slf4j bridges in com…
new 4b030dd ARROW-828: [C++] Add new dependency to README
new ce5b98e ARROW-820: [C++] Build dependencies for Parquet library without arrow…
new 4d2ac87 ARROW-826: [C++/Python] Fix compilation error on Mac with -DARROW_PYTHON=on
new edb8252 ARROW-829: Don't deactivate Parquet dictionary encoding on column-wis…
new 0f9c88f ARROW-725: [Formats/Java] FixedSizeList message and java implementation
new 30e03a9 ARROW-703: Fix issue where setValueCount(0) doesn’t work in the case that we’ve shipped vectors across the wire
new ee5cb2a ARROW-830: [Python] Expose jemalloc memory pool and other memory pool functions in public pyarrow API
new dad1a8e ARROW-832: [C++] Update to gtest 1.8.0, remove now unneeded test_main.cc
new 09e6ead ARROW-836: add test for pandas conversion of timedelta, currently unimplemented
new f512590 ARROW-827: [Python] Miscellaneous improvements to help with Dask support
new 312a665 ARROW-707: [Python] Return NullArray for array of all None in Array.from_pandas. Revert from_numpy -> from_pandas
new 7238d54 ARROW-734: [C++/Python] Support building PyArrow on MSVC
new 84d725b ARROW-825: [Python] Rename pyarrow.from_pylist to pyarrow.array, test on tuples
new bb8514c ARROW-833: [Python] Add Developer quickstart for conda users
new 0bcb785 ARROW-839: [Python] Use mktime variant that is reliable on MSVC
new bb287e2 ARROW-845: [Python] Sync changes from PARQUET-955; explicit ARROW_HOME will override pkgconfig
new 7f20f6e ARROW-818: [Python] Expand Sphinx API docs, pyarrow.* namespace. Add factory functions for time32, time64
new 38efabe ARROW-844: [Format] Update README documents in format/
new 4baaa88 ARROW-847: Specify BUILD_BYPRODUCTS for gtest
new a94c03a ARROW-809: [C++] Do not write excess bytes in IPC writer after slicing arrays
new 59cd801 ARROW-852: Also search for ARROW libs when pkg-config provided the path
new 4555ab9 ARROW-841: [Python] Add pyarrow build to Appveyor
new 41a8ff9 ARROW-853: [Python] Only set RPATH when bundling the shared libraries
new 391242a ARROW-848: [Python] Another pass on conda dev guide
new 74f89cf ARROW-858: Remove boost_regex from arrow dependencies
new 0dc6fe8 ARROW-846: [GLib] Add GArrowTensor, GArrowInt8Tensor and GArrowUInt8Tensor
new a68f31b ARROW-860: [C++] Remove typed Tensor containers
new 3f9b26c ARROW-863: [GLib] Use GBytes to implement zero-copy
new 7c1fef5 ARROW-864: [GLib] Unify Array files
new 6c352e2 ARROW-822: [Python] StreamWriter Wrapper for Socket and File-like Objects without tell()
new 6867e93 ARROW-869 [JS] Rename directory to js/
new 16ea370 ARROW-616: [C++] Do not include debug symbols in release builds by default
new b4a75b1 ARROW-871: [GLib] Unify DataType files
new 423235c ARROW-868: [GLib] Use GBytes to reduce copy
new 76dfd98 ARROW-876: [GLib] Unify ArrayBuilder files
new 578b0ff ARROW-877: [GLib] Add garrow_array_get_null_bitmap()
new 07c6ade ARROW-849: [C++] Support setting production build dependencies with ARROW_BUILD_TOOLCHAIN
new 39a37f7 ARROW-878: [GLib] Add garrow_binary_array_get_buffer()
new a0a925b ARROW-875: Avoid setting an extra empty in fillEmpties()
new 26e5bb1 ARROW-879: compat with pandas v0.20.0
new 33ac8a2 ARROW-882: [C++] Rename statically build library on Windows to avoid …
new 95f489c ARROW-880: [GLib] Support getting raw data of primitive arrays
new de54eff ARROW-659: [C++] Add multithreaded memcpy implementation
new 76d56d3 ARROW-95: Add Jekyll-based website publishing toolchain, migrate existing arrow-site
new 6239abd ARROW-862: [Python] Simplify README landing documentation to direct users and developers toward the documentation
new eaf2118 ARROW-887: add default value to units for backward compatibility
new f00e2ab ARROW-890: [GLib] Add GArrowMutableBuffer
new 1a73c35 ARROW-895: Fix lastSet in fillEmpties() and copyFrom()
new 0bee804 ARROW-888: Transfer ownership of buffer in BitVector transferTo()
new 68decb6 ARROW-865: [Python] Add unit tests validating Parquet date/time type roundtrips
new 6ae49a1 ARROW-892: [GLib] Fix GArrowTensor document
new 015b284 ARROW-894: [GLib] Add GArrowResizableBuffer and GArrowPoolBuffer
new 949249d ARROW-893: Add GLib document to Web site
new 7d433dc ARROW-483: [C++/Python] Provide access to "custom_metadata" Field attribute in IPC setting
new 3ad9d09 ARROW-904: [GLib] Simplify error check codes
new 02c32ff ARROW-903: [GLib] Remove a needless "."
new 8bf61d1 ARROW-697: JAVA Throw exception for record batches > 2GB
new 3fdeac7 ARROW-886 [Java] Fixing reallocation of VariableLengthVector offsets
new e876abb ARROW-898: [C++/Python] Use shared_ptr to avoid copying KeyValueMetadata, add to Field type also
new 909f826 ARROW-867: [Python] pyarrow MSVC fixes
new 81be9c6 ARROW-866: [Python] Be robust to PyErr_Fetch returning a null exc value
new 03dce9d ARROW-900: [Python] Fix UnboundLocalError in ParquetDatasetPiece.read
new 14bec24 ARROW-908: [GLib] Unify OutputStream files
new f13a928 ARROW-907: C++: Construct Table from schema and arrays
new f7ab727 ARROW-896: Support Jupyter Notebook in Web site
new 53c093b ARROW-914 [C++/Python] Fix Decimal ToBytes
new ed5a1d4 ARROW-916: [GLib] Add GArrowBufferOutputStream
new ce0c962 ARROW-917: [GLib] Add GArrowBufferReader
new 2d5142c ARROW-918: [GLib] Use GArrowBuffer for read buffer
new b4886da ARROW-919: [GLib] Use "id" to get type enum value from GArrowDataType
new 00994b8 ARROW-920: [GLib] Add Lua examples
new d4a2a75 ARROW-910: [C++] Write 0 length at EOS in StreamWriter
new 6950e45 ARROW-922: Allow Flatbuffers and RapidJSON to be used locally on Windows
new 8013cf3 ARROW-925: [GLib] Fix GArrowBufferReader test
new c9e61cd ARROW-926: Add wesm to KEYS
new da523ce ARROW-928: [C++] Detect supported MSVC versions
new 569426b ARROW-930: javadoc generation fails with java 8
new 02a121f ARROW-927: C++/Python: Add manylinux1 builds to Travis matrix
new f1bd49d ARROW-931: [GLib] Reconstruct input stream
new 0eff217 ARROW-933: [Python] Remove debug print statement
new e794a59 ARROW-936: fix release README
new 32a4d70 ARROW-936: add missing file; revert tag change
new 928b63f ARROW-938: Fix Rat license warnings
new 2c3e111 ARROW-923: Changelog generation Python script, add 0.1.0 and 0.2.0 changelog
new 80b72d4 [maven-release-plugin] prepare for next development iteration
new bcf073c ARROW-945: [GLib] Add a Lua example to show Torch integration
new 9a48773 ARROW-943: [GLib] Support running unit tests with source archive
new ba2880c ARROW-946: [GLib] Use "new" instead of "open" for constructor name
new cc06197 ARROW-948: [GLib] Update C++ header file list
new f63ff08 ARROW-52: Set up project blog, draft 0.3 release posting
new 1a6d135 ARROW-952: fix regex include from C++ standard library
new 316c63d ARROW-824: Date and Time Vectors should reflect timezone-less semantics
new 5af8069 ARROW-866: [Python] Normalize PyErr exc_value to be more predictable
new 995317a ARROW-929: Remove KEYS file from git
new 8febd03 ARROW-953: Use conda-forge cmake, curl in CI toolchain
new c3a122e ARROW-939: fix division by zero if one of the tensor dimensions is zero
new 75ebf5c ARROW-956: [Python] compat with pandas >= 0.20.0
new 959ec47 ARROW-856: Also read compiler info from stdout
new bd36f6f ARROW-899: [Doc] Add 0.3.0 changelog
new 03c242c ARROW-947: [Python] Improve execution time of manylinux1 build
new 20228a2 ARROW-909: Link jemalloc statically if build as external project
new c48f649 ARROW-963: [GLib] Add equal
new d7a2a1e ARROW-958: [Python] Fix conda source build instructions
new cb5e7b6 ARROW-446: [Python] Expand Sphinx documentation for 0.3
new 3d19831 ARROW-967: [GLib] Support initializing array with buffer
new fe945a2 ARROW-965: Website updates for 0.3.0
new af0c21e ARROW-957: [Doc] Add HDFS and Windows documents to doxygen output
new 74ad4a8 ARROW-984: [GLib] Add Go examples
new ccf83f4 ARROW-985: [GLib] Update package information
new 670612e ARROW-954: Flag for compiling Arrow with header-only boost
new 22c738c ARROW-874: [JS] Read dictionary-encoded vectors
new 2d6453b ARROW-940: [JS] Generate multiple artifacts
new 0216145 ARROW-991: [Python] Create new dtype when deserializing from Arrow to NumPy datetime64
new 8856541 ARROW-1001: [GLib] Unify writer files
new 3f22b5f ARROW-1000: [GLib] Move install document to Website
new c10b7d7 ARROW-1003: [C++] Check flag _WIN32 instead of __WIN32
new 84413b0 ARROW-901: [Python] Add Parquet unit test for fixed size binary
new 1c6f3ef ARROW-813: [Python] setup.py sdist must also bundle dependent cmake m…
new a7722dc ARROW-993: [GLib] Add missing error checks in Go examples
new a8338f1 ARROW-995: [Website] Fix a typo
new 010bd22 ARROW-482 [Java] Exposing custom field metadata
new b066660 ARROW-996: [Website] Add 0.3.0 release announce in Japanese
new a4f29f3 ARROW-29: [C++] FindRe2 cmake module
new 05e8f68 ARROW-1010: [Website] Provide for translations without repeating blog post in blogroll
new 95ee96b ARROW-1016: Python: Include C++ headers (optionally) in wheels
new 9e875a6 ARROW-819: Public Cython and C++ API in the style of lxml, arrow::py::import_pyarrow method
new dbbbc66 ARROW-988 [JS] Add entry to Travis CI matrix
new 99ff240 ARROW-1011: [FORMAT] fix typo and mistakes in Layout.md
new 5739e04 ARROW-1008: [C++] Add abstract stream writer and reader C++ APIs. Give clearer names to IPC reader/writer classes
new d8d3d84 ARROW-1022: [Python] Add multithreaded read option to read_feather
new 852ee4f ARROW-1024: Python: Update build time numpy version to 1.10.1
new c7839e9 ARROW-1017: [Python] Fix memory leaks in conversion to pandas.DataFrame
new 393f46a ARROW-1023: Python: Fix bundling of arrow-cpp for macOS
new 37dbddf ARROW-1004: [Python] Add conversions for numpy object arrays with integers and floats
new 0543379 ARROW-1028: [Python] Fix IPC docs per API changes
new edfb2dc ARROW-1027: [Python] Allow negative indexing in fields/columns on pyarrow Table and Schema objects
new ba9348f ARROW-1031: [GLib] Support pretty print
new 4381845 ARROW-1033: [Python] pytest discovers scripts/test_leak.py
new 681afab ARROW-977: [java] Add Timezone aware timestamp vectors
new b23b864 ARROW-1015 [Java] Schema-level metadata
new abbd815 ARROW-1025: [Website] Improved changelog for website, include git shortlog
new 222cbfe ARROW-998: [Format] Clarify that the IPC file footer contains an additional copy of the schema
new 86a9055 ARROW-182: [C++] Factor out Array::Validate into a separate function
new 47e289a ARROW-961: [Python] Rename InMemoryOutputStream to BufferOutputStream
new ce0bb53 ARROW-1002: [C++] Fix inconsistency with padding at start of IPC file format
new 8a8e7bb ARROW-1037: [GLib] Follow reader name change
new 49c5398 ARROW-1038: [GLib] Follow writer name change
new e7e8d61 ARROW-1040: [GLib] Support tensor IO
new bed0197 ARROW-881: [Python] Reconstruct Pandas DataFrame indexes using metadata
new c4086fe ARROW-997: [Java] Implementing transferPair for FixedSizeListVector
new a4f3259 ARROW-1030: Python: Account for library versioning in parquet-cpp
new fee4475 ARROW-1029: [Python] Fixes for building pyarrow with Parquet support on MSVC. Add to appveyor build
new 62a17b7 ARROW-1044: [GLib] Support Feather
new 0eec40a ARROW-1046: [Python] Reconcile pandas metadata spec
new 37cdc6e ARROW-970: [Python] Nicer experience if user accidentally calls pyarrow.Table ctor directly
new ff72951 ARROW-1053: [Python] Remove unnecessary Py_INCREF in PyBuffer causing memory leak
new a8f8ba0 [maven-release-plugin] prepare release apache-arrow-0.4.0
new cf4ef5e Increment version to 0.5.0-SNAPSHOT
new a6e77f4 ARROW-1054: [Python] Fix test failure on pandas 0.19.2, some refactoring
new b06602d ARROW-1049: [java] vector template cleanup
new d2cc199 ARROW-1062: [GLib] Follow API changes in examples
new 84b7ee1 ARROW-1057: Fix cmake warning and msvc debug asserts
new aa652cb ARROW-1060: [Python] Add unit tests for reference counts in memoryview interface
new 33117d9 ARROW-1034: [PYTHON] Resolve wheel build issues on Windows
new 1cb18d5 ARROW-1061: [C++] Harden decimal parsing against invalid strings
new 078357a ARROW-1066: [Python] pandas 0.20.1 deprecation of pd.lib causes a warning on import
new 4e4435e ARROW-424: [C++] Make ReadAt, Write HDFS functions threadsafe
new 8a700cc ARROW-1063: [Website] Updates for 0.4.0 release, release posting
new 03e8b54 ARROW-1069: Add instructions for publishing maven artifacts
new 51b6bf2 ARROW-897: [GLib] Extract CI configuration for GLib
new 530f0da [Doc] Change cpp api doc, std:shared_pointer_cast to std::static_pointer_cast
new 8229688 ARROW-1078: [Python] Account for Apache Parquet shared library consolidation
new 5c155c3 ARROW-1075: [GLib] Fix build error on macOS
new 092afb6 ARROW-990: [JS] Add tslint support for linting TypeScript
new ba97f34 ARROW-1084: Implementations of BufferAllocator should handle Netty's OutOfDirectMemoryError
new 0576ff5 ARROW-1085: [java] Follow up on template cleanup. Missing method for …
new 931a877 ARROW-1070: [C++] Use physical types for Feather date/time types
new a81aefb ARROW-1082: [GLib] Add CI on macOS
new 8f2b44b ARROW-1051: [Python] Opt in to Parquet unit tests to avoid accidental suppression of dynamic linking errors
new a44155d ARROW-986: [Format] Add brief explanation of dictionary batches in IPC.md
new cfaddab ARROW-1050: [C++] Export arrow::ValidateArray
new 316930c ARROW-1056: [Python] Ignore pandas index in parquet+hdfs test
new 4e134e5 ARROW-1091: Decimal scale and precision are flipped
new a367fd4 ARROW-1086: include additional pxd files during package build
new 44dba71 ARROW-1020: [Format] Revise language for Timestamp type in Schema.fbs to avoid possible confusion about tz-naive timestamps
new 1a72acd [Doc] Fix a few links for files moved in ARROW-957
new 5589dda ARROW-1080: C++: Add tutorial about converting to/from row-wise representation
new c3e865d ARROW-1090: Improve build_ext usability with --bundle-arrow-cpp
new 402baa4 ARROW-1092: More Decimal and scale flipped follow-up
new ac54075 ARROW-1088: [Python] Only test unicode filenames if system supports them
new 2a12482 ARROW-1094: [C++] Always truncate buffer read in ReadableFile::Read if actual number of bytes less than request
new 4631543 [maven-release-plugin] prepare release apache-arrow-0.4.1
new 41b58e4 [maven-release-plugin] prepare for next development iteration
new e344066 ARROW-1095: Add Arrow logo PNG to website img folder
new 7a7b0c2 ARROW-1048: Use existing LD_LIBRARY_PATH in source release script to accommodate non-system toolchain libs
new a382034 ARROW-1101: Implement write(TypeHolder) methods in UnionListWriter
new a44d584 ARROW-1102: Make MessageSerializer.serializeMessage() public
new ae6142d ARROW-1107: [JAVA] Fix getField() for NullableMapVector
new 06c26a2 ARROW-1108: [JAVA] Check if ArrowBuf is empty buffer in getActualConsumedMemory() and getPossibleConsumedMemory()
new 0e680f0 ARROW-1109: [JAVA] transferOwnership fails when readerIndex is not 0
new c6cf124 ARROW-1110: [JAVA] make union vector naming consistent
new 11deee6 ARROW-1111: [JAVA] Make aligning buffers optional, and allow -1 for unknown null count
new ac64853 ARROW-1112: [JAVA] Set lastSet for VarLength and List vectors when loading
new 2a2b109 ARROW-742: [C++] std::wstring_convert exceptions handling
new 25ba44c ARROW-460: [C++] JSON read/write for dictionaries
new d25ea63 ARROW-1115: [C++] use CCACHE_FOUND value for ccache path
new 697df1b ARROW-1117: [Docs] Minor issues in GLib README
new d1de66b ARROW-1118: [Site] Website updates for 0.4.1
new 5b66c25 ARROW-1122: [Website] Add turbodbc + arrow blog post
new 3f26dfa ARROW-1096: [C++] CreateFileMapping maximum size calculation issue
new d54bf48 ARROW-1122: [Website] Change timestamp to yield correct Jekyll date
new 1a23419 ARROW-1124: Increase numpy dependency to >=1.10.x
new 5be05ac ARROW-742: [C++] Use gflags from toolchain; Resolve cmake FindGFlags …
new d874d4e ARROW-1081: Fill null_bitmap correctly in TestBase
new b5e8a48 ARROW-1128: [Docs] command to build a wheel is not properly rendered
new 86c67d0 ARROW-1129: [C++] Fix gflags issue in Linux/macOS toolchain builds
new f0f1ca6 ARROW-1138: Travis: Use OpenJDK7 instead of OracleJDK7
new ef579ca ARROW-1123: Make jemalloc the default allocator
new 5e34309 ARROW-1104: Integrate in-memory object store into arrow
new 222628c ARROW-1140: [C++] Allow optional build of plasma
new 608b89e ARROW-1073: C++: Adapative integer builder
new a16c124 ARROW-1137: Python: Ensure Pandas roundtrip of all-None column
new c1ec0c7 ARROW-1039: Python: Remove duplicate column
new 074dde4 ARROW-1143: C++: Fix comparison of NullArray
new e209e58 ARROW-1144: [C++] Remove unused variable
new 6768f52 ARROW-1139: Silence dlmalloc warning on clang-4.0
new 8bf567e ARROW-1136: [C++] Add null checks for invalid streams
new b7befeb ARROW-1132: [Python] Unable to write pandas DataFrame w/MultiIndex containing duplicate values to parquet
new 1514016 ARROW-1146: Add .gitignore for *_generated.h files in src/plasma/format
new 98f7cac ARROW-1142: [C++] Port over compression toolchain and interfaces from parquet-cpp, use Arrow-style error handling
new 73007de ARROW-1147: [C++] Allow optional vendoring of flatbuffers in plasma
new 41524d6 ARROW-1135: [C++] Use clang 4.0 in one of the Linux builds
new f3bcf76 ARROW-1145: [GLib] Add get_values()
new bea30d6 ARROW-1113: [C++] Upgrade to gflags 2.2.0, use tarball instead of git tag
new fc3f8c2 ARROW-1131: [Python] Enable the Parquet unit tests by default if the extension imports
new 5de6eb5 ARROW-978: [Python] - Change python documentation sphinx theme to bootstrap
new ec6e183 ARROW-1151: [C++] Add branch prediction to RETURN_NOT_OK
new bfe15db ARROW-1152: [Cython] read_tensor should work with a readable file
new 3e754a0 ARROW-1155: [Python] Add null check when user improperly instantiates ArrayValue instances
new cb5f2b9 ARROW-1157: C++/Python: Decimal templates are not correctly exported on OSX
new b065228 ARROW-1154: [C++] Import miscellaneous computational utility code from parquet-cpp
new a588938 ARROW-1159: [C++] Use dllimport for visibility when not building Arrow library
new bddb219 ARROW-834: Python Support creating from iterables
new 65558db ARROW-1162: Empty data vector transfer between list vectors should no…
new 6958252 ARROW-1165: [C++] Refactor PythonDecimalToArrowDecimal to not use templates
new af83c45 ARROW-1166: Fix errors in example and missing reference in Layout.md
new 9f500af ARROW-1170: C++: Link to pthread on ARROW_JEMALLOC=OFF
new 456330f ARROW-599: CMake support of LZ4 compression lib
new 930db87 ARROW-1169: [C++] jemalloc externalproject doesn't build with CMake's ninja generator
new c294ec3 ARROW-1125: partial schemas for Table.from_pandas
new 96e7e99 ARROW-960: Add section on how to develop with pip
new e268ce8 ARROW-915: [Python] Struct Array reads limited support
new 9e4906f ARROW-1160: C++: Implement DictionaryBuilder
new 2e5ddfe ARROW-1179: C++: Add missing virtual destructors
new 2c3e8b0 ARROW-692: Integration test data generator for dictionary types
new a6d0c26 ARROW-1180: [GLib] Fix a returning invalid address bug in garrow_tensor_get_dimension_name()
new e18abac ARROW-1181: [Python] Parquet multiindex test should be optional
new cdee23c ARROW-600: ZSTD compression lib support
new 681479d ARROW-1182: C++: Specify BUILD_BYPRODUCTS for zlib and zstd
new e5a08dd ARROW-1098. [Format] modify document mistake
new 7c18ddd ARROW-966: [Python] Also accept Field instance in pyarrow.list_
new edcded3 ARROW-1148: [C++] Raise minimum CMake version to 3.2
new cbbd04b ARROW-1172: [C++] Refactor to use unique_ptr for builders
new 7d86c28 ARROW-693: [Java] Add dictionary support to JSON reader and writer
new 00a7d55 ARROW-1185: [C++] Status class cleanup, warn_unused_result attribute and Clang warning fixes
new 83a4405 ARROW-599: [C++] Lz4 compression codec support
new c398fda ARROW-462: [C++] Implement in-memory conversions between non-nested primitive types and DictionaryArray equivalent
new 3309d12 ARROW-1174: [GLib] Fix ListArray test failure
new b6b876c ARROW-1193: [C++] Support pkg-config for arrow_python.so
new e894532 ARROW-1197: [GLib] Fix a bug that record batch related functions for C++ aren't included
new 7870804 ARROW-1074: Support lists and arrays in pandas DataFrames without explicit schema
new f73c1c3 ARROW-1201: [Python] Incomplete Python types cause a core dump when repr-ing
new cab07c2 ARROW-1202: [C++] Remove semicolons from status macros
new bc16e0e ARROW-1196: [C++] Release, Debug, Toolchain, NMake Generator Appveyor…
new 471a85f ARROW-1168: [Python] pandas metadata may contain "mixed" data types
new ad57ea8 ARROW-1125: Python: Add public C++ API to unwrap PyArrow object
new 8452071 ARROW-1199: [C++] Implement mutable POD struct for Array data
new dbedc8d ARROW-1186: [C++] Add support to build only Parquet dependencies
new e8c09c6 ARROW-1205: C++: Reference to type objects in ArrayLoader may cause segmentation faults
new afb1928 ARROW-1206: [C++] Add finer grained control of compression library support, do not expose symbols which may not be built in compression.h
new f0ecc06 ARROW-1208: [C++] Temporary remove conda's build of zstd from Toolcha…
new 28e06d8 ARROW-1194: [Python] Expose MockOutputStream in pyarrow.
new 74bc873 ARROW-1150: Silence AdaptiveIntBuilder compiler warning on MSVC
new 85892a2 ARROW-1187: Python: Feather: Serialize a DataFrame with None column
new 248a9d8 ARROW-1212: [GLib] Add garrow_binary_array_get_offsets_buffer()
new c7e0995 ARROW-1208: [C++] Install zstd from conda for Toolchain Appveyor buil…
new 8cad26e ARROW-1200: C++: Switch DictionaryBuilder to signed integers
new cb31b8b ARROW-1215: [Python] Generate documentation for class members in API Reference
new bfe3959 ARROW-962: [Python] Add schema attribute to RecordBatchFileReader
new f62db83 ARROW-1100: [Python] Add mode property to NativeFile
new d46b7ea ARROW-992: [Python] Try to set a __version__ in in-place local builds
new 9ff39f3 ARROW-1216: [Python] Fix creating numpy array from arrow buffers on python 2
new 099f61c ARROW-1218: [C++] Fix arrow build if no compression library is used
new bb0a758 ARROW-1214: [Python/C++] Add C++ functionality to more easily handle encapsulated IPC messages, Python bindings
new dc4216f ARROW-575: Python: Auto-detect nested lists and nested numpy arrays in Pandas
new e438e15 ARROW-1217: [GLib] Add GInputStream based arrow::io::RandomAccessFile
new f266f17 ARROW-1220: [C++] Cmake script errors out if lib is not found under *…
new bf01966 [Python] Correct function name in use with pandas documentation
new 50b518a ARROW-1183: [Python] Implement pandas conversions between Time32, Time64 types and datetime.time
new cdf7db9 ARROW-1223: [GLib] Fix function name that returns wrapped object
new d538426 ARROW-1228: [GLib] Fix test file name
new 8644ee1 ARROW-1227: [GLib] Support GOutputStream
new e370174 ARROW-1222: [Python] Raise exception when passing unsupported Python object type to pyarrow.array
new 5fbfd8e ARROW-597: [Python] Add read_pandas convenience to stream and file reader classes. Add some data type docstrings
new b474cac ARROW-1221: [C++] Add run_clang_format.py script, exclusions file. Pin clang-format-3.9
new ea9bc83 ARROW-1229: [GLib] Use "read" instead of "get" for reading record batch
new 0396240 ARROW-1190: [JAVA] Fixing VectorLoader for duplicate field names
new 1541a08 ARROW-1177: [C++] Check for int32 offset overflow in ListBuilder, BinaryBuilder
new b4d34f8 ARROW-1191: [JAVA] Implement getField() method for complex readers
new a1c8b83 ARROW-1079: [Python] Filter out private directories when building Parquet dataset manifest
new 6035d9b ARROW-1233: [C++] Validate libs availability in conda toolchain
new 8152433 ARROW-1188: [Python] Handle Feather case where category values are null type
new a73252d ARROW-1235: [C++] Make operator<< for Array/Status and std::ostream inline
new 362e754 ARROW-1103: [Python] Support read_pandas (with index metadata) on directory of Parquet files
new c5a89b7 ARROW-1120: Support for writing timestamp(ns) to Int96
new fe9c7ef ARROW-1236: Fix lib path in pkg-config file
new 6999dbd ARROW-935: [Java] Build Javadoc and site with OpenJDK8 in Java CI build
new 2c5b412 ARROW-1167: [Python] Support chunking string columns in Table.from_pandas
new 5aa0809 [GLib] Update rat_exclusion_files.txt
new db181d1 ARROW-1244: Exclude C++ Plasma source tree when creating source release
new 62ef2cd [C++] Remove Plasma source tree for 0.5.0 release pending IP Clearance
new e9f76e1 [maven-release-plugin] prepare release apache-arrow-0.5.0
new 9b26ed8 [maven-release-plugin] prepare for next development iteration
new 2c81015 [C++] Restore Plasma source tree after 0.5.0 release
new fabf7fb ARROW-1241: [C++] Appveyor build matrix extended with Visual Studio 2…
new e1b098e ARROW-1240: [JAVA] security: upgrade slf4j to 1.7.25 and logback to 1.2.3
new 457bb07 ARROW-1237: [JAVA] expose the ability to set lastSet
new 05f7058 ARROW-1239: [JAVA] upgrading git-commit-id-plugin
new a94f471 ARROW-1149: [Plasma] Create Cython client library for Plasma
new 6042c48 ARROW-1195: [C++] CpuInfo init with cores number, frequency and cache…
new ecdc86b ARROW-1249: [JAVA] expose fillEmpties from Nullable variable length vectors
new 886e2af ARROW-1259: [Plasma] Speed up plasma tests
new 9e692af ARROW-1245: [Integration] Enable JavaTester in Integration tests
new 11c92bf ARROW-1246: [Format] Draft Flatbuffer metadata description for Map
new 204f148 ARROW-1260: [Plasma] Use factory method to create Python PlasmaClient
new 07b89bf ARROW-1219: [C++] Use Google C++ code formatting
new 08cec90 ARROW-1252: [Website] Updates for 0.5.0 and short blog post summarizing the release
new ed54dce ARROW-1253: [C++/Python] Speed up C++ / Python builds by using conda-forge toolchain for thirdparty libraries
new f90fa49 [Website] Fix link to 0.5.0 post on install page
new e9e17b5 ARROW-1258: [C++] Suppress Clang dlmalloc compiler warnings
new 2eeaa95 ARROW-1248: [Python] Suppress return-type-c-linkage warning in Cython clang builds
new 676a4a9 ARROW-1255: [Plasma] Fix typo in plasma protocol; add DCHECK for ReadXXX in plasma protocol.
new 5708cd1 [Java] Fix some typos in code comments and exception messages
new dca5d96 ARROW-1275: [C++] Deafult Snappy static lib suffix updated to "_static"
new d76e43e ARROW-1268: [WEBSITE] Added blog post for Spark integration toPandas()
new cae3510 ARROW-1274: [C++] Fix CMake >= 3.3 warning. Also add option to suppress ExternalProject output
new 7b3378f ARROW-1204: [C++] Remove WholeProgramOptimization(/GL) compilation fl…
new f72279b ARROW-1288: Fix many license headers to use proper ASF one
new b7639c1 ARROW-1285: [Python] Delete any incomplete file when attempt to write single Parquet file fails
new ff6c6e0 ARROW-1276: enable parquet serialization of empty DataFrames
new 8841bc0 ARROW-1281: [C++/Python] Add Docker setup for testing HDFS IO in C++ and Python
new 33c85cd [Java] Fix letter case in rat plugin config
new 4df2a0b ARROW-1290: [C++] Double buffer size when exceeding capacity in arrow::BufferBuilder as in array builders
new 44855bb ARROW-1273: [Python] Add Parquet read_metadata, read_schema convenience functions
new 3b14765 ARROW-1289: [Python] Add PYARROW_BUILD_PLASMA CMake option, follow semantics of --with-parquet
new 1dd0f5f ARROW-1267: [Java] Handle zero length case in BitVector.splitAndTransfer
new 05af640 ARROW-276: [JAVA] Nullable Vectors should extend BaseValueVector and not Bas…
new ec32617 ARROW-1192: [JAVA] Use buffer slice for splitAndTransfer in List and Union Vectors.
new 5aea3a3 ARROW-1287: [Python] Implement whence argument for pyarrow.NativeFile.seek
new b4e9ba1 ARROW-968: [Python] Support slices in RecordBatch.__getitem__
new ea1b67c ARROW-1294: [C++] Pin cmake=3.8.0 in MSVC toolchain build
new 4108bda ARROW-1291: [Python] Cast non-string DataFrame columns to strings in RecordBatch/Table.from_pandas
new 2288bfc ARROW-1264: [Python] Raise exception in Python instead of aborting if cannot connect to Plasma store
new b4eec62 ARROW-932: [Python] Fix MSVC compiler warnings, build Python with /WX and -Werror in CI
new af2aeaf ARROW-1213: [Python] Support s3fs filesystem for Amazon S3 in ParquetDataset
new 900105a ARROW-187: [C++] Add development style notes to C++ README, note about esoteric exceptions in constructors
new b5ff2f6 ARROW-1251: [C++] Update C++ README to account for toolchain evolution
new 3a84653 ARROW-1265: [Plasma] Clean up all resources on SIGTERM to keep valgrind output clean
new e1d574c ARROW-1301: [C++/Python] More complete filesystem API for HDFS
new b8754eb ARROW-884: [C++] Exclude internal namespaces from generated Doxygen docs
new aa1d753 ARROW-573: [C++/Python] Implement IPC metadata handling for ordered dictionaries, pandas conversions
new e5ed31f ARROW-1093: [Python] Run flake8 in Travis CI. Add note about development to README
new 7e7861c ARROW-1257: Plasma documentation
new e50b6ae ARROW-1308: [C++] Link utility executables to Arrow shared library if ARROW_BUILD_STATIC=off
new b95bed0 ARROW-1303: [C++] Support downloading Boost
new 5917e07 ARROW-1305: [GLib] Add GArrowIntArrayBuilder
new ee928d2 ARROW-1211: [C++] Enable builder classes to automatically use the default memory pool
new 93b51a0 ARROW-1315: [GLib] Add missing status check for arrow::ArrayBuilder::Finish()
new 21a0191 ARROW-1323: [GLib] Add garrow_boolean_array_get_values()
new 84b7a0d ARROW-1312: [C++] Make ARROW_JEMALLOC OFF by default until ARROW-1282 is resolved
new 1874a8b ARROW-1310: [JAVA] revert changes made in ARROW-886
new 3732324 ARROW-1224: [Format] Clarify language around buffer padding and align…
new f775af7 ARROW-1312: [Python] Follow-up: do not use jemalloc in manylinux1 builds
new a388ddf ARROW-1330: [Plasma] Turn on plasma tests on manylinux1
new aa5d417 ARROW-1326: [Python] Fix Sphinx Build in Travis CI, treat Sphinx warnings as errors
new 717bed0 ARROW-1328: [Python] Set correct Arrow type when coercing to milliseconds and passing explicit type
new 3bc7d46 ARROW-1296: [Java] Fix allocationSizeInBytes in FixedValueVectors.res…
new 25439e7 ARROW-1300: [JAVA] Fix Tests for ListVector
new 3200e91 ARROW-1327: [Python] Always release GIL before calling check_status in Cython
new 619472e ARROW-1225: [Python] Decode bytes to utf8 unicode if possible when passing explicit utf8 type to pyarrow.array
new c0acb86 ARROW-1333: [Plasma] Example code for using Plasma to sort a DataFrame
new f9d9833 ARROW-1283: [JAVA] Allow VectorSchemaRoot to close more than once
new 7a4026a ARROW-1304: [Java] Fix Indentation, WhitespaceAround and EmptyLineSeparator checkstyle warnings in Java
new 0b91cad ARROW-622: [Python] Add coerce_timestamps option to parquet.write_table, deprecate timestamps_to_ms argument
new 2015198 ARROW-1263: [C++] Get CPU info on Windows; Resolve patching whitespac…
new 02ab748 ARROW-1336: [C++] Add arrow::schema factory function, simply some awkward constructors
new 66ab6b2 ARROW-1309: [Python] Handle nested lists with all None values in Array.from_pandas
new 03dcce4 ARROW-1173: [Plasma] Add blog post describing Plasma object store
new 939957f ARROW-1335: [C++] Add offset to PrimitiveArray::raw_values to make consistent with other raw_values
new 5281a82 ARROW-1334: [C++] Add alternate Table constructor that takes vector of Array
new 20cee70 ARROW-1338: [Python] Do not close RecordBatchWriter on dealloc in case sink is no longer valid
new 2615b47 ARROW-1306: [C++] Use UTF8 filenames in local file error messages
new 6e26701 ARROW-439: [Python] Add option in "to_pandas" conversions to yield Categorical from String/Binary arrays
new a9c2f19 ARROW-1242: [JAVA] - upgrade jackson to mitigate security vulnerabilities
new 7fdbcc6 ARROW-1243: [JAVA] update all libs to latest versions
new 86154f0 ARROW-1340: [Java] Fix NullableMapVector field metadata
new e44ede8 ARROW-1343: [Java] Aligning serialized schema, end of buffers in RecordBatches
new 2972c9d ARROW-1342: [Python] Support strided ndarrays in pandas conversion from nested lists
new b795e5c ARROW-1240: [JAVA] security: upgrade logback to address CVE-2017-5929 (take 2)
new 2143349 ARROW-1242: [JAVA] - upgrade jackson to mitigate security vulnerabilities (take 2)
new 63954c0 ARROW-1350: [C++] Do not exclude Plasma source tree from source release
new b173334 [maven-release-plugin] prepare release apache-arrow-0.6.0
new 4db732c [maven-release-plugin] prepare for next development iteration
new 6135958 ARROW-1348: [C++/Python] Release verification script for Windows
new 142f74e ARROW-1331: [JAVA] Refactor unit tests
new a2f4323 ARROW-1352: [Integration] Added specific formatting for producer consumer output
new 94b7cfa ARROW-1339: [C++] Use of boost::filesystem::path to handle file paths
new 31457ae ARROW-801: Provide direct access to underlying buffer memory addresses
new c2fb9cb ARROW-1356: [Website] Add new committers
new b78e2ef ARROW-1353: [Website] Update website for 0.6.0 release and add short release blog post
new 4471dc9 [C++] DOC: Fix a typo in plasma.md
new c0fa8e0 [Python] DOC: Fix Parquet docs to use pyarrow.parquet namespace for write_table
new 3c5290a ARROW-1365: [Python] Remove outdated pyarrow.jemalloc_memory_pool example. Update API doc site build instructions
new 4ef7c89 ARROW-1355: [Java] Make Arrow buildable with jdk9
new c9805d6 ARROW-1373: Implement getBuffer() methods for ValueVector
new e1bad9f [C++] Fix a typo in in plasma.md
new 652fd36 ARROW-1366: [Plasma] Define entry point for the plasma store
new 10f7158 ARROW-1372: [Plasma] enable HUGETLB support on Linux to improve plasma put performance
new b50f235 ARROW-759: [Python] Serializing large class of Python objects in Apache Arrow
new de7c671 ARROW-1357: [Python] Account for chunked arrays when converting lists back to pandas form
new 6ad976e ARROW-1375: [C++] Remove dependency on msvc version for Snappy build
new 4e0aa3c ARROW-1387: [C++] Set up GPU leaf library, add unit test module for CUDA tests
new 5303594 ARROW-1395: [C++/Python] Remove APIs deprecated from 0.5.0 onward
new 3c70ff1 ARROW-1384: [C++] Add SerializeRecordBatch API for writing a record batch as an IPC message to a new buffer
new 2c3a5f4 ARROW-1392: [C++] Add GPU IO interfaces for CUDA
new b312697 ARROW-1386: [C++] Unpin CMake version in MSVC toolchain builds
new 750b77d ARROW-786: [Format] In-memory format for 128-bit Decimals, handling of sign bit
new f50f2ea ARROW-1408: [C++] IPC public API cleanup, refactoring. Add SerializeSchema, ReadSchema public APIs
new a3607d2 ARROW-1410: Remove MAP_POPULATE flag when mmapping files in Plasma store.
new ea503a5 ARROW-1379: [Java] adding maven-dependency-plugin and fixing all reported dependency errors
new b36aab5 ARROW-1411: [Python] Booleans in Float Columns cause Segfault
new 26d4e3f ARROW-1399: [C++] Add CUDA build version defines in public headers
new 6cd2a17 ARROW-1419: [GLib] Suppress sign-conversion warnings
new f250d46 ARROW-1416: Clarify memory layout documentation
new 32e2668 ARROW-1414: [GLib] Cast after status check
new c027a03 ARROW-1412: [Plasma] Add higher level API for putting and getting Python objects
new 0728148 ARROW-1364: [C++] IPC support machinery for record batch roundtrips to GPU device memory
new c0beb87 ARROW-1415: [GLib] Support date32 and date64
new 0ced74e ARROW-1363: [C++] Use buffer layout from dictionary index type in IPC messages
new e6456b6 ARROW-1344: [C++] Do not permit writing to closed BufferOutputStream
new a009aab ARROW-1421: [Python] Extend Python serialization API to accept non-list types
new 5cda693 ARROW-1402: [C++] Deprecate APIs which return std::shared_ptr<MutableBuffer> in favor of std::shared_ptr<Buffer>
new 254078e ARROW-1426: [Site] Fix the title of the top page.
new 441f965 ARROW-1418: [Python] Introduce SerializationContext to register custom serialization callbacks
new 59fb8d0 ARROW-1427: [GLib] Add arrow cpp link to readme
new 19bbf12 ARROW-1428: [C++] Append steps to clone source code to README.mb
new 6af2931 ARROW-1302: C++: Set MAKE to make if not defined
new af78293 ARROW-1429: [Python] Open common Parquet metadata file using passed file system
new 7510ae6 ARROW-837: [Python] Add public pyarrow.allocate_buffer API. Rename FixedSizeBufferOutputStream
new d8c651c ARROW-1433: [C++] Simplify Array::Slice to be non-virtual
new f450025 ARROW-1381: [Python] Use FixedSizeBufferWriter in SerializedPyObject.to_buffer
new 6b391f0 ARROW-1432: [C++] Build bundled jemalloc functions with private prefix
new 78c952d ARROW-1351: Update CHANGELOG.md in 00-prepare.sh when creating release candidate
new 9ab0c95 ARROW-1441: [Site] Add Ruby to Flexible section
new c6295f3 ARROW-1438: [Python] Pull serialization context through PlasmaClient put and get
new 8344f28 ARROW-1434: [Python] Support datetime64[D] numpy arrays
new 28553b4 ARROW-1443: [Java] Fixed a small bug on ArrowBuf.setBytes with unsliced ByteBuffers
new 75d1f61 ARROW-1413: [C++] Add include-what-you-use configuration
new 4956e90 ARROW-1407: Fix bug where DictionaryEncoder can only encode vector le…
new 848a0f7 ARROW-1444: [JAVA] fix last byte copy in BitVector splitAndTransfer
new 1a8dafb ARROW-1447: [C++] Fix many include-what-you-use warnings
new ebca1af ARROW-1452: [C++] Make macros in arrow/util/macros.h more unique
new 016c45a ARROW-1450: [Python] Raise proper error if custom serialization handler fails
new 08b41f9 ARROW-1238: [Java] Adding Decimal type JSON read and write support
new 6f2fd8d ARROW-1449: Implement Decimal using only Int128
new 9968d95 ARROW-1400: [Python] Adding parquet.write_to_dataset() method for writing partitioned .parquet files
new e5aeb90 ARROW-1383: [C++] Add vector append variant to primitive array builders that accepts std::vector<bool>
new cc30512 ARROW-1390: [Python] Add more serialization tests
new f391950 ARROW-1452: [C++] Restore DISALLOW_COPY_AND_ASSIGN usages removed in ARROW-1452 patch
new 0e0da74 ARROW-1460: [C++] Pin clang-format at LLVM 4.0
new b1e56a2 ARROW-1453: [C++/Python] Support non-contiguous Tensors in WriteTensor
new ec32013 ARROW-1417: [Python] Allow more generic filesystem objects to be passed to ParquetDataset
new 338a187 ARROW-1461: [C++] Disable builds using LLVM apt repo until installation issues resolved
new ab328ec ARROW-1401: [C++] Add ARROW_EXTRA_ERROR_CONTEXT option
new df0ca4e ARROW-1461: [C++] Restore LLVM apt usage
new 3283d2a ARROW-1458: [Python] Document that create_parents=False is unsupported in HadoopFileSystem
new 1ba19c2 ARROW-1401: [C++] Add note to readme about ARROW_EXTRA_ERROR_CONTEXT
new bea5ab0 ARROW-1442: [Website] Add note about nightly builds to /install
new 2433289 ARROW-696: [C++] Support decimals in IPC and JSON reader/writer to enable integration tests
new 2660dda ARROW-1377: [Python] Add ParquetFile.scan_contents function to use for benchmarking
new f355354 ARROW-1317: [Python] Attempt to set Hadoop CLASSPATH when using JNI
new 9634f4b ARROW-1462: [GLib] Add GArrowTime32Array and GArrowTime64Array
new e9f3a12 ARROW-695: Add decimal integration test.
new a3514a3 ARROW-1396: [C++] Add PrettyPrint for schemas that outputs dictionaries
new 5abd123 ARROW-1468: [C++] Add primitive Append variants that accept std::vector<T>
new 54d624b ARROW-1405: [Python] Expose LoggingMemoryPool in Python API
new 3d285c6 ARROW-1451: [C++] Add public API file for IO section in arrow/io/api.h
new d9f895e ARROW-1446: [Python] Add (very slow) large memory unit test for int32 overflow in PARQUET-1090
new 3033eac ARROW-1466: [C++] Implement PrettyPrint for DecimalArray
new 6e5f7be ARROW-1435: [Python] Properly handle time zone metadata in Parquet round trips
new a9a80fe ARROW-941: Add "cold start" instructions for running integration tests
new b0b125f ARROW-229: [C++] Implement cast functions for numeric types, booleans
new 3f2fa03 ARROW-1430: [Python] Python CI build outside of a bash function scope, enable flake8 to fail build
new b698227 ARROW-622 [Python] deprecate timestamps_to_ms in .from_pandas()
new 6f27a64 ARROW-1467: [JAVA] Fix reset() and allocateNew() in Nullable Value Vectors t…
new fe45c2b ARROW-1490: [Java] Allow failures for JDK9 for now
new de2edc8 ARROW-1156: [C++/Python] Expand casting API, add UnaryKernel callable. Use Cast in appropriate places when converting from pandas
new 11ebe93 ARROW-1494: [C++] Improve doxygen comments in arrow/table.h, note that RecordBatch::column returns new object
new 5aca7b6 ARROW-1480: [Python] Improve performance of serializing sets
new 8911a34 ARROW-1493: [C++] Flush stream in PrettyPrint functions
new 0c8853f ARROW-1479: [JS] Expand JavaScript implementation
new 8b70ab0 ARROW-1507: [C++] Include arrow/array.h for arrow::internal::ArrayData
new 875be96 ARROW-1504: [GLib] Add GArrowTimestampArray
new 1706aab ARROW-1505: [GLib] Simplify arguments check
new 675e251 ARROW-1506: [C++] Add .pc for compute modules
new b5d510f ARROW-1358: Update sha{1, 256, 512} checksums per latest ASF release policy
new 947ca87 ARROW-594: [C++/Python] Write arrow::Table to stream and file writers
new 4a6a6cb ARROW-1359: [C++] Add flavor='spark' option to write_parquet that sanitizes schema field names
new f211463 ARROW-1514: [C++] Fix a typo in document
new ce623a9 ARROW-1512: [C++] Fix API change in documentation
new c1492bf ARROW-1510: [GLib] Support cast
new 0ac81b5 ARROW-1515: [GLib] Detect version directly
new a534e5d ARROW-1516: [GLib] Update document
new 840b687 ARROW-1517: Remove unnecessary temporary in DecimalUtil::ToString function
new a45b047 ARROW-1307: [Python] Expand IPC section to include object serialization, Feather format. Add Feather functions to API listing
new 94f6247 ARROW-1508: C++: Add support for FixedSizeBinaryType in DictionaryBuilder
new 7e95764 ARROW-1513: C++: Add cast from Dictionary to plain arrays
new 9ede7fb ARROW-1527: Fix Travis CI JDK9 build
new b590c24 ARROW-1519: [C++] Move DecimalUtil functions to methods on the Int128 class
new 18388d6 ARROW-1331: [JAVA] include package statement
new 4c008c2 ARROW-1511: [C++] Promote ArrayData, MakeArray to public API, deprecate MakePrimitiveArray
new 6a020a1 ARROW-1459: [Python] Use list values length to advance offset when reconstructing array of ndarrays
new b77af1a ARROW-1528: [GLib] Resolve recursive include dependency
new 3fbf760 ARROW-1530: [C++] Install arrow/util/parallel.h
new cf1ac9c ARROW-1531: [C++] Return ToBytes by value from Decimal128
new 576fb66 ARROW-1532: [Python] Referencing an Empty Schema causes a SegFault
new b671dcc [Release] Update CHANGELOG.md for 0.7.0
new 97f9029 [maven-release-plugin] prepare release apache-arrow-0.7.0
new d0b0b74 ARROW-1529: [GLib] Use Xcode 8.3 on Travis CI
new 96d451f ARROW-1537: [C++] Support building with full path install_name on macOS
new 72ad07e ARROW-1542: [C++] Install packages in temporary directory in MSVC build verification script
new 59b24ba ARROW-559: Add release verification script for Linux
new bf73d27 ARROW-1545: Remove deprecated args of builder
new bfe6579 ARROW-1546: [GLib] Support GLib 2.40 again
new e093772 ARROW-1544: [JS] Export Vector types
new b635d4c Remove garbage ")"
new 63e7966 ARROW-1464: [GLib] Add "Common build problems" section into the README.md of c_glib
new 4a65fea ARROW-1548: [GLib] Support bulk append in builder
new e1d9c7f ARROW-1550: [Python] Explicitly close owned file handles in ParquetWriter.close to avoid Windows flakiness
new 0d5e699 ARROW-1551: [Website] Website updates, blog post for 0.7.0
new b448f66 ARROW-1550: [Python] Followup: fix flake8 warning
new aebc412 ARROW-1551: [Website] Add 0.7.0 changelog
new 2706b7f ARROW-1533: [JAVA] realloc should consider the existing buffer capacity for computing target memory requirement
new d4685f4 ARROW-1547: [JAVA] Fix 8x memory over-allocation in BitVector
new c4f5a12 ARROW-1536:[C++] Do not transitively depend on libboost_system
new 2551050 ARROW-1554: [Python] Update Sphinx install page to note that VC14 runtime may need to be installed on Windows
new 903d03b ARROW-1553: [JAVA] Implement setInitialCapacity for MapWriter
new 9997a1a ARROW-1557 [Python] Validate names length in Table.from_arrays
new 975f32b ARROW-1497: [Java] Fix JsonReader to initialize count correctly
new 203fb63 ARROW-1500: [C++] Do not ignore return value from truncate in MemoryMa…
new d154c10 ARROW-1578: [C++] Run lint checks in Travis CI much earlier at before_script stage to fail faster
new c470c9c ARROW-1591: C++: Xcode 9 is not correctly detected
new cfcee74 ARROW-1347: [JAVA] Return consistent child field name for List Vectors
new 8fd73b4 ARROW-1595: [Python] Fix package dependency resolution issue causing broken builds
new 8996a4f ARROW-1590: [JS] Flow TS Table method generics
new c0a5019 ARROW-1592: [GLib] Add GArrowUIntArrayBuilder
new b41a4ee ARROW-1598: [C++] Fix diverged code comment in plasma tutorial
new 096b877 ARROW-1601: [C++] Do not read extra byte from validity bitmap, add internal::BitmapReader in lieu of macros
new 39e487c ARROW-1608: Support Release verification script on macOS
new b640cc0 ARROW-1610: C++/Python: Only call python-prefix if the default PYTHON_LIBRARY is not present
new 5da6b87 ARROW-1606: [Python] Copy .lib files in addition to .dll when bundling libraries for Windows
new f9d1e1b ARROW-1611: [C++] Add BitmapWriter, do not perform out of bounds read in BitmapReader when length is 0
new 808a143 ARROW-1612:[GLib] Update readme for mac os
new 3a53f93 [Release] Update version to 0.7.1-SNAPSHOT
new fbabde5 [Release] Update CHANGELOG.md for 0.7.1
new 0e21f84 [maven-release-plugin] prepare release apache-arrow-0.7.1
new 686a8f7 ARROW-1607: [C++] Implement DictionaryBuilder for Decimals
new c358154 ARROW-1609: [Plasma] Xcode 9 compilation workaround
new bdfa65e ARROW-1620: Python: Download Boost in manylinux1 build from bintray
new ac997fb ARROW-1618: [JAVA] Reduce Heap Usage (Phase 1)
new 545496c ARROW-1619: [Java] Set lastSet in JsonFileReader
new 7045b42 ARROW-1615 Added BUILD_WARNING_LEVEL and BUILD_WARNING_FLAGS to Setup…
new a03e093 ARROW-1600: [C++] Add Buffer constructor that wraps std::string
new ccbf644 ARROW-838: [Python] Expand pyarrow.array to handle NumPy arrays not originating in pandas
new cc3b27c ARROW-1626 Add make targets to run the inter-procedural static analys…
new 9aa6eb5 ARROW-1624: [C++] Fix build on LLVM 4.0, remove some clang warning suppressions
new 811e668 ARROW-1629: [C++] Add miscellaneous DCHECKs and minor changes based on infer tool output
new af167fd [Python] Update README.md to reflect that wheels are available on all platforms
new c905783 ARROW-1625: [Serialization] Support OrderedDict and defaultdict serialization
new 82eea49 [Java] Update pom versions to 0.8.0-SNAPSHOT
new 988338c ARROW-1634: [Website] Add release page for 0.7.1, update front page
new ff39cb5 ARROW-1640: Fix HTTPS failures in cmake / libcurl caused by ca-certificates clash
new 8ceee56 ARROW-1543: [C++] Correct C++ tutorial to use std::unique_ptr instead of std::shared_ptr
new 67c6317 ARROW-950: [Website] Add Google Analytics tag to site
new 592c4e8 [Website] jekyll must be run with JEKYLL_ENV=production
new 87fc577 ARROW-1584: [C++/Python] Support Null type in IPC round trips, fix serialize_pandas on empty DataFrame
new 31d33e0 [Website] Update website with new committers
new dc129d6 ARROW-1627: New class to handle collection of BufferLedger(s) within …
new 81319d9 ARROW-1647: [Plasma] Make sure to read length header as int64_t instead of size_t.
new 8b5b22b ARROW-1525: [C++] New compare functions that return boolean instead of Status
new 0f819fa ARROW-1603: [C++] Add BinaryArray::GetString helper method
new 7511cfd ARROW-226: [C++] If opening an HDFS file fails and it does not exist, say so to help with debugging
new f0873a9 ARROW-942: Support running integration tests with both Python 2.7 and 3.6
new 9805ada ARROW-1633: [Python] Support NumPy string and unicode types in pyarrow.array, Array.from_pandas
new ab6aa9a ARROW-1486: [C++] Make Column, RecordBatch, and Table non-copyable
new 909a6f6 ARROW-1616: [Python] Add unit test for RecordBatchWriter.write dispatching to write_table or write_batch
new bd73166 ARROW-1526: [Python] Add unit test for fix in PARQUET-1100
new 51905e5 ARROW-1498: Add CONTRIBUTING.md to .github special directory
new bea3495 ARROW-1539: [C++] Remove APIs deprecated as of 0.7.0 or prior releases
new eaa9538 ARROW-1649: C++: Print number of nulls in PrettyPrint for NullArray
new 3ae4355 ARROW-1653: [Plasma] Use static cast to avoid compiler warning.
new f8cdafa ARROW-1541: [C++] Fix race conditions in arrow_gpu with generated Flatbuffers files. Do not put generated files in source tree
new ac1b66d ARROW-1540: Add NO_VALGRIND option to ADD_ARROW_TEST and disable valgrind in a few problematic tests
new 898f5e2 ARROW-1602: [C++] Add IsValid method to pair with IsNull
new 0a4c5b1 ARROW-1226: [C++] Docs cleaning in arrow/ipc. Doxyfile fixes, move ipc/metadata-internal.h symbols to internal NS
new 8309556 ARROW-1556: [C++] Move verbose AssertArraysEqual function used in PARQUET-1100 into arrow/test-util.h
new b29b065 ARROW-1641: [C++] Hide std::mutex from public headers
new eaeb5d4 ARROW-1250: [Python] Add pyarrow.types module with useful type checking functions
new e31c2e3 ARROW-1585/ARROW-1586: [PYTHON] serialize_pandas roundtrip loses columns name
new 208e798 ARROW-1594: [Python] Multithreaded conversions to Arrow in from_pandas
new 33d446d ARROW-1656: [C++] Endianness Macro is Incorrect on Windows And Mac
new 81a0e67 ARROW-1657: [C++] Multithreaded Read Test Failing on Arch Linux
new a0555c0 ARROW-1535: [Python] Enable sdist tarballs to be installed
new bf2e3ab ARROW-1593: [Python] Pass through preserve_index to RecordBatch.from_pandas in serialize_pandas
new 166f0a8 ARROW-1635: Add release management guide
new ee78cdc ARROW-1503: [Python] Add default serialization context, callbacks for pandas.Series/DataFrame
new 4cb3e97 ARROW-1662: Move to using Homebrew/bundle and Brewfile
new 60cb1c3 ARROW-905 [Docs] Dockerize document generation
new 0d1e69c ARROW-1630: [Serialization] Support Python datetime objects
new 434df8a ARROW-1488: [C++] Implement ArrayBuilder::Finish in terms of FinishInternal based on ArrayData
new 47e6ff6 ARROW-1665: [Serialization] Support more custom datatypes in the default serialization context
new dc53321 ARROW-1648: C++: Add cast from Dictionary[NullType] to NullType
new 894f740 ARROW-1670: [Serialization] Speed up deserialization by getting rid of smart pointer overhead
new a6a97a9 ARROW-1631 [C++] Add GRPC to ThirdpartyToolchain
new e39b479 ARROW-1667: [GLib] Support Meson
new 1571fb4 ARROW-1677: [Blog] Post on ray and arrow serialization
new 2f2a0c1 ARROW-1676: [C++] Only pad null bitmap up to a factor of 8 bytes in Feather format
new 1926bdc ARROW-1613: [Java] Alternative ArrowReader close to free resources but leave ReadChannel open
new 8eb2b0e ARROW-1679: [GLib] Add garrow_record_batch_reader_read_next()
new a043018 ARROW-1678: [Python] Implement numpy.float16 SerDe
new a4813bd ARROW-1685: [GLib] Add GArrowTableBatchReader
new 298e343 ARROW-1673: [Python] Add support for numpy 'bool' type
new d7bf5f2 ARROW-1690: [GLib] Add garrow_array_is_valid()
new a8f5185 ARROW-1666: [GLib] Enable gtk-doc on Travis CI Mac environment
new 971e99d ARROW-1695: [Serialization] Fix reference counting of numpy arrays created in custom serializer
new deaa0cf ARROW-1686: [Docs] rsync contents of apidocs directory into site java directory
new 989aba6 ARROW-1698: [JS] File reader attempts to load the same dictionary batch more than once
new 2ee900c ARROW-1702: Update jemalloc in manylinux1 build
new 3549fa8 ARROW-1683: [Python] Restore TimestampType to pyarrow namespace
new 9d12c7c ARROW-1697: [GitHub] Add ISSUE_TEMPLATE.md
new 05788d0 ARROW-1701: [Serialization] Support zero copy PyTorch Tensor serialization
new fbeaeea ARROW-1704: [GLib] Fix Go example failure
new 9ee5508 ARROW-1522: [Python] Zero copy buffer deserialization
new 61d8a76 ARROW-641: [C++] Do not build io-hdfs-test if ARROW_HDFS is off
new 53dd0c8 ARROW-1087: [Python] Add pyarrow.get_include function. Bundle includes in all builds
new f40618d ARROW-1671: [C++] Deprecate arrow::MakeArray that returns Status, refactor existing code to new variant
new 6209489 ARROW-1708: [JS] Fix linter error
new 8eb2a1b ARROW-1707: Update dev README after movement to GitBox
new 4eb38a2 ARROW-571: [Python] Add unit test for incremental Parquet file building, improve docs
new 2b77b7c ARROW-507: [C++] Complete ListArray::FromArrays implementation, add unit tests
new 8e00ee9 ARROW-1114: [C++] Add simple RecordBatchBuilder class
new 935a3cf ARROW-1654: [Python] Implement pickling for DataType, Field, Schema
new e876e17 ARROW-1720: [Python] Implement bounds check in chunk getter
new b08f7e3 ARROW-1711: [Python] Fix flake8 calls to lint the right directories
new ecb7605 ARROW-1134: [C++] Support for C++/CLI compilation, add NULLPTR define to avoid using nullptr in public headers
new b2596f6 ARROW-1588: [C++/Format] Harden Decimal Format
new 8148b6d ARROW-1726: [GLib] Add setup description to verify C GLib build
new 54d5c81 ARROW-1484: [C++/Python] Implement casts between date, time, timestamp units
new 48a6ff8 ARROW-1721: [Python] Implement null-mask check in places where it isn't supported in numpy_to_arrow.cc
new 238881f ARROW-1675: [Python] Use RecordBatch.from_pandas in Feather write path
new c30a7e3 ARROW-1732: [Python] Permit creating record batches with no columns, test pandas roundtrips
new 6b16cca ARROW-1689: [Python] Allow user to request no data copies
new 7abaa00 ARROW-587: Add fix version to PR merge tool
new a385e2b ARROW-1739: [Python] Fix broken build due to using unittest.TestCase methods
new 3596a43 ARROW-1737: [GLib] Use G_DECLARE_DERIVABLE_TYPE
new 2e04089 ARROW-1736: [GLib] Add GArrowCastOptions:allow-time-truncate
new 59030fe ARROW-1730, ARROW-1738: [Python] Fix wrong datetime conversion
new 2ed886e ARROW-1723: [C++] add ARROW_STATIC to mark static libs on Windows
new 4db0046 ARROW-1555 [Python] Implement Dask exists function
new 2eb78b0 ARROW-1728: [C++] Run clang-format checks in Travis CI
new cc03a45 ARROW-1745: [Plasma] Include gtest after plasma/compat.h in tests.
new 74a934a ARROW-1689: [Python] Implement zero-copy conversions for DictionaryArray
new b221a2c ARROW-1751: [Python] Pandas 0.21.0 introduces a breaking API change for MultiIndex construction
new f257b00 ARROW-1746: [Python] Add build dependencies for Arch Linux
new ec22228 ARROW-1747: [C++] Don't export symbols of statically linked libraries
new 1d36dd2 ARROW-1748: [GLib] Add GArrowRecordBatchBuilder
new 30158ad ARROW-1718: [C++/Python] Implement casts from timestamp to date32/64, properly handle NumPy datetime64[D] -> date32
new 39243ff ARROW-1409: [Format] Remove page id from Buffer metadata, increment metadata version number
new 72b50bc [C++] Fix clang-format failure from ARROW-1409
new 0880550 ARROW-1754: [Python] Fix buggy Parquet roundtrip when an index name is the same as a column name
new eca9924 ARROW-1658: [Python] Add boundschecking of dictionary indices when creating CategoricalBlock
new 9dc4c58 ARROW-1753: [Python] Provide for matching subclasses with register_type in serialization context
new 142e6ee ARROW-1455 [Python] Add Dockerfile for validating Dask integration
new 0373541 ARROW-1766: [GLib] Fix failing builds on OSX
new 527af63 ARROW-1652: [JS] housekeeping, vector cleanup
new 82cd6e5 ARROW-1764: [Python] Add -c conda-forge for Windows dev installation instructions
new 5d66576 ARROW-1727: [Format] Expand Arrow streaming format to permit deltas / additions to existing dictionaries
new b9a2ce9 ARROW-1765: [Doc] Use dependencies from conda in C++ docker build
new fc7104f ARROW-1742: C++: clang-format is not detected correct on OSX anymore
new 62190d7 ARROW-1756: [Python] Fix large file read/write error
new b513c8d ARROW-1762: [C++] Add note to readme about need to set LC_ALL on some Linux systems
new ea4a8f5 ARROW-1714: [Python] Fix invalid serialization/deserialization None name Series
new 1ee73ef ARROW-1770: [GLib] Fix GLib compiler warning
new d7f1398 ARROW-1749: [C++] Handle range of Decimal128 values that require 39 digits to be displayed
new b25b243 ARROW-1663: [Java] use consistent name for null and not-null in FixedSizeLis…
new 9721930 ARROW-480: [Python] Implement RowGroupMetaData.ColumnChunk
new 0106f53 ARROW-1750: [C++] Remove the need for arrow/util/random.h
new 99ea353 ARROW-1771: [C++] ARROW-1749 Breaks Public API test in parquet-cpp
new 3995eb3 ARROW-1768: [Python] Fix suppressed exception in ParquetWriter.__del__
new e631119 [Format] Fix link to Flatbuffers project in IPC.md
new 3188d70 ARROW-1716: [Format/JSON] Use string integer value for Decimals in JSON
new bfc0f24 ARROW-1776: [C++] Define arrow::gpu::CudaContext::bytes_allocated()
new 252a2a5 [GLib] Fix a typo in document
new 78872a1 ARROW-1775: Ability to abort created but unsealed Plasma objects
new dffa486 ARROW-1709: [C++] Decimal.ToString is incorrect for negative scale
new 65a9055 ARROW-972: UnionArray in pyarrow
new ed8aef2 ARROW-1793: fix a typo for README.md
new 2d34f34 ARROW-1788 Fix Plasma store abort bug on client disconnection
new 7c205b0 ARROW-1787: [Python] Support reading parquet files into DataFrames in a backward compatible way
new 21112f8 ARROW-1800: [C++] Fix and simplify random_decimals
new 357eedc ARROW-1781: Don't use brew when using the toolchain
new 550a39f ARROW-1801: [Docs] Update install instructions to use red-data-tools repos
new 7adadd8 ARROW-1763: [Python] Implement __hash__ for DataType
new e8331f4 ARROW-1794: [C++/Python] Rename DecimalArray to Decimal128Array
new 4a33bad ARROW-1767: [C++] Support file reads and writes over 2GB on Windows
new 6f8e287 ARROW-1743: [Python] Avoid non-array writeable-flag check
new 8f2d152 ARROW-1802: [GLib] Support arrow-gpu
new b18bbeb ARROW-1371: [Website] Add "Powered By" page to the website
new e3db5da ARROW-1806: [GLib] Add garrow_record_batch_writer_write_table()
new 9fb806c ARROW-1811: [C++/Python] Rename all Decimal based APIs to Decimal128
new 7255460 ARROW-1810: [Plasma] Remove unused Plasma test shell scripts
new 1d951b5 ARROW-1809: [GLib] Use .xml instead of .sgml for GTK-Doc main file
new 9812aea ARROW-1812: [C++] Plasma store modifies hash table while iterating during client disconnect
new 42353ba ARROW-1473: ValueVector new hierarchy prototype (implementation phase 1)
new 9ee838a ARROW-1474:[JAVA] ValueVector hierarchy (Implementation Phase 2)
new 5bea983 ARROW-1717: [Java] Refactor JsonReader for new class hierarchy and fix integration tests
new 837150e ARROW-1476: [JAVA] Implement Final ValueVector Updates
new ca3acdc ARROW-1821: [INTEGRATION] Add integration test case for when Field has zero null count and optional validity buffer
new ac26eb7 ARROW-1829: [Plasma] Fixes to eviction policy.
new cacbacd ARROW-1795: [Plasma] Create flag to make Plasma store use a single memory-mapped file.
new f2806fa ARROW-1559: [C++] Add Unique kernel and refactor DictionaryBuilder to be a stateful kernel
new eb7be48 ARROW-1805: [Python] Ignore special private files when traversing ParquetDataset
new 202e650 ARROW-1791: Limit generated data range to physical limits for temporal types
new 952ec05 ARROW-1773: [C++] Add casts from date/time types to compatible signed integers
new 37214ef ARROW-1827: [Java] Add checkstyle file and license template
new 9f9dc5b ARROW-1575: [Python] Add tests for pyarrow.column factory function
new d92735e ARROW-1834: [Doc] Build documentation in separate build folders
new b3a3a74 ARROW-1693: [JS] Expand JavaScript implementation, build system, fix integration tests
new cb5da9c ARROW-1778: [Python] Link parquet-cpp statically, privately in manylinux1 wheels
new 284e6c9 ARROW-1826: [JAVA] Avoid branching in copyFrom for fixed width scalars
new d887d91 ARROW-1830: [Python] Relax restriction that Parquet files in a dataset end in .parq or .parquet
new e98adc3 ARROW-1840: [Website] The installation command failed on Windows10 anaconda envir…
new c436376 ARROW-1838: [C++] Conform kernel API to use Datum for input and output
new cac0912 ARROW-1841: [JS] Update text-encoding-utf-8 and tslib for node ESModules support
new 15ed080 ARROW-1703: [C++] Vendor exact version of jemalloc we depend on
new 3fb1491 ARROW-1268: [SITE][FOLLOWUP] Update Spark Post to Reflect Conf Change
new fc4e2c3 ARROW-1808: [C++] Make RecordBatch, Table virtual interfaces for column access
new 1516306 ARROW-1047: [Java] Add Generic Reader Interface for Stream Format
new 9b2dc77 ARROW-1845: [Python] Expose Decimal128Type
new dda2d34 ARROW-1828: [C++] Hash kernel specialization for BooleanType
new 1524ed7 ARROW-1782: [Python] Add pyarrow.compress, decompress APIs
new ea0fb37 ARROW-1577: [JS] add ASF release scripts
new 6ec4f34 ARROW-1047: [Java] [FollowUp] Change ArrowMagic to be non-public class
new ac4bb69 ARROW-1852: [C++] Make retrieval of Plasma manager fd a const operation
new 05bfb26 ARROW-1849: [GLib] Add input checks to GArrowRecordBatch
new aaa0443 ARROW-1855: [GLib] Add workaround for build failure on macOS
new 82e42c5 ARROW-1777: [C++] Add ArrayData::Make static ctor for more convenient construction
new b20beff ARROW-1836: [C++] Remove deprecated static_visitor struct to avoid msvc C4996 warning
new ad82c9a ARROW-1853: [Plasma] Fix off-by-one error in retry processing
new bf1cf3b [Python] Add more detail to development docs (#1356)
new ebb6c76 ARROW-1859: [GLib] Add GArrowDictionaryDataType
new 85e2d89 ARROW-1758: [Python] Remove pickle=True option for object serialization
new 42fc57b ARROW-1178: [C++/Python] Add option to set chunksize in TableBatchReader, Table.to_batches method
new 6176350 [Release] Apache Arrow JavaScript 0.2.0
new 682e248 ARROW-1850: [C++] Use void* / const void* for buffers in file APIs
new b19e183 ARROW-1783: [Python] Provide a "component" dict representation of a serialized Python object with minimal allocation
new a75325a ARROW-1710: [Java] Remove Non-Nullable Vectors
new ffb37db ARROW-1735: [C++] Test CastKernel writing into output array with non-zero offset
new 155bf07 ARROW-1854: [Python] Use pickle to serialize numpy arrays of objects.
new b92c435 ARROW-1684: [Python] Support selecting nested Parquet fields by any path prefix
new bbbbbfb ARROW-1844: [C++] Add initial Unique benchmarks for int64, variable-length strings
new ff8efbf ARROW-1869: [JAVA] Fix LowCostIdentityHashMap name
new 1fd3457 ARROW-1862: [GLib] Add GArrowDictionaryArray
new 705d842 ARROW-1874: [GLib] Add garrow_array_unique()
new ad9105e ARROW-1817: [Java] Configure JsonReader to read floating point NaN values
new 3bed4eb ARROW-1865: [C++] Do not alter number of rows attribute when removing last column from Table
new ff4e407 ARROW-1878: [GLib] Add garrow_array_dictionary_encode()
new 3ee75a0 ARROW-1719: [Java] Remove accessor and mutator interface
new 3b438bc ARROW-1755: [C++] CMake option to link msvc crt statically
new 5dbfe30 ARROW-1863: [Python] PyObjectStringify could render bytes-like output for more types of objects
new 8f2dbee ARROW-1879: [Python] Dask integration tests are not skipped if dask is not installed
new fe6f60c ARROW-1885: [Java] Restore MapVector class names prior to ARROW-1710
new 611a4b9 ARROW-1785: [Format/C++/Java] Remove VectorLayout from serialized schemas
new b241eb6 ARROW-1839/ARROW-1871: [C++/Python] Add Decimal Parquet Read/Write Tests
new d64947e ARROW-1881: Ignore JS tags for Python packages
new 33b628a ARROW-1882: [C++] Reintroduce DictionaryBuilder
new 665da71 ARROW-1890: [Python] Fix mask handling for Date32 NumPy conversions
new aee1a0b ARROW-1889: [Python] --exclude is not available in older git versions
new 712b9d2 ARROW-1784: [Python] Enable zero-copy serialization, deserialization of pandas.DataFrame via components
new aad0782 ARROW-1857: [Python] Add switch for boost linkage with static parquet in wheels
new fb80013 ARROW-1892: [Python] Support binaries in lists
new 1d519d8 ARROW-1487: [C++] Implement casts from List<A> to List<B>, where a cast function is defined from any A to B
new ad454e8 ARROW-1877: [Java] Fix incorrect equals method in JsonStringArrayList
new 7ed4ca7 ARROW-1867: [Java] Add missing methods to BitVector from legacy vector class
new ad30138 ARROW-1891: [Python] Always use NumPy NaT sentinels to mark nulls when converting to array
new aaa978b ARROW-1893: [Python] Convert memoryview to bytes when loading from pickle in Python 2.7
new 64af87b ARROW-1884: [C++] Exclude integration test JSON reader/writer classes from public API
new 2994815 ARROW-1873: [Python] Catch more possible Python/OOM errors in to_pandas conversion path
new 543808d ARROW-1864: [Java] Upgrade Netty to 4.1.17
new fe33155 ARROW-1901: [Python] Support recursive mkdir for DaskFilesystem
new fcf46d9 ARROW-1902: [Python] Remove mkdir race condition from write_to_dataset
new 74b2ebe ARROW-1908: [Python] Construction of arrow table from pandas DataFrame with duplicate column names crashes
new 501d60e ARROW-1895/ARROW-1897: [Python] Add field_name to pandas index metadata
new 84f18c5 ARROW-1910: [C++] Use c_glib Brewfile in README for installing dependencies on macOS (#1407)
new 97678c1 ARROW-1883: [Python] Fix handling of metadata in to_pandas when not all columns are present
new 20bbbcb ARROW-1906: [Python] Do not override user-supplied type in pyarrow.array when converting DatetimeTZ pandas data
new abb9341 ARROW-1905: [Python] Add more comprehensive list of exact type checking functions to pyarrow.types
new 6a7ce32 ARROW-1904: [C++] Deprecate PrimitiveArray::raw_values
new 0095c4e ARROW-1911: [JS] Add Graphistry to Arrow JS proof points
new 16ddb12 ARROW-1914: [C++] Fix build dependency for GPU support build
new 4f1bc97 ARROW-1916: [Java] Include java/dev/checkstyle in git archive for source releases
new 8d006dc ARROW-1915: [Python] Add missing parquet decorator to decimal tests
new e4770b2 ARROW-1917: Fixes to enable verify-release-candidate.sh to work for 0.8.0
new 3afb8f4 [Release] Update CHANGELOG.md for 0.8.0
new 1d689e5 [maven-release-plugin] prepare release apache-arrow-0.8.0
new 5afae14 [maven-release-plugin] prepare for next development iteration
new 9e262f4 ARROW-1924: [Python] Bring back pickle=True option for serialization
new 72e660a ARROW-1933: [GLib] Fix build error with --with-arrow-cpp-build-dir
new db72bfe ARROW-1932: [Website] Update site for 0.8.0 release info
new 44c5a7c ARROW-1926: [GLib] Add garrow_timestamp_data_type_get_unit()
new 9335e28 ARROW-1832: [JS] Implement JSON reader for integration tests
new d023b40 ARROW-1922: Blog post on JAVA vector changes
new bcfd83d ARROW-1935: [Website] Remove link to nightly builds. Fix signature / checksum links, add verification instructions
new ac789ba ARROW-232: [Python] Add unit test for writing Parquet file from chunked table
new 9895181 ARROW-1934: [Website] 0.8.0 release highlights blog post
new ba00956 ARROW-1939: Correct links in release blog post
new ce95db3 ARROW-1931: [C++] Suppress C4996 deprecation warning in MSVC builds for now
new 65f5add ARROW-1944: [C++] Fix ARROW_STATIC_LIB in FindArrow
new 8986521 ARROW-1943: [JAVA] handle setInitialCapacity for deeply nested lists
new 16c79cc ARROW-1951: [Python] Add memcopy threads argument to PlasmaClient put.
new 2ca69d1 ARROW-1948: [Java] Load ListVector validity buffer with BitVectorHelper to handle all non-null
new 0401e6d ARROW-1945: [C++] Fix doxygen documentation of array.h
new d86fdc7 Fix a comment typo (#1448)
new 4daffad ARROW-1953: [JS] Fix JS build
new 0f38a95 ARROW-1947: [Plasma] Change Client Create and Get to use Buffers
new 37c3265 ARROW-1946: [JAVA] Add APIs to decimal vector for writing big endian data
new 4a3e6e3 ARROW-1941: [Python] Fix empty list roundtrip in to_pandas
new 3a69efc ARROW-1920 [C++/Python] Add experimental reader for Apache ORC files
new cb9c52e ARROW-1909: [C++] Enables building with benchmarks on windows
new 0115e93 ARROW-1919: [Plasma] Test that object ids are 20 bytes
new 4dbce60 ARROW-1962: [Java] Adding reset to ValueVector interface
new bda85bb ARROW-1970: [GLib] Add garrow_chunked_array_get_value_data_type() and garrow_chunked_array_get_value_type()
new 234ff71 ARROW-1958: [Python] Error in pandas conversion for datetimetz row index
new 95d894d ARROW-1972: [Python] Import pyarrow in DeserializeObject.
new ee134f0 ARROW-1969: [C++] Don't build ORC extension by default
new 1b0732c ARROW-1965: [GLib] Add garrow_array_builder_get_value_data_type()
new d0a93ba ARROW-1980: [Python] Fix race condition in write_to_dataset
new 3e14e12 ARROW-1978: [Website] Consolidate Powered By project list, add more visibly to front page
new 289e205 ARROW-1977: [C++] Update windows dev docs
new 9eae508 ARROW-764: [C++] Improves performance of CopyBitmap and adds benchmarks
new f82b7e4 ARROW-1927: [Plasma] Add delete function
new b49e8f3 ARROW-1971: [Python] Add pandas serialization to the default
new 99e58da ARROW-1979: [JS] Fix JS builds hanging in es2015
new 9be080f ARROW-1991: [Website] Fix Docker documentation build
new 9e2fc04 ARROW-2002: [Python] check write_queue is not full and writer_thread is alive before enqueue new record when download file.
new 1ffce26 ARROW-1856: [Python] Auto-detect Parquet ABI version when using PARQUET_HOME
new 58a24c5 ARROW-2004: [C++] Add shrink_to_fit parameter to BufferBuilder::Resize, add Reserve method
new bc9f9e5 ARROW-1966: [C++] Accommodate JAVA_HOME on Linux that includes the jre/ directory, or is the full path to directory with libjvm
new a11da7f ARROW-2005: [Python] Fix incorrect flake8 config path to Cython lint config
new 305b54c ARROW-1872: [Website] Minor edits and addition of YAML for versions
new 9e4a6e4 ARROW-1930: [C++] Adds Slice operation to ChunkedArray and Column
new e446084 ARROW-2011: [Python] Allow setting the pickler in the serialization context.
new d135974 ARROW-2000: [Plasma] Deduplicate file descriptors when replying to GetRequest.
new 1bbaf7e [Python] Fix small typos in bytes, String/UTF-8 and FixedSizeBinary type check exceptions. (#1495)
new ed27243 ARROW-2012: [GLib] Support "make distclean"
new 422efd9 ARROW-1580: [Python] Instructions for setting up nightly builds on Linux
new 72dea17 ARROW-1997: [C++/Python] Ignore zero-copy-option in to_pandas when `strings_to_categorical` is True
new 0930b1d ARROW-2018: [C++] fix Build instruction on macOS and Homebrew
new 0a49022 ARROW-1712: [C++] Add method to BinaryBuilder to reserve space for value data
new 2126ebf ARROW-2025: [C++] Creating multiple equivalent `HadoopFileSystem`s works fine
new 6bb1d1b ARROW-2003: [Python] Remove use of fastpath parameter to pandas.core.internals.make_block
new db83fb4 [C++] Update README for linting (#1515)
new 68b119b ARROW-2029: [Python] NativeFile.tell errors after close
new 1a9d024 ARROW-2019: [JAVA] Control the memory allocated for inner vector in LIST (#1497)
new 8edd62e ARROW-2031: [Python] HadoopFileSystem is pickleable
new 51046a0 ARROW-1961: [Python] Preserve pre-existing schema metadata in Parquet files when passing flavor='spark'
new bfce44b ARROW-2017: [Python] Use unsigned PyLong API for uint64 values over int64 range
new f680dac ARROW-2007: [Python] Implement float32 conversions, use NumPy dtype when possible for inner arrays
new a95465b ARROW-2035: [C++] Update vendored cpplint.py to a Py3-compatible one
new 074eafc ARROW-2043: [C++] change description from OS X to macOS
new a71bc83 ARROW-2010: [C++] Do not suppress shorten-64-to-32 warnings from clang, fix warnings in ORC adapter
new 6299a9c ARROW-2032: [C++] ORC ep installs on each call to ninja build
new edde5c1 ARROW-1999: [Python] Type checking in `from_numpy_dtype`
new 450bf47 ARROW-1835: [C++] Create Arrow schema from std::tuple types
new 0543953 ARROW-1646: [Python] Handle NumPy scalar types
new e50b1b3 ARROW-2028: [Python] extra_cmake_args needs to be passed through shlex.split
new d322634 ARROW-1992: [C++/Python] Fix segfault when string to categorical empty string array
new 0621765 ARROW-2048: [Python/C++] Upate Thrift pin to 0.11
new 40dd9cc ARROW-2033: [Python] Fix pa.array() with iterator input
new 673125f ARROW-2054: [C++] Fix compilation warnings
new 8d78376 ARROW-2047: [Python] Use sys.executable instead of one in the search path.
new 5c704bc ARROW-1705: [Python] allow building array from dicts
new 3e63084 ARROW-2036: [Python] Support standard IOBase methods on NativeFile
new cd0676f ARROW-2064: [GLib] Add common build problems link to the install section
new e112995 ARROW-2062: [Python] Do not use memory maps in test_serialization.py to try to improve Travis CI flakiness
new 0d6817a ARROW-2042: [Plasma] Revert API change of plasma::Create to output a MutableBuffer
new 0e04f6d ARROW-2070: [Python] Fix chdir logic in setup.py
new 1ed4019 ARROW-2072: [Python] Fix crash in decimal128.byte_width
new 2d649f9 ARROW-1623: [C++] Add convenience method to construct Buffer from a string that owns its memory
new ff28c76 ARROW-2024: [Python] Remove torch serialization from default serialization context.
new 0ada875 ARROW-1861: [Python] Rework benchmark suite [skip ci]
new c1d77a1 ARROW-2076: [Python] Display slowest test durations
new f84af8f ARROW-2065: [Python] Fix bug in SerializationContext.clone().
new e327747 ARROW-1990: [JS] C++ Refactor, Add DataFrame
new 3098c14 ARROW-2052: [C++ / Python] Rework OwnedRef, remove ScopedRef
new a1fd796 ARROW-2050: [Python] Support `setup.py pytest`
new 5042863 ARROW-1754: [Python] alternative fix for duplicate index/column name that preserves index name if available
new 2e9f7dd ARROW-1706: [Python] Coerce array inputs to StructArray.from_arrays. Flip order of arguments
new e26f3da ARROW-2085: [Python] HadoopFileSystem.isdir/.isfile return False on missing paths
new d949ee6 ARROW-2084: [C++] Support newer Brotli static library names
new febc19d ARROW-1589: [C++] Fuzzing for certain input formats
new 416d565 ARROW-2071: [Python] Lighten serialization tests
new de850b9 ARROW-2089: [GLib] Rename to GARROW_TYPE_BOOLEAN for consistency
new bdedf23 ARROW-2088: [GLib] Add GArrowNumericArray
new 30ade84 ARROW-2071: [Python] Fix test slowness on Travis-CI
new 11c9b42 ARROW-2090: [Python] Add context methods to ParquetWriter
new c49636d ARROW-2086: [Python] Shrink size of arrow_manylinux1_x86_64_base docker image
new 0d02a7d ARROW-2080: [Python] Update documentation about pandas serialization context.
new 1a8c9a4 ARROW-1976: [Python] Handling unicode pandas columns on parquet.read_table
new 2a881a0 ARROW-2044: [JS] Typings should be a regular dependency
new 310e5b1 ARROW-1757: [C++] Add DictionaryArray::FromArrays alternate ctor that can check or sanitized "untrusted" indices
new 54859c5 ARROW-1394: [Plasma] Add optional extension for allocating memory on GPUs
new 3b6b93d ARROW-2108: [Python] Update instructions for ASV
new 57563cd ARROW-2109: [C++] Completely disable boost autolink on MSVC build
new 5dd9bff ARROW-2107: [GLib] Follow arrow::gpu::CudaIpcMemHandle API change
new bd1e6e5 ARROW-2095: [C++] Less verbose building 3rd party deps
new 90bb4bd ARROW-2111: [C++] Lint in parallel
new 45ad0b0 ARROW-2087: [Python] Binaries of 3rdparty are not stripped in manylinux1 base image
new ea4c7b1 ARROW-2110: [Python] Only require pytest-runner on test commands
new f69e9db ARROW-633/634: [Java] Add FixedSizeBinary support in Java and integration tests (Updated)
new 7bf7b2e ARROW-1950: [Python] pandas_type in pandas metadata incorrect for List types
new 6e6f9b7 ARROW-2083: [CI] Detect changed components on Travis-CI
new 88b72df ARROW-2073: [Python] Create struct array from sequence of tuples
new 4fe3a0b ARROW-2114: [Python] Pull latest docker manylinux1 image [skip appveyor]
new 163be42 [Doc] Refine top of README, add GLib bindings to README, fix ASF link
new 0417b14 ARROW-2123: [JS] Upgrade to TS 2.7.1
new ddae240 ARROW-2120: [C++] Add possibility to use empty _MSVC_STATIC_LIB_SUFFIX for Thirdparties
new b18e159 ARROW-1973: [Python] Memory leak when converting Arrow tables with array columns to Pandas dataframes.
new 9d9cc8f ARROW-2049: [Python] Use python -m cython to run Cython, instead of CYTHON_EXECUTABLE
new 41a42de ARROW-2128: [Python] Support arrays of empty lists
new a6bcaa7 ARROW-2106: [Python] Add conversion for a series of datetime objects
new f7d3c64 ARROW-2134: [CI] Make Travis-CI commit inspection more robust
new 90d9daf ARROW-2129: [Python] Handle conversion of empty tables to Pandas
new 08d664a ARROW-2137: [Python] Don't print paths that are ignored when reading Parquet files
new 31a1b9b [Website] Add FPGA acceleration framework Fletcher to "Powered By" (#1593)
new 296e7b0 ARROW-1912: [Website] Add committer affiliations and roles to website
new 62c55ec ARROW-2133: [Python] Fix segfault on conversion of empty nested array to Pandas
new e8a93ca ARROW-1021: [Python] Add documentation for C++ pyarrow API
new cf00b57 ARROW-2138: [C++] abort on failed debug check
new 30ab43f ARROW-2124: [Python] Add test for empty item in array
new 0ffc882 ARROW-1998: [Python] fix crash on empty Numpy arrays
new 4a8a102 ARROW-969: [C++] Add add/remove field functions for RecordBatch
new d28f055 ARROW-2146: [GLib] Add Slice api to ChunkedArray
new c78c92a ARROW-2149: [Python] Reorganize test_convert_pandas.py
new 5dce01f ARROW-2155: [Python] frombuffer() should respect mutability of argument
new 78152f1 ARROW-1815: [Java] Rename MapVector to StructVector
new 7c88443 [Website] Add SciDB to "Powered By" (#1604)
new e15f545 ARROW-2039: [Python] Avoid crashing on uninitialized Buffer
new 0265fb4 ARROW-2151: [Python] Fix conversion from np.uint64 scalars
new c077986 ARROW-2068: [Python] Expose array's buffers
new 148ab9c ARROW-2156: [CI] Isolate Sphinx dependencies
new e8870d1 ARROW-2166: [GLib] Add Slice api to Column
new 7c37b0d ARROW-2161: [Python] Skip test_cython_api if ARROW_HOME isn't defined
new c71a565 ARROW-2117: [C++] Update codebase / CI toolchain for clang 5.0
new 62b9eb2 ARROW-1579: [Java] Adding containerized Spark Integration tests
new a1a34b8 ARROW-2169: [C++] MSVC is complaining about uncaptured variables
new 864c939 ARROW-2174: [JS] export arrow format and schema enums
new 2106152 ARROW-2163: [CI] Make apt installs explicit
new d64e144 ARROW-2170: [Python] construct_metadata fails on reading files where no index was preserved
new 080ce12 ARROW-2173: [C++/Python] Hold the GIL in NumPyBuffer destructor
new 64d2af6 ARROW-2144: [JS] Don't repeat dictionary lookups in DataFrame ops
new f8445a3 ARROW-2171: [C++/Python] Make OwnedRef safer
new d5c507a ARROW-2143: [Python] Provide a manylinux1 wheel for cp27m
new 3da9fd3 ARROW-2178: [JS] Fix JS html FileReader example
new 8e90836 ARROW-2168: [C++] Build toolchain on CI with jemalloc
new 77f2841 ARROW-2159: [JS] Support custom predicates
new 7d992de [Release] Apache Arrow JavaScript 0.3.0
new 46aa99e ARROW-2121: [Python] Handle object arrays directly in pandas serializer.
new ac2637a ARROW-2175: [Python] Install Arrow libraries in Travis CI builds when only Python directory is affected
new e06cfdd ARROW-2179: [C++] Install omitted headers in arrow/util
new f357906 ARROW-1942: [C++] Hash table specializations for small integers
new 94fd3ee ARROW-2172: [C++/Python] Fix converting from Numpy array with non-natural stride
new 7d11d02 ARROW-2190: [GLib] Add add/remove field functions for RecordBatch
new 9298c93 ARROW-2192: [CI] Always build on master branch and repository
new 9fefc23 ARROW-2162: [Python/C++] Decimal Values with too-high precision are multiplied by 100
new 897cc4d ARROW-1345: [Python] Test conversion from nested NumPy arrays with smaller int, float types
new 3e87f77 ARROW-2131: [Python] Prepend module path to PYTHONPATH when spawning subprocess
new bcbcf02 [JS] Fix typo in npm target for esNext/CommonJS. (#1645)
new 5f10067 ARROW-2180: [C++] Remove deprecated APIs from 0.8.0 cycle
new cdc347c ARROW-2132: Add link to Plasma in main README
new 27f7eba ARROW-2069: [Python] Add note that Plasma is not supported on Windows
new 81bfb38 ARROW-2185: Strip CI directives from commit messages
new d52f2ff [Dev] Follow-up, use angle brackets for commit author instead of square brackets
new c2865d0 ARROW-2093: [Python] Do not install PyTorch in Travis CI
new 2f01658 ARROW-2201: [Website] Publish JS API Docs
new 3e3f7c2 ARROW-2066: [Python] Document using pyarrow with Azure Blob Store
new cca4a74 ARROW-2197: Document C++ ABI issue and workaround
new e2dd864 ARROW-2184: [C++] Add static constructor for FileOutputStream returning shared_ptr to OutputStream
new c0b0e33 ARROW-2191: [C++] Only use specific version of jemalloc
new 2fd8f0a ARROW-2204: Fix TLS errors in manylinux1 build
new 27d8339 ARROW-2214: [JS] add nullBitmap getter to DictionaryData that proxies to its indices' nullBitmap
new 655eb74 ARROW-2212: [C++/Python] Build Protobuf in base manylinux 1 docker image
new 5521bcf ARROW-2094: [C++] Install libprotobuf and set PROTOBUF_HOME when using toolchain
new 564fefe ARROW-2213: [JS] fix npm release
new 8c493cd ARROW-2219: [JS] rename indicies to indices
new e50a8ec ARROW-2206: [JS] Document Perspective project
new e0328b0 ARROW-2023: [C++] Fix ASAN failure on malformed / empty stream input, enable ASAN builds, add more dev docs
new c017a63 ARROW-1035: [Python] Add streaming dataframe reconstruction benchmark
new a5c5ad2 ARROW-2203: [C++] StderrStream class
new 887e893 ARROW-1937: [Python] Document nested array initialization
new 482fc58 ARROW-2210: [C++] Reset ptr on failed memory allocation
new 1a92846 ARROW-2223: [JS] compile src/bin as es5-cjs to all output targets
new d3fabe0 ARROW-2230: [Python] Strip catch-all tag matching from git-describe
new 524b522 ARROW-2218: [Python] PythonFile should infer mode when not given
new 0a672bc ARROW-2226, ARROW-2233: [JS] Dictionary bugfixes
new 1d9b834 ARROW-2225: [JS] support tables split across buffers
new 671b53c ARROW-2046: [Python] Support path-like objects
new 3d5880a ARROW-2040: [Python] Deserialized Numpy array must keep ref to underlying tensor
new 5321582 ARROW-2231: [CI] Use clcache on AppVeyor for faster builds
new af2047e ARROW-2215: [Plasma] Hugetables munmap issue
new 8b3bbae ARROW-2198: [Python] correct docstring for parquet.read_table
new b2eb6ac ARROW-1632: [Python] Permit categorical conversions in Table.to_pandas on a per-column basis
new bfac60d ARROW-2145/ARROW-2153/ARROW-2157/ARROW-2160/ARROW-2177: [Python] Decimal conversion not working for NaN values
new 99899d6 ARROW-2232: [Python] pyarrow.Tensor constructor segfaults
new 29495ce ARROW-2176: [C++] Extend DictionaryBuilder to support delta dictionaries
new f403804 ARROW-2205: [Python] Option for integer object nulls
new 5994094 ARROW-2209: [Python] Partition columns are not correctly loaded in schema of ParquetDataset
new 34c33f1 [Python] Document serialization parameter as "string" instead of "bytes"
new 8b1c811 ARROW-2245: ARROW-2246: [Python] Revert static linkage of parquet-cpp in manylinux1 wheel
new 6e699d7 ARROW-2252: [Python] Create buffer from address, size and base
new 03db8a3 ARROW-2251: [GLib] Keep GArrowBuffer alive while GArrowTensor for the buffer is live
new 9ceda35 ARROW-2244: [C++] Add unit test to explicitly check that NullArray internal data set correctly in Slice operations
new b89c124 ARROW-2253: [Python] Support __eq__ on scalar values
new 49f1d00 ARROW-2258: [Python] Add additional information to find Boost on windows
new 55bdae5 ARROW-2254: [Python] Ignore JS tags in local dev versions
new c6359cb ARROW-1929: [C++] Copy over testing utility code from PARQUET-1092
new 45f5da2 ARROW-1982: [Python] Coerce Parquet statistics as bytes to more useful Python scalar types
new 01a099c ARROW-2199: [JAVA] Control the memory allocated for inner vectors in containers. (#1646)
new 06e9fb4 [Python] Add missing dependency to development.rst
new 57e4dd8 ARROW-2265: [Python] Use CheckExact when serializing lists and numpy arrays.
new 51e117d ARROW-2154: [Python] Implement equality on buffers
new cde18a6 ARROW-2234: [JS] Read timestamp low bits as Uint32s
new a58bd72 ARROW-2272: [Python] Clean up leftovers in test_plasma.py
new 5f8a793 ARROW-2279: [Python] Better error message if lib cannot be found
new 60c8081 ARROW-2261: [GLib] Improve memory management for GArrowBuffer data
new 9effbed ARROW-2283: [C++] Support Arrow C++ installed in /usr detection by pkg-config
new fb2316c ARROW-2238: [C++] Detect and use clcache in cmake configuration
new c372dfb ARROW-2280: [Python] Return the offset for the buffers in pyarrow.Array
new 5e945a3 ARROW-2239: [C++] Update Windows build docs
new f3f91b0 ARROW-2263: [Python] Prepend local pyarrow/ path to PYTHONPATH in test_cython.py
new 34b18f7 ARROW-1940: [Python] Extra metadata gets added after multiple conversions between pd.DataFrame and pa.Table
new 04f4e6b ARROW-2289: [GLib] Add Numeric, Integer, FloatingPoint data types
new f56fdc9 ARROW-2270: [Python] Fix lifetime of ForeignBuffer base object
new 40a0008 [Python] Adding more missing Linux dependencies to developer docs
new 23d08b7 ARROW-2150: [Python] Raise NotImplementedError when comparing with pyarrow.Array for now
new 7354a19 ARROW-2284: [Python] Fix error display on test_plasma error
new 8167472 ARROW-2275: [C++] Guard against bad use of Buffer.mutable_data()
new 3511c65 ARROW-2268: Drop usage of md5 checksums for source releases, verification scripts
new c7c2393 ARROW-2269: [Python] Make boost namespace selectable in wheels
new fc9f89a ARROW-2250: [Python] Do not create a subprocess for plasma but just use existing process
new d0284cb ARROW-2236: [JS] Add more complete set of predicates
new 412bb91 ARROW-2291: [C++] Add additional libboost-regex-dev to build instructions in README
new 907a27d ARROW-2288: [Python] Fix slicing logic
new 2f718d7 ARROW-2262: [Python] Support slicing on pyarrow.ChunkedArray
new d64a231 ARROW-2181: [PYTHON][DOC] Add doc on usage of concat_tables
new dc45a1a ARROW-2099: [Python] Add safe option to DictionaryArray.from_arrays to do boundschecking of indices by default
new c7b3c05 ARROW-2297: [JS] babel-jest is not listed as a dev dependency
new 8f2ff30 ARROW-2240: [Python] Array initialization with leading numpy nan fails with exception
new 3917e85 ARROW-2292: [Python] Rename frombuffer() to py_buffer()
new 58fa873 ARROW-2282: [Python] Create StringArray from buffers
new 317b543 ARROW-2293: [JS] Print release vote e-mail template when making source release
new 6fc9922 ARROW-2118: [C++] Fix misleading error when memory mapping a zero-length file
new 171340f ARROW-2135: [Python] Fix NaN conversion when casting from Numpy array
new 0b28dc5 ARROW-2142: [Python] Allow conversion from Numpy struct array
new 7c7b09f ARROW-1643: [Python] Accept hdfs:// prefixes in parquet.read_table and attempt to connect to HDFS
new 33d1091 ARROW-2227: [Python] Fix off-by-one error in chunked binary conversions
new a430758 ARROW-2306: [Python] Fix partitioned Parquet test against HDFS
new 385656c ARROW-2304: [C++] Fix HDFS MultipleClients unit test
new e25e3ef ARROW-2307: [Python] Allow reading record batch streams with zero record batches
new 98012cb ARROW-2312: [JS] run test_js before test_integration
new b185951 ARROW-2313: [C++] Add -NDEBUG flag to arrow.pc
new 630ce5e ARROW-2311: [Python/C++] Fix struct array slicing
new 019a560 ARROW-2309: [C++] Use std::make_unsigned
new 60749b2 ARROW-2316: [C++] Revert Buffer::mutable_data to inline so that linkers do not have to remember to define NDEBUG for release builds
new 20ea781 [Python] Pin Cython to 0.27.3 in verify-release-candidate.sh (#1758)
new e29df7d ARROW-2320: [C++] Vendored Boost build does not build regex library
new 79e19c3 [JS] Small fixes to source release workflow and e-mail template (#1750)
new 82c8b6f ARROW-2318: [Plasma] Run plasma store tests with unique socket
new 95ba6ef ARROW-2321: [C++] Release verification script fails with if CMAKE_INSTALL_LIBDIR is not $ARROW_HOME/lib
new 7be8d37 [Release] Update CHANGELOG.md for 0.9.0
new c695a5d [maven-release-plugin] prepare release apache-arrow-0.9.0
new bb17a0d [maven-release-plugin] prepare for next development iteration
new a50ef9f ARROW-2329: [Website] 0.9.0 release update
new 60848c0 ARROW-2299: [Go] Import Go arrow implementation from influxdata/arrow
new 607c7fa ARROW-2340: [Website] Add blog post about Go code donation
new 948cb4a ARROW-2336: [Website] Add 0.9.0 release blog post
new f45abf0 [Website] Add link to press release
new 07beb51 ARROW-2333: [Python] Fix bundling boost with default namespace
new 47fcef3 ARROW-2334: [C++] Update boost to 1.66.0
new d623567 ARROW-2341: [Python] Improve pa.union() mode argument behaviour
new eecb1bc ARROW-2281: [Python] Add Array.from_buffers()
new f50d858 ARROW-2343: [Java/Packaging] Run mvn clean in API doc builds
new 29268ec ARROW-2342: [Python] Allow pickling more types
new 0c8d164 ARROW-2345: [Documentation] Fix bundle exec and set sphinx nosidebar to True
new e6d8eed ARROW-2322: [Java] Document dev environment requirements for publishing Java release artifacts
new a0ca9b4 ARROW-2346: [Python] Fix PYARROW_CXX_FLAGS with multiple options
new 777f986 ARROW-2331: [Python] Fix indexing for negative or out-of-bounds indices
new 7b2c797 ARROW-2349: [Python] Opt in to bundling Boost shared libraries separately
new af6e3ec ARROW-1913: [Java] Disable Javadoc doclint with Java 8
new 29f744f ARROW-2350: Consolidated RUN step in spark_integration Dockerfile
new 9c7e06b ARROW-2348: [GLib] Remove GLib + Go example
new 6156b1d ARROW-640: [Python] Implement __hash__ and equality for Array scalar values Arrow scalar values
new 27f5a42 ARROW-2301: [Python] Build source distribution inside the manylinux1 docker
new f9f8320 ARROW-2354: [C++] Make PyDecimal_Check() faster
new 3d4b6c1 ARROW-2356: [JS] Fix JSON Reader FixedSizeBinary Vectors
new f29e5a1 ARROW-2368: [JAVA] Correctly pad negative values in DecimalVector#setBigEndian (#1809)
new 866e9b8 ARROW-2327: [JS] Table.fromStruct missing from externs
new 97f5ec0 [C++] Fix documentation typo in arrow/array.h
new ba0cea3 ARROW-2140: [Python] Improve float16 support
new 3f72d14 ARROW-2361: [Rust] Starting point for a native Rust implementation of Arrow
new 3975de5 Update README.md to include new components
new 00b334f [Rust] Update READMEs to add Rust libraries link and to remove out-of-data comment about memory alignment (#1817)
new be049fa ARROW-2370: [GLib] Fix include path in .pc on Meson build
new d2d4cc7 ARROW-2371: [GLib] Update "Requires" in .pc on GNU Autotools build
new 7e27cf5 ARROW-2376: [Rust] Travis builds the Rust library
new 8fdad18 ARROW-2377: [GLib] Support old GObject Introspection
new 11b15a5 ARROW-2357: [Python] Add microbenchmark for PandasObjectIsNull()
new fff992a ARROW-2122: [Python] Pyarrow fails to serialize dataframe with timestamp.
new b6e8b4b ARROW-2381: [Rust] Adds iterator support to Buffer<T>
new fce183c ARROW-2378: [Rust] Rustfmt
new 4c68eca ARROW-2375: [Rust] Implement Drop for Buffer so memory is released
new 65d2558 ARROW-2351 [C++] StringBuilder::append(vector<string>...) not impleme…
new 65493a6 ARROW-2014: [Python] Document read_pandas method in pyarrow.parquet
new 9fc4d89 DOC: Fix a tiny typo in parquet documentation (#1824)
new b0f376a Fix broken build on master (remove duplicate Drop impl for Buffer) (#1828)
new 82d4555 ARROW-2141: [Python] Support variable length binary conversion from Pandas
new 933b32b ARROW-2388: [C++] Use valid_bytes API for StringBuilder::Append
new 806979b ARROW-2382: [Rust] Bug fix: List was not using aligned mem
new 7081752 ARROW-2385: [Rust] implement to_json for DataType and Field
new cf39686 ARROW-2195: [Plasma] Return auto-releasing buffers
new 26bc4ab ARROW-2308: [Python] Make deserialized numpy arrays 64-byte aligned.
new 640fc83 ARROW-2276: [Python] Expose buffer protocol on Tensor
new 76edf43 ARROW-1463: [Java] Cleanup usage of Types.MinorType to MinorType
new 486d592 ARROW-2384: [Rust] Additional test & Trait standardization
new 02b0c72 ARROW-2325: [Python] Update setup.py to use Markdown project description
new 045470c ARROW-2396: [Rust] Unify Rust Errors
new 9515fe9 ARROW-2380: [Python] Streamline conversions
new 29c376d ARROW-2398: [Rust] Create Builder<T> for building buffers directly in aligned memory
new 83bfb39 ARROW-2404: [C++] Fix "declaration of 'type_id' hides class member" w…
new 946517d ARROW-2405: [C++] <function> is required for std::function
new e3f7edc ARROW-2401 Support filters on Hive partitioned Parquet files
new f9c0701 ARROW-2402: [C++] Avoid spurious copies with FixedSizeBinaryBuilder
new 87284a5 [Site] Add Antoine to committers list (#1853)
new f88949b ARROW-2418: [Rust] BUG FIX: reserve memory when building list
new 408aa5a ARROW-2416: [C++] Support system libprotobuf
new b4dafa5 ARROW-2414: Fix a variety of typos.
new 55c1075 ARROW-2353: [CI] Check correctness of built wheel on AppVeyor
new b095994 ARROW-2408: [Rust] Remove build warnings
new 57db8b5 ARROW-2419: [Site] Hard-code timezone
new 7376aab ARROW-2413: [Rust] Remove useless calls to format!().
new ca3dbbb ARROW-2415: [Rust] Fix clippy ref-match-pats warnings.
new abf4ed2 ARROW-2408: [Rust] Ability to get `&mut [T]` from `Buffer<T>`
new 5030e23 ARROW-2420: [Rust] Fix major memory bug and add benches
new ad39d1f ARROW-2424: [Rust] Fix build - add missing import
new 1bb7fba ARROW-2100: [Python] Drop Python 3.4 support
new f56d765 ARROW-2305: [Python] Bump Cython requirement to 0.27+
new 27417b2 ARROW-2328: [C++] Fixed and unit tested feather writing with slice
new e941af8 ARROW-2391: [C++/Python] Segmentation fault from PyArrow when mapping Pandas datetime column to pyarrow.date64
new 33d92a0 ARROW-2434: [Rust] Add windows support
new ca277ae ARROW-2425: [Rust] BUG FIX: Add u8 mappings for Array::from
new c5574f4 ARROW-2426: [GLib] Follow python -> python@3 change in Homebrew
new 6633cc9 ARROW-2433: [Rust] Add Builder.push_slice(&[T])
new 91ec792 ARROW-2411: [C++] Add StringBuilder::Append(const char **values)
new 265142b ARROW-2441: [Rust] Builder<T>::slice_mut assertions are too strict
new 42e195b ARROW-2440: [Rust] Implement ListBuilder<T>
new 1ee7d11 ARROW-2407: [GLib] Add garrow_string_array_builder_append_values()
new ed7db7c ARROW-2097: [CI, Python] Reduce Travis-CI verbosity
new 4009b62 ARROW-2224: [C++] Remove boost-regex dependency
new 6e8ecb5 ARROW-2445: [Rust] Add documentation and make some fields private
new db03663 ARROW-2182: [Python] Build C++ libraries in benchmarks build step
new 9ad8602 ARROW-2432: [Python] Fix Pandas decimal type conversion with None values
new f177404 ARROW-2369: [Python] Fix reading large Parquet files (> 4 GB)
new 685147c ARROW-2451: [Python] Handle non-object arrays more efficiently in custom serializer.
new 0f87c12 ARROW-2437: [C++] Add ReadMessage without aligned argument.
new c96747b ARROW-2455: [C++] Initialize the atomic bytes_allocated_ properly
new 7de1264 ARROW-2387: [Python] Flip test for rescale loss if value < 0
new 98d250e ARROW-2397: [Documentation] Update format documentation to describe tensor alignment.
new b2167e4 ARROW-2435: [Rust] Add memory pool abstraction.
new 3eee3e4 ARROW-2101: [Python/C++] Correctly convert numpy arrays of bytes to arrow arrays of strings when user specifies arrow type of string
new 2d0fbf1 ARROW-2464: [Python] Use a python_version marker instead of a condition
new 72c7f5d ARROW-2454: [C++] Allow zero-array chunked arrays
new 66d0ad1 ARROW-2315: [C++/Python] Flatten struct array
new 2876a3f ARROW-2463: [C++] Update flatbuffers to 1.9.0
new f1ef708 ARROW-2319: [C++] Add BufferedOutputStream class
new d7d3196 ARROW-2442: [C++] Disambiguate builder Append() overloads
new 72df18c ARROW-2465: [Plasma/GPU] Preserve plasma_store rpath
new 4c31b37 ARROW-2147: [Python] Fix type inference of numpy arrays
new 25eff99 ARROW-2468: [Rust] Builder::slice_mut() should take mut self.
new d58057b ARROW-2473: [Rust] List empty slice assertion
new c2e0d42 ARROW-2423: [Python] Enable DataType, Field and plasma ObjectID equality checks against no…
new 18999bb ARROW-2469: [C++] Make out arguments last in ReadMessage.
new 1299931 ARROW-2443: [Python] Allow creation of empty Dictionary indices
new 7eeca3a ARROW-2458: [Plasma] Use one thread pool per PlasmaClient
new 09be7b4 ARROW-2472: [Rust] Remove public attributes from Schema and Field and add accessors
new 46fe09a ARROW-2471: [Rust] Builder zero capacity fix
new c19b1f0 ARROW-2481: [Rust] Move all calls to free() into memory.rs
new 249e039 ARROW-1928: [C++] Add BitmapReader/BitmapWriter benchmarks
new 4c71f30 ARROW-2390: [C++/Python] Map Python exceptions to Arrow status codes
new c9ad33e ARROW-2457: [GLib] Support large is_valids in builder's append_values()
new 54df19d ARROW-1018: [C++] Create FileOutputStream, ReadableFile from file descriptor
new 2452a46 ARROW-2393: [C++] Moving ARROW_CHECK_OK_[PREPEND] macros from status.h into util/logging.h since they use the logging infrastructure and shouldn't be in the public API.
new 3b69c5a ARROW-2450: [Python] Test for Parquet roundtrip of null lists
new 1ba7d51 ARROW-2222: handle untrusted inputs
new 5381295 ARROW-2314: [C++/Python] Fix union array slicing
new 138717a ARROW-1858: [Python] Added documentation for pq.write_dataset
new a6c9d30 ARROW-2453: [Python] Improve Table column access
new a5ae134 ARROW-1731: [Python] Add columns selector in Table.from_array
new 03251e9 ARROW-2427: [C++] Implement ReadAt properly
new 77a5c59 ARROW-2494: [C++] Return status codes from PlasmaClient::Seal instead of crashing
new 7545e3e ARROW-2492: [Python] Prevent segfault on accidental call of pyarrow.Array
new b65205e ARROW-2470: [C++] Avoid seeking in GetFileSize
new 2abc889 ARROW-2489: [Plasma] Fix PlasmaClient ABI variation
new a609309 ARROW-2502: [Rust] Restore Windows Compatibility
new 2d278ab ARROW-2508: [Python] Fix pytest.raises msg to message
new 3d7a5a6 ARROW-2074: [Python] Infer lists of dicts as struct arrays
new 5f9cf9c ARROW-2448: [Plasma] Reference counting for PlasmaClient::Impl
new c8a3ed8 ARROW-2286: [C++/Python] Allow subscripting pyarrow.lib.StructValue
new c574006 ARROW-2498: [Java] Use java 1.8 instead of java 1.7
new c8f17dd ARROW-2518: [Java] Re-instate JDK tests in matrix, but with JDK 8 instead of JDK 7
new 16820a2 ARROW-2452: [TEST] Spark integration test fails with permission error
new 3f5819a [GLib] Fix a typo
new e8d45eb ARROW-2515 [Python] Add DictionaryValue class, fixing bugs with nested dictionaries
new e3fafae ARROW-2513: [Python] DictionaryType should give access to index type and dictionary array
new bdd5cad ARROW-2529: [C++] Update mention of clang-format to 5.0 in the docs
new a2aba52 ARROW-2436: [Rust] Add windows CI
new 101b319 ARROW-2462: [C++] Fix Segfault in UnpackBinaryDictionary
new 07e0778 ARROW-2302: [GLib] Unify GNU Autotools build and Meson build into one Travis CI job
new af5bd80 ARROW-2526: [GLib] Update .gitignore
new 1c5f06e ARROW-2474: [Rust] Add windows support for memory pool abstraction
new 109942c ARROW-2527: [GLib] Enable GPU document
new 03be2a0 ARROW-2525: [GLib] Add garrow_struct_array_flatten()
new 3c8cb4d ARROW-2482: [Format] Clarify struct field alignment
new 4ac7b29 ARROW-2507: [Rust] Don't take a reference when not needed.
new 138c491 ARROW-2422: Support more operators for partition filtering
new 113835a ARROW-2530: [GLib] Support out-of-source directory build again
new f056ef0 ARROW-2485: Re-write of run_clang_format.py, such that it outputs the diffs of th…
new ea76265 ARROW-2484: [C++] Document ABI compliance checking
new a3aaff5 ARROW-2503: [Python] Prevent trailing space character for string statistics
new eda5ca8 ARROW-2509: Build for node 9.8
new 0562d3b ARROW-2417: [Rust] Fix API safety issues
new 2bc110d ARROW-2499: [C++] Factor out Python iteration routines
new 6f9c4db ARROW-2534: [C++] Hide all zlib symbols from libarrow.so
new 9b76ee4 ARROW-2533: [CI] Fast finish failing AppVeyor builds
new 5bef985 ARROW-2332: Feather Reader option to return Table
new 4cc9157 ARROW-2466: [C++] Fix "append" flag to FileOutputStream
new 15e4811 ARROW-2531: [C++] Update clang bits to 6.0
new 7323b60 ARROW-2493: [Python] Add support for pickling to buffers and arrays
new 6bf1c66 ARROW-2505: [C++] Disable MSVC warning C4800
new fbce08d ARROW-2511: [Java] Fix BaseVariableWidthVector.allocateNew to not swallow exception (#1947)
new c884f32 ARROW-2536: [Rust] optimize capacity allocation for ListBuilder
new f75bbc0 [C++] Fix a typo in cpplint (#1986)
new 8797390 ARROW-2522: [C++] Version shared library files
new 26c6805 ARROW-1886: [C++/Python] Flatten struct columns in table
new 8428c51 ARROW-2516: [CI] Filter changes in AppVeyor builds
new 73f0d8e ARROW-2478: [C++] Introduce a checked_cast function that performs a dynamic_cast in debug mode
new b916c79 ARROW-2539: [Plasma] Use unique_ptr instead of raw pointer
new 5bdfff8 ARROW-2541: [Plasma] Replace macros with constexpr
new 3d594bc ARROW-2273: [Python] Raise NotImplementedError when pandas Sparse types serializing
new bcd5967 [Website] Update SciDB in "Powered By" (#2004)
new 1b78ac2 ARROW-2543: [Rust] Cache dependencies when building our rust library
new 2f453a0 ARROW-2544: [CI] Run the C++ tests with two jobs
new a28a198 ARROW-2389: [C++] Add CapacityError
new 0cb992a ARROW-2547: Fix off-by-one in `List<List<byte>>` example
new 0ef8ce3 ARROW-2548: Clarify `List<Char>` Array example
new 0ecbbf4 ARROW-2285: [C++/Python] Can't convert Numpy string arrays
new c46f625 ARROW-2477: [Rust] Set up code coverage in CI
new 9c31eec ARROW-2545: [Python] Link against required system libraries
new 0f07171 ARROW-2540: [Plasma] Create constructors & destructors for ObjectTableEntry
new 488a8c0 ARROW-2546: [JS] Update to npm>=5.7.1 to fight EINTEGRITY problems
new 64923d5 ARROW-2550: [C++] Add missing status codes into arrow::Status::CodeAsString()
new a426c1b ARROW-2549: [GLib] Apply arrow::StatusCode changes to GArrowError
new bb67939 ARROW-2491: [Python] raise NotImplementedError on from_buffers with nested types
new ac91d72 ARROW-2552: [Plasma] Fix memory error
new 35da703 ARROW-2557: [Rust] Add badge for code coverage in README
new a0299bf ARROW-2564: [C++] Replace deprecated method in documentation
new 777e6b5 ARROW-2562: [CI] C++ and Rust code coverage using codecov.io
new b93faac ARROW-2566: [CI] Add codecov.io badge
new 414268b ARROW-1964: [Python] Expose StringBuilder to Python
new 2093f6e ARROW-2479: [C++] Add ThreadPool class
new bb47c36 ARROW-2565: [Plasma] new subscriber cannot receive notifications about existing objects
new 1c1c187 ARROW-2570: [Python] Add support for writing parquet files with LZ4 compression
new 1d42efa ARROW-2500: [Java] IPC Writers/readers are not always setting validity bits correctly
new 6a7ccd2 ARROW-2207: [GLib] Support GArrowDecimal128
new e1c3d4b ARROW-2517: [Java] Add list<decimal> writer (#1965)
new 321773c ARROW-2567: [C++] Not only compare type ids on Array equality
new da3f843 ARROW-2569: [C++] Improve thread pool size heuristic
new 90fb5a3 ARROW-2571: [C++] Lz4Codec doesn't properly handle empty data
new 19af862 ARROW-2576: [GLib] Add abs functions for Decimal128
new 4b8511f ARROW-2578: [Plasma] Use mersenne twister to generate random number
new 5a29ab8 ARROW-2563: [Rust] Poor caching in Travis-CI
new 63da09a ARROW-2580: [GLib] Fix abs functions for Decimal128
new 75acaba ARROW-2577: [Plasma] Add asv benchmarks for plasma
new df9bdc1 Serialize tensors in PyTorch 0.4 (#2033)
new 27fc25c ARROW-2332: Add Feather Dataset class
new 48eee79 Removing extraneous debug print statement from run_clang_format.py (#2045)
new 71d487a ARROW-2584: [JS] Fixes for node v10
new 38db8ed ARROW-2558: [Plasma] avoid walk through all the objects when a client disconnects
new 6ca246a ARROW-2582: [GLib] Add negate functions for Decimal128
new 5bb3d85 ARROW-2589: [Python] Workaround regression in Pandas 0.23.0
new e237918 ARROW-2561: [C++] Fix double free in cuda-test under code coverage
new df20683 ARROW-2595: [Plasma] Use map.find instead of operator[] to avoid producing garbage data
new 941a1b7 ARROW-2486: [C++/Python] Provide a Docker image that contains all dependencies for development
new 7764bc8 ARROW-2574: [Python] Add Cython and Python code coverage
new 635ee1f ARROW-2521: [Rust] Refactor Rust API to use traits and generic to represent Array instead of enum
new 936115c ARROW-2594: [Java] When realloc Vectors, zero out all unfilled bytes of new buffer
new bd54041 [GLib] Use the default directory of GTK-Doc (#2058)
new 3d41978 ARROW-2611: [Python] Fix Python 2 integer serialization
new bd25a65 ARROW-2612: [Plasma] Fix deprecated PLASMA_DEFAULT_RELEASE_DELAY
new f319bca ARROW-2597: [Plasma] remove UniqueIDHasher
new 6132fd3 ARROW-2615: [Rust] Post refactor cleanup
new 15e416e ARROW-2614: Remove 'group: deprecated' in Travis
new 0374adb ARROW-2613: [Docs] Update the gen_apidocs docker script
new fcc13f5 ARROW-2585: [C++] Add Decimal::FromBigEndian, which was formerly a static method in parquet-cpp/src/parquet/arrow/reader.cc
new 1d9d893 ARROW-2629: [Plasma] Iterator invalidation for pending_notifications_
new fc7a382 ARROW-2116: [JS] implement IPC writers
new d9f0cca ARROW-2630: [JAVA] typo fix
new 5c5e95a ARROW-2604: [Java] Add convenience method to VarCharVector to set Text
new 338e597 ARROW-2537: [Ruby] Import
new 01202cc ARROW-2568: [Python] Expose thread pool size setting to Python, and deprecate "nthreads" where possible
new ce23c06 ARROW-1163: [Java] Java client support for plasma
new 6646864 ARROW-2636: [Ruby] Add missing "unofficial" notes
new 1a922fd ARROW-2593: [Python] TypeError: data type "mixed-integer" not understood
new d8bce06 ARROW-2632: [Java] ArrowStreamWriter accumulates ArrowBlock but does not use them
new cdacb09 ARROW-2644: [Python] Fix prototype declaration in Parquet binding
new d19089e ARROW-2641: [C++] Avoid spurious memset() calls, improve bitmap write performance
new 94409a6 ARROW-906: [C++/Python] Read and write field metadata in IPC
new 0894d97 ARROW-2645: [Java] Refactor ArrowWriter to remove all ArrowFileWriter specifc logic
new 79a2207 ARROW-2430: [Packaging] MVP for branch based packaging automation
new 3960db9 ARROW-2668: [C++] Suppress -Wnull-pointer-arithmetic when compiling plasma/malloc.cc on clang
new 5eaa3ab ARROW-2655: [C++] Fix compiler warnings with gcc 7
new 44bfd0d ARROW-2638: [Python] Prevent calling extension class constructors directly
new 9aa7c52 ARROW-2640: [JS] Write schema metadata
new 2fdacde ARROW-2603: [Python] Allow date and datetime subclassing
new b9b4376 ARROW-2664: [Python] Implement __getitem__ / slicing on Buffer
new b1d1633 ARROW-2661: [Python] Adding the ability to programmatically pass hdfs configration key/value pairs via pyarrow
new 88fe5d7 ARROW-2678: [GLib] Add more common problems compiling c_glib on OSX
new 0a3bdba ARROW-2287: [Python] chunked array not iterable, not indexable
new b165e46 ARROW-2675: Fix build error with clang-10 (Apple Clang / LLVM)
new e82a34a ARROW-530: [C++/Python] Provide subpools for better memory allocation …
new dffd193 ARROW-2672: [Python] Build ORC extension in manylinux1 wheels
new f0a8811 ARROW-2682: [CI] Notify in Slack about broken builds
new 27b869a ARROW-2649: [C++] Add GenerateBits() function to improve bitmap writing performance
new 5b4728b ARROW-2635: [Ruby] Add LICENSE.txt and NOTICE.txt for Apache Arrow Ruby
new 3155291 ARROW-2680: [Python] Add documentation about type inference in Table.from_pandas
new d5c5f39 [JS] Add note about IPC format to readme (#2122)
new 8156e25 ARROW-2551: [Plasma] Improve notification logic
new 1e4f8dd Add Ruby to README.md
new 6df28d3 ARROW-2691: [Rust] Update code formatting with latest Rust stable
new dc80a76 ARROW-2695: [Python] Prevent calling scalar constructors directly
new 34890cc ARROW-2689: [Python] Remove parameter timestamps_to_ms
new 9a76caa ARROW-2692: [Python] Add test for writing dictionary encoded columns to chunked Parquet files
new 6b80fa8 ARROW-2693: [Python] pa.chunked_array causes a segmentation fault on empty input
new 53cc09e ARROW-2626: [Python] Add column name to exception message when writing pandas df fails
new df44691 ARROW-2694 - [Python] ArrayValue string conversion returns the representation instead of the converted python object string
new 8d296cc ARROW-2554: [Python] fix timestamp unit detection from python lists
new 2b00b8a ARROW-2488: [C++] Add Boost 1.67 and 1.68 as recognized versions
new d4755e4 ARROW-2677: [Python] Expose Parquet ZSTD compression
new c15eee7 ARROW-2395: [Python] Fix flake8 warnings outside of pyarrow/ directory. Check in CI
new 392fd02 ARROW-2696: [JAVA] enhance AllocationListener with an onFailedAllocation() call (#2133)
new a82a027 ARROW-2721: [C++] Fix ORC and Protocol Buffers link error
new e17f95d ARROW-1780 - JDBC Adapter to convert Relational Data objects to Arrow Data Format Vector Objects (#1759)
new 7c81bd6 ARROW-2723: [C++] Add .pc for arrow orc
new 97c24d7 ARROW-2726: [C++] Fix the latest Boost version
new 884474c ARROW-2699: [C++/Python] Add Table method that replaces a column with a new supplied column
new 1442816 ARROW-2400: [C++] Fix Status destructor performance
new 87cb95c ARROW-2716: [Python] Make manylinux1 base image independent of Python patch releases
new 1ab2abc ARROW-2729: [GLib] Add decimal128 array builder
new 5cd1df6 ARROW-902: [C++] Script for downloading all thirdparty build dependencies and configuration for offline builds
new df327f6 ARROW-2732: [GLib] Update brew packages for macOS
new 6ef9888 ARROW-2733: [GLib] Cast garrow_decimal128 to gint64
new 58a2366 ARROW-2676: [Packaging] Deploy build artifacts to github releases
new 4db8a36 ARROW-2731 Add external Orc capability
new ac0e521 ARROW-2725: [Java] make Accountant.AllocationOutcome publicly visible (#2149)
new b165c86 ARROW-2727: [Java] Fix POM file issue causing build failure in java/adapters/jdbc
new 27ddd3f ARROW-2741: [Python] pa.array from np.datetime[D] and type=pa.date64 produces invalid results
new 2484278 ARROW-2738: [GLib] Use Brewfile on installation process
new e757e7e ARROW-2739: [GLib] Use G_DECLARE_DERIVABLE_TYPE
new 4481b07 ARROW-2608: [Java/Python] Add pyarrow.{Array,Field}.from_jvm / jvm_buffer
new c73a88d ARROW-2745: [C++] ORC ExternalProject needs to declare dependency on vendored protobuf
new c4d1001 ARROW-2740: [Python] Add address property to Buffer
new ae69a78 [JS] Add example to load data via fetch (#2125)
new 061aa5e [JS] Add NPM shield, add yarn installation (#2124)
new 98253f3 [C++] Fix typo in README.md
new 7e6dd60 ARROW-2662: [Python] Add to_pandas to ChunkedArray
new 8c9890c ARROW-2742: [Python] Allow Table.from_batches to use iterator of record batches
new 9a9445a ARROW-2748: [GLib] Add garrow_decimal_data_type_get_scale() (and _precision())
new 66c017e [GLib] Fix a typo
new 6a70680 ARROW-2753: [GLib] Add garrow_schema_*_field()
new 5e6c773 ARROW-2752: [GLib] Document garrow_decimal_data_type_new()
new 9979729 [Python] Fix typo in ipc.rst
new 32de512 ARROW-2751: [GLib] Add garrow_table_replace_column()
new 4a20fc7 ARROW-2683: [Python] Resource Warning (Unclosed File) when using pyarrow.parquet.read_table()
new d38e862 ARROW-2758: [Plasma] Use Scope enum in Plasma
new cac2c07 ARROW-2749: [GLib] Rename *garrow_decimal128_array_get_value to *garrow_decimal128_array_format_value
new 6cf5809 ARROW-2755: [Python] Allow using Ninja to build extension
new c9ce215 ARROW-2747: [Python] Fix huge pages Plasma test
new c56d881 ARROW-2761: [Python] Add support for set operations in hive partition filtering
new 88f6794 ARROW-2763: [Python] Make _metadata file accessible in ParquetDataset
new 4b2e52c ARROW-2335: [Go] move README one directory higher
new c9613cc ARROW-2781: [Python] Download boost using curl in manylinux1 image
new 35b1f9a [Python] MNT: Switch manylinux1 image back to latest tag (#2204)
new ceae13d ARROW-2344: [Go] Run Go unit tests in Travis CI
new 18f31e2 ARROW-2782: [Plasma] xfail plasma hugepage test
new 7deb141 ARROW-1954: [Python] Add metadata accessor to pyarrow.Field
new 42c2101 ARROW-2780: [Go] Run code coverage analysis
new 782c5a5 ARROW-2773: [Python] corrected partition_cols parameter name
new c8d97fa ARROW-2650: [JS] Implement Unions
new e7aaf7b ARROW-2657: [Python] Import TensorFlow python extension before pyarrow to avoid segfault
new 037c156 ARROW-2794: [Plasma] Add the RPC of a list of Delete Objects in Plasma
new 0175167 ARROW-2795: [Python] Run TensorFlow import workaround only on Linux platforms
new d5d39f7 ARROW-2779: [JS] stream reader fixes
new 9d1432e ARROW-2798: [Plasma] Use hashing function that takes into account all UniqueID bytes
new 148c05c ARROW-2754: [Python] Change Python setup.py to make release builds by default
new 010c874 ARROW-2805: [Python] Use official way to find TensorFlow module
new 836afac ARROW-2634: [Go] Add Go license details to LICENSE.txt
new d7ad70d ARROW-2802: [Docs] Move all release management instructions to Confluence
new 0933b4f ARROW-2601: [Python] Prevent user from calling *MemoryPool constructors directly
new 528cb7d ARROW-1454: [Python] Also match ArrowNotImplementedError in unsupported type conversions from pandas
new 735e38b ARROW-2784: [C++] MemoryMappedFile::WriteAt allow writing past the end
new ae5e4b8 ARROW-2809: [C++] Only print cpplint and clang-format output for failures by default
new fbfddfd ARROW-1722: [C++] Add linting script to find C++/CLI incompatibilities
new ddcb8f5 ARROW-2789: [JS] Add iterator to DataFrame
new 1a419fe ARROW-2790: [C++] Buffers can contain uninitialized memory
new 8cfa51a ARROW-2790: [C++] Minor style changes from the review
new b926574 ARROW-2722: [Python] Sanitize dtype number to handle edge cases
new fa08ddf [CI] Add missing quote for omittable variable (#2235)
new e515ba3 ARROW-2823: [C++] Search for flatbuffers in <root>/lib64
new d130a00 ARROW-2812: [Ruby] Fix Arrow::Array#[] interface for Arrow::StructArray
new a096eb1 ARROW-2821: [C++] Remove redundant memsets in BooleanBuilder
new bd1c332 ARROW-2724: [Packaging] Determine whether all the expected artifacts are uploaded
new bce0ca4 [Packaging] Fix fallback value code in Ruby
new 113430a ARROW-2383: [deb] Use system Protocol Buffers
new 7a413fe ARROW-2827: [C++] Stop to use -jN in sub make
new 35e85b8 ARROW-2820: [Python] Check that array lengths in RecordBatch.from_arrays are all the same
new e366604 ARROW-2824: [GLib] Add garrow_decimal128_array_get_value()
new e147057 ARROW-2834: [GLib] Remove "enable_" prefix from Meson options
new fa4f2aa ARROW-2838: [Python] Speed up PandasObjectIsNull
new 5ee8ed7 ARROW-2830: [deb] Enable parallel build again
new 2c90eff ARROW-2829: [GLib] Add GArrowORCFileReader
new b4e3017 ARROW-2836: [Packaging] Expand build matrices to multiple tasks
new 82f2843 ARROW-2848: [Packaging] Use lib*10.deb instead of lib*0.deb
new 59ec2b1 ARROW-2849: [Ruby] Arrow::Table#load supports ORC
new 4e8165c ARROW-2825: [C++] Add AllocateBuffer / AllocateResizableBuffer variants with default memory pool
new cbb13ea ARROW-2851: [C++] Update RAT excludes for new install file names
new 6a2d4e8 ARROW-2846: [Packaging] Update nightly build in crossbow as well as the sample configuration
new d6aa492 ARROW-2850: [C++/Python] Correctly set RPATHs on all binaries
new dea6ced ARROW-2841: [Go] support building in forks
new 56497ba ARROW-2844: [Packaging] Test OSX wheels after build
new ebc8dab ARROW-2771: [JS] Add row proxy object accessor
new 3fd913e ARROW-2861: [Python] Add note about how to not write DataFrame index to Parquet
new cecbcf7 ARROW-1744: [Plasma] Provide TensorFlow operator to transfer Tensors between Plasma and TensorFlow
new c49a57d ARROW-2663: [Python] Make dictionary_encode and unique accesible on Column / ChunkedArray
new 3419058 ARROW-2806: [C++/Python] More consistent null/nan handling
new faf16be ARROW-2787: [Python] Fix Cython usage instructions
new 8d8645c ARROW-564: [Python] Add Array.to_numpy()
new 3d5b4d8 ARROW-2847: [Packaging] Fix artifact name matching for conda forge packages
new 5063b33 ARROW-2690: [Plasma] Use uniform function names in public APIs in Plasma. Add namespace around Flatbuffers
new 4ba8769 ARROW-2872: [Python] Add tensorflow mark to opt-in to TF-related unit tests
new 31a9d38 ARROW-2868: [Packaging] Fix Apache Arrow ORC GLib related problems
new 8443759 ARROW-2871: [Python] Raise when calling to_numpy() on boolean array
new 401c8df ARROW-2874: [Packaging] Pass job prefix when putting on Queue
new 277365a [Packaging] Make OSX wheel build verbose
new 35ef303 ARROW-2810: [Plasma] Remove flatbuffers from public API
new f8cd36a ARROW-2326: [Python] Use @loader_path/ as rpath instead of @loader_path when bundling C++ libraries in wheels on macOS
new aedba2c ARROW-2822: [C++] Zero padding bytes in PoolBuffer
new ec9b72a ARROW-1715: [Python] Implement pickling for Column, ChunkedArray, RecordBatch, Table
new 7d2fbeb ARROW-2822: [C++] Remove the unneeded const qualifier and clarify the comments
new 35c541a ARROW-2883: [C++] Fix Clang warnings in code built with -DARROW_GPU=ON
new 87c976a ARROW-2855: [C++] Blog post that outlines the benefits of using jemalloc
new 89d92dd ARROW-2890: [Plasma] Make python client release method private
new 4b74592 ARROW-2877: [Packaging] crossbow submit results in duplicate Travis CI build
new 73aee8c ARROW-889: [Python/C++] Unify PrettyPrints between Python and C++
new 0172169 ARROW-2886: [Release] Remove an unused variable
new 2ff8708 ARROW-2878: [Packaging] README.md does not mention setting GitHub API token in user's crossbow repo settings
new 3c10ed2 ARROW-2884: [Packaging] Support RC
new 0b838f9 ARROW-2845: [Packaging] Upload additional debian artifacts
new a8ec080 ARROW-2894: [Glib] Adjust tests to format refactor
new 26cf346 ARROW-2896: [GLib] Add missing exports
new d42a7d7 ARROW-2895: [CI] Add missing Ruby dependency on C++
new 355ff08 ARROW-2816: [Python] Make NativeFile BufferedIOBase-compliant
new 13c103a ARROW-2876: [Packaging] Replace ssh-URLs with https://
new 0ecf88d ARROW-2891: [Python] Preserve schema in write_to_dataset
new 141b2c3 ARROW-2711: [Python] Fix inference from Pandas column with first empty list
new c8728de ARROW-2897: [Packaging] Organize supported Ubuntu versions
new a495885 ARROW-2514: [Python] Speed up inferring nested Numpy array
new 491114b ARROW-2744: [C++] Avoid creating list arrays with a null values buffer
new eaa6053 ARROW-2300: [C++/Python] Integration test for HDFS
new 566e398 ARROW-2586: [C++] Changing the type of ListBuilder's and StructBuilder's children from unique_ptr to shared_ptr so that it can support deserialization from Parquet to Arrow with arbitrary nesting
new b0ac721 ARROW-2902: [Python] Clean up after build artifacts created by root docker user in HDFS integration test
new aaf8d5d ARROW-2575: [Python] Exclude hidden files starting with . in ParquetManifest
new 078b806 ARROW-2656: [Python] Improve creation time of ParquetManifest for partitioned datasets using thread pool
new 2a0128d ARROW-2859: [Python] Accept buffer-like objects as sources in open_file, open_stream APIs
new 0269203 ARROW-2756: [Python] Remove redundant imports and minor fixes in parquet tests
new 4bed3bc ARROW-2864: [Plasma] Add deletion cache to delete objects later when they are not in use.
new a8a48c2 ARROW-2901: [Java] Build is failing on Java9
new 6d0747a ARROW-2906: [Website] Remove the link to slack channel
new 9b85118 ARROW-2852: [Rust] Make Array sync and send
new 752d20c ARROW-2907: [GitHub] Improve the first paragraph of "How to contribute patches"
new 2657f9d ARROW-2701: [C++] Make MemoryMappedFile resizable redux
new f019601 ARROW-2908: [Rust] Update version to 0.10.0
new 3642e81 ARROW-2881: [Website] Add community tab to header, add link and callout to dev wiki
new 5033f6e ARROW-2826: [C++] Remove ArrayBuilder::Init method, clean up Resize, remove PoolBuffer from public API
new 16bbdd4 ARROW-2777: [JS] Friendlier onboarding readme
new 8e6af29 ARROW-2914: [Integration] Add WindowPandasUDFTests to Spark integration script
new 47e462f ARROW-2915: [Packaging] Remove artifact form ubuntu-trusty build
new fdc8e6a ARROW-2917: [Python] Use detach() to avoid PyTorch gradient errors
new 76033f4 ARROW-2918: [C++] Improve formatting of Struct pretty prints
new 49ccf6a ARROW-2760: [Python] Remove legacy property definition syntax from parquet module and test them
new 4ba2d19 ARROW-2921: [Release] Update .deb/.rpm changelogs in preparation
new 537e7f7 ARROW-2920: [Python] Fix pytorch segfault
new a18f362 ARROW-2922: [Release] Make python command name customizable
new 432dd93 ARROW-2923: [DOC] Adding Apache Spark integration test instructions
new c6e33d8 ARROW-2924: [Java] mvn release fails when an older maven javadoc plugin is installed
new 0707b16 ARROW-2927: [Packaging] AppVeyor wheel task is failing on initial checkout
new b56d269 ARROW-2700: [Python] Add simple examples to Array.cast docstring
new bbf2e0f ARROW-2929: [C++] ARROW-2826 Breaks parquet-cpp 1.4.0 builds
new cd162f5 ARROW-2704: [Java] Change MessageReader API to improve custom message handling for streams
new 7ffb6c5 ARROW-2934: [Packaging] Add checksums creation to sign subcommand
new 161d1f0 ARROW-2926: [Python] Do not attempt to write tables with invalid schemas in ParquetWriter.write_table
new 2422d9c ARROW-2660: [Python] Experimental zero-copy pickling
new 734828d ARROW-2937: [Java] Followup to ARROW-2704. Make MessageReader classes immutable and clarify docs
new 26affd7 ARROW-2940: [Python] Fix OSError when trying to load libcaffe2.so in pytorch 0.3.0
new 300d914 ARROW-2935: [Packaging] Add verify_binary_artifacts function to verify-release-candidate.sh
new 4c0d49f ARROW-2875: [Packaging] Don't attempt to download arrow archive in linux builds
new 8c964a7 ARROW-2945: [Packaging] Update argument check
new f0de121 ARROW-2947: [Packaging] Remove Ubuntu Artful
new 29408fd ARROW-2946: [Packaging] Stop to use $PWD
new c7175c4 ARROW-2943: [C++] Implement BufferedOutputStream::Flush
new f61d739 ARROW-2944: [Format] Synchronize some metadata changes to columnar format Markdown documents
new beefaca ARROW-2949: [CI] Add retry logic when downloading miniconda to reduce flakiness
new f0cbfc3 ARROW-2955: Fix typo in pyarrow's HDFS API result
new 3b24bc2 ARROW-2954: [Plasma] Reduce plasma store memory usage
new 0fc224b ARROW-2930: [C++] migrated MacOS specific code for shared library target
new 567e33b ARROW-2960: [Packaging] Fix verify-release-candidate for binary packages and fix release cutting script for lib64 cmake issue
new cf79192 ARROW-2911: [Python] Parquet binary statistics that end in '\0' truncate last byte
new d48dce2 ARROW-2953: [Plasma] Reduce plasma memory usage
new 7afa4af ARROW-2963: [C++] Make thread pool fork-safe
new 446dd45 [Release] Update CHANGELOG.md for 0.10.0
new d38bc66 [Release] Update .deb/.rpm changelogs for 0.10.0
new 07f142d [maven-release-plugin] prepare release apache-arrow-0.10.0
new 0f5fb20 ARROW-2813: [CI] Mute uninformative lcov warnings
new ef933a6 ARROW-2666: [Python] Add __array__ method to Array, ChunkedArray, Column
new 0c29673 ARROW-2962: [Packaging] Bintray descriptor files are no longer needed
new 495bf36 ARROW-2480: [C++] Enable casting the value of a decimal to int32_t or int64_t
new 1b2a42e ARROW-2978: [Rust] Change argument to rust fmt to fix build
new 7c953a0 ARROW-2977: [Packaging] Release verification script should check rust too
new de50744 ARROW-2869: [Python] Add documentation for Array.to_numpy
new 072fa77 ARROW-2985: [Ruby] Add support for verifying RC
new 00aed05 ARROW-2990: [GLib] Support building with rpath-ed Arrow C++ on macOS
new 91eab98 ARROW-2951: [CI] Don't skip AppVeyor build on format-only changes
new ea9157a ARROW-2982: Ensure release verification script works with wget < 1.16, build ORC in C++ libraries
new e10f2b3 ARROW-2815: [CI] Skip Java tests and style checks on C++ job [skip appveyor]
new d3c9c1d ARROW-2061: [C++] Make tests a bit faster with Valgrind
new 71145cd ARROW-2988: Improve Windows release verification script to be more automated
new 551e9ce ARROW-2813: [CI] [Followup] Disable gcov output in Travis-CI logs
new edfbf84 ARROW-2964: [Go] wire all primitive arrays into array.MakeFromArray
new 34d383b ARROW-2996: [C++] Fix typo in cpp/.clang-tidy
new a51d81e ARROW-1491: [C++] Add casting from strings to numbers and booleans
new 7bead55 ARROW-3001: [Packaging] Don't modify PATH during rust release verification
new 6a5d6c9 ARROW-2220: Only suggest default fix version that is a mainline release in merge tool
new 632b8ce ARROW-3003: [Doc] Enable Java doc generation
new bfe5eb3 ARROW-2992: [CI] Remove some AppVeyor build configurations
new b6e5653 ARROW-2999: [Python] Disable ASV runs in Travis CI for now
new 9f0a63d ARROW-3005: [Release] Update website, draft simple release blog post for 0.10.0
new e4f2713 ARROW-2992: [Python] Fix Parquet benchmark
new 29c4f76 ARROW-3007: [Packaging] Remove needless dependencies
new 568f1cb ARROW-3011: [CI] Remove Slack notification
new 1ad11fd ARROW-3008: [Packaging] Verify GPU related modules if available
new 25cc0d7 ARROW-3006: [GLib] Fix a bug that .gir/.typelib for GPU aren't installed
new 91ffc00 ARROW-3009: [Python] Fix pyarrow ORC reader
new 5fda431 ARROW-3010: [GLib] Update README to use Bundler
new d247c5b ARROW-3002: [Python] Hash more parts of pyarrow.Field
new 0cef55a ARROW-2976: [Python] Fix pyarrow.get_library_dirs
new c114783 ARROW-3019: [Packaging] Use Bundler to verify Arrow GLib
new 479c011 ARROW-2950: [C++] Clean up util/bit-util.h
new e9c52f2 ARROW-2986: [C++] Use /EHsc flag for exception handling on MSVC, disable C4772 compiler warning in arrow/util/logging.h
new 5c97cd6 ARROW-2994: [Python] Only include Python and NumPy include directories for libarrow_python targets
new 4660833 ARROW-2975: [Plasma] Fix TensorFlow operator compilation with pip package
new 9b24c3e ARROW-3024: [C++] Remove mutex in MemoryPool implementations
new 67aae05 ARROW-3021: [Go] add support for List arrays
new 7b611f7 [C++] Fix a typo in `FindClangTools.cmake`. (#2404)
new 89120f0 ARROW-3023: [C++] Add gold linker enabling logic from Apache Kudu
new 2359de4 ARROW-3027: [Ruby] Stop "git tag" by "rake release"
new 6e5bfe2 ARROW-3026: [Python] [Plasma] Only run Plasma unit tests with valgrind under Python 3.6
new 18edb97 ARROW-3012: [Python] Fix setuptools_scm usage
new 699b0d2 ARROW-3000: [C++] Add option to label test groups then only build those unit tests
new 216f3e4 ARROW-3028: [Python] Do less work to test Python documentation build
new aff1dca ARROW-3015: [Python] Fix typo in uint8() docstring
new 78a4d65 ARROW-2814: [Python] Unify conversion paths for sequences of Python objects
new 0c38a21 ARROW-3029: [Python] Generate version file when building
new b5a97cb ARROW-2998: [C++] Add unique_ptr versions of Allocate[Resizable]Buffer
new baff6d8 ARROW-2979: [GLib] Add operator functions in GArrowDecimal128
new ed3306b ARROW-3046: [GLib] Use rubyish method
new 2dfb01f ARROW-3022: [Go] add support for Struct arrays
new 21c3a18 ARROW-3042: [Go] add godoc badge to README
new affd6d7 ARROW-2952: [C++] Dockerized include-what-you-use
new 7031a86 ARROW-3047: [C++/Python] Better build instructions with ORC
new f708c63 ARROW-3043: [C++] pthread doesn't exist on MinGW
new 7c93d0a ARROW-3034: [Packaging] Resolve symbolic link in tar.gz
new b99d8f3 ARROW-3045: [Python] Remove nullcheck from ipc Message and MessageReader
new ef4622b ARROW-3044: [Python] Remove all occurrences of cython's legacy property definition syntax
new 72e4470 ARROW-3031: [Go] streamline Release of Arrays and Builders
new 1810db2 ARROW-3036: [Go] implement array.NewSlice
new 08fa38b ARROW-3037: [Go] implement Null array
new 8c42737 ARROW-3035 [Rust] Examples in README.md do not run
new 3edfd7c ARROW-3057: [INTEGRATION] Fix spark and hdfs dockerfiles
new 3214e99 ARROW-2925: [JS] Documentation failing in docker container
new f62e07a ARROW-2687: [JS] Example usage in README is outdated
new 51410bc ARROW-3062: [Python] Fix python package finder to also work in Python 2.7
new 719c954 ARROW-3059: [C++] Remove namespace arrow::test
new 9428b86 ARROW-3064: [C++] Add option to ADD_ARROW_TEST to indicate additional dependencies for particular unit test executables
new 170dc75 ARROW-1799: [Plasma C++] Make unittest does not create plasma store executable
new 1209a80 ARROW-2583: [Rust] Buffer should be typeless
new c82dfcd ARROW-3067: [Packaging] Support dev/rc/release .deb/.rpm builds
new 16bbec8 ARROW-3068: [Packaging] Bump version to 0.11.0-SNAPSHOT
new 8db4e10 ARROW-3060: [C++] Factor out string-to-X conversion routines
new da73971 ARROW-3053: [Python] Add unit test for strided object conversion that was failing in 0.9.0
new 349e6b0 ARROW-2971: [Python] Give some modules in arrow/python more descriptive names
new c453a7a ARROW-1968: [C++/Python] Add basic unit tests for ORC reader
new a43e670 ARROW-3083: [CI] Use higher clone depth to pick up tags [skip appveyor]
new 94e8196 ARROW-3088: [Rust] Use internal `Result<T>` type instead of `Result<T, ArrowError>`
new dbf531b ARROW-3050: [C++] Adopt HiveServer2 client codebase from cloudera/hs2client. Add Thrift to thirdparty toolchain
new e8e82d0 ARROW-3094: [Python] Easier construction of schemas and struct types
new 27f990a ARROW-3098: [C++/Python] Allow seeking at end of BufferReader and FixedSizeBufferWriter
new 53e7594 ARROW-3095: [Plasma] Move plasma store
new 9f04fc2 ARROW-3100: [GLib] Follow Homebrew change that lua splits luarocks
new 47f831b ARROW-3106: [Website] Update committers and PMC roster on website
new 8602509 ARROW-3099: [C++] Add benchmark for number parsing
new 4969da7 ARROW-3109: [Python] Add Python 3.7 virtualenvs to manylinux1 container
new 12a5755 ARROW-3110: [C++] Fix warnings with gcc 7.3.0
new 23045d2 ARROW-2965: [Python] Guard against overflow when serializing Numpy uint64 scalar
new f3542c2 ARROW-2705: [JS] CombinationPredicates should take list of predicates
new fda4b3d ARROW-3105: [Plasma] Improve flushing error message
new 6cf8ea7 ARROW-1661: [Python] Build Python 3.7 in manylinux container
new 65a391a ARROW-3111: [Java] Adding logback config file to allow running tests with different log level
new 06e7a1b ARROW-3018: [Plasma] Remove Mersenne twister
new 588c1fa ARROW-1563: [C++] Implement logical unary and binary kernels for boolean arrays
new 5a01af8 ARROW-3114: [Website] Add information about user@ mailing list to website / Community page
new b73aa20 ARROW-1563: [C++][FOLLOWUP] Use std::function instead of declaring auxiliary helper classes
new 628b74b ARROW-3116: [Plasma] Add "ls" to object store
new a5c8a02 ARROW-3117: [GLib] Add garrow_chunked_array_to_string()
new f841b90 ARROW-3017: [C++] Don't throw exception in arrow/util/thread-pool.h
new d20147e ARROW-3125: [Python] Update ASV instructions
new 498215f ARROW-3049: [C++/Python] Fix reading empty ORC file
new 3e61e63 ARROW-3129: [Packaging] Stop to use deprecated BuildRoot and Group in .spec
new 0311f63 ARROW-2734: [Python] Cython api example doesn't work by default on macOS
new ef21ec9 ARROW-3130: [Go] add initial support for Go modules
new 56b50f6 ARROW-3119: [Packaging] Nightly packaging script fails
new 44c2fa7 [Python] [DOC] Fix a typo in doc/source/ipc.rst
new 658087e ARROW-3140: [Plasma] Fix Plasma build with GPU support
new b79b869 ARROW-3148: [C++] Remove needless U+00A0 NO-BREAK SPACE (#2500)
new 5829f10 ARROW-3147: [C++] Improve MSVC version detection
new 927bd34 ARROW-3018: [Plasma][FOLLOWUP] Update plasma documentation
new 84fcb82 ARROW-3142: [C++] Fetch all libs from toolchain environment
new c282a14 ARROW-3115: [JAVA] Style checks - fix import ordering
new 20c0405 ARROW-3153: [Packaging] Fix broken nightly package builds introduced with recent cmake changes and orc tests
new a712165 ARROW-1949: [Python/C++] Add option to Array.from_pandas and pyarrow.array to perform unsafe casts
new 7483331 ARROW-3158: [C++] Handle float truncation during casting
new f209645 ARROW-3163: [Python] Add missing Cython dependency to source package
new ea728c7 ARROW-3167: [CI] Limit clcache cache size
new ad7e6c3 ARROW-2989: [C++/Python] Remove API deprecations in 0.10
new 3eaf46e ARROW-3090: [Rust] Accompany error messages with assertions
new 21ef18b ARROW-3160: [Python] Improve pathlib.Path support in parquet and filesystem modules
new e0c07f4 ARROW-3172: [Rust] Update documentation for datatypes.rs
new c1f528c ARROW-2974: [Python] Replace usages of "source activate" with "conda activate" in CI scripts
new 5d4d6ba ARROW-3177: [Rust] Update expected error messages for tests that 'should panic'
new 4007aff ARROW-2948: [Packaging] Generate changelog with crossbow
new fbba3be ARROW-3061: [JAVA] Fix BufferAllocator#getHeadroom (#2434)
new e7b269d ARROW-1325: [R] Initial R package that builds against the arrow C++ library
new f3247e8 ARROW-3127: [Doc] Add Tutorial for Sending Tensor from C++ to Python
new 03e9bb9 ARROW-3171: [Java] Enable checkstyle for line length and indentation
new 17999a8 [R] Add link to R folder in README.md
new 0427442 Initial pre-ASF parquet-cpp development by Nong Li
new b30c09c PARQUET-232: minor compilation issue
new 4f47d10 PARQUET-416: C++11 compilation, code reorg, libparquet and installation targets
new 09cec85 PARQUET-267: Detach thirdparty code from build configuration.
new aecf146 PARQUET-449: updated to latest parquet.thrift
new aac1934 PARQUET-437: Add googletest setup and ADD_PARQUET_TEST helper
new cea2092 PARQUET-469: Roll back Thrift thirdparty and compiled sources to 0.9.0
new fbda61f PARQUET-434: Add a ParquetFileReader class
new fb06287 PARQUET-418: Refactored parquet_reader utility for printing file contents.
new 7c33cc9 PARQUET-451: Add RowGroupReader helper class and refactor parquet_reader.cc into DebugPrint
new b1d44f1 PARQUET-439: Conform copyright headers to ASF requirements
new 2152bfc PARQUET-428: Support INT96 and FIXED_LEN_BYTE_ARRAY types
new ecf5e7d PARQUET-472: Changed the ownership of InputStream in ColumnReader.
new 6cefaba PARQUET-435: Change column reader methods to be array-oriented rather than scalar
new e837ee9 PARQUET-496: Fix cpplint configuration to catch more style errors
new 3bac789 PARQUET-438: Update RLE encoding tools and add unit tests from Impala
new 41eed7e PARQUET-485: Decouple page deserialization from column reader to facilitate unit testing
new 88e5e0e PARQUET-503: Reenable parquet 2.0 encoding implementations.
new fda229a PARQUET-454: Fix inconsistencies with boolean PLAIN encoding
new 143485d PARQUET-442: Nested schema conversion, Thrift struct decoupling, dump-schema utility
new 783b76c PARQUET-507: Reduce the runtime of rle-test
new 5e91cff PARQUET-455: Fix OS X / Clang compiler warnings
new 569ffb2 PARQUET-448: Add cmake options to not build tests and/or executables
new b47f67e PARQUET-513: Fail build if valgrind finds error during ctest, fix a core dump
new b268bb8 PARQUET-169: Implement support for bulk reading and writing rep/def levels
new bc47477 PARQUET-501: Add OutputStream abstract interface, refactor encoding code paths
new d35efe2 PARQUET-497: Decouple serialized file internals from the ParquetFileReader public API
new 4e18915 PARQUET-505: Column reader should automatically handle large data pages
new c857ddd PARQUET-456: Finish gzip implementation and unit test all compressors
new bf3bf75 PARQUET-446: Hide Thrift compiled headers and Boost from public API, #include scrubbing
new af04814 PARQUET-515: Add "SetData" to LevelDecoder
new a13b20f PARQUET-499: Complete PlainEncoder implementation for all primitive types and test end to end
new 218fe8e PARQUET-468: Use thirdparty Thrift compiler to compile parquet.thrift at make time
new 5d05c2e PARQUET-457: Verify page deserialization for GZIP and SNAPPY codecs, related refactoring
new 70665ce PARQUET-533: Add a Buffer abstraction, refactor input/output classes to be simpler using Buffers
new 3eb2233 PARQUET-525: Add test coverage for failure modes in ParseMetaData
new f0f40f6 PARQUET-538: Improve ColumnReader Tests
new ae1bb51 PARQUET-494: Implement DictionaryEncoder and test dictionary decoding
new cee3691 PARQUET-545: Improve API to support decimal type
new e5f04e6 PARQUET-518: Remove -Wno-sign-compare and scrub integer signedness
new 22f352c PARQUET-520: Add MemoryMapSource and add unit tests for both it and LocalFileSource
new 4923018 PARQUET-463: Add local DCHECK macros, fix some dcheck bugs exposed
new 853b128 PARQUET-537: Ensure that LocalFileSource is properly closed.
new 4bcfb0e PARQUET-519: Remove last of suppressed compiler warnings
new 8d43015 PARQUET-482: Organize public API headers
new 4b0a2d9 PARQUET-549: Add column reader tests for dictionary pages
new 0c764ea PARQUET-555: Dictionary page metadata handling inconsistencies
new 3db3240 PARQUET-561: Add destructor to PIMPL
new cd531f1 PARQUET-556:Extend RowGroupStatistics to include "min" "max" statistics
new 41920f5 PARQUET-488: Add SSE cmake toggle, fix build on systems without SSE
new b2c072e PARQUET-564: Add cmake option to run valgrind on each unit test executable
new c6e269d PARQUET-542: Support custom memory allocators
new 43b895f PARQUET-568: Enable top-level column selection.
new 1ce2869 PARQUET-559: Enable external RandomAccessSource as input to the ParquetFileReader
new 0a0ab70 PARQUET-566: Add method to retrieve the full column path
new bba4d52 PARQUET-572: Rename public namespace to parquet from parquet_cpp
new 8af2caf PARQUET-582: Conversions functions for Parquet enums to Thrift enums
new 2284964 PARQUET-583: Parquet to Thrift schema conversion
new f044f2a PARQUET-587: Implement BufferReader::Read(int64_t,uint8_t*)
new b3cdb9e PARQUET-436: Implement basic Write Support
new 7dc51f8 PARQUET-547: Refactor templates to all be based on DataType structs
new fe1f3b7 PARQUET-477: Add clang-format / clang-tidy checks to toolchain
new bb67f88 PARQUET-512: Add Google benchmark for performance testing
new d050bf4 PARQUET-597: Add data rates to benchmark output
new 862a975 PARQUET-589: Implement BufferedInputStream for better memory usage
new 37e621e PARQUET-599: Better size estimation for levels
new 968e2bd PARQUET-604: Add writer headers to installation
new c41cff6 PARQUET-605: Expose schema node in ColumnDescriptor
new 43bd263 PARQUET-603: Implement missing information in schema descriptor
new dd58e39 PARQUET-607: Public writer header
new 227f66f PARQUET-610: Print additional ColumnMetaData for each RowGroup
new af71bad PARQUET-614: Remove unneeded LZ4-related code
new e1cba98 PARQUET-616: WriteBatch should accept const arrays
new 3ff3b58 PARQUET-600: Add benchmarks for RLE-Level encoding
new e1e0d28 PARQUET-620: Ensure metadata is written only once
new cd8906c PARQUET-619: Add OutputStream for local files
new 634132e PARQUET-621: Add flag to indicate if decimalmetadata is set
new 01d31db PARQUET-598: Test writing all primitive data types
new 466aa3d PARQUET-625: Improve RLE read performance
new 2e42359 PARQUET-629: RowGroupSerializer should only close itself once
new b1a816d PARQUET-633: Add version to WriterProperties
new ba3012e PARQUET-634: Consistent private linking of dependencies
new ce0cdeb PARQUET-592: Support compressed writes
new c21b041 PARQUET-636: Expose selection for different encodings
new 427d0a2 PARQUET-641: Instantiate stringstream only if needed in SerializedPageReader::NextPage
new a8ad3a4 PARQUET-639: Do not export DCHECK in public headers
new 8e98b5c PARQUET-643: Add const modifier to schema pointer reference
new 05c4c9d PARQUET-646: Add options to make developing with clang and 3rd-party gcc easier
new b3d9a14 PARQUET-489: Shared library symbol visibility
new b6abc6d PARQUET-551:Handle compiler warnings due to disabled DCHECKs in relea…
new 1e73c0a PARQUET-657: Do not define DISALLOW_COPY_AND_ASSIGN if already defined
new 3b88b05 PARQUET-658: Add virtual destructor to ColumnReader
new e21857a PARQUET-659: Export extern templates for typed column reader/writer classes
new b283264 PARQUET-662: Compile ParquetException implementation and explicitly export
new a0bfd9d PARQUET-671: performance improvements for rle/bit-packed decoding
new 602b9be PARQUET-666: Add support for writing dictionaries
new 4078b87 PARQUET-694: Revert default data page size back to 1M
new 18aa1ac PARQUET-573: Create a public API for reading and writing file metadata
new f128d51 PARQUET-699: Update parquet.thrift from https://github.com/apache/parquet-format
new 3976997 PARQUET-701: Ensure that Close can be called multiple times
new 08ce126 PARQUET-700: Disable dictionary encoding for boolean columns
new ddf0297 PARQUET-676: Fix incorrect MaxBufferSize for small bit widths
new 78b1de3 PARQUET-681: Add tool to scan a parquet file
new cc1fdec PARQUET-704: Install scan-all.h
new 66e7299 PARQUET-703: Validate that ColumnChunk metadata counts nulls in num_values
new 8268107 PARQUET-708: account for "worst case scenario" in MaxBufferSize for bit_width > 1
new ce843c8 PARQUET-710: Remove unneeded private member variables from RowGroupReader ABI
new 10ebdbd PARQUET-711: Use metadata builders in parquet writer
new adcabc4 PARQUET-687: C++: Switch to PLAIN encoding if dictionary grows too large
new 20c2cb2 PARQUET-718: Fix I/O of non-dictionary encoded pages
new 87ff504 PARQUET-719: Fix WriterBatch API to handle NULL values
new 9e46b37 PARQUET-689: C++: Compress DataPages eagerly
new b8de4d0 PARQUET-720: Mark ScanAllValues as inline to prevent link error
new 13980f9 PARQUET-712: Add library to read into Arrow memory
new 9dfa948 PARQUET-724: Test more advanced properties setting
new 287cf01 PARQUET-728: Incorporate upstream Arrow API changes
new c9c7f4a PARQUET-721: benchmarks for reading into Arrow
new 9aae125 PARQUET-731: API to return metadata size and Skip reading values
new 43c7154 PARQUET-593: Add API for writing Page statistics
new 53958b1 PARQUET-737: Use absolute namespace in macros
new 20d4a17 PARQUET-741: Always allocate fresh buffers while compressing
new 408d788 PARQUET-739: Don't use a static buffer for data accessed by multiple threads
new c81a26e PARQUET-739: Don't use a static buffer for data accessed by multiple threads
new 350e520 PARQUET-747: Better hide TypedRowGroupStatistics in public API
new 73eb456 PARQUET-742: Add missing license headers
new 67ae6e5 PARQUET-752: Account for upstream Arrow API changes
new 05a168c PARQUET-759: Fix handling of columns of empty strings
new 9c527b7 PARQUET-760: Store correct encoding in fallback data pages
new 96a6dd4 PARQUET-745: TypedRowGroupStatistics fails to PlainDecode min and max in ByteArrayType
new 4409707 PARQUET-763: C++: Expose ParquetFileReader through Arrow reader
new 676d61c PARQUET-766: Expose ParquetFileReader through Arrow reader as const
new 6da9e8a PARQUET-764: Support batches for PLAIN boolean writes that aren't a multiple of 8
new 163b2ac PARQUET-762: C++: Use optimistic allocation instead of Arrow Builders
new 5abf985 PARQUET-775: Make TrackingAllocator thread-safe
new 246ec91 PARQUET-702: Add a writer + reader example with detailed comments
new a414be7 PARQUET-778: Standardize the schema output to match the parquet-mr format
new e0f9806 PARQUET-769: Add support for Brotli compression
new fbdba4f PARQUET-779: Export TypedRowGroupStatistics in libparquet
new 7a5f274 PARQUET-780: WriterBatch API does not properly handle NULL values for byte array types
new 38416c4 PARQUET-782: Support writing to Arrow sinks
new 8bbb5d7 PARQUET-789: Catch/translate ParquetExceptions in parquet::arrow::FileReader
new b801505 PARQUET-785: LIST schema conversion for Arrow lists
new 912d7af PARQUET-797: Updates for ARROW-418 header API changes
new 7752273 PARQUET-799: Fix bug in MemoryMapSource::CloseFile
new b50e626 PARQUET-805: Read Int96 into Arrow Timestamp(ns)
new 7790183 PARQUET-812: Read BYTE_ARRAY with no logical type as arrow::BinaryArray
new ffb7f06 PARQUET-816: Workaround for incorrect column chunk metadata in parquet-mr <= 1.2.8
new e348a6d PARQUET-813: Build thirdparty dependencies using ExternalProject
new deb5680 PARQUET-818: Refactoring to utilize common IO, buffer, memory management abstractions and implementations
new bfb24f6 PARQUET-819: Don't try to install no longer existing arrow/utils.h
new d36dc11 PARQUET-807: Allow user to retain ownership of parquet::FileMetaData.
new 1867981 PARQUET-809: Add SchemaDescriptor::Equals method
new 6d354a1 PARQUET-827: Account for arrow::MemoryPool API change and fix bug in reading Int96 timestamps
new ea9c4d3 PARQUET-828: Do not implicitly cast ParquetVersion enum to int
new 2cbd797 PARQUET-829: Make use of ARROW-469
new 6312724 PARQUET-830: Add parquet::arrow::OpenFile with additional properties and metadata args
new 97e69b4 PARQUET-820: Decoders should directly emit arrays with spacing for null entries
new f3a3c69 PARQUET-833: C++: Provide API to write spaced arrays
new 4e52f61 PARQUET-837: Remove RandomAccessSource::Seek method which can be a source of thread safety problems
new 18caeab PARQUET-835: Read Arrow columns in parallel with thread pool
new c195976 PARQUET-836: Bugfix + testcase for column subsetting in arrow::FileReader::ReadFlatTable
new d0446e1 PARQUET-691: Write ColumnChunk metadata after chunk is complete
new 38a6a98 PARQUET-841: Version number being incorrectly written for v1 files
new 493603d PARQUET-842: Do not set unnecessary fields in the parquet::SchemaElement
new 5a21610 PARQUET-843: Impala is thrown off by a REPEATED root schema node
new c016b72 PARQUET-844: Schema, compression consolidation / flattening
new 61b7b12 PARQUET-846: CpuInfo::Init() is not thread safe
new 270bda0 PARQUET-848: Build Thrift bits as part of main parquet_objlib component
new 8fda954 PARQUET-834: Support I/O of arrow::ListArray
new 7f305a6 PARQUET-857: Flatten parquet/encodings directory, consolidate code
new 7a65d43 PARQUET-862: Provide defaut cache size values
new d53bb1a PARQUET-866: API fixes for ARROW-33 patch
new 3eda0d2 PARQUET-867: Support writing sliced Arrow arrays
new ee62a34 PARQUET-874: Use default memory allocator from Arrow
new 6a9631a PARQUET-793: Do not return incorrect statistics
new 72cb04b PARQUET-877: Update Arrow Hash, update Version in metadata.
new cff54fa PARQUET-880: Prevent destructors from throwing
new 74db8d1 PARQUET-882: Improve Application Version parsing
new cb8eab9 PARQUET-888: Add missing virtual dtor.
new 0d2b951 PARQUET-889: Fix compilation when SSE is enabled
new 220aa56 PARQUET-894: Fix compilation warning
new 5ab15c6 PARQUET-895: Fix broken reading of nested repeated columns
new 48b70d0 PARQUET-894: Fix compilation warnings
new 9ca26c7 PARQUET-903: Add option to set RPATH to origin
new fb325c3 PARQUET-890: Support I/O of DATE columns in parquet_arrow
new 6060d83 PARQUET-908: Fix shared library visibility of some symbols in types.h
new b6b5aac PARQUET-909: Reduce buffer allocations (mallocs) on critical path
new aaf4ffd PARQUET-897: Only use designated public headers from libarrow
new 9d27375 PARQUET-919: Account for ARROW-683 changes, but make no functional changes. Set PARQUET_ARROW=on by default
new f0d1456 PARQUET-923: Account for Time type changes in Arrow
new 22d95d2 PARQUET-928: Support pkg-config
new 0f93007 PARQUET-933: Account for API changes in ARROW-728
new 22279eb PARQUET-934: Support multiarch on Debian
new 7bf8f04 PARQUET-935: Set version to shared library
new 81c2696 PARQUET-943: Fix build error on x86
new 4b53921 PARQUET-946: Add ReadRowGroup and num_row_group methods to arrow::FileReader
new d2c347d PARQUET-947: Account for Arrow library consolidation in ARROW-795, API changes in ARROW-782
new 2ea0d60 PARQUET-918: FromParquetSchema API crashes on nested schemas
new b3dedf4 PARQUET-953: Add static constructors to arrow::FileWriter for initializing from schema, add WriteTable method
new ada05fa PARQUET-918: Keep ordering in column indices when converting Parquet Schema
new c3dc8a1 PARQUET-898: Upgrade to googletest 1.8.0, move back to Xcode 6.4 in Travis CI
new 4ea7124 PARQUET-508: Add ParquetFilePrinter
new dac6505 PARQUET-958: [C++] Print Parquet metadata in JSON format
new 61f3b1d PARQUET-915: Support additional Arrow date/time types and metadata
new 6a27975 PARQUET-963: Return NotImplemented when attempting to read a struct field
new 35d09d4 PARQUET-595: API for KeyValue metadata
new 5e60bfc PARQUET-965: Add FIXED_LEN_BYTE_ARRAY read and write support in parquet-arrow
new 4e96056 PARQUET-679: Local Windows build and Appveyor support
new f444dfe PARQUET-936: Return Invalid Status if chunk_size <= 0 when WriteTable in parquet-arrow
new 7242b1c PARQUET-914: Rewording exception message in column writer.
new 8255ccc PARQUET-679: [C++] Resolve unit tests issues on Windows; Run unit tes…
new bd02cca PARQUET-930: Add timestamp[us] to schema test
new 8bc6ec5 PARQUET-679: Fix debug asserts in tests (msvc/debug build)
new 2fab6a2 PARQUET-984: Add abi and so version to pkg-config
new 0e4c4a1 PARQUET-992: Do not transitively include zlib.h in public API
new 7638af1 PARQUET-995: Use sizeof(Int96) instead of Int96Type
new a821f09 PARQUET-997: Fix override compiler warnings
new 0e1f467 PARQUET-978: [C++] Minimizing footer reads for small(ish) metadata
new 7d476b2 PARQUET-991: Resolve msvc warnings; Appveyor treats msvc warnings as …
new a8d8d22 PARQUET-967: Combine libparquet, libparquet_arrow libraries
new 5aa2339 PARQUET-999: Improve MSVC build - Enable PARQUET_BUILD_BENCHMARKS
new 5f42afa PARQUET-1008: [C++] TypedColumnReader::ReadBatch method updated to ac…
new 9dcb12d PARQUET-1003: Modify DEFAULT_CREATED_BY value for every new release v…
new 94e351c PARQUET-1029: [C++] Some extern template symbols not being exported in gcc
new 8f7282b PARQUET-1007: Update parquet.thrift
new 13f3fde PARQUET-991: Fix msvc warning C4100: '<id>': unreferenced formal parameter
new 514b74c PARQUET-1033: Improve documentation about WriteBatchSpaced
new 2d98407 PARQUET-911: [C++] Support nested structs in parquet_arrow
new cc46aff PARQUET-1038: Key value metadata should be nullptr if not set
new 1fdd816 PARQUET-1042: Fix Compilation breaks on GCC 4.8
new 61da26c PARQUET-1041: Support Arrow's NullArray
new 0a32c6b PARQUET-1043: Raise minimum CMake version to 3.2, delete cruft.
new b0414cc PARQUET-1044: Use compression libraries from Apache Arrow
new 81db371 PARQUET-858: Flatten column directory, minor code consolidation
new 40527c3 PARQUET-1045: Remove code that's being moved to Apache Arrow in ARROW-1154
new 5374737 PARQUET-1040: Add missing writer methods
new 3e34c37 PARQUET-1048: Apache Arrow static transitive dependencies
new 658c7fb PARQUET-1053: Fix unused result warnings due to unchecked Statuses
new 68315b8 PARQUET-1054: Fixes for Arrow API changes in ARROW-1199
new 2395770 PARQUET-1035: Write Int96 from Arrow timestamp(ns)
new 6c97fe6 PARQUET-1068: Modify .clang-format to use straight Google format with 90-character line width
new facce86 PARQUET-1072: Build with ARROW_NO_DEPRECATED_API in Travis CI
new 82d516e PARQUET-1078: Add option to coerce Arrow timestamps to a particular unit
new 7fd1519 PARQUET-1079: Remove Arrow offset shift unneeded after ARROW-1335
new 38a4e9f PARQUET-1083: Factor logic in parquet-scan.cc into a library function to help with perf testing
new eadc62e PARQUET-1085: [C++] Use namespaced macros from arrow/util/macros.h, work around UNUSED rename
new 5f54be7 PARQUET-1087: Add ScanContents function to arrow::FileReader that catches Parquet exceptions
new 4845e76 PARQUET-1088: Remove parquet_version.h from version control since it gets auto generated
new 751eb00 PARQUET-1090: Add max row group length option, fix int32 overflow
new 200774e PARQUET-1093: Improve Arrow level generation error message
new dcf96ed PARQUET-1002: Compute statistics based on Sort Order
new 92e7dae PARQUET-1098: Install util/comparison.h
new d29d4a9 PARQUET-1104: Upgrade to Apache Arrow 0.7.0 RC0
new 75cf66a PARQUET-929: Handle arrow::DictionaryArray when writing Arrow data
new cd1c622 PARQUET-1094: Add benchmark for boolean Arrow column I/O
new d7003c0 PARQUET-1100: Introduce RecordReader interface to better support nested data, refactor parquet/arrow/reader
new 468e737 PARQUET-1037: allow arbitrary size row-groups
new 9809754 PARQUET-1108: Fix Int96 comparators
new ac1a5d3 PARQUET-1114 Apply changes for ARROW-1601 ARROW-1611, change shared l…
new f5c7aee PARQUET-1123: [C++] Update parquet-cpp to use Arrow's AssertArraysEqual
new f36231d PARQUET-1121: Handle Dictionary[Null] arrays on writing Arrow tables
new f1dabe9 PARQUET-1138: Fix Arrow 0.7.1 build
new dcea0ab PARQUET-1150: Hide statically linked boost symbols
new da29595 PARQUET-1095: [C++] Read and write Arrow decimal values
new d619050 PARQUET-1164: [C++] Account for API changes in ARROW-1808
new 1124a79 PARQUET-970: Add Lz4 and Zstd compression codecs
new adc569a PARQUET-1167: [C++] FieldToNode function should return a status when throwing an exception
new 4acd139 PARQUET-1175: Fix arrow::ArrayData method rename from ShallowCopy to Copy
new 2b37b1f PARQUET-1165: Pin clang-format version to 4.0
new 5324ee9 PARQUET-859: Flatten parquet/file directory, consolidate file reader, file writer code
new bcc1f88 PARQUET-1177: Add PARQUET_BUILD_WARNING_LEVEL option and more rigorous Clang warnings
new 46e1d4e PARQUET-1092: Support writing chunked arrow::Table columns
new 7dbe374 PARQUET-1180: Fix behaviour of num_children element of primitive nodes
new 4538a2e PARQUET-1086: [C++] Remove usage of arrow/util/compiler-util.h
new d257a88 PARQUET-1193: [CPP] Implement ColumnOrder to support min_value and max_value
new 39c0b7b PARQUET-1179: Upgrade to Thrift 0.11, use std::shared_ptr instead of boost::shared_ptr
new 5ebb78c PARQUET-1200: Support reading a single Arrow column from a Parquet file
new 7f1b0c0 PARQUET-1226: Fixes for CHECKIN compiler warning level with clang 5.0
new cae28c0 PARQUET-1218: More informative error message on too short pages
new 96a0265 PARQUET-1233: Enable option to switch between stl classes and boost c…
new 15e8661 PARQUET-1225: NaN values may lead to incorrect filtering under certai…
new b3f3c09 PARQUET-1245: Fix creating Arrow table with duplicate column names
new 102d951 PARQUET-1166: Add GetRecordBatchReader in parquet/arrow/reader
new f28563d PARQUET-1071: Check that arrow::FileWriter::Close() is idempotent
new de865da PARQUET-1255: Fix error message when PARQUET_TEST_DATA isn't defined
new b73771b PARQUET-1265: Segfault on static ApplicationVersion initialization
new 828783d PARQUET-1267: [C++] replace "unsafe" std::equal by std::memcmp
new a251714 PARQUET-1268: Fix conversion of null list Arrow arrays
new 9d99820 PARQUET-1273: Properly write dictionary values when writing in chunks
new 42f287c PARQUET-1274: Prevent segfault that was occurring when writing a nanosecond timestamp with arrow writer properties set to coerce timestamps and support deprecated int96 timestamps.
new 2d0a904 PARQUET-1272: Return correct row count for nested columns in ScanFileContents
new fa53ea7 PARQUET-1279: [C++] Adding use of ASSERT_NO_FATAL_FAILURE in unit tests when calling helper functions that call ASSERT_ macros
new b9e80c8 PARQUET-1283: [C++] Remove trailing space for string and int96 statis…
new 076fbc6 PARQUET-979: Limit size of min, max or disable stats for long binary types
new f20fe7e PARQUET-1307: Fix memory-test for newer Arrow
new 129d845 PARQUET-1315: ColumnChunkMetaData.has_dictionary_page() should return…
new f38245b PARQUET-1340: Fix Travis Ci valgrind errors related to std::random_de…
new ea8798d PARQUET-1334: [C++] memory_map parameter seems missleading in parquet file opener
new 08ca177 PARQUET-1333: [C++] Reading of files with dictionary size 0 fails on Windows with bad_alloc
new 079ae70 PARQUET-1346: [C++] Protect against empty Arrow arrays with null values
new 630cf0a PARQUET-1350: [C++] Use abstract ResizableBuffer instead of concrete PoolBuffer
new fee8d70 PARQUET-1323: Fix compiler warnings on clang-6
new bd5243e PARQUET-1358: index_page_offset should be unset as it is not supported
new 0ccf832 PARQUET-1348: Add ability to write FileMetaData in arrow FileWriter
new 673ccfa PARQUET-1360: Use conforming API style, variable names in WriteFileMetaData functions
new 0e0f838 PARQUET-1227: Thrift crypto metadata structures
new 40b21b3 PARQUET-1357: FormatStatValue truncates binary statistics on zero character
new b6ad261 PARQUET-1366: [C++] Streamline use of Arrow's bit-util.h APIs
new ed7242e PARQUET-1301: [C++] Crypto package in parquet-cpp
new bdeed71 PARQUET-1332: Add bloom filter for parquet
new e26afc2 PARQUET-1378: Allow RowGroups with zero rows to be written
new 72795ef PARQUET-1308: [C++] Use Arrow thread pool, not Arrow ParallelFor, fix deprecated APIs, upgrade clang-format version. Fix record delimiting bug
new d146452 PARQUET-1382: [C++] Prepare for arrow::test namespace removal
new aa166ed PARQUET-1384: fix clang build error for bloom_filter-test.cc
new 41ae86d PARQUET-1256: Add --print-key-value-metadata option to parquet_reader tool
new cdf2e3f PARQUET-1276: [C++] Reduce the amount of memory used for writing null decimal values
new 1dffe22 PARQUET-1392: Read multiple RowGroups at once into an Arrow table
new 1463276 PARQUET-1372: Add an API to allow writing RowGroups based on size
new 9b4cd9c ARROW-3075: [C++] Incorporate parquet-cpp codebase into Arrow C++ build system. Add unit test label granularity options, ability to add component group targets like 'make parquet' that build libraries and tests
new f397ce9 ARROW-2750: [MATLAB] Initial MATLAB interface, support for reading numeric types from Feather files
new e751999 ARROW-2799: [Python] Add safe option to Table.from_pandas to avoid unsafe casts
new 282bc52 ARROW-3173: [Rust] dynamic_types example does not run
new 92b6863 ARROW-3195: [C++] Add missing error check for NumPy initialization in test
new 462fb00 ARROW-3188: [Python] Table.from_arrays segfaults if lists and schema are passed
new 7a2d228 ARROW-2520: [Rust] CI should also build against nightly Rust
new a42d4bf ARROW-2617: [Rust] Schema should contain fields not columns
new 9f9563f ARROW-3170: [C++] Experimental readahead spooler
new 3f36f62 ARROW-3125: [C++] Add support for finding libpython on MSYS2
new 3147bf1 ARROW-3216: [C++] Add missing libpython link to libarrow_python in MinGW build
new 05d204d ARROW-3217: [C++] Add missing ARROW_STATIC definition in MinGW build
new 22eba33 ARROW-3218: [C++] Remove needless links to utilities in MinGW build
new a66f576 ARROW-3211: [C++] Disable gold linker with MinGW-w64
new 758be7a ARROW-3213: [C++] Use CMake to build vendored Snappy on Windows
new c898ba0 ARROW-3219: [C++] Use Win32 API in MinGW build
new 8393119 ARROW-3214: [C++] Disable insecure warnings in MinGW build
new e912675 ARROW-2936: [Python] Implement Table.cast for casting from one schema to another (if possible)
new 8d71906 ARROW-1963: [C++/Python] Create Array from sequence of numpy.datetime64
new 65154c9 ARROW-3128: [C++] Support system shared zlib
new c950e10 ARROW-3223: [GLib] Use the same shared object versioning rule in C++
new c6d6164 ARROW-1380: [Plasma] Fix "still reachable" valgrind warnings when PLASMA_VALGRIND=1
new 67c05c2 ARROW-2646: [C++/Python] Pandas roundtrip for date objects
new 99190d0 ARROW-1424: [Python] Add CUDA support to pyarrow
new 6daf91b ARROW-3235: [Packaging] Update deb names
new a5f5319 ARROW-3234: [C++] Fix libprotobuf shared library link order
new d851295 ARROW-3231: [Python] Sphinx's autodoc_default_flags is now deprecated
new c48eaec ARROW-3229: [Packaging]: Adjust wheel package scripts to account for Parquet codebase migration
new 6d51137 ARROW-3181: [Packaging] Adjust conda package scripts to account for Parquet codebase migration
new ed89eeb ARROW-3237: [CI] Update linux packaging filenames in rat exclusion list
new 362ba74 ARROW-3236: [C++] Fix stream accounting bug causing garbled schema message when writing IPC file format
new a2a1a66 ARROW-3132: Regenerate 0.10.0 changelog given JIRA metadata updates
new 8b7be05 ARROW-2958: [C++] Bump Flatbuffers EP version to master to build on gcc 8.1
new 4121c5a ARROW-3228: [Python] Do not allow PyObject_GetBuffer to obtain non-readonly Py_buffer when pyarrow Buffer is not mutable
new c698be3 ARROW-3242: [C++] Make CpuInfo a singleton, use coarser-grained dispatch to SSE4 in Parquet dictionary encoding
new 4488110 ARROW-3227: [Python] Require bytes-like input to NativeFile.write
new 2df2a9e ARROW-3157: [C++] Add Buffer::Wrap, MutableBuffer::Wrap convenience methods for wrapping typed memory, std::vector<T>
new 9c9dfa5 ARROW-3190: [C++] Rename Writeable references to Writable, add backwards compatibility, deprecations
new 6258e91 ARROW-3183: [Python] Fix get_library_dirs on Windows
new f9e88e5 ARROW-3251: [C++] Fix conversion warnings in cast.cc
new 3ca916d PARQUET-1398: [C++] move iv_prefix to Algorithms
new 3c9dc76 ARROW-3258: [GLib] Fix CI failure on macOS
new c49cccc ARROW-3257: [C++] Stop to use IMPORTED_LINK_INTERFACE_LIBRARIES
new e23f280 ARROW-3259: [GLib] Rename "writeable" to "writable"
new 8dd3fc7 ARROW-3056: [Python] Add notes to NativeFile docstrings for BufferedIOBase methods that are not implemented
new 92a2e6a ARROW-3240: [GLib] Add build instructions using meson
new 7104d64 ARROW-3187: [C++] Add support for using glog (Google logging library)
new d6b9dc2 ARROW-3072: [C++] Add RETURN_NOT_OK linting rule, use ARROW_RETURN_NOT_OK in header files
new 498776c ARROW-3268: [CI] Reduce conda times on AppVeyor [skip travis]
new e81683c ARROW-3252: [C++] Do not hard code the "v" part of versions in thirdparty toolchain
new c469aca ARROW-3274: [Packaging] Missing glog dependency from conda-forge recipes
new 9d142a6 ARROW-3262: [Python] Implement __getitem__ with integers on pyarrow.Column
new 78886e4 ARROW-3069: [Release] Stop using SHA1 checksums per ASF policy
new 35b5bce ARROW-3267: [Python] Create empty table from schema
new db0ef22 ARROW-3146: [C++] Prototype Flight RPC client and server implementations
new cdd458e ARROW-3261: [Python] Add "field" method to select fields from StructArray
new 7df2966 ARROW-3249: [Python] Run flake8 on integration_test.py and crossbow.py
new e2c4b09 ARROW-3270: [Release] Adjust release verification scripts to recent parquet migration
new 7edfbae ARROW-3287: [C++] Suppress "redeclared without dllimport attribute" warning from MinGW
new b254d34 ARROW-3286: [C++] Add missing ARROW_EXPORT to RecordBatchBuilder
new 773bd67 ARROW-3264: [Java] Checkstyle fix whitespace
new b91c844 ARROW-3284: [R][C++] Status code R error
new 19a1da0 PARQUET-1401: [C++] optional RowGroup fields for handling hidden columns
new 63087e4 ARROW-3288: [GLib] Add missing new API index for 0.11.0
new c25ed39 ARROW-3299: [C++] Make RecordBatchBuilder non-copyable to appease MSVC
new 996fc3c ARROW-3285: [GLib] Add arrow_cpp_build_type and arrow_cpp_build_dir options
new e2f4f9a ARROW-3300: [Release] Update deb package names in preparation
new ba83816 ARROW-3143: [C++] CopyBitmap into existing memory
new c2dc5e1 ARROW-2772: [JS] Commit package-lock.json and/or yarn.lock
new ca7cae9 ARROW-2697: [JS] Add note about published API documentation to JS README
new 3160cf1 ARROW-3196: Add support for merging both ARROW and PARQUET patches
new 53e359b ARROW-3279: [C++] Allow linking Arrow tests dynamically on Windows
new 4b53b30 ARROW-3269: [Python] Fix warnings in unit test suite
new 4d8619b ARROW-3197: [C++] Add instructions for building Parquet libraries and running the unit tests
new 391516d ARROW-3152: [Packaging] Add zlib to runtime dependencies for arrow-cpp conda package
new 6a6740f [Website] Add Falcon to powered_by.md (#2610)
new cc8d636 ARROW-2854: [C++/Python] Casting float NaN to int should raise an error on safe cast
new 94c1d00 ARROW-3305: [JS] Incorrect development documentation link in javascript readme
new 433a898 ARROW-3309: [JS] Missing links from DEVELOP.md
new a20d281 ARROW-3322: [CI] Fix AppVeyor script to skip Rust job when no Rust changes
new c9ac869 ARROW-3212: [C++] Make IPC metadata deterministic, regardless of current stream position. Clean up stream / tensor alignment logic
new 5167502 ARROW-3321: [C++] Improve integer parsing performance
new ea8940a ARROW-3282: [R] initial R functionality
new b3cd616 ARROW-249: [JAVA] Flight GRPC Implementation
new 9d007b1 ARROW-3319: [GLib] Add align() to GArrowInputStream and GArrowOutputStream
new 2a6c0cb ARROW-3313: [R] Move .clang-format to top level. Add r/lint.sh script for linting R C++ files in Travis CI
new ee9b1ba ARROW-3141: [Python] Raise numpy global requirement to 1.14
new 2381636 ARROW-3313: [R] Follow-up: install clang-format in R CI entry
new d1152a6 ARROW-3335: [Python] Add ccache to manylinux1 container
new c1cf985 ARROW-3256,3304: [JS] fix file footer inconsistency, yield all messages from the stream reader
new c800f15 ARROW-3334: [Python] Update conda packages to new numpy requirement
new d54f13d ARROW-3327: [Python] Use local Arrow checkout instead of separate clone
new 6beeaf4 PARQUET-681: Add tool to scan a parquet file
new d1e8433 PARQUET-711: Use metadata builders in parquet writer
new 5d0f5ac PARQUET-728: Incorporate upstream Arrow API changes
new de3f844 PARQUET-702: Add a writer + reader example with detailed comments
new 713c6cd PARQUET-818: Refactoring to utilize common IO, buffer, memory management abstractions and implementations
new 58ad1a8 PARQUET-807: Allow user to retain ownership of parquet::FileMetaData.
new 22a96d9 PARQUET-892: Specify public link targets for parquet_static so that transitive dependencies are linked in executables
new 4b444c6 PARQUET-909: Reduce buffer allocations (mallocs) on critical path
new bfb1e9a PARQUET-508: Add ParquetFilePrinter
new 174cc09 PARQUET-958: [C++] Print Parquet metadata in JSON format
new 3e15021 PARQUET-595: API for KeyValue metadata
new ebf0507 PARQUET-679: Local Windows build and Appveyor support
new 67f84a9 PARQUET-991: Resolve msvc warnings; Appveyor treats msvc warnings as …
new 4d21fd3 PARQUET-1029: [C++] Some extern template symbols not being exported in gcc
new 80dc883 PARQUET-1048: Apache Arrow static transitive dependencies
new 09cd545 PARQUET-1053: Fix unused result warnings due to unchecked Statuses
new d02cd9d PARQUET-1068: Modify .clang-format to use straight Google format with 90-character line width
new 144699c PARQUET-1083: Factor logic in parquet-scan.cc into a library function to help with perf testing
new eefd36e PARQUET-1177: Add PARQUET_BUILD_WARNING_LEVEL option and more rigorous Clang warnings
new 3a9dbdf PARQUET-1196: Example parquet_arrow project
new fca0625 PARQUET-1270: Install executable tools
new 37207bd PARQUET-1256: Add --print-key-value-metadata option to parquet_reader tool
new 0749154 PARQUET-1372: Add an API to allow writing RowGroups based on size
new 723a437 PARQUET-1427: [C++] Incorporate with build system, parquet target. Fix parquet-arrow example per repo changes
new 42cf69a ARROW-3136: [C++] Clean up public API
new 830a2b1 ARROW-3338: [Python] Crash when schema and columns do not match
new a10d40a ARROW-3065: [Python] concat_tables() failing from bad Pandas Metadata
new 81e813f [Website] Add FASTDATA.io to "Powered By" (#2639)
new b796b57 ARROW-3336: [JS] Fix IPC writer serializing sliced arrays
new 3a503ef ARROW-3073,3074: [JS] Add DateVector.from and fix DateVector.indexOf
new 85cab23 ARROW-3174: [Rust] run examples as part of CI
new d46e36a ARROW-3350: [Website] Fix powered by links
new 3c24f7a ARROW-3341: [R] Support for logical vector
new b719e73 ARROW-3349: [C++] Use aligned_* API in MinGW
new c3c024e ARROW-3013: [Website] Fix download links on website for tarballs, checksums
new ceed385 ARROW-3320: [C++] Improve float parsing performance
new 25927a5 ARROW-3352: [Packaging] Fix recently failing wheel builds
new e217802 ARROW-3175: [Java] Switch to official flatbuffers Java artifact and com.github.icexelloss for flatc executable artifact
new 86497f3 ARROW-3281: [Java] Make sure that WritableByteChannel in WriteChannel writes
new 28c4617 [Gandiva] Bootstrap evaluation using LLVM code generation
new 5315d20 [Gandiva] Make use of the modular features of cmake
new 5f721ae [Gandiva] Introduce error codes as error handling strategy.
new 030735f [Gandiva] Support functions of type NULL_INTERNAL
new fd00ba9 [Gandiva] Simplify the api to make function nodes
new d06db12 [Gandiva] Support if-else expression
new ba71c91 [Gandiva] expr decomposition moved to visitor
new 8b0dfe0 [Gandiva] Support literal expressions
new f8738db [Gandiva] Reduce bitmap updates for if-else
new 600f407 [Gandiva] First draft of Gandiva Java APIs
new 203bb7e [Gandiva] Add CMake support for proto files
new 2b27a99 [Gandiva] Add a zero-copy variant to Evaluate
new cadb463 [Gandiva] switch to /// or // style comments
new 3de315f [Gandiva] Add unit tests for bitmap/time fns
new d943f7d [Gandiva] Fix order of includes.
new 585e79c [Gandiva] Add Java APIs
new 6f67ce8 [Gandiva] Integrate java with travis CI.
new 0559290 [Gandiva] update benchmark results
new c949c11 [Gandiva] Add validation checks for Java coding guidelines in the build
new 9ce2377 [Gandiva] Added validation to projector build.
new bd2aa97 [Gandiva] Fixed licenses and minor corrections in build.
new 0f46c5b [Gandiva] Support boolean and/or
new 1fc01ee [Gandiva] Support null literals
new 5216242 [Gandiva] Support AND/OR control expressions
new d1f161b [Gandiva] Support null literals
new 366a972 [Gandiva] Support date/time functions and datatypes
new e344807 [Gandiva] Dynamically load dependencies.
new c0ae3c3 [Gandiva] Support variable len arrow vectors
new 35c9203 [Gandiva] Made Gandiva JNI a packagable library.
new 6099f6d [Gandiva] clang-format to validate/fix style
new 18a9052 [Gandiva] Deploy to ossrh after build.
new 9bc45a0 [Gandiva] support varlen types in gandiva
new 4eb7cde [Gandiva] Add cpp/Java microbenchmarks
new 75ca520 [Gandiva] Added support for time32 and timestampdiff functions
new 025d1b9 [Gandiva] Add support to print expressions
new 39d180f [Gandiva] Support more date/time functions
new eb11ea9 [Gandiva] link libstdc++ statically
new 9e98f03 [Gandiva] Export supported types from Gandiva.
new 8707793 [Gandiva] Fix missing include directory of gtest in CMakeLists.txt
new 5af1119 [Gandiva] Fixed extract second from time.
new e001c1d [Gandiva] Add hash functions on all data types
new a2c3300 [Gandiva] Fixed literals and nulls for time types.
new 8a794e8 [Gandiva] Fixed reference initializations.
new e490a7f [Gandiva] Add support for more date/time functions
new ac661ff [Gandiva] Match gandiva mod operator to dremio for mod zero.
new c0eab74 [Gandiva] Add support for filters
new 743b3c1 [Gandiva] Add java bindings for filter expr
new 7ce243a [Gandiva] Fixed filter bugs.
new 3cfbdb5 [Gandiva] Fixed selection vector array type
new e8ee7f9 [Gandiva] Executing TPCH queries.
new cf51280 [Gandiva] Perf Improvments
new 2cb9c0d [Gandiva] Caching projectors and filters for re-use.
new 71aad3d [Gandiva] Fixed concurrency issue in cache.
new 465e7e8 [Gandiva] Fixed Literal ToString.
new 28915eb [Gandiva] Add support for sql regex functions
new 8beb066 [Gandiva] Add a helper library containing cpp stubs
new abb24a9 [Gandiva] switch to a more efficient date impl
new 27673c8 [Gandiva] switch from std::regex to re2
new b8e3492 [Gandiva] add some minor functions required by tpch-h
new 8e9a915 ARROW-3182: [Gandiva] Integrate gandiva to arrow build. Update licenses to apache license.
new b7d4e08 [Gandiva] Fix copyright header in cpp/src/gandiva/README.md
new 97da1aa ARROW-3354: [Python] Swap cuda.read_record_batch arguments
new 037f6d9 ARROW-3339: [R] Support for character vectors
new 2ccfd0a PARQUET-1431: [C++] Automaticaly set thrift to use boost for thrift versions before 0.11
new 7157a74 ARROW-3276: [Packaging] Add support for Parquet deb/rpm packages
new c6a42e7 ARROW-3357: [Rust] Add a mutable buffer implementation
new f981a11 ARROW-3348: [Plasma] Fix bug in which plasma store dies when object created by remo…
new 74bf736 PARQUET-1160: [C++] Implement BYTE_ARRAY-backed Decimal reads
new 7a4b48c ARROW-3363: [C++/Python] Add helper functions to detect scalar Python types
new 74e98f8 ARROW-3369: [Packaging] Wheel builds are failing due to wheel 0.32 release
new 1649864 ARROW-3180: [C++] Add docker-compose setup to simulate Travis CI run locally
new 5ac7ffe ARROW-3371: [Python] Remove check_metadata argument for Field.equals docstring
new af20905 ARROW-3356: [Python] Document parameters of Table.to_pandas method
new 3545186 ARROW-3373: [Plasma] Fix bug when plasma client requests multiple objects and add test.
new dfe9523 PARQUET-1369: Disregard column sort order if statistics max/min are equal
new 803815d ARROW-3301: [Website] Update Jekyll and Bootstrap 4
new 08e0f68 ARROW-3375: [Rust] remove unused mempool
new d38cf86 ARROW-3076: [Website] Add Google Analytics scripts to Sphinx, Doxygen API docs
new acb0b13 ARROW-1521: [C++] Add BufferOutputStream::Reset method
new 08142bf ARROW-3250: [C++] Buffer implementation which owns memory from a std::string
new 3510ac8 ARROW-3370: [Packaging] Suppress BFD warnings on CentOS 6
new c522bea ARROW-2555: [C++/Python] Allow Parquet-Arrow writer to truncate timestamps instead of failing
new b83db61 ARROW-3374: [Python] Implicitly set from_pandas=True when passing pandas.Categorical to pyarrow.array. Preserve ordered categories
new 5ebab5a ARROW-25: [C++] Implement CSV reader
new ef9ccbb ARROW-3206: [C++] Fix CMake error when ARROW_HIVESERVER2=ON but tests disabled
new 1c7bded ARROW-2983: [Packaging] Verify source release and binary artifacts in different scripts
new 0101468 ARROW-3394: [Java] Remove duplicate dependency in Flight for grpc-netty
new d15da35 ARROW-3376: [C++] Add double-conversion to cpp/thirdparty/download_dependencies.sh
new b5e9cc5 ARROW-3397: [C++] Change a CMake relative path for modules
new efa28a7 ARROW-3393: [C++] Add missing override on virtual dtor in task-group.cc
new 8cb5cbf ARROW-3360: [GLib] Import Parquet GLib
new 93a8f2b ARROW-3395: [C++/Python] Add docker container for linting
new 94352b7 ARROW-3392: [Python] Support filters in disjunctive normal form in ParquetDataset
new bd23961 ARROW-3403: [Website] Source tarball link missing from install page
new fa4c1aa ARROW-3400: [Packaging] Add support for Parquet GLib deb/rpm
new a978786 ARROW-3404: [C++] Make CSV chunker faster
new 9c8129e ARROW-3377: [Gandiva][C++] Replace If statement with bit operations for bitmap
new e30bc75 ARROW-3411: [Packaging] Make dev/release/01-perform.sh executable
new 2862932 ARROW-3413: [Packaging] Include Parquet GLib document to source archive
new 0d1f120 ARROW-3412: [Packaging] Update rat exclude files
new c33a139 ARROW-3416: [Packaging] Use SHA512 instead of SHA1
new 959fa5e ARROW-3415: [Packaging] Fix "conda activate" failure
new 83945f0 ARROW-3417: [Packaging] Fix Parquet C++ test failure
new f7b7f96 ARROW-3418: [C++] Update parquet-cpp version to 1.5.1-SNAPSHOT
new 96affdc ARROW-2165: [JAVA] enhance AllocationListener with onChildAdded()/onChildRemoved() calls (#2697)
new 18a61f6 ARROW-3423: [Packaging] Remove RC information from deb/rpm packages
new 7940ffe [Release] Update CHANGELOG.md for 0.11.0
new 141e1a9 [Release] Update .deb/.rpm changelogs for 0.11.0
new 98db705 [maven-release-plugin] prepare release apache-arrow-0.11.0
new 01dcde2 [maven-release-plugin] prepare for next development iteration
new 222632c [Release] Update .deb package names for 0.12.0
new df745a7 ARROW-3430: [Packaging] Add workaround to verify 0.11.0
new 274237c ARROW-3431: [GLib] Include Gemfile to archive
new 07f2e78 ARROW-3432: [Packaging] Expand variables in commit message
new 9fc896e ARROW-3331: [Gandiva][C++] Add re2 to toolchain
new c6986e9 ARROW-3438: [Packaging] Fix too much Markdown escape in CHANGELOG
new 736c6b6 ARROW-2337: Use Boost shared libraries in Windows release verification script. Parquet fixes
new c0822bc ARROW-3382: [Gandiva][C++] Run tests in travis
new b8dc87a ARROW-3420: [C++] include-what-you-use cleanups for most of src/arrow, add "iwyu" target to top level docker-compose
new 8299684 ARROW-3429: [Packaging] Add binary upload script
new ef02c81 ARROW-3445: [GLib] Fix libarrow-glib link for libparquet-glib
new b46236c ARROW-3225: [C++/Python] Pandas object conversion of ListType<DateType> and ListType<TimeType>
new b8591e8 ARROW-3355: [R] Support for factors
new 9448f58 [Website] Add Petastorm to "Powered By" (#2716)
new bcd372b ARROW-3462: [Packaging] Update CHANGELOG for 0.11.0
new 4b3e5e6 PARQUET-1426: [C++] parquet-dump-schema has poor usability
new c951043 ARROW-3460: [Packaging] Add a script to rebase master on local release branch
new 739ea77 ARROW-3455: [Gandiva][C++] Support pkg-config for Gandiva
new 3769cd1 ARROW-3198: [Website] Blog post for the 0.11 release
new c785e6e ARROW-3463: [Website] Update for 0.11.0
new ea96080 ARROW-3461: [Packaging] Add a script to upload RC artifacts as the official release
new 2928468 ARROW-3467: [C++] Fix building against external double-conversion
new f4f6269 ARROW-3466: [C++] Avoid leaking protobuf symbols
new 97c8692 ARROW-3480: [Website] Fix broken install document for Ubuntu
new b4ed7c8 ARROW-3465: [Documentation] Fix gen_apidocs' docker image
new 8c3b879 [Gandiva] Fixed equality issue in validator.
new 62c5a30 [Gandiva] Fix divide by zero errors.
new 89163c5 [Gandiva] short-circuit regex startsW/endsW
new 89f8925 [Gandiva] add a java perf test for filter
new 17ebd46 [Gandiva] fix a bug caused due to fake cache hits
new 14615b8 [Gandiva] add evaluate variant that skips refs
new fa2273b [Gandiva] add debug msgs to print expressions
new a23fd1b [Gandiva] Error handling support.
new e975616 [Gandiva] Fixed build issues due to cherry pick.
new 9265fe3 [Gandiva] Hide stdc++ from being exported.
new fa98906 [Gandiva] Fixes for build issues.
new 9c10953 [Gandiva] fix bug with nested if in condition
new fb0a588 [Gandiva] allow multiple module instances in cache
new 1ae9235 [Gandiva] Added support for months_between
new 1ecda89 [Gandiva] Added more functions.
new 478f2f6 [Gandiva] math functions, utf8_length
new 7a2f147 [Gandiva] Modify as per arrow code base.
new ed8e566 ARROW-2808: [Python] Add MemoryPool tests
new 32960a1 ARROW-3479: [R] Support to write record_batch as stream
new 862c7df ARROW-3473: [Format] Clarify that 64-bit lengths and null counts are permitted, but not recommended
new c06cbae ARROW-3383: [Gandiva][Java] Fix java build
new 500134e ARROW-3482: [C++] Build with JEMALLOC by default
new cd6e094 ARROW-3353: [Packaging] Build python 3.7 wheels
new 1222996 ARROW-3398: [Rust] Update existing Builder to use MutableBuffer internally
new 35cd7ae ARROW-3474: [GLib] Extend gparquet API with get_schema and read_column
new 6caab7c ARROW-3425: [JS] Programmatically created dictionary vectors don't get dictionary IDs
new 098bd46 ARROW-3336: [JS] Fix IPC writer serializing sliced Utf8Vectors
new 1c85e41 ARROW-3492: [C++] Build jemalloc in parallel
new b027f5a ARROW-3350: [R] Wrap MemoryMappedFile class
new 146e7df ARROW-3442: [C++] Allow dynamic linking of (most) unit tests
new 2b95a00 ARROW-3343: [Java] Disable flaky tests
new afc8c63 ARROW-3440: [Gandiva] fix readme for builds
new 54634dd ARROW-3456: [CI] Reuse docker images and optimize docker-compose containers
new 65f0316 ARROW-3409: [C++] Streaming compression and decompression interfaces
new 4fe30d5 ARROW-3516: [C++] Use unsigned type for difference of pointers in parallel_memcpy
new d0d25bb ARROW-3518: Detect HOMEBREW_PREFIX automatically
new 0689a58 ARROW-3504: [Plasma] Add support for Plasma Client to put/get raw bytes without pyarrow serialization.
new aaf70a4 ARROW-3521: [GLib] Run Python using find_program in meson.build
new 66d9a30 ARROW-3514: [C++] Work around insufficient output size estimate on old zlibs
new c1ede67 ARROW-3494: [Gandiva][C++] fix re2 error in cmake
new 1532f1d ARROW-3528: [R] Fixed typo in R package documentation
new 7abda6d ARROW-3527: [R] remove unused variables
new eddd363 ARROW-1696: [C++] Add (de)compression benchmarks
new 2ba9016 ARROW-2483: [Rust] use bit-packing for boolean vectors
new acd798c ARROW-3534: [Python] Update zlib library in manylinux1 image [skip appveyor]
new 2b9e1e4 ARROW-45: [Python] Add unnest/flatten function for List types
new ed19152 ARROW-3533: [Python/Documentation] Use sphinx_rtd_theme instead of Bootstrap
new 8aea70c ARROW-3540: [Rust] Incorporate BooleanArray into PrimitiveArray
new aba77a6 ARROW-3530: [Java/Python] Add conversion for pyarrow.Schema from org.apache…pojo.Schema
new 390f7d0 ARROW-3542: [C++] Use unsafe appends when building array from CSV
new 56e72ba ARROW-3278: [Python] Retrieve StructType's and StructArray's field by name
new 4ed4053 ARROW-3549: [Rust] Replace i64 with usize for some bit utility functions
new d3ec690 ARROW-3490: [R] streaming of arrow objects to streams
new 90d21e9 ARROW-3551: Update MapD to OmniSci on Powered By page
new 82f8b05 ARROW-3546: [Python] Provide testing setup to verify wheel binaries work in one or more common Linux distributions
new eab7d5f ARROW-1019: [C++] Implement compressed streams
new a8bd1c9 ARROW-3548: [Plasma] Add CreateAndSeal object store method for faster puts for small objects.
new 45c4955 ARROW-3559: [Plasma] Static linking for plasma_store_server.
new 21ca340 ARROW-3565: [Python] Pin tensorflow to 1.11.0 in manylinux1 container
new b26c5ae ARROW-3556: [CI] Disable optimizations on Windows
new 0ecba4f ARROW-3558: [Plasma] Remove fatal error when calling get on unsealed object.
new 6067992 ARROW-3562: [R] Disallow creation of objects with shared_ptr<T>(nullptr), use bits64::integer64
new 2d7fbb2 ARROW-3380: [Python] Support reading gzipped CSV files
new 7810a7e ARROW-3323: [Java] Fix checkstyle naming
new fdbe926 ARROW-3131: [Go] add Go1.11 to the build matrix
new b63430d ARROW-3581: [Gandiva][C++] Use protobuf as shared library when -DARROW_PROTOBUF_USE_SHARED=ON
new 565cb7b ARROW-3568: [Packaging] Run pyarrow unittests for windows wheels
new b1572b5 ARROW-3582: [CI] fix incantation for C++/Java detection tool
new de5d7de ARROW-3539: [CI/Packaging] Update scripts to build against vendored jemalloc
new d1e7ab9 ARROW-3580: [Gandiva][C++] Fix build error with g++ 8.2.0
new d656dc3 ARROW-3573: [Rust] with_bitset does not set valid bits correctly
new ada04ca ARROW-3577: [Go] implement Chunked array
new 2d0d3d0 ARROW-3574: [Plasma] Use static libraries in plasma library.
new 5dce680 ARROW-3472: [Gandiva] remove gandiva_helpers library
new ae448f2 ARROW-3589: [Gandiva] Make gandiva JNI wrappers optional
new 0176b8f ARROW-3557: [Python] Set Cython language level
new 03e7346 ARROW-3598: [Plasma] Fix Plasma GPU linking error.
new 73d379f ARROW-3588: [Java] Fix checkstyle for header license
new 536e4f4 ARROW-3381: [C++] Add bz2 codec
new d904106 ARROW-3451: [C++/Python] pyarrow and numba CUDA interop
new fc2b3b5 ARROW-3469: [Gandiva] Add gandiva travis OSX entry
new 0387ebc ARROW-3603: [Gandiva][C++] Support building with ARROW_BOOST_VENDORED=ON
new d5510e0 PARQUET-1449: [C++] Support building with ARROW_BOOST_VENDORED=ON
new b4f7ed6 ARROW-3605: [Plasma] Remove dependence of plasma/events.h on ae.h.
new deeef92 ARROW-3260: [CI] Move linting to separate Travis-CI job [skip appveyor]
new 5b116ab ARROW-3489: [Gandiva][C++] Added support for IN expressions
new 0b9fad3 ARROW-3515: [C++] Introduce NumericTensor class
new 227ad0d ARROW-3541: [Rust] Update BufferBuilder to allow for new bit-packed BooleanArray
new e5122e3 ARROW-3567: [Gandiva][GLib] Add GLib bindings of Gandiva
new d9ee70c ARROW-3591: [R] Support for collecting decimal types
new 9e0fd73 ARROW-3487: [Gandiva] simplify fns that return errors
new 6d52b22 ARROW-3618: [Packaging/Documentation] Add `-c conda-forge` option to avoid PackagesNotFoundError
new 8e22482 ARROW-2767: [JS] Add generic to Table for column names
new 34e48b5 [R] Add R to allow failures until ARROW-3593 resolved (#2846)
new 9402542 ARROW-2712: [C#] Initial C# .NET library
new f8f43f6 ARROW-3600: [CI/Packaging] Add Ubuntu 18.10
new a718a02 ARROW-3633: [Packaging] Update deb names in dev/tasks/tasks.yml for 0.12.0
new 6f6da3e ARROW-3632: [Packaging] Update deb names in dev/tasks/tasks.yml in release process
new d88b9d0 ARROW-3634: [GLib] Follow CudaDeviceManager::AllocateHost() API change
new 4a3d8c9 ARROW-3569: [Packaging] Run pyarrow unittests when building conda package
new d3ab00b ARROW-3385: [Gandiva][C++][Java] Crossbow support for deploying gandiva jars
new 66bc066 ARROW-3583: [Python/Java] Create RecordBatch from VectorSchemaRoot
new d61988d ARROW-3638: [C++][Python] Move reading from Feather as Table feature to C++ from Python
new cf646e4 ARROW-3584: [Go] Implement Table, Schema and Column
new 0d6d51d ARROW-3576: [Python] Implemented compressed streams
new ce8bc4e ARROW-3555: [Plasma] Unify plasma client get function using metadata.
new a2f52fc ARROW-3405: [Python] Document CSV reader
new 4b9b530 ARROW-3592: [Python] Allow getting view of a binary scalar
new 5f0d3d2 ARROW-3642: [C++] Add arrowConfig.cmake generation
new 8993f8b ARROW-3636: [C++/Python] Update arrow/python/pyarrow_api.h
new 1a6545a ARROW-3358: [Gandiva][C++] Deprecate Gandiva Status.
new 85fd54c ARROW-3647: [R] Fix R bit64 crash and formatting
new f2bf068 ARROW-3566: [Format] Clarify the type of dictonary encoded field
new 3a1dd3f ARROW-3608: [R] Support for time32 and time64 array types
new c89e9b6 ARROW-3529: [Ruby] Import Red Parquet
new b5fafd8 ARROW-3519: [Gandiva] Arena for varlen output fns
new fe01c93 ARROW-3537: [Rust] Implement Tensor Type
new 059e2d0 ARROW-3630: [Plasma][GLib] Add GLib bindings of Plasma
new 2ac04fd ARROW-3661: [Gandiva][GLib] Use "_" as word separator in constant name
new 4ef1dac ARROW-3658: [Rust] Incorrect `List<T>` tests
new 4ee05ab ARROW-3624: [Python/C++] Support for zero-sized device buffers and device-to-device copying
new a56c009 ARROW-3616: [Java] Fix remaining checkstyle issues
new e5b7f72 ARROW-3621: [Go] implement Table, Record, RecordReader and TableReader
new 2cdc4d4 ARROW-3640: [Go] implement Tensors
new 9fb776c ARROW-3637: [Go] implement Stringer for arrays
new eeaf121 ARROW-3038: [Go] implement String array
new 8cc92f4 ARROW-3626: [Go] implement CSV reader
new c61f882 ARROW-3659: [CI] Fix Travis matrix entry 2 documentation to use gcc
new ed41003 ARROW-3657: [R] there is no package called bit64
new 4fbbffa ARROW-3660: [C++] Don't unnecessarily lock MemoryMappedFile for resizing in readonly files
new 5fd9ac9 ARROW-3649: [Rust] Refactor MutableBuffer's resize
new 84e181e ARROW-2535: [Python] Provide pre-commit hooks that check flake8
new e69659d ARROW-3662: [C++] Add a const overload to MemoryMappedFile::GetSize
new 919119f ARROW-3347: [Rust] Implement PrimitiveArrayBuilder
new f0fa18f ARROW-3683: [Go] add functional-option style to configure the CSV reader
new 84ac1dc ARROW-3682: [Go] unexport encoding/csv.Reader from CSV reader
new e65663c ARROW-3670: [C++] Use FindBacktrace to find execinfo.h support
new ed70f05 ARROW-3691: [JS] Update dependencies, switch to terser
new 729f2a4 ARROW-3694: [Java] Avoid superfluous string creation when logging level is disabled
new 010fad2 ARROW-3610: [C++] Add interface to turn stl_allocator into arrow::MemoryPool
new 756f645 ARROW-3648: [Plasma][Java] Add API to get metadata and data at the same time
new eb500b6 ARROW-3517: [C++] Add a workaround for MinGW-w64 32bit crash
new fe222fa ARROW-3695: [Gandiva] Use add_arrow_lib()
new b1d490d ARROW-3696: [C++] Add feather::TableWriter::Write(table)
new 7aac525 ARROW-3063: [Go] remove list of TODOs from go/README
new 100a8eb ARROW-3697: [Ruby] Add schema#[]
new 8a90e80 ARROW-3692: [Gandiva][Ruby] Add Ruby bindings of Gandiva
new 67a436a ARROW-3666: [C++] Improve C++ parser performance
new 7011ae0 ARROW-3656: [C++] Allow whitespace in numeric CSV fields
new e26806e ARROW-3664: [Rust] Add benchmark for PrimitiveArrayBuilder
new 8d1db71 ARROW-1994: [Python] Test against Pandas master
new 16ffa3d ARROW-3506: [Packaging] Nightly tests for docker-compose images
new 0a89ee0 ARROW-3712: [CI] Quick fix for RAT failure
new c9c7f9c ARROW-3704: [Gandiva][C++] Add missing include
new c303cc9 ARROW-3708: [Packaging] Support CMake files in Linux packages
new 394b334 ARROW-3707: [C++] Fix test regression with zstd 1.3.7
new 713d82a ARROW-3477: [C++] fixes for 32 bit architectures
new b5dca1c ARROW-3715: [C++]: Fix typo in gflags_ep CMake config
new e75cbf9 ARROW-3646: [Python] High-level IO API
new 3ab4a0f ARROW-3615: [R] Support for NaN
new 8e9cb87 ARROW-3602: [Gandiva] [Python] Initial Gandiva Cython bindings
new d290538 ARROW-3587: [Python] Efficient serialization for Arrow Objects (array, table, tensor, etc)
new d194241 ARROW-3108: [C++] arrow::PrettyPrint for Table instances
new df4eb30 ARROW-3718: [Gandiva] Remove spurious gtest include
new 87e3307 ARROW-3720: [GLib] Use "indices" instead of "indexes"
new db4bd01 ARROW-3724: [GLib] Update .gitignore
new d5f7b02 ARROW-3711: [C++] Don't pass CXX_FLAGS to C_FLAGS
new 4e5c7af ARROW-3499: [R] Expose arrow::ipc::Message type
new 391ea2c [C#] Add link to C# libraries subdirectory to top level README
new 71cd559 ARROW-3719: [GLib] Support read/write table to/from Feather
new 202265f ARROW-3693: [R] Invalid buffer for empty characters with null data
new dccec7a ARROW-3601: [Rust] Add instructions for publishing to crates.io
new 14b8aa7 ARROW-3536: [C++] Add UTF8 validation functions
new a1c00ac ARROW-3725: [GLib] Add field readers to GArrowStructDataType
new a7df9c6 ARROW-3723: [Plasma][Ruby] Add Ruby bindings of Plasma
new e5e4490 ARROW-3733: [GLib] Add to_string() to GArrowTable and GArrowColumn
new 6af5ce9 ARROW-3734: [C++] Linking static zstd library fails on Arch x86-64
new 3ae4161 ARROW-3698: [Gandiva] Segmentation fault when using a large table in Gandiva
new a45fafa ARROW-3611: [Python] Give better error message when type_id has wrong type.
new 0a2ce9d ARROW-3721: [Gandiva] [Python] Support all Gandiva literals
new 1f79faf ARROW-3407: [C++] Add UTF8 handling to CSV conversion
new 3cf8f35 ARROW-3700: [C++] Ignore empty lines in CSV files
new baa0707 ARROW-3613: [Go] fix builder resize
new 97bfded ARROW-2673: [Python] Add documentation + docstring for ARROW-2661
new f63b3e2 ARROW-3716: [R] Missing cases for ChunkedArray conversion
new d8d07dc ARROW-3742: Fix pyarrow.types & gandiva cython bindings
new 2534cd7 ARROW-3749: [GLib] Fix typos
new 83c4dde ARROW-3744: [Ruby] Use garrow_table_to_string() in Arrow::Table#to_s
new 133c62d ARROW-3753: [Gandiva] Remove debug print
new d6a7600 ARROW-3743: [Ruby] Add support for saving/loading Feather
new 8a75297 ARROW-3752: [C++] Remove unused status::ArrowError
new 5d631da ARROW-3736: [CI/Docker] Ninja test in `docker-compose run cpp` hangs
new 2a69511 PARQUET-1439: Remove PARQUET_ARROW_LINKAGE option, clean up overall library linking configuration
new 455cde1 ARROW-3748: [GLib] Add GArrowCSVReader
new efb93b7 ARROW-3747: [C++] Switch order of struct members in Decimal128
new ffcc363 ARROW-3710: [CI/Python] Run nightly tests against pandas master
new 1ef6c26 ARROW-3746: [Gandiva] [Python] Print list of functions registered with gandiva
new dc5df8f ARROW-3614: [R] Support for timestamps
new b541888 ARROW-3439: [R] R language bindings for Feather format
new 9769644 ARROW-2807: [Python] [Parquet] Add memory_map= option to parquet.read_table, read_pandas, read_schema
new df54da2 ARROW-3722: [C++] Allow specifying types of CSV columns
new 736b5b0 PARQUET-1458: [C++] parquet::CompressionToString not recognizing brotli compression
new 217c28a ARROW-3751: [Gandiva][Python] Add more cython bindings for gandiva
new 0d092e0 ARROW-912: [Python] Recommend that Python developers use -DCMAKE_INSTALL_LIBDIR=lib when building Arrow C++ libraries
new 1c8f8fc ARROW-3796: [Rust] Add Example for PrimitiveArrayBuilder
new 281eb22 ARROW-3798: [GLib] Add support for column type CSV read option
new c604adb ARROW-3672 & ARROW-3673: [Go] add support for time32 and time64 array
new d5cfabf ARROW-3754: [C++] Enable Zstandard by default only when CMake is 3.7 or later
new 6e46bdc ARROW-3703: [Python] DataFrame.to_parquet crashes if datetime column has time zones
new 1a00fe5 ARROW-3797: [Rust] BinaryArray::value_offset incorrect in offset case
new d873261 ARROW-3186: [GLib][CI] Use the latest Meson again
new 3e84f99 [Gandiva] Add link to Gandiva codebase in top level README
new 948e0fb ARROW-3800: [C++] Vendor a string_view backport
new cc5b445 ARROW-3784: [R] Array with type fails with x is not a vector
new 5874af5 ARROW-3765: [Gandiva] Segfault when the validity bitmap has not been allocated
new f83454c ARROW-3788: [Ruby] Add support for CSV parser written in C++
new fd2c862 ARROW-3821: [Format/Documentation]: Fix typos and grammar issues in Flight.proto comments
new 074e2c2 ARROW-3194: [JAVA] Use split length in splitAndTransfer to set value count
new d911850 ARROW-2113/3768: [Python] set classpath to all hadoop jars when HADOOP_HOME present
new 69ffda1 ARROW-3781: [C++] Implement BufferedOutputStream::SetBufferSize. Allocate buffer from MemoryPool
new 29657d8 ARROW-3795: [R] Support for retrieving NAs from INT64 arrays
new 2326d3d ARROW-3825: [Python] Document how to run the Python unit tests in python/README.md
new 60305d5 ARROW-3644: [Rust] Implement ListArrayBuilder
new 5f3461e ARROW-3805: [Gandiva] Handle null validity bit-map in if-else
new b315aa0 ARROW-3830: [GLib] Add GArrowCodec
new d23f4f2 ARROW-3366: [R] Dockerfile for docker-compose setup
new 767ff8c ARROW-3793: [C++] TestScalarAppendUnsafe is not testing unsafe appends
new c07165d ARROW-3835: [C++] Add missing arrow::io::CompressedOutputStream::raw() implementation
new 9de2edc ARROW-3836: [C++] Add PREFIX, EXTRA_LINK_LIBS, DEPENDENCIES to ADD_ARROW_BENCHMARK
new 2c492a8 PARQUET-1271: [C++] Rename parquet_reader tool to parquet-reader for consistency
new 7200bdf ARROW-3837: [C++] Add GFLAGS_IS_A_DLL define to fix Windows build
new d7e3fb4 ARROW-3773: [C++] Remove redundant AssertArraysEqual function from before monorepo merge
new ba2b2ea ARROW-3437: [C++] Use older API for boost::optional, remove gtest include from prod code, remove -static-libstdc++ flags
new dedaee6 ARROW-3787: [Rust] Implement From<ListArray> for BinaryArray
new 0cc777f ARROW-3841: [C++] Suppress catching polymorphic type by value warning
new ee62840 ARROW-3609: [Gandiva] Convert Gandiva benchmark tests as gbenchmark t…
new beb1cb8 ARROW-3755: [GLib] Add GArrowCompressedInputStream and GArrowCompressedOutputStream
new 7e6bf41 ARROW-2720: [C++] Defer setting of -std=c++11 compiler option to CMAKE_CXX_STANDARD, use CMake option for -fPIC
new 074c9c2 ARROW-3847: [GLib] Remove unnecessary '\'
new 72598e4 ARROW-3766: [Python] pa.Table.from_pandas doesn't use schema ordering
new c04a62b ARROW-3726: [Rust] Add CSV reader with example
new 7cdab9b ARROW-3778: [C++] Compile parts of test-util.h that we can once, link with unit tests
new 2591454 ARROW-3845: [Gandiva] [GLib] Add GGandivaNode
new 9509220 ARROW-3785: [C++] Enable using double-conversion from $ARROW_BUILD_TOOLCHAIN
new d06d0d0 ARROW-3738: [C++] Parse ISO8601-like timestamps in CSV columns
new ac7c9cc ARROW-3852: [C++] Suppress used uninitialized warning
new 2db3d4c ARROW-3713: [Rust] Implement BinaryArrayBuilder
new b5d78a7 ARROW-3851: [C++] Run clang-format in parallel
new dc6da3a ARROW-3756: [CI/Docker/Java] Java tests are failing in docker-compose setup
new eaf8d32 ARROW-2653: [C++] Refactor hash table support
new 80238f2 ARROW-3856: [Ruby] Support compressed CSV save/load
new 1995e88 ARROW-3854: [GLib] Deprecate garrow_gio_{input,output}_stream_get_raw()
new 3692a23 ARROW-3858: [GLib] Use {class_name}_get_instance_private
new 3e0958a ARROW-3607: [Java] delete() method via JNI for plasma
new 6dbd1ec ARROW-3864: [GLib] Add support for allow-float-truncate cast option
new c3f968c ARROW-3070: [Packaging] Use Bintray
new 8d5bfc6 ARROW-3865: [Packaging] Add double-conversion dependency to conda forge recipes and the windows wheel build
new 8c52f4c ARROW-3782: [C++] Implement BufferedInputStream to pair with BufferedOutputStream
new 7281731 ARROW-3741: [R] Add support for arrow::compute::Cast to convert Arrow arrays from one type to anothe
new 10b204e ARROW-3728: [Python] Ignore differences in schema custom metadata when writing table to ParquetWriter
new d2256c5 ARROW-3677: [Go] Add fixed-length binary builder and array
new 1013a1d ARROW-3831: [C++] Add support for returning decompressed size
new 427a219 ARROW-2591: [Python] Add Parquet test case writing list-typed column with empty lists that caused segfault on 0.9.0
new 54b0af8 ARROW-1993: [Python] Add function for determining implied Arrow schema from pandas.DataFrame
new 1020e0b ARROW-3889: [Python] Crash when creating schema from invalid args
new c3612e8 ARROW-3860: [C++] Add ARROW_GANDIVA_STATIC_LIBSTDCPP option to restore hard-coded behavior prior to ARROW-3437
new 5a001e0 ARROW-3859: [Arrow][Java] Fixed backward incompatible change. (#3018)
new 49f3086 ARROW-3891: [Java] Remove Long.bitCount with simple bitmap operations
new 853ba6b ARROW-3493: [Java] Make sure bound checks are off
new 57b55de ARROW-3862: [C++] Improve third-party dependencies download script
new c98399d ARROW-3790: [C++] Fix erroneous safe casting
new 39d1e86 ARROW-3868: [Rust] Switch to nightly Rust for required build, stable is now allowed to fail
new 0066af8 ARROW-3855: [Rust] Schema/Field/Datatype now have derived serde traits
new 13c63bd ARROW-3823: [R] + buffer.complex
new 99b3b0a PARQUET-1463: [C++] Utilize common hashing machinery for dictionary encoding
new 87feee3 ARROW-3273: [Java] Fix checkstyle for Javadocs
new cd74741 ARROW-3807: [R] Missing Field API
new adbf763 ARROW-3895: [Rust] csv::Reader now returns Result<Option> instead of Option<Result>
new dd896c9 ARROW-3613: [Go] Fix builder downsize
new 7800684 ARROW-3844: [C++] Remove ARROW_USE_SSE and ARROW_SSE3
new 2d089a6 ARROW-3900: [GLib] Add garrow_mutable_buffer_set_data()
new ff5c37c ARROW-3169: [C++] Break up array-test into multiple compilation units
new 05c70b0 ARROW-3849: [C++] Leverage Armv8 crc32 extension instructions to accelerate the hash computation for Arm64
new 64e8ae7 ARROW-3888: [C++] Fix various compiler warnings
new a4951ed ARROW-3740: [C++] Builder should not downsize
new 69d207f ARROW-3905: [Ruby] Add StructDataType#[]
new 3c0d65e ARROW-3886: [C++] Add support for decompressed buffer size check for Snappy
new 2c34fe8 ARROW-3202: [C++] Fix compilation on Alpine Linux by using ARROW_WITH_BACKTRACE define
new e87110f ARROW-3893: [C++] Improve adaptive int builder performance
new cca2608 ARROW-3878: [Rust] Improve primitive types
new 2bc4d95 [Website] Update website for 0.11.1 bugfix release (#3060)
new a667fca ARROW-3920: [plasma] Fix reference counting in custom tensorflow plasma operator.
new 7aae9e9 ARROW-3924: [Packaging][Plasma] Add support for Plasma deb/rpm packages
new e453f5c ARROW-3898: [Example] parquet-arrow example has compilation errors
new d111a77 ARROW-3870: [C++] Add Peek to InputStream abstract interface
new bfedd3a ARROW-3925: [Python] Add autoconf to conda install instructions
new 8f02a1b ARROW-3912: [Plasma][GLib] Add support for creating and referring objects
new 67b9215 ARROW-3890: [Python] Handle NumPy binary arrays with UTF-8 validation when converting to StringArray
new fd2f798 ARROW-3684: [Go] Add chunking ability to CSV reader
new 1621868 ARROW-3922: [C++] Micro-optimizations to BitUtil::GetBit
new 98bdde8 PARQUET-1467: [C++] Remove defunct ChunkedAllocator code
new 681efd8 ARROW-3853: [C++] Implement string to timestamp cast
new d54a154 ARROW-3842: [R] RecordBatchStreamWriter api
new 67d9264 ARROW-3199: [Plasma] File descriptor send and receive retries
new 6045dd2 ARROW-2759: [Plasma] Export plasma notification socket
new e4693bb ARROW-3884: [Python] Add LLVM6 to manylinux1 base image
new c204d3e PARQUET-1469: [C++] Fix data corruption bug in parquet::internal::DefinitionLevelsToBitmap that was triggered through random data
new bb739b0 ARROW-3906: [C++] Break out builder.cc into multiple compilation units
new 0310d6a ARROW-3874: [C++] Add LLVM_DIR to find_package in FindLLVM.cmake
new 5b308ba ARROW-3914: [C++/Python/Packaging] Docker-compose setup for Alpine linux
new 0729cb7 ARROW-2909: [JS] Add convenience function for creating a table from a list of vectors
new 4a630f9 ARROW-3892: [JS] Remove any dependency on compromised NPM flatmap-stream package
new ffc5154 ARROW-3931: [C++] Make possible to build regardless of LANG
new 4ac4eb5 ARROW-3934: [Gandiva] Only add precompiled tests if ARROW_GANDIVA_BUILD_TESTS
new abdf7b4 ARROW-3629: [Python] Add write_to_dataset to Python Sphinx API listing
new a06624e ARROW-2993: [JS] Document minimum supported NodeJS version
new 2f4af26 ARROW-3937: [Rust] Fix Rust nightly build (formatting rules changed)
new 0afb975 ARROW-3936: [C++] Add _O_NOINHERIT to the file open flags on Windows
new 9c4c887 ARROW-3883: [Rust] Update README
new 286f59c ARROW-3939: [Rust] Remove macro definition for ListArrayBuilder
new 5b85981 ARROW-3318: [C++] Push down read-all-batches operation on RecordBatchReader into C++
new 8152916 ARROW-3894: [C++] Ensure that IPC file is properly initialized even if no record batches are written
new 42a52d7 ARROW-3871: [R] Replace usages of C++ GetValuesSafely with new methods on ArrayData
new 8db3748 ARROW-3441: [Gandiva] Use common unit test creation facilities, do not produce multiple executables for the same unit tests
new 137a69e ARROW-3681: [Go] Add benchmarks for CSV reader
new 0d0ff75 ARROW-3929: [Go] improve CSV reader memory usage
new 1f099c7 ARROW-3272: [Java][Docs] Add documentation about Java code style
new 187b98e ARROW-3291: [C++] Add string_view-based constructor for BufferReader
new 35f8a34 ARROW-3834: [Doc] Merge C++ and Python documentation
new 15be8e5 ARROW-3908: [Rust] Update rust dockerfile to use nightly toolchain
new 898e06c ARROW-3209: [C++] Rename libarrow_gpu to libarrow_cuda
new 6c29f00 ARROW-3921: [GLib][CI] Log Homebrew output
new dd1ab19 ARROW-3586: [Python] Add test ensuring no segfault
new f9e5818 ARROW-3945: [Website] Update website for Gandiva donation
new a084006 ARROW-3941: [R] RecordBatchStreamReader$schema
new b731b58 PARQUET-1471: [C++] TypedStatistics<T>::UpdateSpaced reads out of bounds value when there are more definition levels than spaced values
new 494ef75 ARROW-3946: [GLib] Add support for union
new 1cb1fa4 ARROW-3930: [C++] Avoid using Mersenne Twister for random test data
new 072df89 ARROW-3948: [GLib][CI] Set timeout to Homebrew
new 6bfd26a ARROW-3952: [Rust] Upgrade to Rust 2018 Edition
new 80989ef ARROW-3955: [GLib] Add (transfer full) to free when no longer needed
new 4d810b4 ARROW-3950: [Plasma] Make loading the TensorFlow op optional
new 03b4a27 ARROW-3961: [Python/Documentation] Fix wrong path in the pyarrow README
new 5704d8d ARROW-3940: [Python/Documentation] Add required packages to the development instruction
new 1dee3f4 ARROW-3303: [C++] API for creating arrays from simple JSON string
new d3d7669 ARROW-3969: [Rust] Format using stable rustfmt
new cc24218 ARROW-3963: [Packaging/Docker] Nightly test for building sphinx documentations
new 7a5631d ARROW-3964: [Go] Refactor examples of csv reader
new a4063ed ARROW-3967: [Gandiva] [C++] Make node.h public
new 1dc906e ARROW-3885: [Rust] Release prepare step should increment Rust version
new 612bdca ARROW-3970: [Gandiva][C++] Remove unnecessary boost dependencies.
new d6284cf ARROW-3792: [C++] Writing a list-type chunked column to Parquet fails if any chunk is 0-length
new e4761e0 ARROW-3727: [Python] Document use of foreign_buffer()
new fa5d5ad ARROW-3980: [C++] Fix CRTP use in json-simple.cc
new 7a296bd ARROW-3977: [Gandiva] fix label during ctest invoc
new e6b96aa ARROW-3687: [Rust] Anything measuring array slots should be `usize`
new 8973cfe ARROW-3983: [Gandiva][Crossbow] Link Boost statically in JAR packaging scripts
new 9da4584 ARROW-2624: [Python] Random schema generator for Arrow conversion and Parquet testing
new 9c8ddae ARROW-3942: [R] Feather api fixes
new 1220184 ARROW-3641: [Python] Remove unneeded public keyword from pyarrow public C APIs
new 24d00c0 ARROW-3248: [C++] Add "arrow" prefix to Arrow core unit tests, use PREFIX instead of file name for csv, io, ipc tests. Modular target cleanup
new 2428945 ARROW-3880: [Rust] Implement simple math operations for numeric arrays
new bb3fa4b ARROW-3993: [JS] CI Jobs Failing
new e734135 ARROW-3995: [CI] Use understandable names on Travis
new a1eff5f ARROW-3986: [C++] Write prose documentation
new c7e9860 ARROW-3960: [Rust] remove extern crate for Rust 2018
new 28d16c0 ARROW-3996: [C++] Add missing packages on Linux
new 527fed6 ARROW-3913: [Gandiva] [GLib] Add GGandivaLiteralNode
new c0ac97f ARROW-4004: [GLib] Replace GPU with CUDA
new c029b77 ARROW-3976: [Ruby] Try to upgrade git to avoid errors caused by Homebrew on older git
new 67506d9 ARROW-4002: [C++][Gandiva] Remove needless CMake version check
new a3ba1a2 ARROW-3988: [C++] Do not build unit tests by default, fix building Gandiva unit tests when ARROW_BUILD_TESTS=OFF
new aa8bb3c ARROW-3986: [C++] Document memory management and table APIs
new 7ddfba6 ARROW-3470: [C++] Fix row-wise example
new 4594041 ARROW-4008: [C++] Restore ARROW_BUILD_UTILITIES to fix integration tests
new 0005048 ARROW-4005: [Plasma] [GLib] Add gplasma_client_disconnect()
new 1882a07 ARROW-3866: [Python] Column metadata is not transferred to tables in pyarrow
new 2a726c1 PARQUET-1473: [C++] Add helper function that converts ParquetVersion to human-friendly string
new e34057c ARROW-4019: [C++] Fix Coverity issues
new b3bc338 ARROW-3958: [Plasma] Reduce number of IPCs
new 8c41303 ARROW-4015: [Plasma] remove unused interfaces for plasma manager
new 804502f ARROW-3184: [C++] Enable modular builds and installs with ARROW_OPTIONAL_INSTALL option. Remove ARROW_GANDIVA_BUILD_TESTS
new 73f94c9 ARROW-3762: [C++/Python] Support reading Parquet BYTE_ARRAY columns containing over 2GB of data
new ce12fb5 ARROW-1807: [Java] consolidate bufs to reduce heap (#3121)
new e098651 ARROW-3971: [Python] Remove deprecations in 0.11 and prior
new 537aa2f ARROW-4037: [Packaging] Remove workaround to verify 0.11.0
new 23dfc1c ARROW-4006: Add CODE_OF_CONDUCT.md
new 0936938 ARROW-4039: [Python] Update link to 'development.rst' page from Python README.md
new 2e8cfca ARROW-3230: [Python] Missing comparisons on ChunkedArray, Table
new d61ae4a ARROW-3449: [C++] Fixes to build with CMake 3.2. Document what requires newer CMake
new 784d1cd ARROW-4044: [Packaging/Python] Add hypothesis test dependency to pyarrow conda recipe
new 055496c ARROW-4029: [C++] Exclude headers with 'internal' from installation. Document header file conventions in README
new ec154d2 ARROW-2026: [C++] Enforce use_deprecated_int96_timestamps to all time…
new 1fd2a25 ARROW-3953: [Python] Compat with pandas 0.24 rename of MultiIndex labels -> codes
new 715cba5 ARROW-4035: [Ruby] Support msys2 mingw dependencies
new ac047b2 ARROW-4048: [GLib] Return ChunkedArray instead of Array in gparquet_arrow_file_reader_read_column
new 77d3a46 ARROW-4049: [C++] Arrow never use glog even though glog is linked.
new 5d1934f ARROW-4034: [Ruby] Add support :append option to FileOutputStream
new 63fd350 ARROW-4043: [Packaging/Docker] Python tests on alpine miss pytest dependency
new 51f5e94 ARROW-4041: [CI] Python 2.7 run uses Python 3.6
new 4cfd6d3 ARROW-4045: [Packaging/Python] Add hypothesis test dependency to wheel crossbow tests
new 0190e60 ARROW-4054: [Python] Update gtest, flatbuffers and OpenSSL in manylinux1 base image
new 3986157 ARROW-3879: [C++] Fix uninitialized member in CudaBufferWriter
new 836ad52 ARROW-4017: [C++] Move vendored libraries in dedicated directory
new a236464 ARROW-3674: [Go] Implement Date32 and Date64 array types
new c7cb1ce ARROW-3368: [Integration/CI/Python] Add dask integration test to docker-compose setup
new 5c48bdb ARROW-2637: [C++/Python] Build support and instructions for development on Alpine Linux
new 0b78f4b ARROW-4033: [C++] Use readlink -f instead of realpath in dependency download script
new 9fcce64 ARROW-4026: [C++] Add *-all, *-tests, *-benchmarks modular CMake targets. Use in Travis CI
new e9ed591 ARROW-4028: [Rust] Merge parquet-rs codebase
new 5a5d807 [C++] Make Doxygen less verbose (#3213)
new d432cb4 ARROW-2560: [Rust] The Rust README should include Rust-specific information on contributing
new 36ded49 ARROW-3058: [Python] Raise more helpful better error message when writing a pandas.DataFrame to Feather format that requires a chunked layout
new e832df3 ARROW-3387: [C++] Implement Binary to String cast
new 781e251 ARROW-4055: [Python] Fails to convert pytz.utc with versions 2018.3 and earlier
new 758bd55 ARROW-2919: [C++/Python] Improve HdfsFile error messages, fix Python unit test suite
new 1a5991c ARROW-4069: [Python] Add tests for casting binary -> string/utf8. Add pyarrow.utf8() type factory alias for readability
new bfa7f11 ARROW-4070: [C++] Enable use of ARROW_BOOST_VENDORED with ninja-build
new 25b6a6c ARROW-4073: [Python] Fix URI parsing on Windows. Also fix test for get_library_dirs when using ARROW_HOME to develop
new 944b9e3 ARROW-4051: [Gandiva] [GLib] Add support for null literal
new b8d4477 ARROW-3989: [Rust] [CSV] Cast bool string to lower case in reader
new cec8d23 ARROW-4066: [Doc] Instructions to create Sphinx documentation
new 6bfac93 ARROW-3979 : [Gandiva] fix all valgrind reported errors
new d089643 ARROW-3803: [C++/Python] Merge C++ builds and tests, run Python tests in separate CI entries
new 320621d ARROW-4030: [CI] Use travis_terminate in more script commands to fail faster
new f66fa80 ARROW-554: [C++] Add functions to unify dictionary types and arrays
new e39e364 ARROW-3545: [C++/Python] Use "field" terminology with StructType, specify behavior with duplicate field names
new cca9d28 ARROW-3620: [Python] Document pa.cpu_count() in Sphinx API docs
new 1a8c8f0 ARROW-4038: [Rust] Implement boolean AND, OR, NOT array ops
new 729cc3d ARROW-4085: [GLib] Use "field" for struct data type
new c39db63 ARROW-4082: [C++] Allow RelWithDebInfo, improve FindClangTools
new ce9c6e3 ARROW-4084: [C++] Make Status static method support variadic arguments
new 2ab97bc ARROW-4089: [Plasma] The tutorial is wrong regarding the parameter type of PlasmaClient.Create
new 398466e ARROW-4079: [C++] Add machine benchmark
new ff29319 ARROW-4087: [C++] Make CSV spellings of null values configurable
new 1a86ab5 ARROW-3982: [C++] Allow "binary" input in simple JSON format
new 700bd40 ARROW-4052: [C++] Linker errors with glog and gflags
new c6d97c5 ARROW-4093: [C++] Fix wrong suggested method name
new 747590a ARROW-4011: [Gandiva] Install irhelpers.bc and use it
new 48dd172 ARROW-2970: [Python] Support conversions of NumPy string arrays requiring chunked binary output
new 7ebd7b3 ARROW-4105: [Rust] Add rust-toolchain to enforce user to use nightly toolchain for building
new b23cedd ARROW-4075: [Rust] Reuse array builder after calling finish()
new ddc5e9a ARROW-4106: [Python] Tests fail to run because hypothesis update broke its API
new ffc8877 ARROW-4101: [C++] Identity BinaryType cast
new e179dda ARROW-4098: [Python] Deprecate open_file/open_stream top level APIs in favor of using ipc namespace
new 6578089 ARROW-2592: [Python] Add "ignore_metadata" option to Table.to_pandas
new 79d8bf2 ARROW-4107: [Python] Use ninja in pyarrow manylinux1 build
new 6b79887 ARROW-4109: [Packaging] Missing glog dependency from arrow-cpp conda recipe
new 385c438 ARROW-3938: [Packaging] Stop to refer java/pom.xml to get version information
new cd543b9 ARROW-2504: [Website] Add ApacheCon NA link
new cfaea42 PARQUET-1481: [C++] Throw exception when encountering bad Thrift metadata in RecordReader
new 49f93e0 ARROW-4114: [C++] Add python to requirements list for running on ubuntu
new 91c585d ARROW-4100: [Gandiva][C++] Fix regex for special character dot.
new 2849f46 ARROW-4115: [Gandiva] zero-init boolean data bufs
new 46ecbb6 ARROW-4103: [Docs] Move documentation build instructions from source/python/development.rst to docs/README.md
new 0c2f354 ARROW-4116: [Python] Add warning to development instructions to avoid virtualenv when using Anaconda/miniconda
new 9c76600 ARROW-4112: [Packaging] Add support for Gandiva .deb
new abde663 ARROW-4078: [CI] Detect changes in docs/ directory and build the Linux Python entry if so
new 5904eea ARROW-3324: [Python] Destroy temporary metadata builder classes more eagerly when building files to reduce memory usage
new a536529 ARROW-4102: [C++] Return common IdentityCast when casting to equal type
new 6781c2d ARROW-4088: [Python] Table.from_batches() fails when passed a schema with metadata
new 0696eb5 ARROW-3932: [Python] Include Benchmarks.md in Sphinx docs
new 9b03947 ARROW-3928: [Python] Deduplicate Python objects when converting binary, string, date, time types to object arrays
new 83a4e97 ARROW-4080: [Rust] Improving lengthy build times in Appveyor
new 0a631db ARROW-4113: [R] Fix version number
new 68daba2 ARROW-3020: [C++/Python] Allow empty arrow::Table objects to be written as empty Parquet row groups
new 8ed97cc ARROW-4129: [Python] Fix syntax problem in benchmark docs
new 7074889 ARROW-4132: [GLib] Add more GArrowTable constructors
new 7b12202 ARROW-4135: [Python] Can't reload a pandas dataframe containing a list of datetime.time
new 8d792b1 PARQUET-1484: [C++] Improve memory usage of FileMetaDataBuilder
new 8c26b77 ARROW-4134: [Packaging] Properly setup timezone in docker tests to prevent ORC adapter's abort
new 71ccba9 ARROW-4128: [C++] Update style guide to reflect NULLPTR and doxygen
new 9376d85 ARROW-3910: [Python] Set date_as_objects=True as default in to_pandas methods
new 1291274 ARROW-4137: [Rust] Move parquet code into a separate crate
new c71d27f ARROW-4141: [Ruby] Add support for creating schema from raw Ruby objects
new 6ca8fcd ARROW-4148: [CI/Python] Disable ORC on nightly Alpine builds
new 7f1fbf8 ARROW-4009: [CI] Run Valgrind and C++ code coverage in different builds
new 1ff7978 ARROW-4150: [C++] Ensure allocated buffers have non-null data pointer
new 95f6ecf ARROW-4152: [GLib] Remove an example to show Torch integration
new 1e9a236 ARROW-4149: [CI/C++] Parquet test misses ZSTD compression codec in CMake 3.2 nightly builds
new cc9e228 ARROW-4122: [C++] Initialize class members based on codebase static analysis
new c569a4c ARROW-4127: [Documentation][Python] Add instructions to build with Docker
new 7405406 ARROW-4156: [C++] Don't use object libs with Xcode
new 161d00f ARROW-4157: [C++] Fix clang documentation warnings on Ubuntu 18.04
new c322aec ARROW-4158: Allow committers to set ARROW_GITHUB_API_TOKEN for merge script, better debugging output
new 4057b5f PARQUET-690: [C++] Reuse Thrift resources when serializing metadata structures
new fba4f32 ARROW-3760: [R] Support Arrow CSV reader
new fa37ea3 [Documentation] Fix syntax error in building.rst (#3313)
new a4f4808 ARROW-4153: [GLib] Add builder_append_value() for consistency
new 857deae ARROW-4154: [GLib] Add GArrowDecimal128DataType
new 46b1bc7 ARROW-4160: [Rust] Add README and executable files to parquet
new 66f0d39 [Documentation][C++] Change build conda create command for Windows developer (#3316)
new 5723ada ARROW-3731: MVP to read parquet in R library
new 91a72ac ARROW-4130: [Go] offset not used when accessing binary array
new 4895340 ARROW-4155: [Rust] Implement array_ops::sum() for PrimitiveArray<T>
new 601498f ARROW-4161: [GLib] Add PlasmaClientOptions
new fa6e423 ARROW-4162: [Ruby] Add support for creating data types from description
new b95628f ARROW-4166: [Ruby] Add support for saving to and loading from buffer
new 5fad191 ARROW-4171: [Rust] fix parquet crate release version
new 0002630 Fix link to Intel SIMD docs
new 1eec9e8 ARROW-4168: [GLib] Use property to keep GArrowDataType passed in garrow_field_new()
new 16460d3 ARROW-4173: Fix JIRA library name in error message
new 84e10b6 ARROW-4174: [Ruby] Add support for building composite array from raw Ruby objects
new ed1d60d ARROW-3544: [Gandiva] [C++] Create function registry in multiple compilation units to reduce build times
new 1aecb98 ARROW-4179: [Python] Use more public API to determine whether a test has a pytest mark or not
new 72405a1 ARROW-4151: [Rust] Restructure project directories
new b92b1f5 ARROW-4125: [Python] Don't fail ASV if Plasma extension is not built (e.g. on Windows)
new 134081b ARROW-4126: [Go] offset not used when accessing boolean array
new 0eadd41 ARROW-3665: [Rust] Implement StructArrayBuilder
new a1ea48b ARROW-4185: [Rust] Change directory before running Rust examples on Windows
new 1143942 ARROW-4042: [Rust] Rename BinaryArray::get_value to value
new 2057859 ARROW-4188: [Rust] Move Rust README to top level rust directory
new 55848a3 ARROW-4104: [Java] fix a race condition in AllocationManager (#3246)
new 8704f8b ARROW-4183: [Ruby] Add Arrow::Struct as an element of Arrow::StructArray
new af07f75 ARROW-4060: [Rust] Add parquet arrow converter.
new 4f2f533 ARROW-4178: [C++] Fix TSan and UBSan errors
new d6ddcbf ARROW-3701: [Gandiva] add op for decimal 128
new ccec638 ARROW-4191: [C++] Use same CC and AR for jemalloc as for the main sources
new 326015c ARROW-4186: [C++] BitmapWriter shouldn't clobber data when length == 0
new ac45f32 ARROW-3839: [Rust] Add ability to infer schema in CSV reader
new bcca04a ARROW-4172: [Rust] more consistent naming in array builders
new a3aed3b ARROW-4184: [Ruby] Add Arrow::RecordBatch#to_table
new 420c949 ARROW-4175: [GLib] Add support for decimal compare operators
new bfe6865 ARROW-4147: [Java] reduce heap usage for varwidth vectors (#3298)
new cec7541 ARROW-4199: [GLib] Add garrow_seekable_input_stream_peek()
new 090a8c0 ARROW-4200: [C++/Python] Enable conda_env_python.yml to work on Windows, simplify python/development.rst
new af925d9 ARROW-2038: [Python] Strip s3:// scheme in S3FSWrapper isdir() and isfile()
new 361285d ARROW-4209: [Gandiva] Avoid struct return param in IR
new bcfacaa ARROW-3233: [Python] Add prose documentation for CUDA support
new 3330d66 ARROW-4118: [Python] Fix benchmark setup for "asv run"
new 6b496f7 ARROW-3997: [Documentation] Clarify dictionary index type
new b8aeb79 ARROW-854: [Format] Add tentative SparseTensor format
new 84b221d ARROW-4138: [Python] Fix setuptools_scm version customization on Windows
new a80c27e ARROW-4197: [C++] Better Emscripten support
new 87ceb3c ARROW-3959: [Rust] Add date/time data types
new b29ecdc ARROW-4177: [C++] Add ThreadPool and TaskGroup microbenchmarks
new db29723 ARROW-4215: [GLib] Fix typos in documentation
new 3b61349 ARROW-2968: [R] Multi-threaded conversion from Arrow table to R data.frame
new 2b361fb ARROW-3428: [Python] Fix from_pandas conversion from float to bool
new 8ab1493 ARROW-4065: [C++] arrowTargets.cmake is broken
new 7fcad2c ARROW-3126: [Python] Make Buffered* IO classes available to Python, incorporate into input_stream, output_stream factory functions
new ea69e8f ARROW-4207: [Gandiva] [GLib] Add support for IfNode
new 9c0e643 ARROW-4211: [GLib] Add GArrowFixedSizeBinaryDataType
new f67a515 ARROW-4214: [Ruby] Add support for building RecordBatch from raw Ruby objects
new b8d5913 ARROW-3916: [Python] Add support for `filesystem` kwarg in ParquetWriter
new bf34291 ARROW-4227: [GLib] Fix wrong data type in field of composite data type
new fc7b414 ARROW-4210: [Python] Mention boost-cpp directly in the conda meta.yaml for pyarrow
new 9d342ec ARROW-3819: [Packaging] Update conda variant files to conform with feedstock after compiler migration
new 5a502d2 ARROW-4216: [Python] Add CUDA API docs
new 76618f6 [Release/Java] Disable Flight test case
new d7a6833 ARROW-4229: [Packaging] Set crossbow target explicitly to enable building arbitrary arrow repo
new 54b35b4 ARROW-4233: [Packaging] Use Docker to build source archive
new 38a628d ARROW-4238: [Packaging] Fix RC version conflict between crossbow and rake
new 06de47a ARROW-4237: [Packaging] Fix CMAKE_INSTALL_LIBDIR in release verification script
new 9178ad8 ARROW-4241: [Packaging] Disable crossbow conda OSX clang builds
new 0a553b7 ARROW-4240: [Packaging] Add missing Plasma GLib and Gandiva GLib documents to souce archive
new f7eb1f7 ARROW-4239: [Packaging] Fix version update for the next version
new 3e97ca1 ARROW-4243: [Python] Fix test failures with pandas 0.24.0rc1
new be663c1 ARROW-4247: [Release] Update verify script for 0.12.0
new 5598d2f ARROW-2828: [JS] Refactor Data, Vectors, Visitor, Typings, build, tests, dependencies
new 85362b3 ARROW-4249: [Plasma] Clean up client namespace
new cf047fc ARROW-4256: [Release] Fix Windows verification script for 0.12 release
new a78a3d6 ARROW-4228: [GLib] Add garrow_list_data_type_get_field()
new 61b50dc ARROW-4257: [Release] Update release verification script to check binaries on Bintray
new 6f4d1a9 ARROW-4246: [Plasma][Python] PlasmaClient.list returns wrong information with CUDA enabled Plasma
new 87ac6fd ARROW-4246: [Plasma][Python][Follow-up] Ensure plasma::ObjectTableEntry always has the same size regardless of whether built with CUDA support
new 09d3496 ARROW-4260: [Python] NumPy buffer protocol failure
new 5a7507c ARROW-4266: [Python][CI] Disable ORC tests in dask integration test
new 18c0e82 ARROW-4258: [Python] Safe cast fails from numpy float64 array with nans to integer
new 143558e [CI] Temporary fix for conda-forge migration (#3406)
new fbfd1ff ARROW-4269: [Python] Fix serialization in pandas 0.22
new a1a9221 ARROW-4270: [Packaging][Conda] Update xcode version and remove toolchain builds
new 6c8c0c1 [Release] Update CHANGELOG.md for 0.12.0
new db508ee [Release] Update .deb/.rpm changelogs for 0.12.0
new 6fcd910 [Release] Update versions for 0.12.0
new 8ca4138 [maven-release-plugin] prepare release apache-arrow-0.12.0
new da899a3 ARROW-4276: [Release] Remove needless Bintray authentication
new cffd038 [Release] Build C++ unit tests in verify-release-candidate.bat
new c25db9c ARROW-4291: [Dev] Support selecting features in release verification scripts
new d75ffe3 ARROW-4298: [Java] Add javax.annotation-api dependency for JDK >= 9
new 77c2714 ARROW-4374: [C++] DictionaryBuilder does not correctly report length and null_count
new 61e6c27 ARROW-4367: [C++] StringDictionaryBuilder segfaults on Finish with only null entries
new b434fd9 ARROW-4255: [C++] Eagerly initialize name_to_index_ to avoid race
new e70c530 ARROW-4582: [Python/C++] Acquire the GIL on Py_INCREF
new e98dd0c [Release] Update versions for 0.12.1-SNAPSHOT
new a558d43 ARROW-3564: [C++] Fix dictionary encoding logic for Parquet 2.0
new e0bd2a6 ARROW-4323: [Packaging] Fix failing OSX clang conda forge builds
new 6b64322 ARROW-4373: [Packaging] Travis fails to deploy conda packages on OSX
new 1c9b52b ARROW-4629: [Python] Pandas arrow conversion slowed down by imports
new 3e96dd2 ARROW-4267: [C++/Parquet] Handle duplicate and struct columns in RowGroup reads
new cef7e71 ARROW-4501: Fix out-of-bounds read in DoubleCrcHash
The 3382 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails. The revisions
listed as "add" were already present in the repository and have only
been added to this reference.