You are viewing a plain text version of this content. The canonical link for it is here.
- [jira] [Commented] (SPARK-18406) Race between end-of-task and completion iterator read lock release - posted by "Xingbo Jiang (JIRA)" <ji...@apache.org> on 2019/05/01 00:13:00 UTC, 4 replies.
- [jira] [Resolved] (SPARK-24422) Add JDK11 in our Jenkins' build servers - posted by "shane knapp (JIRA)" <ji...@apache.org> on 2019/05/01 00:20:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-24422) Add JDK11 in our Jenkins' build servers - posted by "shane knapp (JIRA)" <ji...@apache.org> on 2019/05/01 00:21:00 UTC, 1 replies.
- [jira] [Resolved] (SPARK-27608) Upgrade Surefire plugin to 3.0.0-M3 - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/01 02:17:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-24422) Add JDK11 in our Jenkins' build servers - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/01 02:19:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27613) Caching an RDD composed of Row Objects produces some kind of key recombination - posted by "Andres Fernandez (JIRA)" <ji...@apache.org> on 2019/05/01 02:27:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27613) Caching an RDD composed of Row Objects produces some kind of key recombination - posted by "Andres Fernandez (JIRA)" <ji...@apache.org> on 2019/05/01 02:28:00 UTC, 7 replies.
- [jira] [Commented] (SPARK-27602) SparkSQL CBO can't get true size of partition table after partition pruning - posted by "angerszhu (JIRA)" <ji...@apache.org> on 2019/05/01 02:41:00 UTC, 3 replies.
- [jira] [Commented] (SPARK-24530) Sphinx doesn't render autodoc_docstring_signature correctly (with Python 2?) and pyspark.ml docs are broken - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/01 03:39:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-22796) Add multiple column support to PySpark QuantileDiscretizer - posted by "Dor Kedem (JIRA)" <ji...@apache.org> on 2019/05/01 05:24:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27593) CSV Parser returns 2 DataFrame - Valid and Malformed DFs - posted by "Ladislav Jech (JIRA)" <ji...@apache.org> on 2019/05/01 07:06:00 UTC, 5 replies.
- [jira] [Commented] (SPARK-27597) RuntimeConfig should be serializable - posted by "Liang-Chi Hsieh (JIRA)" <ji...@apache.org> on 2019/05/01 08:30:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27607) Improve performance of Row.toString() - posted by "Marco Gaido (JIRA)" <ji...@apache.org> on 2019/05/01 10:10:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-27612) Creating a DataFrame in PySpark with ArrayType produces some Rows with Arrays of None - posted by "Marco Gaido (JIRA)" <ji...@apache.org> on 2019/05/01 10:46:00 UTC, 9 replies.
- [jira] [Commented] (SPARK-27332) Filter Pushdown duplicates expensive ScalarSubquery (discarding result) - posted by "Marco Gaido (JIRA)" <ji...@apache.org> on 2019/05/01 11:00:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27614) Executor shuffle fetch hang - posted by "weDataSphere (JIRA)" <ji...@apache.org> on 2019/05/01 12:01:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27615) Merge small files in the read stage - posted by "weDataSphere (JIRA)" <ji...@apache.org> on 2019/05/01 12:03:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27616) Standalone cluster management user resource allocation - posted by "weDataSphere (JIRA)" <ji...@apache.org> on 2019/05/01 12:04:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-17637) Packed scheduling for Spark tasks across executors - posted by "Josh Rosen (JIRA)" <ji...@apache.org> on 2019/05/01 14:12:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27607) Improve performance of Row.toString() - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/01 14:41:00 UTC, 1 replies.
- [jira] [Assigned] (SPARK-26921) Fix CRAN hack as soon as Arrow is available on CRAN - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/01 14:42:00 UTC, 2 replies.
- [jira] [Assigned] (SPARK-27611) Redundant javax.activation dependencies in the Maven build - posted by "Cheng Lian (JIRA)" <ji...@apache.org> on 2019/05/01 15:28:00 UTC, 2 replies.
- [jira] [Updated] (SPARK-27612) Creating a DataFrame in PySpark with ArrayType produces some Rows with Arrays of None - posted by "Bryan Cutler (JIRA)" <ji...@apache.org> on 2019/05/01 16:07:00 UTC, 3 replies.
- [jira] [Assigned] (SPARK-27557) Add copybutton to spark Python API docs for easier copying of code-blocks - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/01 16:28:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27557) Add copybutton to spark Python API docs for easier copying of code-blocks - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/01 16:28:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27618) Unnecessary access to externalCatalog - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/01 18:41:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27617) Not able to specify LOCATION for internal table - posted by "Sujith Chacko (JIRA)" <ji...@apache.org> on 2019/05/01 18:41:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27618) Unnecessary access to externalCatalog - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/01 18:42:00 UTC, 2 replies.
- [jira] [Commented] (SPARK-27618) Unnecessary access to externalCatalog - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/01 18:42:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27617) Not able to specify LOCATION for internal table - posted by "Sujith Chacko (JIRA)" <ji...@apache.org> on 2019/05/01 18:42:00 UTC, 4 replies.
- [jira] [Resolved] (SPARK-27618) Unnecessary access to externalCatalog - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/01 18:44:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27617) Not able to specify LOCATION for internal table - posted by "Sujith Chacko (JIRA)" <ji...@apache.org> on 2019/05/01 18:47:00 UTC, 3 replies.
- [jira] [Comment Edited] (SPARK-27617) Not able to specify LOCATION for internal table - posted by "Sujith Chacko (JIRA)" <ji...@apache.org> on 2019/05/01 18:49:00 UTC, 9 replies.
- [jira] [Commented] (SPARK-24935) Problem with Executing Hive UDF's from Spark 2.2 Onwards - posted by "Reza Safi (JIRA)" <ji...@apache.org> on 2019/05/01 20:57:00 UTC, 1 replies.
- [jira] [Assigned] (SPARK-24708) Document the default spark url of master in standalone is "spark://localhost:7070" - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/02 00:48:00 UTC, 1 replies.
- [jira] [Resolved] (SPARK-26921) Fix CRAN hack as soon as Arrow is available on CRAN - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/02 01:03:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27619) MapType should be prohibited in hash expressions - posted by "Josh Rosen (JIRA)" <ji...@apache.org> on 2019/05/02 02:37:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27619) MapType should be prohibited in hash expressions - posted by "Josh Rosen (JIRA)" <ji...@apache.org> on 2019/05/02 02:38:00 UTC, 3 replies.
- [jira] [Created] (SPARK-27620) Update jetty to 9.4.18.v20190429 - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/02 03:58:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27620) Update jetty to 9.4.18.v20190429 - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/02 04:06:00 UTC, 2 replies.
- [jira] [Commented] (SPARK-27543) Support getRequiredJars and getRequiredFiles APIs for Hive UDFs - posted by "Chakravarthi (JIRA)" <ji...@apache.org> on 2019/05/02 04:27:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27541) Refresh class definitions for jars added via addJar() - posted by "Chakravarthi (JIRA)" <ji...@apache.org> on 2019/05/02 04:29:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27621) Calling transform() method on a LinearRegressionModel throws NoSuchElementException - posted by "Anca Sarb (JIRA)" <ji...@apache.org> on 2019/05/02 09:20:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27621) Calling transform() method on a LinearRegressionModel throws NoSuchElementException - posted by "Anca Sarb (JIRA)" <ji...@apache.org> on 2019/05/02 09:21:00 UTC, 2 replies.
- [jira] [Updated] (SPARK-27621) Calling transform() method on a LinearRegressionModel throws NoSuchElementException - posted by "Anca Sarb (JIRA)" <ji...@apache.org> on 2019/05/02 09:22:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27621) Calling transform() method on a LinearRegressionModel throws NoSuchElementException - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/02 09:24:00 UTC, 2 replies.
- [jira] [Issue Comment Deleted] (SPARK-27621) Calling transform() method on a LinearRegressionModel throws NoSuchElementException - posted by "Anca Sarb (JIRA)" <ji...@apache.org> on 2019/05/02 09:28:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27622) Avoiding network communication when block mangers are running on the host - posted by "Attila Zsolt Piros (JIRA)" <ji...@apache.org> on 2019/05/02 11:27:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27622) Avoiding network communication when block mangers are running on the host - posted by "Attila Zsolt Piros (JIRA)" <ji...@apache.org> on 2019/05/02 11:28:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27606) Deprecate `extended` field in ExpressionDescription/ExpressionInfo - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/02 12:11:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27606) Deprecate `extended` field in ExpressionDescription/ExpressionInfo - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/02 12:11:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-26924) Document Arrow optimization and vectorized R APIs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/02 12:45:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-26921) Fix CRAN hack as soon as Arrow is available on CRAN - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/02 12:45:01 UTC, 0 replies.
- [jira] [Updated] (SPARK-26924) Fix CRAN hack as soon as Arrow is available on CRAN - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/02 12:46:00 UTC, 1 replies.
- [jira] [Updated] (SPARK-26921) Document Arrow optimization and vectorized R APIs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/02 12:46:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-26924) Fix CRAN hack as soon as Arrow is available on CRAN - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/02 12:47:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-26759) Arrow optimization in SparkR's interoperability - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/02 12:47:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-26759) Arrow optimization in SparkR's interoperability - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/02 12:48:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-23098) Migrate Kafka batch source to v2 - posted by "Gabor Somogyi (JIRA)" <ji...@apache.org> on 2019/05/02 12:58:00 UTC, 4 replies.
- [jira] [Created] (SPARK-27623) Provider org.apache.spark.sql.avro.AvroFileFormat could not be instantiated - posted by "Alexandru Barbulescu (JIRA)" <ji...@apache.org> on 2019/05/02 14:16:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27607) Improve performance of Row.toString() - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/02 14:22:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27623) Provider org.apache.spark.sql.avro.AvroFileFormat could not be instantiated - posted by "Alexandru Barbulescu (JIRA)" <ji...@apache.org> on 2019/05/02 15:11:00 UTC, 1 replies.
- [jira] [Assigned] (SPARK-14083) Analyze JVM bytecode and turn closures into Catalyst expressions - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/02 17:08:01 UTC, 1 replies.
- [jira] [Assigned] (SPARK-27194) Job failures when task attempts do not clean up spark-staging parquet files - posted by "Marcelo Vanzin (JIRA)" <ji...@apache.org> on 2019/05/02 17:10:00 UTC, 3 replies.
- [jira] [Created] (SPARK-27624) Fix CalenderInterval to show an empty interval correctly - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/02 18:50:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27624) Fix CalenderInterval to show an empty interval correctly - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/02 18:57:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-27598) DStreams checkpointing does not work with the Spark Shell - posted by "Gabor Somogyi (JIRA)" <ji...@apache.org> on 2019/05/02 18:57:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27601) Upgrade stream-lib to 2.9.6 - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/02 20:23:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27601) Upgrade stream-lib to 2.9.6 - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/02 20:23:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27625) ScalaReflection.serializerFor fails for annotated types - posted by "Patrick Grandjean (JIRA)" <ji...@apache.org> on 2019/05/02 21:28:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27625) ScalaReflection.serializerFor fails for annotated types - posted by "Patrick Grandjean (JIRA)" <ji...@apache.org> on 2019/05/02 21:30:01 UTC, 2 replies.
- [jira] [Created] (SPARK-27626) Fix `docker-image-tool.sh` to be robust in non-bash shell env - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/02 21:48:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27626) Fix `docker-image-tool.sh` to be robust in non-bash shell env - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/02 21:56:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27627) Make option "pathGlobFilter" as a general option for all file sources - posted by "Gengliang Wang (JIRA)" <ji...@apache.org> on 2019/05/02 23:51:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27627) Make option "pathGlobFilter" as a general option for all file sources - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/02 23:54:00 UTC, 2 replies.
- [jira] [Resolved] (SPARK-27620) Update jetty to 9.4.18.v20190429 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/03 00:29:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27396) SPIP: Public APIs for extended Columnar Processing Support - posted by "Bryan Cutler (JIRA)" <ji...@apache.org> on 2019/05/03 00:51:00 UTC, 3 replies.
- [jira] [Assigned] (SPARK-27612) Creating a DataFrame in PySpark with ArrayType produces some Rows with Arrays of None - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/03 03:05:00 UTC, 2 replies.
- [jira] [Resolved] (SPARK-27467) Upgrade Maven to 3.6.1 - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/03 03:05:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27586) Improve binary comparison: replace Scala's for-comprehension if statements with while loop - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/03 03:33:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27586) Improve binary comparison: replace Scala's for-comprehension if statements with while loop - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/03 03:33:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27612) Creating a DataFrame in PySpark with ArrayType produces some Rows with Arrays of None - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/03 05:41:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-20193) Selecting empty struct causes ExpressionEncoder error. - posted by "Terry Moschou (JIRA)" <ji...@apache.org> on 2019/05/03 07:39:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27623) Provider org.apache.spark.sql.avro.AvroFileFormat could not be instantiated - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/03 08:19:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27614) Executor shuffle fetch hang - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/03 08:25:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27615) Merge small files in the read stage - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/03 08:25:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27615) Merge small files in the read stage - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/03 08:25:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27613) Caching an RDD composed of Row Objects produces some kind of key recombination - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/03 08:27:00 UTC, 1 replies.
- [jira] [Updated] (SPARK-27609) [Documentation Issue?] from_json expects values of options dictionary to be - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/03 08:29:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27609) from_json expects values of options dictionary to be - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/03 08:30:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-27609) from_json expects values of options dictionary to be - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/03 08:38:00 UTC, 3 replies.
- [jira] [Commented] (SPARK-22814) JDBC support date/timestamp type as partitionColumn - posted by "Shyama (JIRA)" <ji...@apache.org> on 2019/05/03 10:45:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27628) SortMergeJoin on a low-cardinality column results in heavy skew and large partitions - posted by "Michael Wu (JIRA)" <ji...@apache.org> on 2019/05/03 11:59:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27628) SortMergeJoin on a low-cardinality column results in heavy skew and large partitions - posted by "Michael Wu (JIRA)" <ji...@apache.org> on 2019/05/03 12:06:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27629) Prevent Unpickler from intervening each unpickling - posted by "Liang-Chi Hsieh (JIRA)" <ji...@apache.org> on 2019/05/03 16:04:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27629) Prevent Unpickler from intervening each unpickling - posted by "Liang-Chi Hsieh (JIRA)" <ji...@apache.org> on 2019/05/03 16:06:00 UTC, 2 replies.
- [jira] [Assigned] (SPARK-27629) Prevent Unpickler from intervening each unpickling - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/03 16:08:00 UTC, 2 replies.
- [jira] [Commented] (SPARK-8682) Range Join for Spark SQL - posted by "Matthew Porter (JIRA)" <ji...@apache.org> on 2019/05/03 17:15:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27626) Fix `docker-image-tool.sh` to be robust in non-bash shell env - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/03 17:18:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-22974) CountVectorModel does not attach attributes to output column - posted by "William Zhang (JIRA)" <ji...@apache.org> on 2019/05/03 17:48:00 UTC, 2 replies.
- [jira] [Commented] (SPARK-22974) CountVectorModel does not attach attributes to output column - posted by "yuhao yang (JIRA)" <ji...@apache.org> on 2019/05/03 17:50:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27630) Stage retry causes totalRunningTasks calculation to be negative - posted by "dzcxzl (JIRA)" <ji...@apache.org> on 2019/05/03 17:51:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27630) Stage retry causes totalRunningTasks calculation to be negative - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/03 17:54:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-27630) Stage retry causes totalRunningTasks calculation to be negative - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/03 17:54:00 UTC, 1 replies.
- [jira] [Issue Comment Deleted] (SPARK-27612) Creating a DataFrame in PySpark with ArrayType produces some Rows with Arrays of None - posted by "Bryan Cutler (JIRA)" <ji...@apache.org> on 2019/05/03 18:11:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-26130) Change Event Timeline Display Functionality on the Stages Page to use either REST API or data from other tables - posted by "Parth Gandhi (JIRA)" <ji...@apache.org> on 2019/05/03 21:30:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-12312) JDBC connection to Kerberos secured databases fails on remote executors - posted by "shanyu zhao (JIRA)" <ji...@apache.org> on 2019/05/03 21:45:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27510) Master fall into dead loop while launching executor failed in Worker - posted by "Xingbo Jiang (JIRA)" <ji...@apache.org> on 2019/05/03 22:50:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27621) Calling transform() method on a LinearRegressionModel throws NoSuchElementException - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/03 23:20:01 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27555) cannot create table by using the hive default fileformat in both hive-site.xml and spark-defaults.conf - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/04 00:04:00 UTC, 1 replies.
- [jira] [Resolved] (SPARK-27555) cannot create table by using the hive default fileformat in both hive-site.xml and spark-defaults.conf - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/04 00:04:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27577) Wrong thresholds selected by BinaryClassificationMetrics when downsampling - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/04 00:56:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27629) Prevent Unpickler from intervening each unpickling - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/04 04:23:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27631) Avoid repeating calculate table statistics when AUTO_SIZE_UPDATE_ENABLED is enabled - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/04 04:43:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27631) Avoid repeating calculate table statistics when AUTO_SIZE_UPDATE_ENABLED is enabled - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/04 04:50:00 UTC, 1 replies.
- [jira] [Assigned] (SPARK-27631) Avoid repeating calculate table statistics when AUTO_SIZE_UPDATE_ENABLED is enabled - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/04 04:54:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-27555) cannot create table by using the hive default fileformat in both hive-site.xml and spark-defaults.conf - posted by "Sandeep Katta (JIRA)" <ji...@apache.org> on 2019/05/04 05:40:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27553) Operation log is not closed when close session - posted by "pin_zhang (JIRA)" <ji...@apache.org> on 2019/05/04 11:20:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27632) More efficient Row.merge function - posted by "Phil (JIRA)" <ji...@apache.org> on 2019/05/04 14:58:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27592) Set the bucketed data source table SerDe correctly - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/04 15:53:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27450) Timestamp cast fails when the ISO8601 string omits minutes, seconds or milliseconds - posted by "Maxim Gekk (JIRA)" <ji...@apache.org> on 2019/05/04 17:58:00 UTC, 1 replies.
- [jira] [Comment Edited] (SPARK-22814) JDBC support date/timestamp type as partitionColumn - posted by "Al Johri (JIRA)" <ji...@apache.org> on 2019/05/04 19:38:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27602) SparkSQL CBO can't get true size of partition table after partition pruning - posted by "angerszhu (JIRA)" <ji...@apache.org> on 2019/05/05 03:47:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27633) Remove redundant aliases in NestedColumnAliasing - posted by "Liang-Chi Hsieh (JIRA)" <ji...@apache.org> on 2019/05/05 04:53:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27633) Remove redundant aliases in NestedColumnAliasing - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/05 04:56:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27634) deleteCheckpointOnStop should be configurable - posted by "Yu Wang (JIRA)" <ji...@apache.org> on 2019/05/05 08:34:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27603) Make ShuffleClient pluggable - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/05 08:35:00 UTC, 1 replies.
- [jira] [Updated] (SPARK-27634) deleteCheckpointOnStop should be configurable - posted by "Yu Wang (JIRA)" <ji...@apache.org> on 2019/05/05 08:38:00 UTC, 2 replies.
- [jira] [Commented] (SPARK-27634) deleteCheckpointOnStop should be configurable - posted by "Genmao Yu (JIRA)" <ji...@apache.org> on 2019/05/05 09:14:00 UTC, 3 replies.
- [jira] [Issue Comment Deleted] (SPARK-27634) deleteCheckpointOnStop should be configurable - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/05 09:15:02 UTC, 0 replies.
- [jira] [Created] (SPARK-27635) Prevent from splitting too many partitions smaller than row group size in Parquet file format - posted by "Lantao Jin (JIRA)" <ji...@apache.org> on 2019/05/05 09:48:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27635) Prevent from splitting too many partitions smaller than row group size in Parquet file format - posted by "Lantao Jin (JIRA)" <ji...@apache.org> on 2019/05/05 09:49:00 UTC, 2 replies.
- [jira] [Assigned] (SPARK-27635) Prevent from splitting too many partitions smaller than row group size in Parquet file format - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/05 09:56:00 UTC, 1 replies.
- [jira] [Resolved] (SPARK-27632) More efficient Row.merge function - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/05 11:26:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27634) deleteCheckpointOnStop should be configurable - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/05 12:03:00 UTC, 1 replies.
- [jira] [Assigned] (SPARK-27520) Introduce a global config system to replace hadoopConfiguration - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/05 16:10:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-25604) Reduce the overall time costs in Jenkins tests - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/05 16:41:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-25604) Reduce the overall time costs in Jenkins tests - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/05 16:41:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21309) Remove SQLConf parameters from the analyzer - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/05 16:44:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20089) Add DESC FUNCTION and DESC EXTENDED FUNCTION to SQLQueryTestSuite - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/05 16:44:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27636) Remove cached RDD blocks after PIC execution - posted by "shahid (JIRA)" <ji...@apache.org> on 2019/05/05 21:30:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27636) Remove cached RDD blocks after PIC execution - posted by "shahid (JIRA)" <ji...@apache.org> on 2019/05/05 21:31:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27636) Remove cached RDD blocks after PIC execution - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/05 22:15:00 UTC, 2 replies.
- [jira] [Assigned] (SPARK-27596) The JDBC 'query' option doesn't work for Oracle database - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/06 01:43:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27637) If exception occured while fetching blocks by netty block transfer service, check whether the relative executor is alive before retry - posted by "feiwang (JIRA)" <ji...@apache.org> on 2019/05/06 02:33:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27637) If exception occured while fetching blocks by netty block transfer service, check whether the relative executor is alive before retry - posted by "feiwang (JIRA)" <ji...@apache.org> on 2019/05/06 02:35:00 UTC, 4 replies.
- [jira] [Commented] (SPARK-27637) If exception occured while fetching blocks by netty block transfer service, check whether the relative executor is alive before retry - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/06 05:06:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27637) If exception occured while fetching blocks by netty block transfer service, check whether the relative executor is alive before retry - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/06 05:06:00 UTC, 2 replies.
- [jira] [Resolved] (SPARK-27596) The JDBC 'query' option doesn't work for Oracle database - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/06 05:07:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27439) Use analyzed plan when explaining Dataset - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/06 06:22:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27439) Explainging Dataset should show correct resolved plans - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/06 06:23:00 UTC, 2 replies.
- [jira] [Commented] (SPARK-24641) Spark-Mesos integration doesn't respect request to abort itself - posted by "Igor Berman (JIRA)" <ji...@apache.org> on 2019/05/06 06:47:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27227) Spark Runtime Filter - posted by "Song Jun (JIRA)" <ji...@apache.org> on 2019/05/06 07:12:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27227) Spark Runtime Filter - posted by "Song Jun (JIRA)" <ji...@apache.org> on 2019/05/06 07:20:00 UTC, 0 replies.
- [jira] [Comment Edited] (SPARK-27227) Spark Runtime Filter - posted by "Song Jun (JIRA)" <ji...@apache.org> on 2019/05/06 07:33:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27638) date format yyyy-M-dd comparison isn't handled properly - posted by "peng bo (JIRA)" <ji...@apache.org> on 2019/05/06 08:00:01 UTC, 0 replies.
- [jira] [Updated] (SPARK-27638) date format yyyy-M-dd comparison isn't handled properly - posted by "peng bo (JIRA)" <ji...@apache.org> on 2019/05/06 08:01:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27639) InMemoryTableScan should show the table name on UI - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/06 08:13:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27638) date format yyyy-M-dd comparison isn't handled properly - posted by "peng bo (JIRA)" <ji...@apache.org> on 2019/05/06 08:17:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27639) InMemoryTableScan should show the table name on UI - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/06 08:33:00 UTC, 1 replies.
- [jira] [Updated] (SPARK-27638) date format yyyy-M-dd comparison not handled properly - posted by "peng bo (JIRA)" <ji...@apache.org> on 2019/05/06 08:34:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27638) date format yyyy-M-dd string comparison not handled properly - posted by "peng bo (JIRA)" <ji...@apache.org> on 2019/05/06 08:35:00 UTC, 1 replies.
- [jira] [Updated] (SPARK-27639) InMemoryTableScan should show the table name on UI - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/06 08:51:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-5594) SparkException: Failed to get broadcast (TorrentBroadcast) - posted by "Jeffrey(Xilang) Yan (JIRA)" <ji...@apache.org> on 2019/05/06 09:20:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27622) Avoiding network communication when block mangers are running on the same host - posted by "Attila Zsolt Piros (JIRA)" <ji...@apache.org> on 2019/05/06 10:04:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27622) Avoiding network communication when block manger fetching from the same host - posted by "Attila Zsolt Piros (JIRA)" <ji...@apache.org> on 2019/05/06 10:09:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27622) Avoid network communication when block manger fetches from the same host - posted by "Attila Zsolt Piros (JIRA)" <ji...@apache.org> on 2019/05/06 10:20:01 UTC, 0 replies.
- [jira] [Created] (SPARK-27640) Avoid duplicate lookups for datasource through provider - posted by "jiaan.geng (JIRA)" <ji...@apache.org> on 2019/05/06 11:18:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27640) Avoid duplicate lookups for datasource through provider - posted by "jiaan.geng (JIRA)" <ji...@apache.org> on 2019/05/06 11:26:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27640) Avoid duplicate lookups for datasource through provider - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/06 11:28:00 UTC, 1 replies.
- [jira] [Comment Edited] (SPARK-27634) deleteCheckpointOnStop should be configurable - posted by "Gabor Somogyi (JIRA)" <ji...@apache.org> on 2019/05/06 12:09:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-26839) on JDK11, IsolatedClientLoader must be able to load java.sql classes - posted by "Mihaly Toth (JIRA)" <ji...@apache.org> on 2019/05/06 12:32:00 UTC, 2 replies.
- [jira] [Resolved] (SPARK-27579) remove BaseStreamingSource and BaseStreamingSink - posted by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2019/05/06 12:43:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-23887) update query progress - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/06 13:04:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27641) Unregistering a single Metrics Source with no metrics leads to removing all the from other sources with the same name - posted by "Sergey Zhemzhitsky (JIRA)" <ji...@apache.org> on 2019/05/06 14:47:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27641) Unregistering a single Metrics Source with no metrics leads to removing all the metrics from other sources with the same name - posted by "Sergey Zhemzhitsky (JIRA)" <ji...@apache.org> on 2019/05/06 14:49:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27642) make v1 offset extends v2 offset - posted by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2019/05/06 15:35:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27643) Add supported Hive version in doc - posted by "Zhichao Zhang (JIRA)" <ji...@apache.org> on 2019/05/06 15:41:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27643) Add supported Hive version list in doc - posted by "Zhichao Zhang (JIRA)" <ji...@apache.org> on 2019/05/06 15:41:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-27638) date format yyyy-M-dd string comparison not handled properly - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/06 15:44:00 UTC, 6 replies.
- [jira] [Assigned] (SPARK-27642) make v1 offset extends v2 offset - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/06 15:53:00 UTC, 1 replies.
- [jira] [Comment Edited] (SPARK-27638) date format yyyy-M-dd string comparison not handled properly - posted by "Maxim Gekk (JIRA)" <ji...@apache.org> on 2019/05/06 15:58:00 UTC, 3 replies.
- [jira] [Updated] (SPARK-24935) Problem with Executing Hive UDF's from Spark 2.2 Onwards - posted by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2019/05/06 16:35:00 UTC, 1 replies.
- [jira] [Resolved] (SPARK-23299) __repr__ broken for Rows instantiated with *args - posted by "Holden Karau (JIRA)" <ji...@apache.org> on 2019/05/06 17:05:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-19335) Spark should support doing an efficient DataFrame Upsert via JDBC - posted by "Darshan (JIRA)" <ji...@apache.org> on 2019/05/06 17:22:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-26555) Thread safety issue causes createDataset to fail with misleading errors - posted by "Josh Rosen (JIRA)" <ji...@apache.org> on 2019/05/06 17:23:00 UTC, 2 replies.
- [jira] [Comment Edited] (SPARK-27622) Avoid network communication when block manger fetches from the same host - posted by "Attila Zsolt Piros (JIRA)" <ji...@apache.org> on 2019/05/06 18:23:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27622) Avoid network communication when block manager fetches disk persisted RDD blocks from the same host - posted by "Attila Zsolt Piros (JIRA)" <ji...@apache.org> on 2019/05/06 18:40:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27622) Avoid the network when block manager fetches disk persisted RDD blocks from the same host - posted by "Attila Zsolt Piros (JIRA)" <ji...@apache.org> on 2019/05/06 18:41:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27644) Enable spark.sql.optimizer.nestedSchemaPruning.enabled by default - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/06 18:46:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27644) Enable spark.sql.optimizer.nestedSchemaPruning.enabled by default - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/06 20:21:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27645) Cache result of count function to that RDD - posted by "Seungmin Lee (JIRA)" <ji...@apache.org> on 2019/05/06 20:53:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27646) Required refactoring for bytecode analysis work - posted by "DB Tsai (JIRA)" <ji...@apache.org> on 2019/05/06 21:50:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27646) Required refactoring for bytecode analysis - posted by "DB Tsai (JIRA)" <ji...@apache.org> on 2019/05/06 21:51:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27646) Required refactoring for bytecode analysis - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/06 21:54:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-27466) LEAD function with 'ROWS BETWEEN UNBOUNDED PRECEDING AND UNBOUNDED FOLLOWING' causes exception in Spark - posted by "Bruce Robbins (JIRA)" <ji...@apache.org> on 2019/05/06 23:01:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-25139) PythonRunner#WriterThread released block after TaskRunner finally block which invoke BlockManager#releaseAllLocksForTask - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/06 23:39:00 UTC, 3 replies.
- [jira] [Commented] (SPARK-27643) Add supported Hive version list in doc - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/07 02:45:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27647) Metric Gauge not threadsafe - posted by "bettermouse (JIRA)" <ji...@apache.org> on 2019/05/07 04:21:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27648) In Spark2.4 Structured Streamingļ¼šThe executor storage memory increasing over time - posted by "tommy duan (JIRA)" <ji...@apache.org> on 2019/05/07 09:07:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27648) In Spark2.4 Structured Streamingļ¼šThe executor storage memory increasing over time - posted by "tommy duan (JIRA)" <ji...@apache.org> on 2019/05/07 09:08:00 UTC, 13 replies.
- [jira] [Commented] (SPARK-27549) Commit Kafka Source offsets to facilitate external tooling - posted by "Tarush Grover (JIRA)" <ji...@apache.org> on 2019/05/07 10:06:00 UTC, 3 replies.
- [jira] [Created] (SPARK-27649) Unify the way you use 'spark.network.timeout' - posted by "jiaan.geng (JIRA)" <ji...@apache.org> on 2019/05/07 10:15:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27649) Unify the way you use 'spark.network.timeout' - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/07 10:19:00 UTC, 2 replies.
- [jira] [Created] (SPARK-27650) sepate the row iterator functionality from ColumnarBatch - posted by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2019/05/07 10:40:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27650) sepate the row iterator functionality from ColumnarBatch - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/07 10:48:00 UTC, 1 replies.
- [jira] [Updated] (SPARK-27645) Cache result of count function to that RDD - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/07 12:16:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27647) Metric Gauge not threadsafe - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/07 12:16:00 UTC, 2 replies.
- [jira] [Resolved] (SPARK-27647) Metric Gauge not threadsafe - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/07 12:16:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27645) Cache result of count function to that RDD - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/07 12:16:00 UTC, 3 replies.
- [jira] [Resolved] (SPARK-27645) Cache result of count function to that RDD - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/07 12:19:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27643) Add supported Hive version list in doc - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/07 12:20:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-23191) Workers registration failes in case of network drop - posted by "wuyi (JIRA)" <ji...@apache.org> on 2019/05/07 12:21:00 UTC, 4 replies.
- [jira] [Updated] (SPARK-27639) InMemoryTableScan shows the table name on UI if possible - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/07 13:23:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27577) Wrong thresholds selected by BinaryClassificationMetrics when downsampling - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/07 13:47:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27258) The value of "spark.app.name" or "--name" starts with number , which causes resourceName does not match regular expression - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/07 14:00:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-24708) Document the default spark url of master in standalone is "spark://localhost:7070" - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/07 14:09:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27651) Avoid the network when block manager fetches shuffle blocks from the same host - posted by "Attila Zsolt Piros (JIRA)" <ji...@apache.org> on 2019/05/07 14:55:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27651) Avoid the network when block manager fetches shuffle blocks from the same host - posted by "Attila Zsolt Piros (JIRA)" <ji...@apache.org> on 2019/05/07 14:56:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27566) SIGSEV in Spark SQL during broadcast - posted by "Martin Studer (JIRA)" <ji...@apache.org> on 2019/05/07 14:56:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27563) automatically get the latest Spark versions in HiveExternalCatalogVersionsSuite - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/07 15:07:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27652) Caught Hive MetaException when query by partition (partition col start with underscore) - posted by "Tongqing Qiu (JIRA)" <ji...@apache.org> on 2019/05/07 16:47:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27652) Caught Hive MetaException when query by partition (partition col start with underscore) - posted by "Tongqing Qiu (JIRA)" <ji...@apache.org> on 2019/05/07 16:49:00 UTC, 1 replies.
- [jira] [Resolved] (SPARK-27652) Caught Hive MetaException when query by partition (partition col start with underscore) - posted by "Tongqing Qiu (JIRA)" <ji...@apache.org> on 2019/05/07 16:55:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27590) do not consider skipped tasks when scheduling speculative tasks - posted by "Imran Rashid (JIRA)" <ji...@apache.org> on 2019/05/07 17:03:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27610) Yarn external shuffle service fails to start when spark.shuffle.io.mode=EPOLL - posted by "Marcelo Vanzin (JIRA)" <ji...@apache.org> on 2019/05/07 17:49:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27610) Yarn external shuffle service fails to start when spark.shuffle.io.mode=EPOLL - posted by "Marcelo Vanzin (JIRA)" <ji...@apache.org> on 2019/05/07 17:49:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27294) Multi-cluster Kafka delegation token support - posted by "Marcelo Vanzin (JIRA)" <ji...@apache.org> on 2019/05/07 18:42:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27294) Multi-cluster Kafka delegation token support - posted by "Marcelo Vanzin (JIRA)" <ji...@apache.org> on 2019/05/07 18:42:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-26944) Python unit-tests.log not available in artifacts for a build in Jenkins - posted by "shane knapp (JIRA)" <ji...@apache.org> on 2019/05/07 19:09:00 UTC, 1 replies.
- [jira] [Comment Edited] (SPARK-26944) Python unit-tests.log not available in artifacts for a build in Jenkins - posted by "shane knapp (JIRA)" <ji...@apache.org> on 2019/05/07 19:34:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27548) PySpark toLocalIterator does not raise errors from worker - posted by "Bryan Cutler (JIRA)" <ji...@apache.org> on 2019/05/07 21:49:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-23961) pyspark toLocalIterator throws an exception - posted by "Bryan Cutler (JIRA)" <ji...@apache.org> on 2019/05/07 21:49:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23961) pyspark toLocalIterator throws an exception - posted by "Bryan Cutler (JIRA)" <ji...@apache.org> on 2019/05/07 21:49:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27548) PySpark toLocalIterator does not raise errors from worker - posted by "Bryan Cutler (JIRA)" <ji...@apache.org> on 2019/05/07 21:50:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27653) Add max_by() / min_by() SQL aggregate functions - posted by "Josh Rosen (JIRA)" <ji...@apache.org> on 2019/05/07 22:55:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27347) Fix supervised driver retry logic when agent crashes/restarts - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/07 22:58:00 UTC, 2 replies.
- [jira] [Created] (SPARK-27654) spark unable to read parquet file- corrupt footer - posted by "Gautham Rajendiran (JIRA)" <ji...@apache.org> on 2019/05/08 02:11:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27654) spark unable to read parquet file- corrupt footer - posted by "Gautham Rajendiran (JIRA)" <ji...@apache.org> on 2019/05/08 02:13:00 UTC, 3 replies.
- [jira] [Comment Edited] (SPARK-23191) Workers registration failes in case of network drop - posted by "zuotingbing (JIRA)" <ji...@apache.org> on 2019/05/08 02:22:00 UTC, 1 replies.
- [jira] [Comment Edited] (SPARK-27647) Metric Gauge not threadsafe - posted by "bettermouse (JIRA)" <ji...@apache.org> on 2019/05/08 02:30:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27639) InMemoryTableScan shows the table name on UI if possible - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/08 04:02:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-26944) Python unit-tests.log not available in artifacts for a build in Jenkins - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/08 04:38:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27655) Persistent the table statistics to metadata after fall back to hdfs - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/08 04:42:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27655) Persistent the table statistics to metadata after fall back to hdfs - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/08 04:43:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27655) Persistent the table statistics to metadata after fall back to hdfs - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/08 04:54:00 UTC, 1 replies.
- [jira] [Resolved] (SPARK-25139) PythonRunner#WriterThread released block after TaskRunner finally block which invoke BlockManager#releaseAllLocksForTask - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/08 04:55:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27654) spark unable to read parquet file- corrupt footer - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/08 05:08:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27654) spark unable to read parquet file- corrupt footer - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/08 05:08:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27642) make v1 offset extends v2 offset - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/08 06:05:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27648) In Spark2.4 Structured Streamingļ¼šThe executor storage memory increasing over time - posted by "Gabor Somogyi (JIRA)" <ji...@apache.org> on 2019/05/08 08:24:00 UTC, 21 replies.
- [jira] [Assigned] (SPARK-27622) Avoid the network when block manager fetches disk persisted RDD blocks from the same host - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/08 08:30:00 UTC, 1 replies.
- [jira] [Resolved] (SPARK-27649) Unify the way you use 'spark.network.timeout' - posted by "Gabor Somogyi (JIRA)" <ji...@apache.org> on 2019/05/08 08:34:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27600) Unable to start Spark Hive Thrift Server when multiple hive server server share the same metastore - posted by "pin_zhang (JIRA)" <ji...@apache.org> on 2019/05/08 08:59:00 UTC, 1 replies.
- [jira] [Comment Edited] (SPARK-27600) Unable to start Spark Hive Thrift Server when multiple hive server server share the same metastore - posted by "pin_zhang (JIRA)" <ji...@apache.org> on 2019/05/08 09:03:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27656) Safely register class for GraphX - posted by "zhengruifeng (JIRA)" <ji...@apache.org> on 2019/05/08 10:19:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27656) Safely register class for GraphX - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/08 10:21:00 UTC, 1 replies.
- [jira] [Resolved] (SPARK-23805) support vector-size validation and Inference - posted by "zhengruifeng (JIRA)" <ji...@apache.org> on 2019/05/08 10:32:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-24664) Column support name getter - posted by "zhengruifeng (JIRA)" <ji...@apache.org> on 2019/05/08 10:32:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21879) Should Scalers handel NaN values? - posted by "zhengruifeng (JIRA)" <ji...@apache.org> on 2019/05/08 10:33:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19208) MultivariateOnlineSummarizer performance optimization - posted by "zhengruifeng (JIRA)" <ji...@apache.org> on 2019/05/08 10:33:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20711) MultivariateOnlineSummarizer/Summarizer incorrect min/max for NaN value - posted by "zhengruifeng (JIRA)" <ji...@apache.org> on 2019/05/08 10:33:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18518) HasSolver should support allowed values - posted by "zhengruifeng (JIRA)" <ji...@apache.org> on 2019/05/08 10:34:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18757) Models in Pyspark support column setters - posted by "zhengruifeng (JIRA)" <ji...@apache.org> on 2019/05/08 10:34:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16872) Include Gaussian Naive Bayes Classifier - posted by "zhengruifeng (JIRA)" <ji...@apache.org> on 2019/05/08 10:35:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14174) Implement the Mini-Batch KMeans - posted by "zhengruifeng (JIRA)" <ji...@apache.org> on 2019/05/08 10:35:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13385) Enable AssociationRules to generate consequents with user-defined lengths - posted by "zhengruifeng (JIRA)" <ji...@apache.org> on 2019/05/08 10:35:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13677) Support Tree-Based Feature Transformation for ML - posted by "zhengruifeng (JIRA)" <ji...@apache.org> on 2019/05/08 10:35:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7008) An implementation of Factorization Machine (LibFM) - posted by "zhengruifeng (JIRA)" <ji...@apache.org> on 2019/05/08 10:36:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22320) ORC should support VectorUDT/MatrixUDT - posted by "zhengruifeng (JIRA)" <ji...@apache.org> on 2019/05/08 10:36:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17906) MulticlassClassificationEvaluator support target label - posted by "zhengruifeng (JIRA)" <ji...@apache.org> on 2019/05/08 10:37:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27641) Unregistering a single Metrics Source with no metrics leads to removing all the metrics from other sources with the same name - posted by "chunpinghe (JIRA)" <ji...@apache.org> on 2019/05/08 10:44:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-20166) Use XXX for ISO timezone instead of ZZ which is FastDateFormat specific in CSV/JSON time related options - posted by "Shyama (JIRA)" <ji...@apache.org> on 2019/05/08 12:46:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-25861) Remove unused refreshInterval parameter from the headerSparkPage method. - posted by "Artem Kalchenko (JIRA)" <ji...@apache.org> on 2019/05/08 12:53:00 UTC, 1 replies.
- [jira] [Assigned] (SPARK-27641) Unregistering a single Metrics Source with no metrics leads to removing all the metrics from other sources with the same name - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/08 13:08:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27657) ml.util.Instrumentation.logFailure doesn't log error message - posted by "Xiangrui Meng (JIRA)" <ji...@apache.org> on 2019/05/08 17:28:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27658) Catalog API to load functions - posted by "Ryan Blue (JIRA)" <ji...@apache.org> on 2019/05/08 17:33:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27659) Allow PySpark toLocalIterator to pre-fetch data - posted by "Bryan Cutler (JIRA)" <ji...@apache.org> on 2019/05/08 17:40:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27657) ml.util.Instrumentation.logFailure doesn't log error message - posted by "Xiangrui Meng (JIRA)" <ji...@apache.org> on 2019/05/08 17:42:00 UTC, 2 replies.
- [jira] [Created] (SPARK-27660) Allow PySpark toLocalIterator to pre-fetch data - posted by "Bryan Cutler (JIRA)" <ji...@apache.org> on 2019/05/08 17:42:00 UTC, 0 replies.
- [jira] [Comment Edited] (SPARK-27657) ml.util.Instrumentation.logFailure doesn't log error message - posted by "Xiangrui Meng (JIRA)" <ji...@apache.org> on 2019/05/08 17:43:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27661) Add SupportsNamespaces interface for v2 catalogs - posted by "Ryan Blue (JIRA)" <ji...@apache.org> on 2019/05/08 17:46:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27660) Allow PySpark toLocalIterator to pre-fetch data - posted by "Bryan Cutler (JIRA)" <ji...@apache.org> on 2019/05/08 18:10:00 UTC, 0 replies.
- [jira] [Closed] (SPARK-27660) Allow PySpark toLocalIterator to pre-fetch data - posted by "Bryan Cutler (JIRA)" <ji...@apache.org> on 2019/05/08 18:10:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27659) Allow PySpark toLocalIterator to prefetch data - posted by "Bryan Cutler (JIRA)" <ji...@apache.org> on 2019/05/08 18:12:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-25139) PythonRunner#WriterThread released block after TaskRunner finally block which invoke BlockManager#releaseAllLocksForTask - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/08 18:49:00 UTC, 1 replies.
- [jira] [Assigned] (SPARK-27661) Add SupportsNamespaces interface for v2 catalogs - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/08 20:54:00 UTC, 3 replies.
- [jira] [Commented] (SPARK-27661) Add SupportsNamespaces interface for v2 catalogs - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/08 21:35:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27624) Fix CalenderInterval to show an empty interval correctly - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/08 21:36:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-26130) Change Event Timeline Display Functionality on the Stages Page to use either REST API or data from other tables - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/08 22:21:01 UTC, 1 replies.
- [jira] [Created] (SPARK-27662) SQL tab shows two jobs for one SQL command - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/08 23:38:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27662) SQL tab shows two jobs for one SQL command - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/08 23:38:00 UTC, 1 replies.
- [jira] [Resolved] (SPARK-27627) Make option "pathGlobFilter" as a general option for all file sources - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/08 23:44:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27662) SQL tab shows two jobs for one SQL command - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/08 23:53:00 UTC, 1 replies.
- [jira] [Updated] (SPARK-27359) Joins on some array functions can be optimized - posted by "Nikolas Vanderhoof (JIRA)" <ji...@apache.org> on 2019/05/09 00:57:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27359) Joins on some array functions can be optimized - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/09 00:57:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27663) Task accomplished incompletely but marked as success - posted by "Fan Yunbo (JIRA)" <ji...@apache.org> on 2019/05/09 03:01:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27663) Task accomplished incompletely but marked as success - posted by "Fan Yunbo (JIRA)" <ji...@apache.org> on 2019/05/09 03:03:00 UTC, 5 replies.
- [jira] [Commented] (SPARK-27663) Task accomplished incompletely but marked as success - posted by "Fan Yunbo (JIRA)" <ji...@apache.org> on 2019/05/09 03:12:00 UTC, 6 replies.
- [jira] [Comment Edited] (SPARK-27663) Task accomplished incompletely but marked as success - posted by "Fan Yunbo (JIRA)" <ji...@apache.org> on 2019/05/09 03:22:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27207) There exists a bug with SortBasedAggregator where merge()/update() operations get invoked on the aggregate buffer without calling initialize - posted by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2019/05/09 03:22:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27658) Catalog API to load functions - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/09 03:29:00 UTC, 1 replies.
- [jira] [Assigned] (SPARK-26412) Allow Pandas UDF to take an iterator of pd.DataFrames or Arrow batches - posted by "Xiangrui Meng (JIRA)" <ji...@apache.org> on 2019/05/09 03:30:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27653) Add max_by() / min_by() SQL aggregate functions - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/09 05:17:00 UTC, 2 replies.
- [jira] [Comment Edited] (SPARK-27648) In Spark2.4 Structured Streamingļ¼šThe executor storage memory increasing over time - posted by "tommy duan (JIRA)" <ji...@apache.org> on 2019/05/09 09:56:00 UTC, 8 replies.
- [jira] [Assigned] (SPARK-27617) Not able to specify LOCATION for internal table - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/09 10:10:00 UTC, 1 replies.
- [jira] [Assigned] (SPARK-27625) ScalaReflection.serializerFor fails for annotated types - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/09 10:34:00 UTC, 2 replies.
- [jira] [Created] (SPARK-27664) Performance issue with FileStatusCache, while reading from object stores. - posted by "Prashant Sharma (JIRA)" <ji...@apache.org> on 2019/05/09 11:11:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27664) Performance issue with FileStatusCache, while reading from object stores. - posted by "Prashant Sharma (JIRA)" <ji...@apache.org> on 2019/05/09 11:14:00 UTC, 2 replies.
- [jira] [Created] (SPARK-27665) Split fetch shuffle blocks protocol from OpenBlocks - posted by "Yuanjian Li (JIRA)" <ji...@apache.org> on 2019/05/09 11:35:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27665) Split fetch shuffle blocks protocol from OpenBlocks - posted by "Yuanjian Li (JIRA)" <ji...@apache.org> on 2019/05/09 11:36:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27666) Stop python runner threads when task finishes - posted by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2019/05/09 11:41:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27665) Split fetch shuffle blocks protocol from OpenBlocks - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/09 12:23:00 UTC, 2 replies.
- [jira] [Created] (SPARK-27667) when hive.cli.print.current.db is set, spark cli is not working as expected - posted by "Sandeep Katta (JIRA)" <ji...@apache.org> on 2019/05/09 12:27:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27667) when hive.cli.print.current.db is set, spark cli is not working as expected - posted by "Sandeep Katta (JIRA)" <ji...@apache.org> on 2019/05/09 12:27:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27667) when hive.cli.print.current.db is set, spark cli is not working as expected - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/09 12:32:00 UTC, 1 replies.
- [jira] [Assigned] (SPARK-27638) date format yyyy-M-dd string comparison not handled properly - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/09 12:56:00 UTC, 2 replies.
- [jira] [Commented] (SPARK-27631) Avoid repeating calculate table statistics when AUTO_SIZE_UPDATE_ENABLED is enabled - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/09 13:17:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-26412) Allow Pandas UDF to take an iterator of pd.DataFrames or Arrow batches - posted by "Weichen Xu (JIRA)" <ji...@apache.org> on 2019/05/09 13:26:00 UTC, 1 replies.
- [jira] [Resolved] (SPARK-27540) Add 'meanAveragePrecision_at_k' metric to RankingMetrics - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/09 13:49:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27540) Add 'meanAveragePrecision_at_k' metric to RankingMetrics - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/09 13:49:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27540) Add 'meanAveragePrecision_at_k' metric to RankingMetrics - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/09 13:49:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27492) GPU scheduling - High level user documentation - posted by "Thomas Graves (JIRA)" <ji...@apache.org> on 2019/05/09 14:26:01 UTC, 0 replies.
- [jira] [Updated] (SPARK-27636) Remove cached RDD blocks after PIC execution - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/09 14:28:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27636) Remove cached RDD blocks after PIC execution - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/09 14:29:00 UTC, 0 replies.
- [jira] [Comment Edited] (SPARK-26412) Allow Pandas UDF to take an iterator of pd.DataFrames or Arrow batches - posted by "Weichen Xu (JIRA)" <ji...@apache.org> on 2019/05/09 15:19:00 UTC, 1 replies.
- [jira] [Resolved] (SPARK-27089) Loss of precision during decimal division - posted by "Marco Gaido (JIRA)" <ji...@apache.org> on 2019/05/09 15:48:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-26182) Cost increases when optimizing scalaUDF - posted by "Marco Gaido (JIRA)" <ji...@apache.org> on 2019/05/09 15:54:00 UTC, 1 replies.
- [jira] [Assigned] (SPARK-23191) Workers registration failes in case of network drop - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/09 16:02:00 UTC, 2 replies.
- [jira] [Commented] (SPARK-23986) CompileException when using too many avg aggregation after joining - posted by "Siddharth Dangi (JIRA)" <ji...@apache.org> on 2019/05/09 19:58:00 UTC, 2 replies.
- [jira] [Created] (SPARK-27668) File source V2: support reporting statistics - posted by "Gengliang Wang (JIRA)" <ji...@apache.org> on 2019/05/09 21:50:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27520) Introduce a global config system to replace hadoopConfiguration - posted by "Thomas Graves (JIRA)" <ji...@apache.org> on 2019/05/09 21:51:00 UTC, 1 replies.
- [jira] [Updated] (SPARK-27668) File source V2: support reporting statistics - posted by "Gengliang Wang (JIRA)" <ji...@apache.org> on 2019/05/09 21:54:00 UTC, 1 replies.
- [jira] [Assigned] (SPARK-27668) File source V2: support reporting statistics - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/09 21:55:00 UTC, 2 replies.
- [jira] [Resolved] (SPARK-27271) Migrate Text to File Data Source V2 - posted by "Gengliang Wang (JIRA)" <ji...@apache.org> on 2019/05/09 21:58:01 UTC, 0 replies.
- [jira] [Reopened] (SPARK-27600) Unable to start Spark Hive Thrift Server when multiple hive server server share the same metastore - posted by "pin_zhang (JIRA)" <ji...@apache.org> on 2019/05/10 00:05:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27669) Refactor DataFrameWriter to always go through Catalyst for analysis - posted by "Eric Liang (JIRA)" <ji...@apache.org> on 2019/05/10 00:55:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27669) Refactor DataFrameWriter to resolve datasources in a command - posted by "Eric Liang (JIRA)" <ji...@apache.org> on 2019/05/10 00:56:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27669) Refactor DataFrameWriter to resolve datasources in a command - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/10 00:59:00 UTC, 1 replies.
- [jira] [Resolved] (SPARK-26182) Cost increases when optimizing scalaUDF - posted by "bupt_ljy (JIRA)" <ji...@apache.org> on 2019/05/10 04:16:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-26437) Decimal data becomes bigint to query, unable to query - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/10 04:52:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-21172) EOFException reached end of stream in UnsafeRowSerializer - posted by "Lasantha Fernando (JIRA)" <ji...@apache.org> on 2019/05/10 05:44:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-17025) Cannot persist PySpark ML Pipeline model that includes custom Transformer - posted by "Yogesh Agrawal (JIRA)" <ji...@apache.org> on 2019/05/10 05:54:00 UTC, 0 replies.
- [jira] [Comment Edited] (SPARK-23986) CompileException when using too many avg aggregation after joining - posted by "Pedro Fernandes (JIRA)" <ji...@apache.org> on 2019/05/10 06:53:00 UTC, 2 replies.
- [jira] [Created] (SPARK-27670) Add High available for Spark Hive thrift server. - posted by "jiaan.geng (JIRA)" <ji...@apache.org> on 2019/05/10 07:07:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27670) Add High available for Spark Hive thrift server. - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/10 07:28:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27671) Analysis exception thrown when casting from a nested null in a struct - posted by "Liang-Chi Hsieh (JIRA)" <ji...@apache.org> on 2019/05/10 10:40:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27672) Add since info to string expressions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/10 10:47:03 UTC, 0 replies.
- [jira] [Commented] (SPARK-27664) Performance issue with FileStatusCache, while reading from object stores. - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/10 10:48:00 UTC, 2 replies.
- [jira] [Assigned] (SPARK-27664) Performance issue with FileStatusCache, while reading from object stores. - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/10 10:48:00 UTC, 1 replies.
- [jira] [Assigned] (SPARK-27671) Analysis exception thrown when casting from a nested null in a struct - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/10 10:48:00 UTC, 1 replies.
- [jira] [Assigned] (SPARK-27672) Add since info to string expressions - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/10 10:50:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27673) Add since info to random. regex, null expressions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/10 11:09:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27673) Add since info to random. regex, null expressions - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/10 11:11:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-25075) Build and test Spark against Scala 2.13 - posted by "Stavros Kontopoulos (JIRA)" <ji...@apache.org> on 2019/05/10 12:40:00 UTC, 7 replies.
- [jira] [Commented] (SPARK-18484) case class datasets - ability to specify decimal precision and scale - posted by "Bill Schneider (JIRA)" <ji...@apache.org> on 2019/05/10 13:18:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27674) the hint should not be dropped after cache lookup - posted by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2019/05/10 13:37:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27674) the hint should not be dropped after cache lookup - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/10 13:42:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27675) do not use MutableColumnarRow in ColumnarBatch - posted by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2019/05/10 14:39:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27675) do not use MutableColumnarRow in ColumnarBatch - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/10 14:42:00 UTC, 1 replies.
- [jira] [Resolved] (SPARK-27650) sepate the row iterator functionality from ColumnarBatch - posted by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2019/05/10 14:44:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27625) ScalaReflection.serializerFor fails for annotated types - posted by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2019/05/10 14:51:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-14948) Exception when joining DataFrames derived form the same DataFrame - posted by "Graton M Gathright (JIRA)" <ji...@apache.org> on 2019/05/10 15:06:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-26412) Allow Pandas UDF to take an iterator of pd.DataFrames - posted by "Xiangrui Meng (JIRA)" <ji...@apache.org> on 2019/05/10 15:06:01 UTC, 2 replies.
- [jira] [Commented] (SPARK-26412) Allow Pandas UDF to take an iterator of pd.DataFrames - posted by "Xiangrui Meng (JIRA)" <ji...@apache.org> on 2019/05/10 15:11:00 UTC, 0 replies.
- [jira] [Comment Edited] (SPARK-25075) Build and test Spark against Scala 2.13 - posted by "Stavros Kontopoulos (JIRA)" <ji...@apache.org> on 2019/05/10 15:54:00 UTC, 2 replies.
- [jira] [Resolved] (SPARK-24980) add support for pandas/arrow etc for python2.7 and pypy builds - posted by "shane knapp (JIRA)" <ji...@apache.org> on 2019/05/10 16:28:00 UTC, 0 replies.
- [jira] [Closed] (SPARK-24980) add support for pandas/arrow etc for python2.7 and pypy builds - posted by "shane knapp (JIRA)" <ji...@apache.org> on 2019/05/10 16:28:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-21367) R older version of Roxygen2 on Jenkins - posted by "shane knapp (JIRA)" <ji...@apache.org> on 2019/05/10 16:55:00 UTC, 2 replies.
- [jira] [Resolved] (SPARK-21367) R older version of Roxygen2 on Jenkins - posted by "shane knapp (JIRA)" <ji...@apache.org> on 2019/05/10 16:56:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-26632) Separate Thread Configurations of Driver and Executor - posted by "Marcelo Vanzin (JIRA)" <ji...@apache.org> on 2019/05/10 17:44:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-26632) Separate Thread Configurations of Driver and Executor - posted by "Marcelo Vanzin (JIRA)" <ji...@apache.org> on 2019/05/10 17:44:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27347) Fix supervised driver retry logic when agent crashes/restarts - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/10 17:57:00 UTC, 2 replies.
- [jira] [Resolved] (SPARK-27347) Fix supervised driver retry logic when agent crashes/restarts - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/10 17:57:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27660) Allow PySpark toLocalIterator to pre-fetch data - posted by "Holden Karau (JIRA)" <ji...@apache.org> on 2019/05/10 19:03:00 UTC, 1 replies.
- [jira] [Issue Comment Deleted] (SPARK-27660) Allow PySpark toLocalIterator to pre-fetch data - posted by "Holden Karau (JIRA)" <ji...@apache.org> on 2019/05/10 19:04:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27676) InMemoryFileIndex should hard-fail on missing files instead of logging and continuing - posted by "Josh Rosen (JIRA)" <ji...@apache.org> on 2019/05/10 19:17:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27676) InMemoryFileIndex should hard-fail on missing files instead of logging and continuing - posted by "Josh Rosen (JIRA)" <ji...@apache.org> on 2019/05/10 19:19:00 UTC, 2 replies.
- [jira] [Commented] (SPARK-27676) InMemoryFileIndex should hard-fail on missing files instead of logging and continuing - posted by "Michael Armbrust (JIRA)" <ji...@apache.org> on 2019/05/10 19:38:00 UTC, 2 replies.
- [jira] [Created] (SPARK-27677) Disk-persisted RDD blocks served by shuffle service, and ignored for Dynamic Allocation - posted by "Imran Rashid (JIRA)" <ji...@apache.org> on 2019/05/10 21:41:01 UTC, 0 replies.
- [jira] [Commented] (SPARK-27677) Disk-persisted RDD blocks served by shuffle service, and ignored for Dynamic Allocation - posted by "Imran Rashid (JIRA)" <ji...@apache.org> on 2019/05/10 21:42:00 UTC, 2 replies.
- [jira] [Commented] (SPARK-25888) Service requests for persist() blocks via external service after dynamic deallocation - posted by "Imran Rashid (JIRA)" <ji...@apache.org> on 2019/05/10 21:47:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27678) Support Knox user impersonation in UI - posted by "Marcelo Vanzin (JIRA)" <ji...@apache.org> on 2019/05/10 22:08:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27678) Support Knox user impersonation in UI - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/10 22:35:00 UTC, 2 replies.
- [jira] [Created] (SPARK-27679) Improve queries with LIKE expression - posted by "Achuth Narayan Rajagopal (JIRA)" <ji...@apache.org> on 2019/05/10 23:37:01 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27679) Improve queries with LIKE expression - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/10 23:41:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27680) Remove usage of Traversable - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/11 01:10:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27681) Use scala.collection.Seq explicitly instead of scala.Seq alias - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/11 01:13:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27682) Avoid use of Scala collection classes that are removed in 2.13 - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/11 01:14:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27680) Remove usage of Traversable - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/11 01:32:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-27681) Use scala.collection.Seq explicitly instead of scala.Seq alias - posted by "Marcelo Vanzin (JIRA)" <ji...@apache.org> on 2019/05/11 04:09:00 UTC, 14 replies.
- [jira] [Comment Edited] (SPARK-27681) Use scala.collection.Seq explicitly instead of scala.Seq alias - posted by "Marcelo Vanzin (JIRA)" <ji...@apache.org> on 2019/05/11 04:29:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27677) Disk-persisted RDD blocks served by shuffle service, and ignored for Dynamic Allocation - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/11 14:01:00 UTC, 2 replies.
- [jira] [Updated] (SPARK-27682) Avoid use of Scala collection classes that are removed in 2.13 - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/11 18:33:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27682) Avoid use of Scala collection classes that are removed in 2.13 - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/11 18:34:00 UTC, 1 replies.
- [jira] [Assigned] (SPARK-27682) Avoid use of Scala collection classes that are removed in 2.13 - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/11 18:37:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27683) Remove usage of TraversableOnce - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/11 18:40:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-3289) Avoid job failures due to rescheduling of failing tasks on buggy machines - posted by "Josh Rosen (JIRA)" <ji...@apache.org> on 2019/05/11 22:11:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8352) Affixed table of contents, similar to Bootstrap 3 docs - posted by "Josh Rosen (JIRA)" <ji...@apache.org> on 2019/05/11 22:12:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8351) Umbella for improving Spark documentation CSS + JS - posted by "Josh Rosen (JIRA)" <ji...@apache.org> on 2019/05/11 22:13:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27683) Remove usage of TraversableOnce - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/11 23:25:00 UTC, 5 replies.
- [jira] [Resolved] (SPARK-27675) do not use MutableColumnarRow in ColumnarBatch - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/12 11:01:00 UTC, 0 replies.
- [jira] [Comment Edited] (SPARK-27683) Remove usage of TraversableOnce - posted by "PJ Fanning (JIRA)" <ji...@apache.org> on 2019/05/12 15:02:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27074) Hive 3.1 metastore support HiveClientImpl.runHive - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/12 15:35:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27343) Use ConfigEntry for hardcoded configs for spark-sql-kafka - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/12 15:47:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27343) Use ConfigEntry for hardcoded configs for spark-sql-kafka - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/12 15:47:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27343) Use ConfigEntry for hardcoded configs for spark-sql-kafka - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/12 15:47:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27684) Reduce ScalaUDF conversion overheads for primitives - posted by "Josh Rosen (JIRA)" <ji...@apache.org> on 2019/05/12 20:18:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27684) Reduce ScalaUDF conversion overheads for primitives - posted by "Josh Rosen (JIRA)" <ji...@apache.org> on 2019/05/12 20:20:00 UTC, 3 replies.
- [jira] [Commented] (SPARK-27684) Reduce ScalaUDF conversion overheads for primitives - posted by "Felix Cheung (JIRA)" <ji...@apache.org> on 2019/05/12 22:59:00 UTC, 4 replies.
- [jira] [Commented] (SPARK-27335) cannot collect() from Correlation.corr - posted by "Michael Chirico (JIRA)" <ji...@apache.org> on 2019/05/13 03:45:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27685) `union` doesn't promote non-nullable columns of struct to nullable - posted by "Huon Wilson (JIRA)" <ji...@apache.org> on 2019/05/13 04:46:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27685) `union` doesn't promote non-nullable columns of struct to nullable - posted by "Huon Wilson (JIRA)" <ji...@apache.org> on 2019/05/13 04:50:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27686) Update - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/13 05:09:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27686) Update migration guide - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/13 05:10:00 UTC, 2 replies.
- [jira] [Resolved] (SPARK-27668) File source V2: support reporting statistics - posted by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2019/05/13 06:20:00 UTC, 0 replies.
- [jira] [Comment Edited] (SPARK-27546) Should repalce DateTimeUtils#defaultTimeZoneuse with sessionLocalTimeZone - posted by "Jiatao Tao (JIRA)" <ji...@apache.org> on 2019/05/13 07:19:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27687) Kafka consumer cache parameter rename and documentation - posted by "Gabor Somogyi (JIRA)" <ji...@apache.org> on 2019/05/13 08:20:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27687) Kafka consumer cache parameter rename and documentation - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/13 08:30:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-21827) Task fail due to executor exception when enable Sasl Encryption - posted by "SĆ©bastien BARNOUD (JIRA)" <ji...@apache.org> on 2019/05/13 09:23:00 UTC, 3 replies.
- [jira] [Updated] (SPARK-27687) Kafka consumer cache parameter rename and documentation - posted by "Gabor Somogyi (JIRA)" <ji...@apache.org> on 2019/05/13 10:22:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27688) Beeline should show database in the prompt - posted by "Sandeep Katta (JIRA)" <ji...@apache.org> on 2019/05/13 10:52:00 UTC, 5 replies.
- [jira] [Created] (SPARK-27688) Beeline should show database in the prompt - posted by "Sandeep Katta (JIRA)" <ji...@apache.org> on 2019/05/13 10:52:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-26601) Make broadcast-exchange thread pool keepalivetime and maxThreadNumber configurable - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/13 11:42:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-26601) Make broadcast-exchange thread pool keepalivetime and maxThreadNumber configurable - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/13 11:42:00 UTC, 0 replies.
- [jira] [Comment Edited] (SPARK-27688) Beeline should show database in the prompt - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/13 12:19:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27689) Error to execute hive views with spark - posted by "Juan Antonio (JIRA)" <ji...@apache.org> on 2019/05/13 13:41:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27653) Add max_by() / min_by() SQL aggregate functions - posted by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2019/05/13 14:40:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27617) Not able to specify LOCATION for internal table - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/13 16:31:00 UTC, 0 replies.
- [jira] [Closed] (SPARK-27617) Not able to specify LOCATION for internal table - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/13 16:31:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27671) Analysis exception thrown when casting from a nested null in a struct - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/13 16:40:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27689) Error to execute hive views with spark - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/13 16:50:00 UTC, 7 replies.
- [jira] [Created] (SPARK-27690) Refactor HiveClientImpl#reset() to remove materialized view first - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/13 16:53:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27690) Refactor HiveClientImpl#reset() to remove materialized view first - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/13 17:05:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27691) Issue when running queries using filter predicates on pandas GROUPED_AGG udfs - posted by "Michael Tong (JIRA)" <ji...@apache.org> on 2019/05/13 17:33:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27691) Issue when running queries using filter predicates on pandas GROUPED_AGG udafs - posted by "Michael Tong (JIRA)" <ji...@apache.org> on 2019/05/13 17:34:00 UTC, 1 replies.
- [jira] [Resolved] (SPARK-27402) Fix hadoop-3.2 test issue(except the hive-thriftserver module) - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/13 17:37:00 UTC, 1 replies.
- [jira] [Updated] (SPARK-27681) Use scala.collection.Seq explicitly instead of scala.Seq alias - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/13 17:48:00 UTC, 0 replies.
- [jira] [Closed] (SPARK-21367) R older version of Roxygen2 on Jenkins - posted by "shane knapp (JIRA)" <ji...@apache.org> on 2019/05/13 18:17:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27671) Fix error when casting from a nested null in a struct - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/13 18:25:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-27671) Fix error when casting from a nested null in a struct - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/13 18:36:00 UTC, 2 replies.
- [jira] [Created] (SPARK-27692) Optimize evaluation of udf that is deterministic and has literal inputs - posted by "Sunitha Kambhampati (JIRA)" <ji...@apache.org> on 2019/05/13 18:47:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27692) Optimize evaluation of udf that is deterministic and has literal inputs - posted by "Sunitha Kambhampati (JIRA)" <ji...@apache.org> on 2019/05/13 18:50:00 UTC, 1 replies.
- [jira] [Assigned] (SPARK-27692) Optimize evaluation of udf that is deterministic and has literal inputs - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/13 19:00:00 UTC, 1 replies.
- [jira] [Resolved] (SPARK-27671) Fix error when casting from a nested null in a struct - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/13 19:43:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27693) DataSourceV2: Add default catalog property - posted by "Ryan Blue (JIRA)" <ji...@apache.org> on 2019/05/13 20:08:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27693) DataSourceV2: Add default catalog property - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/13 20:13:00 UTC, 1 replies.
- [jira] [Issue Comment Deleted] (SPARK-27682) Avoid use of Scala collection classes that are removed in 2.13 - posted by "Stavros Kontopoulos (JIRA)" <ji...@apache.org> on 2019/05/13 21:43:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27690) Remove materialized views first in `HiveClientImpl.reset` - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/13 22:10:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27690) Remove materialized view first in `HiveClientImpl.reset` - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/13 22:10:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27694) Create a data source table using the result of a query should update statistics if spark.sql.statistics.size.autoUpdate.enabled is enabled - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/14 02:32:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27694) Create a data source table using the result of a query should update statistics if spark.sql.statistics.size.autoUpdate.enabled is enabled - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/14 02:50:00 UTC, 1 replies.
- [jira] [Updated] (SPARK-27689) Error to execute hive views with spark - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/14 04:27:00 UTC, 5 replies.
- [jira] [Created] (SPARK-27695) SELECT * returns null column when reading from Hive / ORC and spark.sql.hive.convertMetastoreOrc=true - posted by "Oscar Cassetti (JIRA)" <ji...@apache.org> on 2019/05/14 05:04:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27694) CTAS created data source table support collect statistics - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/14 05:26:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27696) kubernetes driver pod not deleted after finish. - posted by "Henry Yu (JIRA)" <ji...@apache.org> on 2019/05/14 06:10:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27696) kubernetes driver pod not deleted after finish. - posted by "Henry Yu (JIRA)" <ji...@apache.org> on 2019/05/14 06:12:00 UTC, 1 replies.
- [jira] [Updated] (SPARK-20356) Spark sql group by returns incorrect results after join + distinct transformations - posted by "Josh Rosen (JIRA)" <ji...@apache.org> on 2019/05/14 06:18:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27697) KubernetesClientApplication alway exit with 0 - posted by "Henry Yu (JIRA)" <ji...@apache.org> on 2019/05/14 06:20:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-6743) Join with empty projection on one side produces invalid results - posted by "Josh Rosen (JIRA)" <ji...@apache.org> on 2019/05/14 06:22:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27697) KubernetesClientApplication alway exit with 0 - posted by "Henry Yu (JIRA)" <ji...@apache.org> on 2019/05/14 06:27:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-18578) Full outer join in correlated subquery returns incorrect results - posted by "Josh Rosen (JIRA)" <ji...@apache.org> on 2019/05/14 06:28:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-19017) NOT IN subquery with more than one column may return incorrect results - posted by "Josh Rosen (JIRA)" <ji...@apache.org> on 2019/05/14 06:28:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-18473) Correctness issue in INNER join result with window functions - posted by "Josh Rosen (JIRA)" <ji...@apache.org> on 2019/05/14 06:29:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-18504) Scalar subquery with extra group by columns returning incorrect result - posted by "Josh Rosen (JIRA)" <ji...@apache.org> on 2019/05/14 06:29:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27698) Add new method for getting pushed down filters in Parquet file reader - posted by "Gengliang Wang (JIRA)" <ji...@apache.org> on 2019/05/14 07:52:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27698) Add new method for getting pushed down filters in Parquet file reader - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/14 08:18:00 UTC, 2 replies.
- [jira] [Updated] (SPARK-27698) Add new method for getting pushed down filters in Parquet file reader - posted by "Gengliang Wang (JIRA)" <ji...@apache.org> on 2019/05/14 08:21:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-26278) V2 Streaming sources cannot be written to V1 sinks - posted by "Genmao Yu (JIRA)" <ji...@apache.org> on 2019/05/14 09:21:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-26302) retainedBatches configuration can eat up memory on driver - posted by "Genmao Yu (JIRA)" <ji...@apache.org> on 2019/05/14 09:28:00 UTC, 0 replies.
- [jira] [Comment Edited] (SPARK-26302) retainedBatches configuration can eat up memory on driver - posted by "Genmao Yu (JIRA)" <ji...@apache.org> on 2019/05/14 09:28:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27699) Partially push down disjunctive predicated in Parquet/Orc - posted by "Gengliang Wang (JIRA)" <ji...@apache.org> on 2019/05/14 09:42:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27699) Partially push down disjunctive predicated in Parquet/ORC - posted by "Gengliang Wang (JIRA)" <ji...@apache.org> on 2019/05/14 09:46:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27699) Partially push down disjunctive predicated in Parquet/ORC - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/14 09:55:00 UTC, 2 replies.
- [jira] [Commented] (SPARK-26338) Use scala-xml explicitly - posted by "Stavros Kontopoulos (JIRA)" <ji...@apache.org> on 2019/05/14 09:59:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27700) SparkSubmit closes with SocketTimeoutException in kubernetes mode. - posted by "Udbhav Agrawal (JIRA)" <ji...@apache.org> on 2019/05/14 10:08:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27700) SparkSubmit closes with SocketTimeoutException in kubernetes mode. - posted by "Udbhav Agrawal (JIRA)" <ji...@apache.org> on 2019/05/14 10:08:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27701) Extend NestedColumnAliasing to more nested field cases - posted by "Liang-Chi Hsieh (JIRA)" <ji...@apache.org> on 2019/05/14 10:09:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27700) SparkSubmit closes with SocketTimeoutException in kubernetes mode. - posted by "Udbhav Agrawal (JIRA)" <ji...@apache.org> on 2019/05/14 10:09:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27701) Extend NestedColumnAliasing to more nested field cases - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/14 10:11:00 UTC, 1 replies.
- [jira] [Assigned] (SPARK-27700) SparkSubmit closes with SocketTimeoutException in kubernetes mode. - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/14 10:13:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-27685) `union` doesn't promote non-nullable columns of struct to nullable - posted by "Marco Gaido (JIRA)" <ji...@apache.org> on 2019/05/14 10:21:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27685) `union` doesn't promote non-nullable columns of struct to nullable - posted by "Marco Gaido (JIRA)" <ji...@apache.org> on 2019/05/14 10:21:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27702) Allow using some alternatives for service accounts - posted by "Udbhav Agrawal (JIRA)" <ji...@apache.org> on 2019/05/14 10:46:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27702) Allow using some alternatives for service accounts - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/14 10:50:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27703) kubernetes jobs are filing with Unsatisfiedlink error - posted by "Udbhav Agrawal (JIRA)" <ji...@apache.org> on 2019/05/14 11:12:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27703) kubernetes jobs are failing with Unsatisfiedlink error - posted by "Udbhav Agrawal (JIRA)" <ji...@apache.org> on 2019/05/14 11:13:00 UTC, 1 replies.
- [jira] [Assigned] (SPARK-27703) kubernetes jobs are failing with Unsatisfiedlink error - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/14 11:20:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27704) Change default class loader to ParallelGC - posted by "Mihaly Toth (JIRA)" <ji...@apache.org> on 2019/05/14 11:24:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27705) kubernetes integration test break on osx when test PVTestsSuite - posted by "Henry Yu (JIRA)" <ji...@apache.org> on 2019/05/14 12:12:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27705) kubernetes integration test break on osx when test PVTestsSuite - posted by "Henry Yu (JIRA)" <ji...@apache.org> on 2019/05/14 12:26:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-26278) V2 Streaming sources cannot be written to V1 sinks - posted by "Justin Polchlopek (JIRA)" <ji...@apache.org> on 2019/05/14 12:47:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27488) Driver interface to support GPU resources - posted by "Thomas Graves (JIRA)" <ji...@apache.org> on 2019/05/14 13:02:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27638) date format yyyy-M-dd string comparison not handled properly - posted by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2019/05/14 13:29:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27024) Executor interface for cluster managers to support GPU resources - posted by "Thomas Graves (JIRA)" <ji...@apache.org> on 2019/05/14 13:49:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-25719) Search functionality in datatables in stages page should search over formatted data rather than the raw data - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/14 14:06:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-25719) Search functionality in datatables in stages page should search over formatted data rather than the raw data - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/14 14:06:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27680) Remove usage of Traversable - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/14 14:17:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-26812) PushProjectionThroughUnion nullability issue - posted by "Josh Rosen (JIRA)" <ji...@apache.org> on 2019/05/14 14:31:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-26812) PushProjectionThroughUnion nullability issue - posted by "Josh Rosen (JIRA)" <ji...@apache.org> on 2019/05/14 14:31:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27706) Add SQL metrics of numOutputRows for BroadcastExchangeExec - posted by "dzcxzl (JIRA)" <ji...@apache.org> on 2019/05/14 15:03:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27707) Performance issue using explode - posted by "Ohad Raviv (JIRA)" <ji...@apache.org> on 2019/05/14 15:05:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27706) Add SQL metrics of numOutputRows for BroadcastExchangeExec - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/14 15:08:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-27707) Performance issue using explode - posted by "Ohad Raviv (JIRA)" <ji...@apache.org> on 2019/05/14 15:10:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27690) Remove materialized views first in `HiveClientImpl.reset` - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/14 16:08:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27708) Add documentation for v2 data sources - posted by "Ryan Blue (JIRA)" <ji...@apache.org> on 2019/05/14 17:07:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27709) AppStatusListener.cleanupExecutors should remove dead executors in an ordering that makes sense, not a random order - posted by "Josh Rosen (JIRA)" <ji...@apache.org> on 2019/05/14 17:30:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27709) AppStatusListener.cleanupExecutors should remove dead executors in an ordering that makes sense, not a random order - posted by "Josh Rosen (JIRA)" <ji...@apache.org> on 2019/05/14 17:31:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27710) ClassNotFoundException: $line196400984558.$read$ in OuterScopes - posted by "Josh Rosen (JIRA)" <ji...@apache.org> on 2019/05/14 17:46:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27703) kubernetes jobs are failing with Unsatisfiedlink error - posted by "Udbhav Agrawal (JIRA)" <ji...@apache.org> on 2019/05/14 18:25:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27711) InputFileBlockHolder should be unset at the end of tasks - posted by "Jose Torres (JIRA)" <ji...@apache.org> on 2019/05/14 18:52:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27711) InputFileBlockHolder should be unset at the end of tasks - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/14 18:58:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-2387) Remove the stage barrier for better resource utilization - posted by "Hieu Tri Huynh (JIRA)" <ji...@apache.org> on 2019/05/14 20:18:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27712) createDataFrame() reorders row - posted by "Tim Ludwinski (JIRA)" <ji...@apache.org> on 2019/05/14 21:01:00 UTC, 0 replies.
- [jira] [Comment Edited] (SPARK-18484) case class datasets - ability to specify decimal precision and scale - posted by "Bill Schneider (JIRA)" <ji...@apache.org> on 2019/05/14 21:09:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27339) Decimal up cast to higher scale fails while reading parquet to Dataset - posted by "Bill Schneider (JIRA)" <ji...@apache.org> on 2019/05/14 21:12:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27708) Add documentation for v2 data sources - posted by "Jacek Laskowski (JIRA)" <ji...@apache.org> on 2019/05/14 22:43:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27708) Add documentation for v2 data sources - posted by "Jacek Laskowski (JIRA)" <ji...@apache.org> on 2019/05/14 22:46:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27036) Even Broadcast thread is timed out, BroadCast Job is not aborted. - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/14 22:49:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27713) Move RecordBinaryComparator and unsafe sorters from catalyst project to core - posted by "Xianyin Xin (JIRA)" <ji...@apache.org> on 2019/05/15 02:23:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27713) Move RecordBinaryComparator and unsafe sorters from catalyst project to core - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/15 02:31:00 UTC, 2 replies.
- [jira] [Commented] (SPARK-27713) Move RecordBinaryComparator and unsafe sorters from catalyst project to core - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/15 02:31:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27714) Support Join Reorder based on Genetic algorithm when the # of joined tables > 12 - posted by "Xianyin Xin (JIRA)" <ji...@apache.org> on 2019/05/15 02:37:01 UTC, 0 replies.
- [jira] [Updated] (SPARK-27714) Support Join Reorder based on Genetic Algorithm when the # of joined tables > 12 - posted by "Xianyin Xin (JIRA)" <ji...@apache.org> on 2019/05/15 02:38:00 UTC, 2 replies.
- [jira] [Updated] (SPARK-27710) ClassNotFoundException: $line196400984558.$read$ in OuterScopes - posted by "Josh Rosen (JIRA)" <ji...@apache.org> on 2019/05/15 02:51:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27712) createDataFrame() reorders row - posted by "Josh Rosen (JIRA)" <ji...@apache.org> on 2019/05/15 03:14:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-15463) Support for creating a dataframe from CSV in Dataset[String] - posted by "Ruslan Dautkhanov (JIRA)" <ji...@apache.org> on 2019/05/15 05:15:00 UTC, 1 replies.
- [jira] [Assigned] (SPARK-24923) DataSourceV2: Add CTAS and RTAS logical operations - posted by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2019/05/15 05:45:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-24923) DataSourceV2: Add CTAS and RTAS logical operations - posted by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2019/05/15 05:46:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27715) SQL query details in UI dose not show in correct format. - posted by "Genmao Yu (JIRA)" <ji...@apache.org> on 2019/05/15 06:06:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27715) SQL query details in UI dose not show in correct format. - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/15 06:10:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27716) Complete the transactions support for part of jdbc datasource operation - posted by "feiwang (JIRA)" <ji...@apache.org> on 2019/05/15 06:10:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27716) Complete the transactions support for part of jdbc datasource operations. - posted by "feiwang (JIRA)" <ji...@apache.org> on 2019/05/15 06:11:00 UTC, 3 replies.
- [jira] [Assigned] (SPARK-27716) Complete the transactions support for part of jdbc datasource operations. - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/15 06:13:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27717) support UNION in continuous processing - posted by "Genmao Yu (JIRA)" <ji...@apache.org> on 2019/05/15 06:30:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27717) support UNION in continuous processing - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/15 06:40:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27718) incorrect result from pagerank - posted by "De-En Lin (JIRA)" <ji...@apache.org> on 2019/05/15 06:52:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27718) incorrect result from pagerank - posted by "De-En Lin (JIRA)" <ji...@apache.org> on 2019/05/15 06:56:00 UTC, 3 replies.
- [jira] [Assigned] (SPARK-27718) incorrect result from pagerank - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/15 07:11:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27719) Set maxDisplayLogSize for spark history server - posted by "hao.li (JIRA)" <ji...@apache.org> on 2019/05/15 07:14:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27713) Move RecordBinaryComparator and unsafe sorters from catalyst project to core - posted by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2019/05/15 07:28:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-24374) SPIP: Support Barrier Execution Mode in Apache Spark - posted by "Ruiguang Pei (JIRA)" <ji...@apache.org> on 2019/05/15 07:34:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27720) ConcurrentModificationException on closing DirectKafkaInputDStream - posted by "Vlad (JIRA)" <ji...@apache.org> on 2019/05/15 07:50:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27721) spark mvn test failed on aarch64 - posted by "huangtianhua (JIRA)" <ji...@apache.org> on 2019/05/15 08:17:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27721) spark mvn test failed on aarch64 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/15 08:20:00 UTC, 1 replies.
- [jira] [Resolved] (SPARK-27721) spark mvn test failed on aarch64 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/15 08:20:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27714) Support Join Reorder based on Genetic Algorithm when the # of joined tables > 12 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/15 08:21:00 UTC, 4 replies.
- [jira] [Updated] (SPARK-27721) spark mvn test failed on aarch64 - posted by "huangtianhua (JIRA)" <ji...@apache.org> on 2019/05/15 08:21:00 UTC, 1 replies.
- [jira] [Updated] (SPARK-27721) spark ./build/mvn test failed on aarch64 - posted by "huangtianhua (JIRA)" <ji...@apache.org> on 2019/05/15 08:28:00 UTC, 1 replies.
- [jira] [Reopened] (SPARK-27721) spark ./build/mvn test failed on aarch64 - posted by "huangtianhua (JIRA)" <ji...@apache.org> on 2019/05/15 08:30:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27721) spark ./build/mvn test failed on aarch64 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/15 08:37:00 UTC, 3 replies.
- [jira] [Resolved] (SPARK-27721) spark ./build/mvn test failed on aarch64 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/15 08:37:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-26809) insert overwrite directory + concat function => error - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/15 08:43:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-23739) Spark structured streaming long running problem - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/15 08:44:02 UTC, 0 replies.
- [jira] [Updated] (SPARK-23427) spark.sql.autoBroadcastJoinThreshold causing OOM exception in the driver - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/15 08:44:02 UTC, 0 replies.
- [jira] [Updated] (SPARK-24400) Issue with spark while accessing managed table with partitions across multiple namespaces - HDFS Federation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/15 08:44:02 UTC, 0 replies.
- [jira] [Updated] (SPARK-25061) Spark SQL Thrift Server fails to not pick up hiveconf passing parameter - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/15 08:44:02 UTC, 0 replies.
- [jira] [Updated] (SPARK-24437) Memory leak in UnsafeHashedRelation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/15 08:44:02 UTC, 0 replies.
- [jira] [Updated] (SPARK-26301) Consider switching from putting secret in environment variable directly to using secret reference - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/15 08:44:02 UTC, 1 replies.
- [jira] [Updated] (SPARK-22783) event log directory(spark-history) filled by large .inprogress files for spark streaming applications - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/15 08:44:02 UTC, 0 replies.
- [jira] [Updated] (SPARK-22714) Spark API Not responding when Fatal exception occurred in event loop - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/15 08:44:04 UTC, 0 replies.
- [jira] [Updated] (SPARK-22118) Should prevent change epoch in success stage while there is some running stage - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/15 08:45:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-18159) Stand-alone cluster, supervised app: restart of worker hosting the driver causes app to run twice - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/15 08:45:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-19280) Failed Recovery from checkpoint caused by the multi-threads issue in Spark Streaming scheduler - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/15 08:45:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-20693) Kafka+SSL: path for security related files needs to be different for driver and executors - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/15 08:45:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-17634) Spark job hangs when using dapply - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/15 08:46:01 UTC, 0 replies.
- [jira] [Updated] (SPARK-11115) Host verification is not correct for IPv6 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/15 08:46:02 UTC, 0 replies.
- [jira] [Updated] (SPARK-17468) Cluster workers crushed when master network bad more than one WORKER_TIMEOUT_MS! - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/15 08:46:02 UTC, 0 replies.
- [jira] [Updated] (SPARK-12826) Spark Workers do not attempt reconnect or exit on connection failure. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/15 08:46:02 UTC, 0 replies.
- [jira] [Updated] (SPARK-16996) Hive ACID delta files not seen - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/15 08:46:02 UTC, 0 replies.
- [jira] [Updated] (SPARK-16239) SQL issues with cast from date to string around daylight savings time - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/15 08:46:02 UTC, 0 replies.
- [jira] [Updated] (SPARK-17516) Current user info is not checked on STS in DML queries - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/15 08:46:02 UTC, 0 replies.
- [jira] [Created] (SPARK-27722) Remove UnsafeKeyValueSorter - posted by "Liang-Chi Hsieh (JIRA)" <ji...@apache.org> on 2019/05/15 08:51:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27722) Remove UnsafeKeyValueSorter - posted by "Liang-Chi Hsieh (JIRA)" <ji...@apache.org> on 2019/05/15 08:52:00 UTC, 2 replies.
- [jira] [Assigned] (SPARK-27549) Commit Kafka Source offsets to facilitate external tooling - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/15 09:22:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27723) Unable to pull the oracle table data using patitionColumn date/timeStamp - posted by "Shyama (JIRA)" <ji...@apache.org> on 2019/05/15 09:37:00 UTC, 0 replies.
- [jira] [Comment Edited] (SPARK-24374) SPIP: Support Barrier Execution Mode in Apache Spark - posted by "Ruiguang Pei (JIRA)" <ji...@apache.org> on 2019/05/15 09:40:01 UTC, 2 replies.
- [jira] [Updated] (SPARK-27720) ConcurrentModificationException on closing DirectKafkaInputDStream - posted by "ov7a (JIRA)" <ji...@apache.org> on 2019/05/15 11:02:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27712) createDataFrame() reorders row - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/15 11:19:00 UTC, 1 replies.
- [jira] [Updated] (SPARK-27723) Unable to pull the oracle table data using patitionColumn date/timeStamp - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/15 11:56:00 UTC, 2 replies.
- [jira] [Commented] (SPARK-27723) Unable to pull the oracle table data using patitionColumn date/timeStamp - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/15 11:56:00 UTC, 8 replies.
- [jira] [Commented] (SPARK-27720) ConcurrentModificationException on closing DirectKafkaInputDStream - posted by "Gabor Somogyi (JIRA)" <ji...@apache.org> on 2019/05/15 14:10:00 UTC, 3 replies.
- [jira] [Comment Edited] (SPARK-27720) ConcurrentModificationException on closing DirectKafkaInputDStream - posted by "Gabor Somogyi (JIRA)" <ji...@apache.org> on 2019/05/15 14:13:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27682) Avoid use of Scala collection classes that are removed in 2.13 - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/15 14:30:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-19728) PythonUDF with multiple parents shouldn't be pushed down when used as a predicate - posted by "Krishna Prasanna Sistla (JIRA)" <ji...@apache.org> on 2019/05/15 15:00:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-24437) Memory leak in UnsafeHashedRelation - posted by "Eyal Farago (JIRA)" <ji...@apache.org> on 2019/05/15 15:15:00 UTC, 0 replies.
- [jira] [Comment Edited] (SPARK-15463) Support for creating a dataframe from CSV in Dataset[String] - posted by "Ruslan Dautkhanov (JIRA)" <ji...@apache.org> on 2019/05/15 16:01:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27631) Avoid repeating calculate table statistics - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/15 16:24:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27720) ConcurrentModificationException on operating with DirectKafkaInputDStream - posted by "ov7a (JIRA)" <ji...@apache.org> on 2019/05/15 16:44:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27678) Support Knox user impersonation in UI - posted by "Marcelo Vanzin (JIRA)" <ji...@apache.org> on 2019/05/15 16:59:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-24923) DataSourceV2: Add CTAS logical operation - posted by "Ryan Blue (JIRA)" <ji...@apache.org> on 2019/05/15 17:10:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27724) Add RTAS logical operation - posted by "Ryan Blue (JIRA)" <ji...@apache.org> on 2019/05/15 17:11:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27724) DataSourceV2: Add RTAS logical operation - posted by "Ryan Blue (JIRA)" <ji...@apache.org> on 2019/05/15 17:11:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27725) GPU Scheduling - add an example discovery Script - posted by "Thomas Graves (JIRA)" <ji...@apache.org> on 2019/05/15 17:25:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27687) Kafka consumer cache parameter rename and documentation - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/15 17:43:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27488) Driver interface to support GPU resources - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/15 17:57:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27726) Performance of InMemoryStore suffers under load - posted by "David C Navas (JIRA)" <ji...@apache.org> on 2019/05/15 18:13:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27726) Performance of InMemoryStore suffers under load - posted by "David C Navas (JIRA)" <ji...@apache.org> on 2019/05/15 18:16:00 UTC, 2 replies.
- [jira] [Created] (SPARK-27727) Asynchronous ElementStore cleanup should have only one pending cleanup per class - posted by "David C Navas (JIRA)" <ji...@apache.org> on 2019/05/15 18:21:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27727) Asynchronous ElementStore cleanup should have only one pending cleanup per class - posted by "David C Navas (JIRA)" <ji...@apache.org> on 2019/05/15 18:21:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27726) Performance of InMemoryStore suffers under load - posted by "Mark Hamstra (JIRA)" <ji...@apache.org> on 2019/05/15 18:22:00 UTC, 3 replies.
- [jira] [Created] (SPARK-27728) Address thread-safety of InMemoryStore and ElementTrackingStores. - posted by "David C Navas (JIRA)" <ji...@apache.org> on 2019/05/15 18:25:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27728) Address thread-safety of InMemoryStore and ElementTrackingStores. - posted by "David C Navas (JIRA)" <ji...@apache.org> on 2019/05/15 18:26:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27729) Extract deletion of the summaries from the stage deletion loop - posted by "David C Navas (JIRA)" <ji...@apache.org> on 2019/05/15 18:34:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27729) Extract deletion of the summaries from the stage deletion loop - posted by "David C Navas (JIRA)" <ji...@apache.org> on 2019/05/15 18:35:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27730) Add support for removeAllKeys - posted by "David C Navas (JIRA)" <ji...@apache.org> on 2019/05/15 18:39:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27730) Add support for removeAllKeys - posted by "David C Navas (JIRA)" <ji...@apache.org> on 2019/05/15 18:39:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27731) Cleanup some odd-looking typing choices and exception handling - posted by "David C Navas (JIRA)" <ji...@apache.org> on 2019/05/15 18:49:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27731) Cleanup some non-compile time type checking and exception handling - posted by "David C Navas (JIRA)" <ji...@apache.org> on 2019/05/15 18:50:00 UTC, 1 replies.
- [jira] [Assigned] (SPARK-27726) Performance of InMemoryStore suffers under load - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/15 19:15:00 UTC, 2 replies.
- [jira] [Created] (SPARK-27732) DataSourceV2: Add CreateTable logical operation - posted by "Ryan Blue (JIRA)" <ji...@apache.org> on 2019/05/15 20:00:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27733) Upgrade to Avro 1.9.x - posted by "Ismaƫl Mejƭa (JIRA)" <ji...@apache.org> on 2019/05/15 21:01:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27732) DataSourceV2: Add CreateTable logical operation - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/15 21:17:00 UTC, 2 replies.
- [jira] [Resolved] (SPARK-20774) BroadcastExchangeExec doesn't cancel the Spark job if broadcasting a relation timeouts. - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/15 21:48:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27036) Even Broadcast thread is timed out, BroadCast Job is not aborted. - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/15 21:49:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27354) Move incompatible code from the hive-thriftserver module to sql/hive-thriftserver/v1.2.1 - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/15 21:53:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27734) Add memory based thresholds for shuffle spill - posted by "Adrian Muraru (JIRA)" <ji...@apache.org> on 2019/05/15 22:00:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27734) Add memory based thresholds for shuffle spill - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/15 22:10:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27735) Interval string in upper case is not supported in Trigger - posted by "Shixiong Zhu (JIRA)" <ji...@apache.org> on 2019/05/15 22:38:01 UTC, 0 replies.
- [jira] [Created] (SPARK-27736) Improve handling of FetchFailures caused by ExternalShuffleService losing track of executor registrations - posted by "Josh Rosen (JIRA)" <ji...@apache.org> on 2019/05/15 22:45:01 UTC, 0 replies.
- [jira] [Updated] (SPARK-27736) Improve handling of FetchFailures caused by ExternalShuffleService losing track of executor registrations - posted by "Josh Rosen (JIRA)" <ji...@apache.org> on 2019/05/15 22:47:00 UTC, 3 replies.
- [jira] [Resolved] (SPARK-27674) the hint should not be dropped after cache lookup - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/15 22:49:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27737) Upgrade to 2.3.5 for Hive Metastore Client 2.3 - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/15 22:51:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27735) Interval string in upper case is not supported in Trigger - posted by "Shixiong Zhu (JIRA)" <ji...@apache.org> on 2019/05/15 22:56:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27735) Interval string in upper case is not supported in Trigger - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/15 22:57:00 UTC, 1 replies.
- [jira] [Assigned] (SPARK-27737) Upgrade to 2.3.5 for Hive Metastore Client 2.3 - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/15 22:58:00 UTC, 3 replies.
- [jira] [Created] (SPARK-27738) Upgrade the built-in Hive to 2.3.5 for hadoop-3.2 - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/15 23:25:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-26192) MesosClusterScheduler reads options from dispatcher conf instead of submission conf - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/15 23:43:00 UTC, 2 replies.
- [jira] [Assigned] (SPARK-27738) Upgrade the built-in Hive to 2.3.5 for hadoop-3.2 - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/15 23:49:00 UTC, 1 replies.
- [jira] [Assigned] (SPARK-27722) Remove UnsafeKeyValueSorter - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/16 00:30:00 UTC, 2 replies.
- [jira] [Created] (SPARK-27739) CacheManager.cacheQuery should copy stats from optimized plan - posted by "John Zhuge (JIRA)" <ji...@apache.org> on 2019/05/16 00:56:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27739) Persist should use stats from optimized plan - posted by "John Zhuge (JIRA)" <ji...@apache.org> on 2019/05/16 01:16:00 UTC, 1 replies.
- [jira] [Updated] (SPARK-27739) df.persist should save stats from optimized plan - posted by "John Zhuge (JIRA)" <ji...@apache.org> on 2019/05/16 01:29:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27739) df.persist should save stats from optimized plan - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/16 01:35:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27740) JIRA status test - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/16 01:35:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27740) JIRA status test - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/16 01:36:00 UTC, 2 replies.
- [jira] [Reopened] (SPARK-27740) JIRA status test - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/16 01:36:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-22128) Update paranamer to 2.8 to avoid BytecodeReadingParanamer ArrayIndexOutOfBoundsException with Scala 2.12 + Java 8 lambda - posted by "Michael Heuer (JIRA)" <ji...@apache.org> on 2019/05/16 02:45:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27718) incorrect result from pagerank - posted by "De-En Lin (JIRA)" <ji...@apache.org> on 2019/05/16 02:54:00 UTC, 0 replies.
- [jira] [Comment Edited] (SPARK-27718) incorrect result from pagerank - posted by "De-En Lin (JIRA)" <ji...@apache.org> on 2019/05/16 02:56:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27741) Transitivity on predicate pushdown - posted by "U Shaw (JIRA)" <ji...@apache.org> on 2019/05/16 07:41:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27741) Transitivity on predicate pushdown - posted by "U Shaw (JIRA)" <ji...@apache.org> on 2019/05/16 07:42:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-27741) Transitivity on predicate pushdown - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/16 08:10:00 UTC, 1 replies.
- [jira] [Updated] (SPARK-27733) Upgrade to Avro 1.9.x - posted by "Ismaƫl Mejƭa (JIRA)" <ji...@apache.org> on 2019/05/16 10:13:00 UTC, 1 replies.
- [jira] [Resolved] (SPARK-27741) Transitivity on predicate pushdown - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/16 10:18:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27733) Upgrade to Avro 1.9.x - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/16 10:21:00 UTC, 4 replies.
- [jira] [Created] (SPARK-27742) Security Support in Sources and Sinks for SS and batch - posted by "Stavros Kontopoulos (JIRA)" <ji...@apache.org> on 2019/05/16 10:38:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27742) Security Support in Sources and Sinks for SS and Batch - posted by "Stavros Kontopoulos (JIRA)" <ji...@apache.org> on 2019/05/16 10:46:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27722) Remove UnsafeKeyValueSorter - posted by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2019/05/16 12:09:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27743) alter table: bucketing - posted by "xzh_dz (JIRA)" <ji...@apache.org> on 2019/05/16 12:13:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27744) SubqueryExec thread pool does not preserve thread local properties - posted by "Onur Satici (JIRA)" <ji...@apache.org> on 2019/05/16 12:28:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27743) alter table: bucketing - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/16 12:29:00 UTC, 1 replies.
- [jira] [Assigned] (SPARK-27744) SubqueryExec thread pool does not preserve thread local properties - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/16 12:37:01 UTC, 1 replies.
- [jira] [Commented] (SPARK-27744) SubqueryExec thread pool does not preserve thread local properties - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/16 12:37:01 UTC, 0 replies.
- [jira] [Commented] (SPARK-27334) Support specify scheduler name for executor pods when submit - posted by "Alexander Fedosov (JIRA)" <ji...@apache.org> on 2019/05/16 13:19:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27720) ConcurrentModificationException on operating with DirectKafkaInputDStream - posted by "Gabor Somogyi (JIRA)" <ji...@apache.org> on 2019/05/16 13:19:00 UTC, 4 replies.
- [jira] [Commented] (SPARK-27742) Security Support in Sources and Sinks for SS and Batch - posted by "Gabor Somogyi (JIRA)" <ji...@apache.org> on 2019/05/16 13:25:00 UTC, 4 replies.
- [jira] [Commented] (SPARK-18107) Insert overwrite statement runs much slower in spark-sql than it does in hive-client - posted by "KaiXu (JIRA)" <ji...@apache.org> on 2019/05/16 13:44:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27376) Design: YARN supports Spark GPU-aware scheduling - posted by "Thomas Graves (JIRA)" <ji...@apache.org> on 2019/05/16 14:25:00 UTC, 2 replies.
- [jira] [Assigned] (SPARK-27376) Design: YARN supports Spark GPU-aware scheduling - posted by "Thomas Graves (JIRA)" <ji...@apache.org> on 2019/05/16 14:27:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27377) Upgrade YARN to 3.1.2+ to support GPU - posted by "Thomas Graves (JIRA)" <ji...@apache.org> on 2019/05/16 14:28:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27377) Upgrade YARN to 3.1.2+ to support GPU - posted by "Thomas Graves (JIRA)" <ji...@apache.org> on 2019/05/16 14:29:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27379) YARN passes GPU info to Spark executor - posted by "Thomas Graves (JIRA)" <ji...@apache.org> on 2019/05/16 14:30:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27379) YARN passes GPU info to Spark executor - posted by "Thomas Graves (JIRA)" <ji...@apache.org> on 2019/05/16 14:30:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27378) spark-submit requests GPUs in YARN mode - posted by "Thomas Graves (JIRA)" <ji...@apache.org> on 2019/05/16 14:31:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27745) build/mvn take wrong scala version when compile for scala 2.12 - posted by "Izek Greenfield (JIRA)" <ji...@apache.org> on 2019/05/16 14:32:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27746) add a logical plan link in the physical plan - posted by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2019/05/16 14:37:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27747) add a logical plan link in the physical plan - posted by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2019/05/16 14:37:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27747) add a logical plan link in the physical plan - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/16 14:46:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27748) Kafka consumer/producer password/token redaction - posted by "Gabor Somogyi (JIRA)" <ji...@apache.org> on 2019/05/16 14:51:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27749) Fix hadoop-3.2 hive-thriftserver module test issue - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/16 14:57:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27748) Kafka consumer/producer password/token redaction - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/16 15:06:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27750) Standalone scheduler - ability to prioritize applications over drivers, many drivers act like Denial of Service - posted by "t oo (JIRA)" <ji...@apache.org> on 2019/05/16 15:59:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27373) Design: Kubernetes support for GPU-aware scheduling - posted by "Thomas Graves (JIRA)" <ji...@apache.org> on 2019/05/16 16:05:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27373) Design: Kubernetes support for GPU-aware scheduling - posted by "Thomas Graves (JIRA)" <ji...@apache.org> on 2019/05/16 16:20:00 UTC, 0 replies.
- [jira] [Comment Edited] (SPARK-27373) Design: Kubernetes support for GPU-aware scheduling - posted by "Thomas Graves (JIRA)" <ji...@apache.org> on 2019/05/16 16:22:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27745) build/mvn take wrong scala version when compile for scala 2.12 - posted by "Marcelo Vanzin (JIRA)" <ji...@apache.org> on 2019/05/16 16:25:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27749) Fix hadoop-3.2 hive-thriftserver module test issue - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/16 16:32:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27751) buildReader is now protected - posted by "Geet Kumar (JIRA)" <ji...@apache.org> on 2019/05/16 16:57:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27752) Updata lz4-java from 1.5.2 to 1.6.0 - posted by "Kazuaki Ishizaki (JIRA)" <ji...@apache.org> on 2019/05/16 17:03:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27752) Updata lz4-java from 1.5.1 to 1.6.0 - posted by "Kazuaki Ishizaki (JIRA)" <ji...@apache.org> on 2019/05/16 17:05:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27753) Support SQL expressions for interval parameter in Structured Streaming - posted by "Shixiong Zhu (JIRA)" <ji...@apache.org> on 2019/05/16 18:02:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27736) Improve handling of FetchFailures caused by ExternalShuffleService losing track of executor registrations - posted by "Thomas Graves (JIRA)" <ji...@apache.org> on 2019/05/16 18:07:00 UTC, 1 replies.
- [jira] [Assigned] (SPARK-27752) Updata lz4-java from 1.5.1 to 1.6.0 - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/16 19:13:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27754) Introduce spark on k8s config for driver request cores - posted by "Arun Mahadevan (JIRA)" <ji...@apache.org> on 2019/05/16 20:34:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27754) Introduce spark on k8s config for driver request cores - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/16 20:39:00 UTC, 1 replies.
- [jira] [Resolved] (SPARK-27735) Interval string in upper case is not supported in Trigger - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/16 21:25:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27576) table capabilty to skip the output column resolution - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/16 23:26:00 UTC, 0 replies.
- [jira] [Comment Edited] (SPARK-27733) Upgrade to Avro 1.9.x - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/16 23:35:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27751) buildReader is now protected - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/17 00:03:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27751) buildReader is now protected - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/17 00:03:00 UTC, 4 replies.
- [jira] [Resolved] (SPARK-27718) incorrect result from pagerank - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/17 02:28:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27752) Updata lz4-java from 1.5.1 to 1.6.0 - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/17 03:47:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27634) deleteCheckpointOnStop should be configurable - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/17 04:08:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27755) Update zstd-jni to 1.4.0-1 - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/17 05:58:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27755) Update zstd-jni to 1.4.0-1 - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/17 06:03:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27756) Add a shape property to DataFrame in pyspark - posted by "Louis Yang (JIRA)" <ji...@apache.org> on 2019/05/17 06:05:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-16060) Vectorized ORC reader - posted by "Atif Sharif (JIRA)" <ji...@apache.org> on 2019/05/17 06:47:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-26356) Remove SaveMode from data source v2 API - posted by "Atif Sharif (JIRA)" <ji...@apache.org> on 2019/05/17 06:50:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27757) Bump Jackson to 2.9.9 - posted by "Fokko Driesprong (JIRA)" <ji...@apache.org> on 2019/05/17 09:01:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27758) Features won't generate after 1M rows - posted by "Rakesh Partapsing (JIRA)" <ji...@apache.org> on 2019/05/17 09:02:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27757) Bump Jackson to 2.9.9 - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/17 09:06:00 UTC, 2 replies.
- [jira] [Resolved] (SPARK-27699) Partially push down disjunctive predicated in Parquet/ORC - posted by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2019/05/17 11:27:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27530) FetchFailedException: Received a zero-size buffer for block shuffle - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/17 12:05:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27546) Should repalce DateTimeUtils#defaultTimeZoneuse with sessionLocalTimeZone - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/17 12:33:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27450) Timestamp cast fails when the ISO8601 string omits minutes, seconds or milliseconds - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/17 12:51:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27759) Do not auto cast array to np.array in vectorized udf - posted by "colin fang (JIRA)" <ji...@apache.org> on 2019/05/17 12:52:00 UTC, 0 replies.
- [jira] [Reopened] (SPARK-27450) Timestamp cast fails when the ISO8601 string omits minutes, seconds or milliseconds - posted by "Leandro Rosa (JIRA)" <ji...@apache.org> on 2019/05/17 15:17:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27760) Spark resources - user configs change .count to be .amount, and yarn configs should match - posted by "Thomas Graves (JIRA)" <ji...@apache.org> on 2019/05/17 15:41:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27361) YARN support for GPU-aware scheduling - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/17 16:01:00 UTC, 1 replies.
- [jira] [Updated] (SPARK-27679) Improve queries with LIKE expression - posted by "Achuth Narayan Rajagopal (JIRA)" <ji...@apache.org> on 2019/05/17 17:22:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-27679) Improve queries with LIKE expression - posted by "Achuth Narayan Rajagopal (JIRA)" <ji...@apache.org> on 2019/05/17 17:23:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27463) SPIP: Support Dataframe Cogroup via Pandas UDFs - posted by "Bryan Cutler (JIRA)" <ji...@apache.org> on 2019/05/17 17:29:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27761) Make UDF nondeterministic by default(?) - posted by "Sunitha Kambhampati (JIRA)" <ji...@apache.org> on 2019/05/17 18:07:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27761) Make UDF nondeterministic by default(?) - posted by "Sunitha Kambhampati (JIRA)" <ji...@apache.org> on 2019/05/17 18:13:00 UTC, 2 replies.
- [jira] [Resolved] (SPARK-27552) The configuration `hive.exec.stagingdir` is invalid on Windows OS - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/17 19:01:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27552) The configuration `hive.exec.stagingdir` is invalid on Windows OS - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/17 19:01:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27762) Support user provided avro schema for writing fields with different ordering - posted by "DB Tsai (JIRA)" <ji...@apache.org> on 2019/05/17 19:58:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27762) Support user provided avro schema for writing fields with different ordering - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/17 20:02:00 UTC, 1 replies.
- [jira] [Resolved] (SPARK-27712) createDataFrame() reorders row - posted by "Bryan Cutler (JIRA)" <ji...@apache.org> on 2019/05/17 21:47:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27755) Update zstd-jni to 1.4.0-1 - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/18 03:05:01 UTC, 0 replies.
- [jira] [Commented] (SPARK-27024) Executor interface for cluster managers to support GPU resources - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/18 03:10:01 UTC, 0 replies.
- [jira] [Updated] (SPARK-27024) Executor interface for cluster managers to support GPU resources - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/18 03:10:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27673) Add since info to random. regex, null expressions - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/18 03:12:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27672) Add since info to string expressions - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/18 03:13:00 UTC, 0 replies.
- [jira] [Closed] (SPARK-27207) There exists a bug with SortBasedAggregator where merge()/update() operations get invoked on the aggregate buffer without calling initialize - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/18 03:16:02 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27510) Master fall into dead loop while launching executor failed in Worker - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/18 03:19:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27207) There exists a bug with SortBasedAggregator where merge()/update() operations get invoked on the aggregate buffer without calling initialize - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/18 03:20:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27684) Reduce ScalaUDF conversion overheads for primitives - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/18 14:01:00 UTC, 2 replies.
- [jira] [Assigned] (SPARK-27707) Performance issue using explode - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/18 15:54:00 UTC, 1 replies.
- [jira] [Resolved] (SPARK-27746) add a logical plan link in the physical plan - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/18 18:12:00 UTC, 0 replies.
- [jira] [Closed] (SPARK-27746) add a logical plan link in the physical plan - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/18 18:12:00 UTC, 0 replies.
- [jira] [Closed] (SPARK-27712) createDataFrame() reorders row - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/18 18:15:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27463) SPIP: Support Dataframe Cogroup via Pandas UDFs - posted by "Chris Martin (JIRA)" <ji...@apache.org> on 2019/05/18 19:29:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27463) Support Dataframe Cogroup via Pandas UDFs - posted by "Chris Martin (JIRA)" <ji...@apache.org> on 2019/05/18 19:30:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27705) kubernetes integration test break on osx when test PVTestsSuite - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/18 20:45:01 UTC, 2 replies.
- [jira] [Created] (SPARK-27763) Port test cases from PostgreSQL to Spark SQL - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/19 00:26:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27764) Feature Parity between PostgreSQL and Spark - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/19 00:35:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27763) Port test cases from PostgreSQL to Spark SQL - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/19 00:42:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27765) Type Casts: expression::type - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/19 02:02:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27765) Type Casts: expression::type - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/19 02:03:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27765) Type Casts: expression::type - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/19 02:03:00 UTC, 0 replies.
- [jira] [Closed] (SPARK-27765) Type Casts: expression::type - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/19 02:04:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27766) Data type: POINT(x, y) - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/19 03:09:00 UTC, 0 replies.
- [jira] [Closed] (SPARK-27766) Data type: POINT(x, y) - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/19 03:10:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27766) Data type: POINT(x, y) - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/19 03:10:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27766) Data type: POINT(x, y) - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/19 03:10:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27767) Built-in function: generate_series - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/19 03:26:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27767) Built-in function: generate_series - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/19 03:27:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27767) Built-in function: generate_series - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/19 03:28:00 UTC, 0 replies.
- [jira] [Closed] (SPARK-27767) Built-in function: generate_series - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/19 03:28:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27759) Do not auto cast array to np.array in vectorized udf - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/19 03:56:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27756) Add a shape property to DataFrame in pyspark - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/19 03:58:00 UTC, 2 replies.
- [jira] [Resolved] (SPARK-27756) Add a shape property to DataFrame in pyspark - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/19 03:58:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27768) Incorrect statistical aggregate results when the inputs have constants infinity - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/19 04:00:01 UTC, 0 replies.
- [jira] [Updated] (SPARK-27768) Incorrect statistical aggregate results when the inputs have constants infinity - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/19 04:01:00 UTC, 2 replies.
- [jira] [Created] (SPARK-27769) Handling of sublinks within outer-level aggregates. - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/19 04:25:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27754) Introduce spark on k8s config for driver request cores - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/19 04:30:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27754) Introduce spark on k8s config for driver request cores - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/19 04:30:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27770) Add aggregates.sql - Part1 - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/19 04:36:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27770) Add aggregates.sql - Part1 - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/19 04:45:00 UTC, 2 replies.
- [jira] [Commented] (SPARK-27768) Incorrect statistical aggregate results when the inputs have constants infinity - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/19 06:16:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-21444) Fetch failure due to node reboot causes job failure - posted by "Igor Berman (JIRA)" <ji...@apache.org> on 2019/05/19 08:49:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27758) Features won't generate after 1M rows - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/19 09:09:00 UTC, 3 replies.
- [jira] [Resolved] (SPARK-27613) Caching an RDD composed of Row Objects produces some kind of key recombination - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/19 10:07:01 UTC, 0 replies.
- [jira] [Created] (SPARK-27771) Add documentation for grouping functions (cube, rollup, grouping and grouping_id) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/19 11:28:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27771) Add documentation for grouping functions (cube, rollup, grouping and grouping_id) - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/19 11:36:00 UTC, 1 replies.
- [jira] [Updated] (SPARK-27749) hadoop-3.2 support hive-thriftserver - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/19 14:00:03 UTC, 0 replies.
- [jira] [Assigned] (SPARK-26412) Allow Pandas UDF to take an iterator of pd.DataFrames - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/19 15:41:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27772) SQLTestUtils Refactoring - posted by "William Wong (JIRA)" <ji...@apache.org> on 2019/05/19 15:49:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27772) SQLTestUtils Refactoring - posted by "William Wong (JIRA)" <ji...@apache.org> on 2019/05/19 15:52:00 UTC, 9 replies.
- [jira] [Comment Edited] (SPARK-26841) Timestamp pushdown on Kafka table - posted by "Richard Yu (JIRA)" <ji...@apache.org> on 2019/05/19 17:54:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-26841) Timestamp pushdown on Kafka table - posted by "Richard Yu (JIRA)" <ji...@apache.org> on 2019/05/19 17:54:00 UTC, 1 replies.
- [jira] [Updated] (SPARK-27768) Infinity, -Infinity, NaN should be recognized in a case insensitive manner - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/19 22:17:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27771) Add SQL description for grouping functions (cube, rollup, grouping and grouping_id) - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/20 02:29:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27771) Add SQL description for grouping functions (cube, rollup, grouping and grouping_id) - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/20 02:29:00 UTC, 0 replies.
- [jira] [Reopened] (SPARK-27402) Fix hadoop-3.2 test issue(except the hive-thriftserver module) - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/20 02:33:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27402) Fix hadoop-3.2 test issue(except the hive-thriftserver module) - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/20 02:38:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-27768) Infinity, -Infinity, NaN should be recognized in a case insensitive manner - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/20 02:53:01 UTC, 9 replies.
- [jira] [Commented] (SPARK-27772) SQLTestUtils Refactoring - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/20 02:54:00 UTC, 5 replies.
- [jira] [Comment Edited] (SPARK-27768) Infinity, -Infinity, NaN should be recognized in a case insensitive manner - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/20 02:54:00 UTC, 1 replies.
- [jira] [Resolved] (SPARK-27758) Features won't generate after 1M rows - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/20 03:29:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27716) Complete the transactions support for part of jdbc datasource operations. - posted by "Liang-Chi Hsieh (JIRA)" <ji...@apache.org> on 2019/05/20 04:38:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27694) Support auto-updating table statistics for data source CTAS command - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/20 05:27:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27694) Support auto-updating table statistics for data source CTAS command - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/20 05:31:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27693) DataSourceV2: Add default catalog property - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/20 05:32:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27702) Allow using some alternatives for service accounts - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/20 05:39:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27702) Allow using some alternatives for service accounts - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/20 05:42:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27773) Add shuffle service metric for number of exceptions caught in TransportChannelHandler - posted by "Steven Rand (JIRA)" <ji...@apache.org> on 2019/05/20 05:51:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27773) Add shuffle service metric for number of exceptions caught in TransportChannelHandler - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/20 06:17:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27774) Avoid hardcoded configs - posted by "wenxuanguan (JIRA)" <ji...@apache.org> on 2019/05/20 06:19:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27774) Avoid hardcoded configs - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/20 06:25:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27774) Avoid hardcoded configs - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/20 06:25:00 UTC, 2 replies.
- [jira] [Created] (SPARK-27775) Support multiple return values for udf - posted by "Xianjin YE (JIRA)" <ji...@apache.org> on 2019/05/20 06:35:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27322) Select from multiple catalogs - posted by "John Zhuge (JIRA)" <ji...@apache.org> on 2019/05/20 06:50:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-27757) Bump Jackson to 2.9.9 - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/20 08:22:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27776) Avoid duplicate Java reflection in DataSource - posted by "jiaan.geng (JIRA)" <ji...@apache.org> on 2019/05/20 09:02:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27776) Avoid duplicate Java reflection in DataSource - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/20 09:25:00 UTC, 2 replies.
- [jira] [Created] (SPARK-27777) Eliminate uncessary sliding job in AreaUnderCurve - posted by "zhengruifeng (JIRA)" <ji...@apache.org> on 2019/05/20 09:34:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27777) Eliminate uncessary sliding job in AreaUnderCurve - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/20 09:43:00 UTC, 2 replies.
- [jira] [Updated] (SPARK-19297) Add ability for --packages tag to pull latest version - posted by "Alexander Fedosov (JIRA)" <ji...@apache.org> on 2019/05/20 10:42:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-19297) Add ability for --packages tag to pull latest version - posted by "Alexander Fedosov (JIRA)" <ji...@apache.org> on 2019/05/20 10:42:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-19039) UDF ClosureCleaner bug when UDF, col applied in paste mode in REPL - posted by "Phillip Henry (JIRA)" <ji...@apache.org> on 2019/05/20 11:05:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-15348) Hive ACID - posted by "Jefferson Colares (JIRA)" <ji...@apache.org> on 2019/05/20 13:48:00 UTC, 6 replies.
- [jira] [Created] (SPARK-27778) toPandas with arrow enabled fails for DF with no partition - posted by "David Vogelbacher (JIRA)" <ji...@apache.org> on 2019/05/20 13:58:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27778) toPandas with arrow enabled fails for DF with no partitions - posted by "David Vogelbacher (JIRA)" <ji...@apache.org> on 2019/05/20 13:59:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27778) toPandas with arrow enabled fails for DF with no partitions - posted by "David Vogelbacher (JIRA)" <ji...@apache.org> on 2019/05/20 13:59:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27720) ConcurrentModificationException on operating with DirectKafkaInputDStream - posted by "Gabor Somogyi (JIRA)" <ji...@apache.org> on 2019/05/20 14:12:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27778) toPandas with arrow enabled fails for DF with no partitions - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/20 14:16:00 UTC, 2 replies.
- [jira] [Assigned] (SPARK-27420) KinesisInputDStream should expose a way to disable CloudWatch metrics - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/20 15:06:00 UTC, 1 replies.
- [jira] [Updated] (SPARK-27420) KinesisInputDStream should expose a way to configure CloudWatch metrics - posted by "Kengo Seki (JIRA)" <ji...@apache.org> on 2019/05/20 15:18:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27420) KinesisInputDStream should expose a way to configure CloudWatch metrics - posted by "Kengo Seki (JIRA)" <ji...@apache.org> on 2019/05/20 15:20:01 UTC, 0 replies.
- [jira] [Created] (SPARK-27779) Regression when explode on map in Generate - posted by "Liang-Chi Hsieh (JIRA)" <ji...@apache.org> on 2019/05/20 15:37:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27779) Regression when explode on map in Generate - posted by "Liang-Chi Hsieh (JIRA)" <ji...@apache.org> on 2019/05/20 15:43:00 UTC, 2 replies.
- [jira] [Commented] (SPARK-27725) GPU Scheduling - add an example discovery Script - posted by "Thomas Graves (JIRA)" <ji...@apache.org> on 2019/05/20 16:00:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27374) Fetch assigned resources from TaskContext - posted by "Thomas Graves (JIRA)" <ji...@apache.org> on 2019/05/20 16:01:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27780) Shuffle server & client should be versioned to enable smoother upgrade - posted by "Imran Rashid (JIRA)" <ji...@apache.org> on 2019/05/20 16:20:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27781) Tried to access method org.apache.avro.specific.SpecificData.()V - posted by "Michael Heuer (JIRA)" <ji...@apache.org> on 2019/05/20 18:28:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27782) Use '#' to mark expression id embedded in the subquery name in the SubqueryExec operator. - posted by "Dilip Biswal (JIRA)" <ji...@apache.org> on 2019/05/20 18:40:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27782) Use '#' to mark expression id embedded in the subquery name in the SubqueryExec operator. - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/20 18:50:00 UTC, 1 replies.
- [jira] [Updated] (SPARK-27781) Tried to access method org.apache.avro.specific.SpecificData.()V - posted by "Michael Heuer (JIRA)" <ji...@apache.org> on 2019/05/20 19:30:00 UTC, 2 replies.
- [jira] [Commented] (SPARK-25762) Upgrade guava version in spark dependency lists due to CVE issue - posted by "Arun Mahadevan (JIRA)" <ji...@apache.org> on 2019/05/20 19:56:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-25728) SPIP: Structured Intermediate Representation (Tungsten IR) for generating Java code - posted by "Chen Li (JIRA)" <ji...@apache.org> on 2019/05/20 20:16:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-21492) Memory leak in SortMergeJoin - posted by "Xingbo Jiang (JIRA)" <ji...@apache.org> on 2019/05/20 21:25:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-25783) Spark shell fails because of jline incompatibility - posted by "koert kuipers (JIRA)" <ji...@apache.org> on 2019/05/20 22:02:00 UTC, 0 replies.
- [jira] [Reopened] (SPARK-27439) Explainging Dataset should show correct resolved plans - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/20 22:11:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27439) Explainging Dataset should show correct resolved plans - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/20 22:14:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27783) Add customizable hint error handler - posted by "Maryann Xue (JIRA)" <ji...@apache.org> on 2019/05/20 22:50:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27783) Add customizable hint error handler - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/20 22:59:00 UTC, 1 replies.
- [jira] [Closed] (SPARK-22996) update R to pass newest version of lintr checks - posted by "shane knapp (JIRA)" <ji...@apache.org> on 2019/05/20 23:15:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22996) update R to pass newest version of lintr checks - posted by "shane knapp (JIRA)" <ji...@apache.org> on 2019/05/20 23:15:01 UTC, 0 replies.
- [jira] [Assigned] (SPARK-22766) Install R linter package in spark lib directory - posted by "shane knapp (JIRA)" <ji...@apache.org> on 2019/05/20 23:17:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22766) Install R linter package in spark lib directory - posted by "shane knapp (JIRA)" <ji...@apache.org> on 2019/05/20 23:17:00 UTC, 0 replies.
- [jira] [Closed] (SPARK-22766) Install R linter package in spark lib directory - posted by "shane knapp (JIRA)" <ji...@apache.org> on 2019/05/20 23:17:00 UTC, 0 replies.
- [jira] [Closed] (SPARK-19612) Tests failing with timeout - posted by "shane knapp (JIRA)" <ji...@apache.org> on 2019/05/20 23:28:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19612) Tests failing with timeout - posted by "shane knapp (JIRA)" <ji...@apache.org> on 2019/05/20 23:28:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-19612) Tests failing with timeout - posted by "shane knapp (JIRA)" <ji...@apache.org> on 2019/05/20 23:28:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27784) Alias ID reuse can break correctness when substituting foldable expressions - posted by "Ryan Blue (JIRA)" <ji...@apache.org> on 2019/05/20 23:37:01 UTC, 0 replies.
- [jira] [Updated] (SPARK-27784) Alias ID reuse can break correctness when substituting foldable expressions - posted by "Ryan Blue (JIRA)" <ji...@apache.org> on 2019/05/20 23:53:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27784) Alias ID reuse can break correctness when substituting foldable expressions - posted by "Ryan Blue (JIRA)" <ji...@apache.org> on 2019/05/21 00:11:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27785) Introduce .joinWith() overload for inner join of 3 or more tables - posted by "Josh Rosen (JIRA)" <ji...@apache.org> on 2019/05/21 00:40:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27785) Introduce .joinWith() overloads for typed inner joins of 3 or more tables - posted by "Josh Rosen (JIRA)" <ji...@apache.org> on 2019/05/21 00:41:00 UTC, 2 replies.
- [jira] [Updated] (SPARK-27774) Avoid hardcoded configs - posted by "wenxuanguan (JIRA)" <ji...@apache.org> on 2019/05/21 01:06:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-26596) sparksql "insert overwrite local directory" does not write back to driver node - posted by "ant_nebula (JIRA)" <ji...@apache.org> on 2019/05/21 01:38:01 UTC, 0 replies.
- [jira] [Comment Edited] (SPARK-27751) buildReader is now protected - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 02:31:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27439) Explainging Dataset should show correct resolved plans - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/21 03:56:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-19273) Stage is not retay when shuffle file is lost - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-20315) Set ScalaUDF's deterministic to true - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-9579) Improve Word2Vec unit tests - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-21425) LongAccumulator, DoubleAccumulator not threadsafe - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-19322) Allow customizable column name in ScalaReflection.schemaFor - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-7211) Improvements for FPGrowth - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-22474) cannot read a parquet file containing a Seq[Map[MyCaseClass, String]] - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-22411) Heuristic to combine splits in DataSourceScanExec isn't accurate when dynamic allocation is enabled - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-20013) merge renameTable to alterTable in ExternalCatalog - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-20178) Improve Scheduler fetch failures - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-22351) Support user-created custom Encoders for Datasets - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-22553) Drop FROM in nonReserved - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-10912) Improve Spark metrics executor.filesystem - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-19705) Preferred location supporting HDFS Cache for FileScanRDD - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-17048) ML model read for custom transformers in a pipeline does not work - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-6883) Fork pyspark's cloudpickle as a separate dependency - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-19225) Spark SQL round constant double return null - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-22277) Chi Square selector garbling Vector content. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-19390) Replace the unnecessary usages of hiveQlTable - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-21086) CrossValidator, TrainValidationSplit should preserve all models after fitting - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-14887) Generated SpecificUnsafeProjection Exceeds JVM Code Size Limits - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-21349) Make TASK_SIZE_TO_WARN_KB configurable - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-13573) Open SparkR APIs (R package) to allow better 3rd party usage - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-17885) Spark Streaming deletes checkpointed RDD then tries to load it after restart - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-14236) UDAF does not use incomingSchema for update Method - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-22073) Add workaround for HIVE-15653 to avoid stats being accidentally wiped - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-22350) Select grouping__id from subquery - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-23810) Matrix Multiplication is so bad, file I/O to local python is better - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-14864) [MLLIB] Implement Doc2Vec - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-18725) Creating a datasource table with schema should not scan all files for table - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-23086) Spark SQL cannot support high concurrency for lock in HiveMetastoreCatalog - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-21005) VectorIndexerModel does not prepare output column field correctly - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-21874) Support changing database when rename table. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-19001) Worker will submit multiply CleanWorkDir and SendHeartbeat task with each RegisterWorker response - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-17154) Wrong result can be returned or AnalysisException can be thrown after self-join or similar operations - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-20856) support statement using nested joins - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-21905) ClassCastException when call sqlContext.sql on temp table - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-22213) Spark to detect slow executors on nodes with problematic hardware - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-13619) Jobs page UI shows wrong number of failed tasks - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-21612) Allow unicode strings in __getitem__ of StructType - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-22923) Non-equi join(theta join) should use sort merge join - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-12061) Persist for Map/filter with Lambda Functions don't always read from Cache - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-20734) Structured Streaming spark.sql.streaming.schemaInference not handling schema changes - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-20846) Incorrect posgres sql array column schema inferred from table. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-22531) Migrate the implementation of IDF from MLLib to ML - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-5206) Accumulators are not re-registered during recovering from checkpoint - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-17423) Support IGNORE NULLS option in Window functions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-23346) Failed tasks reported as success if the failure reason is not ExceptionFailure - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-18736) CreateMap allows non-unique keys - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-20207) Add ablity to exclude current row in WindowSpec - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-22859) Permission of created table and database folder are not correct - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-21226) Save empty dataframe in pyspark prints nothing - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-22104) Add new option to dataframe -> parquet ==> custom extension to file name - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-20479) Performance degradation for large number of hash-aggregated columns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-14649) DagScheduler re-starts all running tasks on fetch failure - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-13225) [SQL] Support Intersect All/Distinct - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-4836) Web UI should display separate information for all stage attempts - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-19578) Poor pyspark performance - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-19783) Treat shorter/longer lengths of tokens as malformed records in CSV parser - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-11502) Word2VecSuite needs appropriate checks - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-20810) ML LinearSVC vs MLlib SVMWithSGD output different solution - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-20248) Spark SQL add limit parameter to enhance the reliability. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-19929) SparkSQL can't show the hive Managed table's LOCATION property when using the command of 'show create table...' - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-19978) spark thrift server to switch to normative hadoop 2.2+ service lifecycle - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-11520) RegressionMetrics should support instance weights - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-22685) Spark Streaming using Kinesis doesn't work if shard checkpoints exist in DynamoDB - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-19667) Create table with HiveEnabled in default database use warehouse path instead of the location of default database - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-23297) Spark job is finished but the stage process is error - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-19111) S3 Mesos history upload fails silently if too large - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-23172) Expand the ReorderJoin rule to handle Project nodes - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-22048) Show id, runId, batch in Description column in SQL tab for streaming queries (as in Jobs) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-18069) Many examples in Python docstrings are incomplete - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-22567) spark.mesos.executor.memoryOverhead equivalent for the Driver when running on Mesos - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-21195) MetricSystem should pick up dynamically registered metrics in sources - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-19615) Provide Dataset union convenience for divergent schema - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-22382) Spark on mesos: doesn't support public IP setup for agent and master. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-19588) Allow putting keytab file to HDFS location specified in spark.yarn.keytab - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-20122) Analyzer reports "unresolved operator 'Aggregate" for nonexistent window specifications - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-19119) Approximate percentile support for frequency distribution table - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-21290) R document Programmatically Specifying the Schema in SQL guide - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-18727) Support schema evolution as new files are inserted into table - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-22246) UnsafeRow, UnsafeArrayData, and UnsafeMapData use MemoryBlock - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-19767) API Doc pages for Streaming with Kafka 0.10 not current - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-6962) Netty BlockTransferService hangs in the middle of SQL query - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-18743) StreamingContext.textFileStream(directory) has no events shown in Web UI - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-19668) Multiple NGram sizes - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-10473) EventLog will loss message in the long-running security application - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-21754) No Exception/Warn When Join Columns are Differing Types - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-14771) Python ML Param and UID issues - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-22691) Custom HttpFileSystem, issue with question-marks in path - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-20113) overwrite mode appends data on MySQL table that does not have a primary key - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-20485) Split ALS.scala into multiple files - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-22340) pyspark setJobGroup doesn't match java threads - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-10078) Vector-free L-BFGS - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-22555) Possibly incorrect scaling of L2 regularization strength in LinearRegression - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-19962) add DictVectorizor for DataFrame - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-22270) Renaming DF column breaks sparkPlan.outputOrdering - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-22180) Allow IPv6 address in org.apache.spark.util.Utils.parseHostPort - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-14046) RandomForest improvement umbrella - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-12347) Write script to run all MLlib examples for testing - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-22241) Apache spark giving InvalidSchemaException: Cannot write a schema with an empty group: optional group element { - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-18977) Heavy udf is not stopped by cancelJobGroup - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-17419) Mesos virtual network support - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-22611) Spark Kinesis ProvisionedThroughputExceededException leads to dropped records - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-18272) Test topic addition for subscribePattern on Kafka DStream and Structured Stream - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-21154) ParseException when Create View from another View in Spark SQL - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-21037) ignoreNulls does not working properly with window functions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-24203) Make executor's bindAddress configurable - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-16087) Spark Hangs When Using Union With Persisted Hadoop RDD - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-20755) UDF registration should throw exception if UDF not found on classpath - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-20851) Drop spark table failed if a column name is a numeric string - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-19541) High Availability support for ThriftServer - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-20382) fileSystem.closed when run load data on spark beeline - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-19979) [MLLIB] Multiple Estimators/Pipelines In CrossValidator - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-14561) History Server does not see new logs in S3 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-23420) Datasource loading not handling paths with regex chars. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-16636) Missing documentation for CalendarIntervalType type in sql-programming-guide.md - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-21423) MODE average aggregate function. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-19090) Dynamic Resource Allocation not respecting spark.executor.cores - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-20828) Concatenated grouping sets scenario not supported - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-6527) sc.binaryFiles can not access files on s3 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-12191) Support "." character in DataFrame column name for ML - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-10294) When Parquet writer's close method throws an exception, we will call close again and trigger a NPE - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-23439) Ambiguous reference when selecting column inside StructType with same name that outer colum - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-20525) ClassCast exception when interpreting UDFs from a String in spark-shell - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-20332) Avro/Parquet GenericFixed decimal is not read into Spark correctly - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-18023) Adam optimizer - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-21526) Add support to ML LogisticRegression for setting initial model - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-22987) UnsafeExternalSorter cases OOM when invoking `getIterator` function. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-22096) use aggregateByKeyLocally to save one stage in calculating ItemFrequency in NaiveBayes - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-21507) Exception when using spark.jars.packages - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-8544) PMML export for Gradient Boosted Trees - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-22314) Accessing Hive UDFs defined without 'USING JAR' from Spark - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-19230) View creation in Derby gets SQLDataException because definition gets very big - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-22504) Optimization in overwrite table in case of failure - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-11075) Spark SQL Thrift Server authentication issue on kerberized yarn cluster - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-23502) Support async init of spark context during spark-shell startup - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-10149) Locality Level is ANY on "Details for Stage" WebUI page - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-21882) OutputMetrics doesn't count written bytes correctly in the saveAsHadoopDataset function - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-790) Implement the reregistered() callback in MesosScheduler to support master failover - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-20422) Worker registration retries should be configurable - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-21916) Set isolationOn=true when create client to remote hive metastore - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-21518) Warnings if spark.mesos.task.labels is unset - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-6442) MLlib Local Linear Algebra Package - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-19908) Direct buffer memory OOM should not cause stage retries. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-22639) no rowcount estimation returned if groupby clause involves substring - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-20188) Catalog recoverPartitions should allow specifying the database name - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-21841) Spark SQL doesn't pick up column added in hive when table created with saveAsTable - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-17519) [MESOS] Enhance robustness when ExternalShuffleService is broken - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-21930) When the number of attempting to restart receiver greater than 0,spark do nothing in 'else' - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-22441) JDBC REAL type is mapped to Double instead of Float - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-20240) SparkSQL support limitations of max dynamic partitions when inserting hive table - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-22966) Spark SQL should handle Python UDFs that return a datetime.date or datetime.datetime - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-21476) RandomForest classification model not using broadcast in transform - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-21484) Wrong query plans of Dataset after persist/unpersist - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-21659) FileStreamSink checks for _spark_metadata even if path has globs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-22343) Add support for publishing Spark metrics into Prometheus - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-20632) Allow 'Column.getItem()' API to accept Vector columns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-22331) Make MLlib string params case-insensitive - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-23061) Support default write mode, settable via spark config - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-20847) Error reading NULL int[] element from postgres -- null pointer exception. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-12945) ERROR LiveListenerBus: Listener JobProgressListener threw an exception - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-11171) PMML for Pipelines API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-6143) Improve FP-Growth for mining closed-forms of frequent patterns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-23453) ToolBox compiled Spark UDAF causes java.lang.InternalError: Malformed class name - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-14327) Scheduler holds locks which cause huge scheulder delays and executor timeouts - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-24019) AnalysisException for Window function expression to compute derivative - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-17966) Support Spark packages with R code on Mesos - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-7129) Add generic boosting algorithm to spark.ml - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:37 UTC, 0 replies.
- [jira] [Updated] (SPARK-22612) NullPointerException in AppendOnlyMap - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:37 UTC, 0 replies.
- [jira] [Updated] (SPARK-19031) JDBC Streaming Source - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:37 UTC, 0 replies.
- [jira] [Updated] (SPARK-17121) Support _HOST replacement for principal - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:37 UTC, 0 replies.
- [jira] [Updated] (SPARK-19030) Dropped event errors being reported after SparkContext has been stopped - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:37 UTC, 0 replies.
- [jira] [Updated] (SPARK-17565) Janino exception when calculating metrics for large generated class - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:38 UTC, 0 replies.
- [jira] [Updated] (SPARK-5243) Spark will hang if (driver memory + executor memory) exceeds limit on a 1-worker cluster - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:38 UTC, 0 replies.
- [jira] [Updated] (SPARK-20263) create empty dataframes in sparkR - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:38 UTC, 0 replies.
- [jira] [Updated] (SPARK-12806) Support SQL expressions extracting values from VectorUDT - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:38 UTC, 0 replies.
- [jira] [Updated] (SPARK-21715) History Server should not respond history page html content multiple times for only one http request - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:38 UTC, 0 replies.
- [jira] [Updated] (SPARK-19243) Error when selecting from DataFrame containing parsed data from files larger than 1MB - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:38 UTC, 0 replies.
- [jira] [Updated] (SPARK-21838) "Completed Applications" links not always working in cluster with spark.ui.reverseProxy=true - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:38 UTC, 0 replies.
- [jira] [Updated] (SPARK-20006) Separate threshold for broadcast and shuffled hash join - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-22250) Be less restrictive on type checking - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-21957) Add current_user function - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-20800) Allow users to set job group when connecting through the SQL thrift server - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-14483) Display user name for each job and query - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-7953) Spark should cleanup output dir if job fails - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-23506) Add refreshByPath in HiveMetastoreCatalog and invalidByPath in FileStatusCache - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-12554) Standalone mode may hang if max cores is not a multiple of executor cores - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:40 UTC, 0 replies.
- [jira] [Updated] (SPARK-19503) Execution Plan Optimizer: avoid sort or shuffle when it does not change end result such as df.sort(...).count() - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:40 UTC, 0 replies.
- [jira] [Updated] (SPARK-14760) Feature transformers should always invoke transformSchema in transform or fit - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:40 UTC, 0 replies.
- [jira] [Updated] (SPARK-19663) Dynamic Batch Interval Adjustment - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:40 UTC, 0 replies.
- [jira] [Updated] (SPARK-19909) Batches will fail in case that temporary checkpoint dir is on local file system while metadata dir is on HDFS - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:40 UTC, 0 replies.
- [jira] [Updated] (SPARK-13209) transitive closure on a dataframe - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:40 UTC, 0 replies.
- [jira] [Updated] (SPARK-20680) Spark-sql do not support for void column datatype of view - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:40 UTC, 0 replies.
- [jira] [Updated] (SPARK-22046) Streaming State cannot be scalable - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:40 UTC, 0 replies.
- [jira] [Updated] (SPARK-19568) Must include class/method documentation for CRAN check - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:41 UTC, 0 replies.
- [jira] [Updated] (SPARK-22453) Zookeeper configuration for MesosClusterDispatcher is not documented - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:41 UTC, 0 replies.
- [jira] [Updated] (SPARK-22679) It's slow to stop streaming context - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:41 UTC, 0 replies.
- [jira] [Updated] (SPARK-18806) driverwrapper and executor doesn't exit when worker killed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:41 UTC, 0 replies.
- [jira] [Updated] (SPARK-23270) FileInputDStream Streaming UI 's records should not be set to the default value of 0, it should be the total number of rows of new files. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:41 UTC, 0 replies.
- [jira] [Updated] (SPARK-19144) Add test for GaussianMixture with distributed decompositions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:41 UTC, 0 replies.
- [jira] [Updated] (SPARK-21560) Add hold mode for the LiveListenerBus - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:41 UTC, 0 replies.
- [jira] [Updated] (SPARK-20996) Better handling AM reattempt based on exit code in yarn mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:41 UTC, 0 replies.
- [jira] [Updated] (SPARK-12225) Support adding or replacing multiple columns at once in DataFrame API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:41 UTC, 0 replies.
- [jira] [Updated] (SPARK-22432) Allow long creation site to be logged for RDDs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:41 UTC, 0 replies.
- [jira] [Updated] (SPARK-11758) Missing Index column while creating a DataFrame from Pandas - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:41 UTC, 0 replies.
- [jira] [Updated] (SPARK-20927) Add cache operator to Unsupported Operations in Structured Streaming - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:04 UTC, 0 replies.
- [jira] [Updated] (SPARK-12965) Indexer setInputCol() doesn't resolve column names like DataFrame.col() - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:04 UTC, 0 replies.
- [jira] [Updated] (SPARK-23183) Failure caused by TaskContext is missing in the thread spawned by user code - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:04 UTC, 0 replies.
- [jira] [Updated] (SPARK-23626) Spark DAGScheduler scheduling performance hindered on JobSubmitted Event - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:04 UTC, 2 replies.
- [jira] [Updated] (SPARK-18896) Suppress ScalaCheck warning -- Unknown ScalaCheck args provided when executing tests using sbt - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:04 UTC, 0 replies.
- [jira] [Updated] (SPARK-19772) Flaky test: pyspark.streaming.tests.WindowFunctionTests - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:04 UTC, 0 replies.
- [jira] [Updated] (SPARK-22740) [SQL][JDBC] Reserved SQL words are not escaped for table names - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:04 UTC, 0 replies.
- [jira] [Updated] (SPARK-21227) Unicode in Json field causes AnalysisException when selecting from Dataframe - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:04 UTC, 0 replies.
- [jira] [Updated] (SPARK-21186) PySpark with --packages fails to import library due to lack of pythonpath to .ivy2/jars/*.jar - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:05 UTC, 0 replies.
- [jira] [Updated] (SPARK-17300) ClosedChannelException caused by missing block manager when speculative tasks are killed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:05 UTC, 0 replies.
- [jira] [Updated] (SPARK-21997) Spark shows different results on char/varchar columns on Parquet - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:05 UTC, 0 replies.
- [jira] [Updated] (SPARK-20281) Table-valued function range in SQL should use the same number of partitions as spark.range - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:05 UTC, 0 replies.
- [jira] [Updated] (SPARK-20840) Misleading spurious errors when there are Javadoc (Unidoc) breaks - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:05 UTC, 0 replies.
- [jira] [Updated] (SPARK-19875) Map->filter on many columns gets stuck in constraint inference optimization code - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:05 UTC, 0 replies.
- [jira] [Updated] (SPARK-19433) ML Pipeline with long stages takes long time to finish - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:05 UTC, 0 replies.
- [jira] [Updated] (SPARK-22192) An RDD of nested POJO objects cannot be converted into a DataFrame using SQLContext.createDataFrame API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:05 UTC, 0 replies.
- [jira] [Updated] (SPARK-19756) drop the table cache after inserting into a data source table - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-18139) Dataset mapGroups with return typ Seq[Product] produces scala.ScalaReflectionException: object $line262.$read not found - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-19984) ERROR codegen.CodeGenerator: failed to compile: org.codehaus.commons.compiler.CompileException: File 'generated.java' - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-21346) Spark does not use SSL for HTTP File Server and Broadcast Server - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-21591) Implement treeAggregate on Dataset API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-19698) Race condition in stale attempt task completion vs current attempt task completion when task is doing persistent state changes - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-22216) Improving PySpark/Pandas interoperability - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-2868) Support named accumulators in Python - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-20573) --packages fails when transitive dependency can only be resolved from repository specified in POM's tag - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-9612) Add instance weight support for GBTs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-20481) Wrong mapping for BooleanType in Spark SQL - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-20990) Multi-line support for JSON - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-20005) There is no "Newline" in UI in describtion - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-20328) HadoopRDDs create a MapReduce JobConf, but are not MapReduce jobs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-18591) Replace hash-based aggregates with sort-based ones if inputs already sorted - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-21419) Support Mesos failover_timeout in driver (Mesos cluster mode) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-11966) Spark API for UDTFs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-18397) cannot create table by using the hive default fileformat - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-20347) Provide AsyncRDDActions in Python - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-22196) Combine multiple input splits into a HadoopPartition - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-20660) Not able to merge Dataframes with different column orders - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-18454) Changes to improve Nearest Neighbor Search for LSH - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-14037) count(df) is very slow for dataframe constructed using SparkR::createDataFrame - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-12977) Factoring out StreamingListener and UI to support history UI - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-22070) Spark SQL filter comparisons failing with timestamps and ISO-8601 strings - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-22594) Handling spark-submit and master version mismatch - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-13801) DataFrame.col should return unresolved attribute - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-21533) "configure(...)" method not called when using Hive Generic UDFs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-21449) Hive client's SessionState was not closed properly in HiveExternalCatalog - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-19989) Flaky Test: org.apache.spark.sql.kafka010.KafkaSourceStressSuite - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-18676) Spark 2.x query plan data size estimation can crash join queries versus 1.x - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-18599) Add the Spectral LDA algorithm - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-19742) When using SparkSession to write a dataset to Hive the schema is ignored - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-13610) Create a Transformer to disassemble vectors in DataFrames - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-21054) Reset Command support reset specific property which is compatible with Hive - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-16551) Accumulator Examples should demonstrate different use case from UDAFs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-1548) Add Partial Random Forest algorithm to MLlib - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-21194) Fail the putNullmethod when containsNull=false. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-2141) Add sc.getPersistentRDDs() to PySpark - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-16944) [MESOS] Improve data locality when launching new executors when dynamic allocation is enabled - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-761) Print a nicer error message when incompatible Spark binaries try to talk - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-23520) Add support for MapType fields in JSON schema inference - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-22532) Spark SQL function 'drop_duplicates' throws error when passing in a column that is an element of a struct - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-21317) Avoid unnecessary sort in FileFormatWriter if data is already bucketed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-7290) Add StringVectorizer - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-17670) Spark DataFrame/Dataset no longer supports Option[Map] in case classes - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-22117) TasksetManager echo not change after MapoutTracker epoch changed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-20827) cannot express HAVING without a GROUP BY clause - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-22191) Add hive serde example with serde properties - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-20170) Enhance spark framework to support failover in case mesos master failure - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-22195) Add cosine similarity to org.apache.spark.ml.linalg.Vectors - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-20870) Update the output of spark-sql -H - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-20002) Add support for unions between streaming and batch datasets - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-18887) Executor OOM due to tungsten memory leak in external sorter - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-19577) insert into a partition datasource table with InMemoryCatalog after the partition location alter by alter command failed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-13868) Random forest accuracy exploration - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-19102) Accuracy error of spark SQL results - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-20230) FetchFailedExceptions should invalidate file caches in MapOutputTracker even if newer stages are launched - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-14216) ML tree models should have a standardized, reusable feature importance test - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-20623) Application link returns redirect to SPARK_LOCAL_IP and disregards SPARK_PUBLIC_DNS - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-20305) Master may keep in the state of "COMPELETING_RECOVERY",then all the application registered cannot get resources, when the leader master change. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-22013) Allow to read the results of a streaming query as non-streaming datasource - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-19341) Bucketing support for Structured Streaming - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-22633) spark-submit.cmd cannot handle long arguments - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-24220) java.lang.NullPointerException at org.apache.spark.sql.execution.UnsafeExternalRowSorter.(UnsafeExternalRowSorter.java:83) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-20028) Implement NGrams aggregate function - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-19475) (ML|MLlib).linalg.DenseVector method delegation fails for __neg__ - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-22079) Serializer in HiveOutputWriter miss loading job configuration - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-12832) Mesos cluster mode should handle constraints - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-17487) Configurable bucketing info extraction - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-23696) StructType.fromString swallows exceptions from DataType.fromJson - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:27 UTC, 0 replies.
- [jira] [Updated] (SPARK-22339) Push epoch updates to executors on fetch failure to avoid fetch retries for missing executors - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:27 UTC, 0 replies.
- [jira] [Updated] (SPARK-22742) Spark2.x does not support read data from Hive 2.2 and 2.3 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:27 UTC, 0 replies.
- [jira] [Updated] (SPARK-19353) Support binary I/O in PipedRDD - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:27 UTC, 0 replies.
- [jira] [Updated] (SPARK-17174) Provide support for Timestamp type Column in add_months function to return HH:mm:ss - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:27 UTC, 0 replies.
- [jira] [Updated] (SPARK-8562) Annoying messages about executor lost after stopping SparkContext - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-18170) Confusing error message when using rangeBetween without specifying an "orderBy" - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-23395) Add an option to return an empty DataFrame from an RDD generated by a Hadoop file when there are no usable paths - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-10908) ClassCastException in HadoopRDD.getJobConf - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-21236) Make the threshold of using HighlyCompressedStatus configurable. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-20418) multi-label classification support - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-22201) Dataframe describe includes string columns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-20071) StringIndexer overflows Kryo serialization buffer when run on column with many long distinct values - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-19771) Support OR-AND amplification in Locality Sensitive Hashing (LSH) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-5535) Add parameter for storage levels - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-19699) createOrReplaceTable does not always replace an existing table of the same name - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-12823) Cannot create UDF with StructType input - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-18688) Interpolated time series join - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-19487) Low latency execution for Spark - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-14652) pyspark streaming driver unable to cleanup metadata for cached RDDs leading to driver OOM - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-19575) Reading from or writing to a hive serde table with a non pre-existing location should succeed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-22240) S3 CSV number of partitions incorrectly computed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-20419) Support for Mesos Maintenance primitives - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-15798) Secondary sort in Dataset/DataFrame - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-22536) VectorizedParquetRecordReader doesn't use Parquet's dictionary filtering feature - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-20222) No Spark SQL UI when executing queries in Spark SQL CLI - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-22182) Incorrect Date and Timestamp conversion beyon before 1000 year - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-22928) API Documentation issue for org.apache.spark.sql.streaming.Trigger - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-13337) DataFrame join-on-columns function should support null-safe equal - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-19416) Dataset.schema is inconsistent with Dataset in handling columns with periods - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-22630) Consolidate all configuration properties into one page - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-19420) Confusing error message when using outer join on two large tables - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-19071) Optimizations for ML Pipeline Tuning - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-21107) Pyspark: ISO-8859-1 column names inconsistently converted to UTF-8 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-18994) worker clean up app directory block the heartbeat sending - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-11373) Add metrics to the History Server and providers - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-10846) Stray META-INF in directory spark-shell is launched from causes problems - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-22970) Getting Application ID from Submission ID - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-21184) QuantileSummaries implementation is wrong and QuantileSummariesSuite fails with larger n - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-22455) Provide an option to store the exception records/files and reasons in log files when reading data from a file-based data source. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-11421) Add the ability to add a jar to the current class loader - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-21757) Jobs page fails to load when executor removed event's reason contains single quote - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-18116) spark streaming ui show 0 events when recovering from checkpoint - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-6407) Streaming ALS for Collaborative Filtering - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-16849) Improve subquery execution by deduplicating the subqueries with the same results - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-21792) Document Spark Streaming Dynamic Allocation Configuration - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-14927) DataFrame. saveAsTable creates RDD partitions but not Hive partitions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-21465) array('L') support might lead to overflow error - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:37 UTC, 0 replies.
- [jira] [Updated] (SPARK-21245) Resolve code duplication for classification/regression summarizers - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:37 UTC, 0 replies.
- [jira] [Updated] (SPARK-22589) Subscribe to multiple roles in Mesos - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:37 UTC, 0 replies.
- [jira] [Updated] (SPARK-19346) Add further cold-start strategies for ALS prediction - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:37 UTC, 0 replies.
- [jira] [Updated] (SPARK-20722) Replay event log that hasn't be replayed in current checking period in advance for request - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:05 UTC, 0 replies.
- [jira] [Updated] (SPARK-21021) Reading partitioned parquet does not respect specified schema column order - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:05 UTC, 0 replies.
- [jira] [Updated] (SPARK-22051) Explicit control of number of partitions after dataframe operations (join, order...) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-13303) Spark fails with pandas import error when pandas is not explicitly imported by user - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-22235) Can not kill job gracefully in spark standalone cluster - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-22234) Distinct window functions are not supported - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-23519) Create View Commands Fails with The view output (col1,col1) contains duplicate column name - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-17322) 'ANY n' clause for SQL queries to increase the ease of use of WHERE clause predicates - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-20201) Flaky Test: org.apache.spark.sql.catalyst.expressions.OrderingSuite - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-21162) Cannot count rows in an empty Hive table stored as parquet when spark.sql.parquet.cacheMetadata is set to false - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-9213) Improve regular expression performance (via joni) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-20227) Job hangs when joining a lot of aggregated columns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-20237) Spark-1.6 current and later versions of memory management issues - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-22619) Implement the CG method for ALS - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-19854) Refactor file partitioning strategy to make it easier to extend / unit test - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-18392) LSH API, algorithm, and documentation follow-ups - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-8480) Add setName for Dataframe - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-20837) Spark SQL doesn't support escape of single/double quote as SQL standard. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-21301) Should abort active taskSets or kill all running Tasks when that stage success. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-19731) IN Operator should support arrays - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-21894) Some Netty errors do not propagate to the top level driver - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-19051) test_hivecontext (pyspark.sql.tests.HiveSparkSubmitTests) fails in python/run-tests - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-16683) Group by does not work after multiple joins of the same dataframe - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-15809) PySpark SQL UDF default returnType - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-14045) DecisionTree improvement umbrella - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-19576) Task attempt paths exist in output path after saveAsNewAPIHadoopFile completes with speculation enabled - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-19856) Turn partitioning related test cases in FileSourceStrategySuite from integration tests into unit tests - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-4412) Parquet logger cannot be configured - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:09 UTC, 0 replies.
- [jira] [Updated] (SPARK-22260) java.lang.RuntimeException: hdfs://HdfsHA/logrep/1/sspstatistic/_metadata is not a Parquet file (too small) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:09 UTC, 0 replies.
- [jira] [Updated] (SPARK-18563) mapWithState: initialState should have a timeout setting per record - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:09 UTC, 0 replies.
- [jira] [Updated] (SPARK-1866) Closure cleaner does not null shadowed fields when outer scope is referenced - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:09 UTC, 0 replies.
- [jira] [Updated] (SPARK-18664) Don't respond to HTTP OPTIONS in HTTP-based UIs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:09 UTC, 0 replies.
- [jira] [Updated] (SPARK-22353) ResultIterable should be indexable - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:09 UTC, 0 replies.
- [jira] [Updated] (SPARK-19011) ApplicationDescription should add the Submission ID for the standalone cluster - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:09 UTC, 0 replies.
- [jira] [Updated] (SPARK-19288) Failure (at test_sparkSQL.R#1300): date functions on a DataFrame in R/run-tests.sh - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:09 UTC, 0 replies.
- [jira] [Updated] (SPARK-22336) When spark-submit cluster mode is run from a Mesos task, the job fails - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:09 UTC, 0 replies.
- [jira] [Updated] (SPARK-23738) Memory usage for executors - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:09 UTC, 0 replies.
- [jira] [Updated] (SPARK-22680) SparkSQL scan all partitions when the specified partitions are not exists in parquet formatted table - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:09 UTC, 0 replies.
- [jira] [Updated] (SPARK-18450) Add AND-amplification to Locality Sensitive Hashing - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:09 UTC, 0 replies.
- [jira] [Updated] (SPARK-22292) Add spark.mem.max to limit the amount of memory received from Mesos - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:10 UTC, 0 replies.
- [jira] [Updated] (SPARK-21252) The duration times showed by spark web UI are inaccurate - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:10 UTC, 0 replies.
- [jira] [Updated] (SPARK-20271) Add FuncTransformer to simplify custom transformer creation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:10 UTC, 0 replies.
- [jira] [Updated] (SPARK-19052) the rest api don't support multiple standby masters on standalone cluster - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:10 UTC, 0 replies.
- [jira] [Updated] (SPARK-23613) Different Analyzed logical plan data types for the same table in different queries - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:10 UTC, 0 replies.
- [jira] [Updated] (SPARK-22561) Dynamically update topics list for spark kafka consumer - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:10 UTC, 0 replies.
- [jira] [Updated] (SPARK-23895) Job continues to run even though some tasks have been failed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:10 UTC, 0 replies.
- [jira] [Updated] (SPARK-21556) PySpark, Unable to save pipeline of non-spark transformers - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-22037) Collapse Project if it is the child of Aggregate - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-22527) Reuse coordinated ShuffleExchange if possible - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-15834) Time zone / locale sensitivity umbrella - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-21360) Spark failing to query SQL Server. Query contains a column having space in where clause - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-21242) Allow spark executors to function in mesos w/ container networking enabled - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-19063) Add parameter for storage levels to LDA - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-21863) while performing saveToMemSQL, get Exception in thread "Thread-23" java.lang.AssertionError: assertion failed: Task -1024 release lock on block rdd_5_2 more times than it acquired it - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-23480) NullPointerException in AppendOnlyMap.growTable - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-18536) Failed to save to hive table when case class with empty field - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-18650) race condition in FileScanRDD.scala - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-20675) Support Index to skip when retrieval disk structure in CoGroupedRDD - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-22958) Spark is stuck when the only one executor fails to register with driver - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-23479) struct() cannot be combined with alias(metadata={}) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-21427) Describe mapGroupsWithState and flatMapGroupsWithState for stateful aggregation in Structured Streaming - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-11010) Fixes and enhancements addressing UDTs' api and several usability concerns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-21482) Make LabeledPoint bean-compliant so it can be used in Encoders.bean(LabeledPoint.class) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-17716) Hidden Markov Model (HMM) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-18925) Reduce memory usage of mapWithState - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-19340) Opening a file in CSV format will result in an exception if the filename contains special characters - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-16428) Spark file system watcher not working on Windows - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-22205) Incorrect result with user defined agg function followed by a non deterministic function - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-4899) Support Mesos features: roles and checkpoints - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-21483) Make org.apache.spark.ml.linalg.Vector bean-compliant so it can be used in Encoders.bean(Vector.class) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-21935) Pyspark UDF causing ExecutorLostFailure - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-22448) Add functions like Mode(), NumNulls(), etc. in Summarizer - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-23625) spark sql long-running mission will be dead - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-20658) spark.yarn.am.attemptFailuresValidityInterval doesn't seem to have an effect - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-20072) Clarify ALS-WR documentation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-21001) Staging folders from Hive table are not being cleared. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-24078) reduce with unionAll takes a long time - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-21955) OneForOneStreamManager may leak memory when network is poor - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-22345) Sort-merge join generates incorrect code for CodegenFallback filter conditions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-20320) AnalysisException: Columns of grouping_id (count(value#17L)) does not match grouping columns (count(value#17L)) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-19108) Broadcast all shared parts of tasks (to reduce task serialization time) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-22261) Collect and show failed task metrics for ui - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-20469) Add a method to display DataFrame schema in PipelineStage - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-20091) DagScheduler should allow running concurrent attempts of a stage in case of multiple fetch failure - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-18233) Failed to deserialize the task - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-23474) mapWithState + async operations = no checkpointing - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-21888) Cannot add stuff to Client Classpath for Yarn Cluster Mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-16151) Make generated params non-final - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-15810) Aggregator doesn't play nice with Option - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-12764) XML Column type is not supported (JDBC connection to Postgres) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-20292) string representation of TreeNode is messy - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-14501) spark.ml parity for fpm - frequent items - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-19689) Job Details page doesn't show 'Tasks: Succeeded/Total' progress bar text properly - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-19867) merge defaultTablePath logic when create table for InMemroyCatalog and HiveExternalCatalog - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-20142) Move RewriteDistinctAggregates later into query execution - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-22194) Allow namespacing of configs in spark.internal.config - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-19371) Cannot spread cached partitions evenly across executors - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-20138) Add imports to snippets in Spark SQL, DataFrames and Datasets Guide doc - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-18959) invalid resource statistics for standalone cluster - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-22546) Allow users to update the dataType of a column - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-19035) rand() function in case when cause failed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-16365) Ideas for moving "mllib-local" forward - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-20112) SIGSEGV in GeneratedIterator.sort_addToSorter - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-19216) LogisticRegressionModel is missing getThreshold() - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-12140) Support Streaming UI in HistoryServer - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-20952) ParquetFileFormat should forward TaskContext to its forkjoinpool - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-22724) TakeOrderedAndProjectExec operator has poor performance when sorting on low cardinality keys - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-21691) Accessing canonicalized plan for query with limit throws exception - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-18724) Add TuningSummary for TrainValidationSplit and CountVectorizer - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-19939) Add support for association rules in ML - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-21183) Unable to return Google BigQuery INTEGER data type into Spark via google BigQuery JDBC driver: java.sql.SQLDataException: [Simba][JDBC](10140) Error converting value to long. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-19356) Number of active tasks is negative even when there is no failed executor - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-22878) Count totalDroppedEvents for LiveListenerBus - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-8487) Update reduceByKeyAndWindow docs to highlight that filtering Function must be used - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-22200) Kinesis Receivers stops if Kinesis stream was re-sharded - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-19360) Spark 2.X does not support stored by clause - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-23467) Enable way to create DataFrame from pre-partitioned files (Parquet/ORC/etc.) with each in-memory partition mapped to 1 physical file partition - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:01:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-11669) Python interface to SparkR GLM module - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:10 UTC, 0 replies.
- [jira] [Updated] (SPARK-18011) SparkR serialize "NA" throws exception - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:10 UTC, 0 replies.
- [jira] [Updated] (SPARK-19310) PySpark Window over function changes behaviour regarding Order-By - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:10 UTC, 0 replies.
- [jira] [Updated] (SPARK-19046) Dataset checkpoint consumes too much disk space - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:10 UTC, 0 replies.
- [jira] [Updated] (SPARK-4366) Aggregation Improvement - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:10 UTC, 0 replies.
- [jira] [Updated] (SPARK-23610) Cast of ArrayType of NullType to ArrayType of nullable material type does not work - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-22482) Unreadable Parquet array columns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-13025) Allow user to specify the initial model when training LogisticRegression - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-14401) Switch to stock sbt-pom-reader plugin - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-15605) ML JavaDeveloperApiExample is broken - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-19844) UDF in when control function is executed before the when clause is evaluated. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-22620) Deadlock in blacklisting when executor is removed due to a heartbeat timeout - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-19730) Predicate Subqueries do not push results of subqueries to data source - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-20447) spark mesos scheduler suppress call - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-20683) Make table uncache chaining optional - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-18496) java.lang.AssertionError: assertion failed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-20938) explain() for datasources implementing CatalystScan does not show pushed predicates correctly - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-19315) StructType should support nested lookup; throws IllegalArgumentException - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-22625) Properly cleanup inheritable thread-locals - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-23746) HashMap UserDefinedType giving cast exception in Spark 1.6.2 while implementing UDAF - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-15044) spark-sql will throw "input path does not exist" exception if it handles a partition which exists in hive table, but the path is removed manually - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-19643) Document how to use Spark/SparkR on Windows - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-19364) Stream Blocks in Storage Persists Forever when Kinesis Checkpoints are enabled and an exception is thrown - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-17684) 'null' appears in the data during aggregateByKey action. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-22056) Add subconcurrency for KafkaRDDPartition - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-16900) Complete-mode output for file sinks - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-22457) Tables are supposed to be MANAGED only taking into account whether a path is provided - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-19863) Whether or not use CachedKafkaConsumer need to be configured, when you use DirectKafkaInputDStream to connect the kafka in a Spark Streaming application - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-8418) Add single- and multi-value support to ML Transformers - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-10678) Specialize PrefixSpan for single-item patterns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-18831) SQL tab missing from UI when using ElasticSearch connector - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-20054) [Mesos] Detectability for resource starvation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-17134) Use level 2 BLAS operations in LogisticAggregator - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-5256) Improving MLlib optimization APIs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-11782) Master Web UI should link to correct Application UI in cluster mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-13525) SparkR: java.net.SocketTimeoutException: Accept timed out when running any dataframe function - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-21867) Support async spilling in UnsafeShuffleWriter - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-17977) DataFrameReader and DataStreamReader should have an ancestor class - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-22480) Dynamic Watermarking - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-19657) start-master.sh accidentally forces the use of a loopback address in master URL - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-13333) DataFrame filter + randn + unionAll has bad interaction - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-16578) Configurable hostname for RBackend - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-4049) Storage web UI "fraction cached" shows as > 100% - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-17538) sqlContext.registerDataFrameAsTable is not working sometimes in pyspark 2.0.0 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-19973) StagePage should display the number of executors. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-21109) union two dataset[A] don't work as expected if one of the datasets is originated from a dataframe - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-20486) Encapsulate ALS in-block and out-block data structures and methods into a separate class - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-10815) Public API: Streaming Sources and Sinks - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-21493) Add more metrics to External Shuffle Service - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-19596) After a Stage is completed, all Tasksets for the stage should be marked as zombie - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-16523) Support Row Based Aggregation HashMap - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-23969) Using FPGrowth with PipelinedRDD gives EOF Error - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-18778) Fix the Scala classpath in the spark-shell - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-19524) newFilesOnly does not work according to docs. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-21695) Spark scheduler locality algorithm can take longer then expected - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-21496) Support codegen for TakeOrderedAndProjectExec - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-17169) To use scala macros to update code when SharedParamsCodeGen.scala changed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-19351) Support for obtaining file splits from underlying InputFormat - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-19790) OutputCommitCoordinator should not allow another task to commit after an ExecutorFailure - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-21158) SparkSQL function SparkSession.Catalog.ListTables() does not handle spark setting for case-sensitivity - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-23621) DataFrame.insertInto() is persisting all columns for mixed structured data-type - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-21859) SparkFiles.get failed on driver in yarn-cluster and yarn-client mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-20249) Add summary for LinearSVCModel - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-21752) Config spark.jars.packages is ignored in SparkSession config - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-20139) Spark UI reports partial success for completed stage while log shows all tasks are finished - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-13964) Feature hashing improvements - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-10881) Unable to use custom log4j appender in spark executor - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-13158) Show the information of broadcast blocks in WebUI - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-15571) Pipeline unit test improvements - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-20867) Move individual hints from Statistics into HintInfo class - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-19477) [SQL] Datasets created from a Dataframe with extra columns retain the extra columns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-20008) hiveContext.emptyDataFrame.except(hiveContext.emptyDataFrame).count() returns 1 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-15127) Column names are handled incorrectly when they originate from a single Dataframe - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-23282) Probable mistake in hasLaunchedTask condition - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-23768) Proxy configuration for extraJavaOptions in defaults conf - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-17459) Add Linear Discriminant to dimensionality reduction algorithms - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-20701) dataframe.show has wrong white space when containing Supplement Unicode character - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-23478) Inconsistent behaviour of union when columns have conflicting metadata - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-20765) Cannot load persisted PySpark ML Pipeline that includes 3rd party stage (Transformer or Estimator) if the package name of stage is not "org.apache.spark" and "pyspark" - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-19234) AFTSurvivalRegression chokes silently or with confusing errors when any labels are zero - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-19890) Make MetastoreRelation statistics estimation more accurately - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-19485) Launch tasks async i.e. dont wait for the network - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-21336) Revise rand comparison in BatchEvalPythonExecSuite - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-22316) Cannot Select ReducedAggregator Column - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-21679) KMeans Clustering is Not Deterministic - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-22590) SparkContext's local properties missing from TaskContext properties - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-21117) Built-in SQL Function Support - WIDTH_BUCKET - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-22581) Catalog api does not allow to specify partitioning columns with create(external)table - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-23320) RANDOM pseudo environment variable has low resolution under Windows - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-9103) Tracking spark's memory usage - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-19879) Spark UI table sort breaks event timeline - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-21968) Improved KernelDensity support - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-24150) Race condition in FsHistoryProvider - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-22451) Reduce decision tree aggregate size for unordered features from O(2^numCategories) to O(numCategories) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-18691) Spark can hang if a node goes down during a shuffle - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-18940) Percentile and approximate percentile support for frequency distribution table - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-22459) EdgeDirection "Either" Does Not Considerate Real "Either" Direction - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-22942) Spark Sql UDF throwing NullPointer when adding a filter on a columns that uses that UDF - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-18616) Pure Python Implementation of MLWritable for use in Pipeline - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-21964) Enable splitting the Aggregate (on Expand) into a number of Aggregates for grouing analytics - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-21529) Uniontype not supported when reading from Hive tables. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-22738) Spark YARN UI does not create fully qualified links for paging impacts use with Knox - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-20099) Add transformSchema to pyspark.ml - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-22435) Support processing array and map type using script - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-24065) Issue with the property IgnoreLeadingWhiteSpace - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-20045) Make sure SparkHadoopMapReduceWriter is resilient to failures of writers and committers - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-22293) Avoid unnecessary traversal in ResolveReferences - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-20083) Change matrix toArray to not create a new array when matrix is already column major - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-18142) Spark Master tries to launch workers 145 times within 1 minute - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-18249) StackOverflowError when saving dataset to parquet - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-19418) Dataset generated java code fails to compile as java.lang.Long does not accept UTF8String in constructor - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-19703) Add Suppress/Revive support to the Mesos Spark Driver - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-21179) Unable to return Hive INT data type into Spark via Hive JDBC driver: Caused by: java.sql.SQLDataException: [Simba][JDBC](10140) Error converting value to int. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-1359) SGD implementation is not efficient - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-22098) Add aggregateByKeyLocally in RDD - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-15785) Add initialModel param to Gaussian Mixture Model (GMM) in spark.ml - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-23180) RFormulaModel should have labels member - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-23168) Hints for fact tables and unique columns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-19141) VectorAssembler metadata causing memory issues - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-21390) Dataset filter api inconsistency - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-21417) Detect transitive join conditions via expressions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-14351) Optimize ImpurityAggregator for decision trees - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-23512) Complex operations on Dataframe corrupts data - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-18787) spark.shuffle.io.preferDirectBufs does not completely turn off direct buffer usage by Netty - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-18301) VectorAssembler does not support StructTypes - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-23139) Read eventLog file with mixed encodings - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-22077) RpcEndpointAddress fails to parse spark URL if it is an ipv6 address. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-22828) Data corruption happens when same RDD being repeatedly used as parent RDD of a custom RDD which reads each parent RDD in concurrent threads - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:27 UTC, 0 replies.
- [jira] [Updated] (SPARK-20803) KernelDensity.estimate in pyspark.mllib.stat.KernelDensity throws net.razorvine.pickle.PickleException when input data is normally distributed (no error when data is not normally distributed) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:27 UTC, 0 replies.
- [jira] [Updated] (SPARK-16205) dict -> StructType conversion is undocumented - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:27 UTC, 0 replies.
- [jira] [Updated] (SPARK-20902) Word2Vec implementations with Negative Sampling - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:27 UTC, 0 replies.
- [jira] [Updated] (SPARK-20226) Call to sqlContext.cacheTable takes an incredibly long time in some cases - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:27 UTC, 0 replies.
- [jira] [Updated] (SPARK-6685) Use DSYRK to compute AtA in ALS - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:27 UTC, 0 replies.
- [jira] [Updated] (SPARK-21944) Watermark on window column is wrong - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:27 UTC, 0 replies.
- [jira] [Updated] (SPARK-20370) create external table on read only location fails - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-23855) Performing a Join after a CrossJoin can lead to data corruption - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-20982) Consider adding SSL support for Spark REST submission server and client - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-24226) while reading data from oracle 12c from spark and using the numofpartition more than 1 is not returning the exact count - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-20060) Support Standalone visiting secured HDFS - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-21674) Support double/single-quoted strings for alias names in SQL - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-19504) clearCache fails to delete orphan RDDs, especially in pyspark - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-19839) Fix memory leak in BytesToBytesMap - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-22137) Failed to insert VectorUDT to hive table with DataFrameWriter.insertInto(tableName: String) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-18859) Catalyst codegen does not mark column as nullable when it should. Causes NPE - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-19176) Change bin.xml to be compatible with groupId "org.spark-project.hive" - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-5844) Optimize Pipeline.fit for ParamGrid - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-20001) Support PythonRunner executing inside a Conda env - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-4638) Spark's MLlib SVM classification to include Kernels like Gaussian / (RBF) to find non linear boundaries - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-20522) hostname -f does not work with coreutils - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-16394) Timestamp conversion error in pyspark.sql.Row because of timezones - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-12686) Support group-by push down into data sources - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-15797) To expose groupingSets for DataFrame - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-11593) Replace catalyst converter with RowEncoder in ScalaUDF - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-5564) Support sparse LDA solutions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-3165) DecisionTree does not use sparsity in data - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-21488) Make saveAsTable() and createOrReplaceTempView() return dataframe of created table/ created view - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-19553) Add GroupedData.countApprox() - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-10335) GraphX Connected Components fail with large number of iterations - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-20095) A code bug in CodegenContext.withSubExprEliminationExprs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-21606) HiveServer2/HiveThriftServer2 catches OOMs on request threads - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-23381) Murmur3 hash generates a different value from other implementations - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-19091) createDataset(sc.parallelize(x: Seq)) should be equivalent to createDataset(x: Seq) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-12105) Add a DataFrame.show() with argument for output PrintStream - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-21257) LDA : create an Evaluator to enable cross validation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-23494) Expose InferSchema's functionalities to the outside - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-10764) Add optional caching to Pipelines - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-20029) LiR supports bound constrained optimization - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-21876) Idling Executors that never handled any tasks are not cleared from BlockManager after being removed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-23771) Uneven Rowgroup size after repartition - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-23487) Fix the failure in spark-branch-2.2-lint - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-12648) UDF with Option[Double] throws ClassCastException - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-4240) Refine Tree Predictions in Gradient Boosting to Improve Prediction Accuracy. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-21430) Add PMML support to SparkR - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-21340) Bring PySpark MLLib evaluation metrics to parity with Scala API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-20225) Spark Job hangs while writing parquet files to HDFS - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-19738) Consider adding error handler to DataStreamWriter - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-9272) Persist information of individual partitions when persisting partitioned data source tables to metastore - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-18404) RPC call from executor to driver blocks when getting map output locations (Netty Only) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-23766) Not able to execute multiple queries in spark structured streaming - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-17136) Design optimizer interface for ML algorithms - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-19325) Running query hang-up 5min - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-20859) SQL Loader does not recognize multidimensional columns in postgresql (like integer[]][]) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-13911) Having condition and order by cannot both have aggregate functions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-22248) spark marks all columns as null when its unable to parse single column - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-21048) Add an option --merged-properties-file to distinguish the configuration loading behavior - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-23492) Application shows up as running in history server even when latest attempt has completed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-20372) Word2Vec Continuous Bag Of Words model - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-20171) Analyzer should include the arity of a function when reporting "AnalysisException: Undefined function" - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-21152) Use level 3 BLAS operations in LogisticAggregator - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-19743) Exception when creating more than one implicit Encoder in REPL - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-13007) Document where configuration / properties are read and applied - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-17654) Propagate bucketing information for Hive tables to / from Catalog - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-18974) FileInputDStream could not detected files which moved to the directory - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:02:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-12261) pyspark crash for large dataset - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:05 UTC, 0 replies.
- [jira] [Updated] (SPARK-21136) Misleading error message for typo in SQL - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:05 UTC, 3 replies.
- [jira] [Updated] (SPARK-20699) The end of Python stdout/stderr streams may be lost by PythonRunner - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:05 UTC, 0 replies.
- [jira] [Updated] (SPARK-19741) ClassCastException when using Dataset with type containing value types - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:05 UTC, 0 replies.
- [jira] [Updated] (SPARK-8984) Developer documentation for ML Pipelines - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-21497) Pull non-deterministic joining keys from Join operator - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-21368) TPCDSQueryBenchmark can't refer query files. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-12264) Add a typeTag or scalaTypeTag method to DataType - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-3383) DecisionTree aggregate size could be smaller - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-19177) SparkR Data Frame operation between columns elements - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-22207) High memory usage when converting relational data to Hierarchical data - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-19845) failed to uncache datasource table after the table location altered - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-12344) Remove env-based configurations - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-18603) Support `OuterReference` in projection list of IN correlated subqueries - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-22510) Exceptions caused by 64KB JVM bytecode or 64K constant pool entry limit - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-20829) var_samp returns Nan while other vendors return a null value - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-13365) should coalesce do anything if coalescing to same number of partitions without shuffle - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-18683) REST APIs for standalone Master态Workers and Applications - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-20939) Do not duplicate user-defined functions while optimizing logical query plans - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-23215) Dataset Grouping: Index out of bounds error - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-21452) SessionState in HiveClientImpl is never closed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-21761) [Core] Add the application's final state for SparkListenerApplicationEnd event - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-18636) UnsafeShuffleWriter and DiskBlockObjectWriter do not consider encryption / compression in metrics - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-17968) Support using 3rd-party R packages on Mesos - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-18005) optional binary Dataframe Column throws (UTF8) is not a group while loading a Dataframe - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-3976) Detect block matrix partitioning schemes - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-21710) ConsoleSink causes OOM crashes with large inputs. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-15302) Implement FK/PK "rely novalidate" constraints for better CBO - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-11471) Improve the way that we plan shuffled join - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-19870) Repeatable deadlock on BlockInfoManager and TorrentBroadcast - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-4862) Streaming | Setting checkpoint as a local directory results in Checkpoint RDD has different partitions error - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-8529) Set metadata for MinMaxScaler - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-16844) Generate code for sort based aggregation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-21143) Fail to fetch blocks >1MB in size in presence of conflicting Netty version - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-21558) Kinesis lease failover time should be increased or made configurable - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-22338) namedtuple serialization is inefficient - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-17302) Cannot set non-Spark SQL session variables in hive-site.xml, spark-defaults.conf, or using --conf - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-20622) Parquet partition discovery for non key=value named directories - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-20802) kolmogorovSmirnovTest in pyspark.mllib.stat.Statistics throws net.razorvine.pickle.PickleException when input data is normally distributed (no error when data is not normally distributed) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-21429) show on structured Dataset is equivalent to writeStream to console once - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-16571) DataFrame repartition leads to unexpected error during shuffle - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-20530) "Cannot evaluate expression" when filtering on parquet partition column - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:09 UTC, 0 replies.
- [jira] [Updated] (SPARK-21262) Stop sending 'stream request' when shuffle blocks. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:09 UTC, 0 replies.
- [jira] [Updated] (SPARK-15071) Check the result of all TPCDS queries - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:09 UTC, 0 replies.
- [jira] [Updated] (SPARK-24008) SQL/Hive Context fails with NullPointerException - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:09 UTC, 0 replies.
- [jira] [Updated] (SPARK-22616) df.cache() / df.persist() should have an option blocking like df.unpersist() - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:09 UTC, 0 replies.
- [jira] [Updated] (SPARK-19648) Unable to access column containing '.' for approxQuantile function on DataFrame - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:09 UTC, 0 replies.
- [jira] [Updated] (SPARK-19628) Duplicate Spark jobs in 2.1.0 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:09 UTC, 0 replies.
- [jira] [Updated] (SPARK-19653) `Vector` Type Should Be A First-Class Citizen In Spark SQL - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:09 UTC, 0 replies.
- [jira] [Updated] (SPARK-7420) Flaky test: o.a.s.streaming.JobGeneratorSuite "Do not clear received block data too soon" - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:09 UTC, 0 replies.
- [jira] [Updated] (SPARK-20110) Windowed aggregation do not work when the timestamp is a nested field - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:09 UTC, 0 replies.
- [jira] [Updated] (SPARK-14047) GBT improvement umbrella - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:10 UTC, 0 replies.
- [jira] [Updated] (SPARK-18920) Update outdated date formatting - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:10 UTC, 0 replies.
- [jira] [Updated] (SPARK-21521) History service requires user is in any group - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:10 UTC, 0 replies.
- [jira] [Updated] (SPARK-21684) df.write double escaping all the already escaped characters except the first one - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:10 UTC, 0 replies.
- [jira] [Updated] (SPARK-23252) When NodeManager and CoarseGrainedExecutorBackend processes are killed, the job will be blocked - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-16824) Add API docs for VectorUDT - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:11 UTC, 2 replies.
- [jira] [Updated] (SPARK-22020) Support session local timezone - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-22563) Spark row_number() deterministic generation and materialization as a checkpoint - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-21683) "TaskKilled (another attempt succeeded)" log message should be INFO level, not WARN - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-16073) Performance of Parquet encodings on saving primitive arrays - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-23286) Partial registration to external shuffle services on Mesos - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-20324) Control itemSets length in PrefixSpan - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-20729) Reduce boilerplate in Spark ML models - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-20724) spark-submit verbose mode should list default settings values - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-13493) json to DataFrame to parquet does not respect case sensitiveness - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-18955) Add ability to emit kafka events to DStream or KafkaDStream - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-22492) Spark master web ui server in standalone mode do not honor host option - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-17763) JacksonParser silently parses null as 0 when the field is not nullable - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-22502) OnlineLDAOptimizer variationalTopicInference might be able to handle empty documents - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-21748) Migrate the implementation of HashingTF from MLlib to ML - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-18618) SparkR GLM model predict should support type as a argument - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-3728) RandomForest: Learn models too large to store in memory - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-21056) InMemoryFileIndex.listLeafFiles should create at most one spark job when listing files in parallel - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-21861) Add more details to PageRank illustration - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-14822) Add lazy executor startup to Mesos Scheduler - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-8515) Improve ML attribute API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-20357) Expose Calendar.getWeekYear() as Spark SQL date function to be consistent with weekofyear() - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-18611) mesos shuffle service on v2 isn't compatible with spark v1 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-21480) Memory leak in org.apache.hadoop.hive.metastore.MetaStoreDirectSql.executeNoResult - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-19222) Limit Query Performance issue - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-8118) Turn off noisy log output produced by Parquet 1.7.0 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-1924) Make local:/ scheme work in more deploy modes - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-16882) Failures in JobGenerator Thread are Swallowed, Job Does Not Fail - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-11222) Add style checker rules to validate doc tests aren't included in docs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-12835) StackOverflowError when aggregating over column from window function - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-17360) PySpark can create dataframe from a Python generator - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-19135) SQL: Type inconsistencies with Structs, Arrays and Nulls - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-18889) Spark incorrectly reads default columns from a Hive view - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-21641) Combining windowing (groupBy) and mapGroupsWithState (groupByKey) in Spark Structured Streaming - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-19367) Hive metastore temporary configuration doesn't specify default filesystem - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-20325) Spark Structured Streaming documentation Update: checkpoint configuration - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-15060) Fix stack overflow when executing long lineage transform without checkpoint - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-21131) Fix batch gradient bug in SVDPlusPlus - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-22736) Consider caching decoded dictionaries in VectorizedColumnReader - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-18097) Can't drop a table from Hive if the schema is corrupt - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-22108) Logical Inconsistency in Timestamp Cast - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-22671) SortMergeJoin read more data when wholeStageCodegen is off compared with when it is on - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-20261) EventLoggingListener may not truly flush the logger when a compression codec is used - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-3528) Reading data from file:/// should be called NODE_LOCAL not PROCESS_LOCAL - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-21038) Reduce redundant generated init code in Catalyst codegen - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-21061) GMM Error : Matrix is not symmetric - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-22299) Use OFFSET and LIMIT for JDBC DataFrameReader striping - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-19492) Dataset, filter and pattern matching on elements - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-19902) Support more expression canonicalization: Add, Subtract, Multiply and Divide - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-18359) Let user specify locale in CSV parsing - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:03:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-20169) Groupby Bug with Sparksql - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-20210) Scala tests aborted in Spark SQL on ppc64le - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-17430) Spark task Hangs after OOM while DAG scheduler tries to serialize a task - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-22198) Java incompatibility when extending UnaryTransformer or Transformer - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-23130) Spark Thrift does not clean-up temporary files (/tmp/*_resources and /tmp/hive/*.pipeout) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-22420) Spark SQL return invalid json string for struct with date/datetime field - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-15176) Job Scheduling Within Application Suffers from Priority Inversion - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-20221) Port pyspark.mllib.linalg tests in pyspark/mllib/tests.py to pyspark.ml.linalg - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-18534) Datasets Aggregation with Maps - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-19215) Add necessary check for `RDD.checkpoint` to avoid potential mistakes - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-20158) crash in Spark sql insert in partitioned hive tables - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-22220) Spark SQL: LATERAL VIEW OUTER null pointer exception with GROUP BY - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-16992) Pep8 code style - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-20128) MetricsSystem not always killed in SparkContext.stop() - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-18844) Add more binary classification metrics to BinaryClassificationMetrics - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-1503) Implement Nesterov's accelerated first-order method - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-19538) DAGScheduler and TaskSetManager can have an inconsistent view of whether a stage is complete. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-19892) Implement findAnalogies method for Word2VecModel - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-18502) Spark does not handle columns that contain backquote (`) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-18067) SortMergeJoin adds shuffle if join predicates have non partitioned columns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-17608) Long type has incorrect serialization/deserialization - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-24261) Spark cannot read renamed managed Hive table - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-14698) CREATE FUNCTION cloud not add function to hive metastore - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-20454) Improvement of ShortestPaths in Spark GraphX - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-21036) Truncate action and writes should be in one transaction. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-19044) PySpark dropna() can fail with AnalysisException - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-22613) Make UNCACHE TABLE behaviour consistent with CACHE TABLE - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-11609) Resolve permanent Hive UDFs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-14706) Python ML persistence integration test - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-19370) Flaky test: MetadataCacheSuite - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-15642) Metadata gets lost when selecting a field of a StructType - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-24116) SparkSQL inserting overwrite table has inconsistent behavior regarding HDFS trash - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-21386) ML LinearRegression supports warm start from user provided initial model - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-18409) LSH approxNearestNeighbors should use approxQuantile instead of sort - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-22955) Error generating jobs when Stopping JobGenerator gracefully - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-12519) "Managed memory leak detected" when using distinct on PySpark DataFrame - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-16180) Task hang on fetching blocks (cached RDD) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-19299) Nulls in non nullable columns causes data corruption in parquet - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-21740) DataFrame.write does not work with Phoenix JDBC Driver - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-21420) Support array type code 'q' and 'Q' - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-14141) Let user specify datatypes of pandas dataframe in toPandas() - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-20215) ReuseExchange is boken in SparkSQL - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-18780) "org.apache.spark.sql.catalyst.errors.package$TreeNodeException: Binding attribute, tree fromunixtime(cast(ā€¦))" - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-12042) Python API for mllib.stat.test.StreamingTest - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-20322) MinPattern length in PrefixSpan - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-22084) Performance regression in aggregation strategy - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-19928) Incorrect error message when grouping function used with wrong types - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-21500) Update Shuffle Fetch bookkeeping immediately after receiving remote block - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-16207) order guarantees for DataFrames - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-14651) CREATE TEMPORARY TABLE is not supported yet - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-14764) Spark SQL documentation should be more precise about which SQL features it supports - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-15772) Add missing parameter descriptions and examples in Java/Scala API docs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-22547) Don't include executor ID in metrics name - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-20547) ExecutorClassLoader's findClass may not work correctly when a task is cancelled. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:18 UTC, 1 replies.
- [jira] [Updated] (SPARK-18704) CrossValidator should preserve more tuning statistics - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-18451) Always set -XX:+HeapDumpOnOutOfMemoryError for Spark tests - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-15829) spark master webpage links to application UI broke when running in cluster mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-24177) Spark returning inconsistent rows and data in a join query when run using Spark SQL (using SQLContext.sql(...)) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-2489) Unsupported parquet datatype optional fixed_len_byte_array - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-21706) Support Custom PartitionSpec Provider for Kinesis Firehose or similar - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-11844) can not read class org.apache.parquet.format.PageHeader: don't know what type: 13 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-20973) insert table fail caused by unable to fetch data definition file from remote hdfs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-22706) Cannot read Teradata CLOB column type correctly in Spark 2.2.0 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-13228) HiveSparkSubmitSuite is flaky - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-18978) Spark streaming ClassCastException - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-12890) Spark SQL query related to only partition fields should not scan the whole data. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-9279) Spark Master Refuses to Bind WebUI to a Privileged Port - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-21651) Detect MapType in Json InferSchema - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-20062) Inconsistent checking on ML estimator/model copy in the unit tests. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-16938) Cannot resolve column name after a join - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-22183) Inconsistency in LIKE escaping between literal values and column-based ones - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-21697) NPE & ExceptionInInitializerError trying to load UTF from HDFS - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-20414) avoid creating only 16 reducers when calling topByKey() - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-12635) More efficient (column batch) serialization for Python/R - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-8971) Support balanced class labels when splitting train/cross validation sets - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-20411) New features for expression.scalalang.typed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-17804) Pandas dtypes are not correctly inferred by pyspark - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-22215) Add a configuration parameter to set max size for generated classes - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-19233) Inconsistent Behaviour of Spark Streaming Checkpoint - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-19100) Schedule tasks in descending order of estimated input size / estimated task duration - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-16738) Queryable state for Spark State Store - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:21 UTC, 3 replies.
- [jira] [Updated] (SPARK-21537) toPandas() should handle nested columns (as a Pandas MultiIndex) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-19651) ParallelCollectionRDD.collect should not issue a Spark job - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-19833) remove SQLConf.HIVE_VERIFY_PARTITION_PATH, we always return empty when the path does not exists - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-2926) Add MR-style (merge-sort) SortShuffleReader for sort-based shuffle - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-20299) NullPointerException when null and string are in a tuple while encoding Dataset - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-18131) Support returning Vector/Dense Vector from backend - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-18884) Support Array[_] in ScalaUDF - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-18255) SQLContext.getOrCreate always returns a SQLContext even if a user originally created a HiveContext - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-19522) --executor-memory flag doesn't work in local-cluster mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-20811) GBT Classifier failed with mysterious StackOverflowError - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-20116) Remove task-level functionality from the DAGScheduler - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-19486) Investigate using multiple threads for task serialization - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-19847) port hive read to FileFormat API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-18621) PySQL SQL Types (aka Dataframa Schema) have __repr__() with Scala and not Python representation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-18512) FileNotFoundException on _temporary directory with Spark Streaming 2.0.1 and S3A - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-18833) Changing partition location using the 'ALTER TABLE .. SET LOCATION' command via beeline doesn't get reflected in Spark - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-22910) Wrong results in Spark Job because failed to move to Trash - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-12157) Support numpy types as return values of Python UDFs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-17762) invokeJava fails when serialized argument list is larger than INT_MAX (2,147,483,647) bytes - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-21014) Support get fields with schema name - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-19747) Consolidate code in ML aggregators - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-18689) Support prioritized apps utilizing linux cgroups - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-21821) Support to force kill the CoarseGrainedExecutorBackend process which is likely be orphaned. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-21564) TaskDescription decoding failure should fail the task - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-23511) Catalyst: Implement GetField - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-7839) Augment build environment to support native libraries with SparkR - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-23552) Dataset.withColumn does not allow overriding of a struct field - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-21458) Tear down the framework when failover_timeout > 0 (Mesos cluster mode) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-18825) Eliminate duplicate links in SparkR API doc index - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-17934) Support percentile scale in ml.feature - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-3434) Distributed block matrix - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-18713) using SparkR build step wise regression model (glm) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:27 UTC, 0 replies.
- [jira] [Updated] (SPARK-19419) Unable to detect all the cases of cartesian products when spark.sql.crossJoin.enabled is false - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:27 UTC, 0 replies.
- [jira] [Updated] (SPARK-17820) Spark sqlContext.sql() performs only first insert for HiveQL "FROM target INSERT INTO dest" command to insert into multiple target tables from same source - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:27 UTC, 0 replies.
- [jira] [Updated] (SPARK-18115) Custom metrics Sink/Source prevent Executor from starting - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:27 UTC, 0 replies.
- [jira] [Updated] (SPARK-23370) Spark receives a size of 0 for an Oracle Number field and defaults the field type to be BigDecimal(30,10) instead of the actual precision and scale - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:27 UTC, 0 replies.
- [jira] [Updated] (SPARK-20234) Improve Framework for Basic ML Tests - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:27 UTC, 0 replies.
- [jira] [Updated] (SPARK-14151) Propose to refactor and expose Metrics Sink and Source interface - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:27 UTC, 0 replies.
- [jira] [Updated] (SPARK-18728) Consider using Algebird's Aggregator instead of org.apache.spark.sql.expressions.Aggregator - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-20970) Deprecate TaskMetrics._updatedBlockStatuses - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-11976) Support "." character in DataFrame column name - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-22629) incorrect handling of calls to random in UDFs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-22237) Spark submit script should use downloaded files in standalone/local client mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-20562) Support Maintenance by having a threshold for unavailability - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-24223) ExternalShuffleService looses registrations - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-22321) Improve logging in the mesos scheduler - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-22427) StackOverFlowError when using FPGrowth - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-8643) local-cluster may not shutdown SparkContext gracefully - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-21363) Prevent column name duplication in temporary view - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-15572) ML persistence in R format: compatibility with other languages - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-10780) Set initialModel in KMeans in Pipelines API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-12650) No means to specify Xmx settings for spark-submit in cluster deploy mode for Spark on YARN - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-20723) Random Forest Classifier should expose intermediateRDDStorageLevel similar to ALS - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-20965) Support PREPARE/EXECUTE/DECLARE/FETCH statements - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-21598) Collect usability/events information from Spark History Server - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-9139) Add backwards-compatibility tests for DataType.fromJson() - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-22491) union all can't execute parallel with group by - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-17133) Improvements to linear methods in Spark - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-19462) when spark.sql.adaptive.enabled is enabled, DF is not resilient to node/container failure - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-22383) Generate code to directly get value of primitive type array from ColumnVector for table cache - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-20016) SparkLauncher submit job failed after setConf with special charaters under windows - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-20313) Possible lack of join optimization when partitions are in the join condition - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-19259) spark locality in CNI context - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-14146) Imported implicits can't be found in Spark REPL in some cases - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-21682) Caching 100k-task RDD GC-kills driver (due to updatedBlockStatuses?) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-22524) Subquery on UI appear as the same node even if it's not reused - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-18731) Task size in K-means is so large - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-12052) DataFrame with self-join fails unless toDF() column aliases provided - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-19819) Use concrete data in SparkR DataFrame examples - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-21259) More rules for scalastyle - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-20745) Data gets wrongly copied from one row to others, possibly related to named structs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-20663) Data missing after insert overwrite table partition which is created on specific location - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-15682) Hive partition write looks for root hdfs folder for existence - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-23641) Wrong username when making relative path to Hive LOAD DATA absolute - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-22116) Should ignore fetchFaileException if caused by kill event - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-19621) R Windows AppVeyor test should run CRAN checks - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-19685) PipedRDD tasks should not hang on interruption / errors - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-23216) Multiclass LogisticRegression could have methods like NCE, NEG, Hierarchical SoftMax, Blackout or IS - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-22342) refactor schedulerDriver registration - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-22558) SparkHiveDynamicPartition fails when trying to write data from kafka to hive using spark streaming - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-19248) Regex_replace works in 1.6 but not in 2.0 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:36 UTC, 3 replies.
- [jira] [Updated] (SPARK-19428) Ability to select first row of groupby - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-22539) Add second order for rangepartitioner since partition number may be small if the specified key is skewed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-19258) Spark DataFrame saveAsTable not working properly after Sentry turned on - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-19374) java.security.KeyManagementException: Default SSLContext is initialized automatically - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-9338) Aliases from SELECT not available in GROUP BY - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-11620) parquet.hadoop.ParquetOutputCommitter.commitJob() throws parquet.io.ParquetEncodingException - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:37 UTC, 0 replies.
- [jira] [Updated] (SPARK-22879) LogisticRegression inconsistent prediction when proba == threshold - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:37 UTC, 0 replies.
- [jira] [Updated] (SPARK-19917) qualified partition location stored in catalog - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:37 UTC, 0 replies.
- [jira] [Updated] (SPARK-22599) Avoid extra reading for cached table - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:37 UTC, 0 replies.
- [jira] [Updated] (SPARK-23437) [ML] Distributed Gaussian Process Regression for MLlib - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:37 UTC, 0 replies.
- [jira] [Updated] (SPARK-13572) HiveContext reads avro Hive tables incorrectly - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:37 UTC, 0 replies.
- [jira] [Updated] (SPARK-21378) Spark Poll timeout when specific offsets are passed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:37 UTC, 0 replies.
- [jira] [Updated] (SPARK-15328) Word2Vec import for original binary format - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:38 UTC, 0 replies.
- [jira] [Updated] (SPARK-19551) Theme for PySpark documenation could do with improving - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:38 UTC, 0 replies.
- [jira] [Updated] (SPARK-23764) Utils.tryWithSafeFinally swollows fatal exceptions in the finally block - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:38 UTC, 0 replies.
- [jira] [Updated] (SPARK-18712) keep the order of sql expression and support short circuit - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:38 UTC, 0 replies.
- [jira] [Updated] (SPARK-19098) Shuffled data leak/size doubling in ConnectedComponents/Pregel iterations - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:38 UTC, 0 replies.
- [jira] [Updated] (SPARK-15816) SQL server based on Postgres protocol - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:38 UTC, 0 replies.
- [jira] [Updated] (SPARK-18626) Concurrent write to table fails from spark - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-22876) spark.yarn.am.attemptFailuresValidityInterval does not work correctly - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-7200) Tungsten test suites should fail if memory leak is detected - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-19023) Memory leak on GraphX with an iterative algorithm and checkpoint on the graph - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-18839) Executor is active on web, but actually is dead - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-16454) Consider adding a per-batch transform for structured streaming - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-23876) OR condition in joins causes results to come back to driver - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-21251) Add Kafka consumer metrics - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-10719) SQLImplicits.rddToDataFrameHolder is not thread safe when using Scala 2.10 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:40 UTC, 1 replies.
- [jira] [Updated] (SPARK-17434) Flaky test: HiveSparkSubmitSuite's "SPARK-9757 Persist Parquet relation with decimal column" - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:40 UTC, 0 replies.
- [jira] [Updated] (SPARK-22468) subtract creating empty DataFrame that isn't initialised properly - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:40 UTC, 0 replies.
- [jira] [Updated] (SPARK-17878) Support for multiple null values when reading CSV data - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:40 UTC, 0 replies.
- [jira] [Updated] (SPARK-23460) PySpark concurrency python egg cache directory - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:40 UTC, 0 replies.
- [jira] [Updated] (SPARK-20219) Schedule tasks based on size of input from ScheduledRDD - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:41 UTC, 0 replies.
- [jira] [Updated] (SPARK-19209) "No suitable driver" on first try - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:41 UTC, 0 replies.
- [jira] [Updated] (SPARK-3255) Faster algorithms for logistic regression - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:41 UTC, 0 replies.
- [jira] [Updated] (SPARK-18348) Improve tree ensemble model summary - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:41 UTC, 0 replies.
- [jira] [Updated] (SPARK-21178) Add support for label specific metrics in MulticlassClassificationEvaluator - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:41 UTC, 0 replies.
- [jira] [Updated] (SPARK-18996) Spark SQL support for post hooks - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:41 UTC, 0 replies.
- [jira] [Updated] (SPARK-20340) Size estimate very wrong in ExternalAppendOnlyMap from CoGroupedRDD, cause OOM - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:42 UTC, 0 replies.
- [jira] [Updated] (SPARK-19926) Make pyspark exception more readable - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:42 UTC, 0 replies.
- [jira] [Updated] (SPARK-19781) Bucketizer's handleInvalid leave null values untouched unlike the NaNs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:42 UTC, 0 replies.
- [jira] [Updated] (SPARK-18610) greatest/leatest fails to run with string aginst date/timestamp - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:42 UTC, 0 replies.
- [jira] [Updated] (SPARK-18226) SparkR displaying vector columns in incorrect way - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:42 UTC, 0 replies.
- [jira] [Updated] (SPARK-23287) Spark scheduler does not remove initial executor if not one job submitted - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:42 UTC, 0 replies.
- [jira] [Updated] (SPARK-18693) BinaryClassificationEvaluator, RegressionEvaluator, and MulticlassClassificationEvaluator should use sample weight data - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:42 UTC, 0 replies.
- [jira] [Updated] (SPARK-20034) IndexedRowMatrix should have toLocalMatrix defined on it. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:42 UTC, 0 replies.
- [jira] [Updated] (SPARK-19366) Dataset should have getNumPartitions method - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:43 UTC, 0 replies.
- [jira] [Updated] (SPARK-21719) Enable complex expression in Column.getItem(...) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:43 UTC, 0 replies.
- [jira] [Updated] (SPARK-10496) Efficient DataFrame cumulative sum - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:43 UTC, 0 replies.
- [jira] [Updated] (SPARK-23964) why does Spillable wait for 32 elements? - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:43 UTC, 0 replies.
- [jira] [Updated] (SPARK-14040) Null-safe and equality join produces incorrect result with filtered dataframe - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:43 UTC, 0 replies.
- [jira] [Updated] (SPARK-24174) Expose Hadoop config as part of /environment API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:43 UTC, 0 replies.
- [jira] [Updated] (SPARK-15491) JSON serialization fails for JDBC DataFrames - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:44 UTC, 0 replies.
- [jira] [Updated] (SPARK-22765) Create a new executor allocation scheme based on that of MR - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:44 UTC, 0 replies.
- [jira] [Updated] (SPARK-18122) Fallback to Kryo for unknown classes in ExpressionEncoder - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:44 UTC, 0 replies.
- [jira] [Updated] (SPARK-18178) Importing Pandas Tables with Missing Values - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:44 UTC, 0 replies.
- [jira] [Updated] (SPARK-21209) Implement Incremental PCA algorithm for ML - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:44 UTC, 0 replies.
- [jira] [Updated] (SPARK-23659) Spark Job gets stuck during shuffle - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:45 UTC, 0 replies.
- [jira] [Updated] (SPARK-20805) updated updateP in SVD++ is error - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:45 UTC, 0 replies.
- [jira] [Updated] (SPARK-22155) Generates a simple default alias for the function in select - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:45 UTC, 0 replies.
- [jira] [Updated] (SPARK-6634) Allow replacing columns in Transformers - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:45 UTC, 0 replies.
- [jira] [Updated] (SPARK-12964) SparkContext.localProperties leaked - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:45 UTC, 0 replies.
- [jira] [Updated] (SPARK-14172) Hive table partition predicate not passed down correctly - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:45 UTC, 0 replies.
- [jira] [Updated] (SPARK-20797) mllib lda's LocalLDAModel's save: out of memory. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:46 UTC, 0 replies.
- [jira] [Updated] (SPARK-16366) Time comparison failures in SparkR unit tests - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:46 UTC, 0 replies.
- [jira] [Updated] (SPARK-13860) TPCDS query 39 returns wrong results compared to TPC official result set - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:46 UTC, 0 replies.
- [jira] [Updated] (SPARK-19855) Create an internal FilePartitionStrategy interface - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:46 UTC, 0 replies.
- [jira] [Updated] (SPARK-21716) The time-range window can't be applied on the reduce operator - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:47 UTC, 0 replies.
- [jira] [Updated] (SPARK-21314) ByteArrayMethods.arrayEquals could use some optimizations - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:47 UTC, 0 replies.
- [jira] [Updated] (SPARK-16280) Implement histogram_numeric SQL function - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:47 UTC, 0 replies.
- [jira] [Updated] (SPARK-22906) External shuffle IP different from Host ip - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:47 UTC, 0 replies.
- [jira] [Updated] (SPARK-6810) Performance benchmarks for SparkR - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:47 UTC, 0 replies.
- [jira] [Updated] (SPARK-11237) PMML export for ML KMeans - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:47 UTC, 0 replies.
- [jira] [Updated] (SPARK-19982) JavaDatasetSuite.testJavaBeanEncoder sometimes fails with "Unable to generate an encoder for inner class" - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:48 UTC, 0 replies.
- [jira] [Updated] (SPARK-11003) Allowing UserDefinedTypes to extend primatives - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:48 UTC, 0 replies.
- [jira] [Updated] (SPARK-19147) netty throw NPE - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:48 UTC, 0 replies.
- [jira] [Updated] (SPARK-20959) Add a parameter to UnsafeExternalSorter to configure filebuffersize - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:48 UTC, 0 replies.
- [jira] [Updated] (SPARK-21220) Use outputPartitioning's bucketing if possible on write - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:48 UTC, 0 replies.
- [jira] [Updated] (SPARK-19114) Backpressure could support non-integral rates (< 1) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:04:48 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10912) Improve Spark metrics executor.filesystem - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:11:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23282) Probable mistake in hasLaunchedTask condition - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:11:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19325) Running query hang-up 5min - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:11:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21336) Revise rand comparison in BatchEvalPythonExecSuite - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:11:02 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19663) Dynamic Batch Interval Adjustment - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:11:02 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17133) Improvements to linear methods in Spark - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:11:02 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9213) Improve regular expression performance (via joni) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:11:02 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19477) [SQL] Datasets created from a Dataframe with extra columns retain the extra columns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:11:03 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21695) Spark scheduler locality algorithm can take longer then expected - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:11:04 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6962) Netty BlockTransferService hangs in the middle of SQL query - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:11:04 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13573) Open SparkR APIs (R package) to allow better 3rd party usage - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:11:05 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22451) Reduce decision tree aggregate size for unordered features from O(2^numCategories) to O(numCategories) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:11:05 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21748) Migrate the implementation of HashingTF from MLlib to ML - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:11:05 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19962) add DictVectorizor for DataFrame - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:11:05 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20660) Not able to merge Dataframes with different column orders - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:11:06 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19299) Nulls in non nullable columns causes data corruption in parquet - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:11:06 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20734) Structured Streaming spark.sql.streaming.schemaInference not handling schema changes - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:11:07 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19098) Shuffled data leak/size doubling in ConnectedComponents/Pregel iterations - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:11:07 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13303) Spark fails with pandas import error when pandas is not explicitly imported by user - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:11:07 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22051) Explicit control of number of partitions after dataframe operations (join, order...) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:11:07 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22108) Logical Inconsistency in Timestamp Cast - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:11:08 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21220) Use outputPartitioning's bucketing if possible on write - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:11:08 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23478) Inconsistent behaviour of union when columns have conflicting metadata - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:11:08 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22581) Catalog api does not allow to specify partitioning columns with create(external)table - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:11:08 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19730) Predicate Subqueries do not push results of subqueries to data source - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:11:09 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22630) Consolidate all configuration properties into one page - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:11:09 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19790) OutputCommitCoordinator should not allow another task to commit after an ExecutorFailure - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:11:09 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19030) Dropped event errors being reported after SparkContext has been stopped - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:11:09 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22680) SparkSQL scan all partitions when the specified partitions are not exists in parquet formatted table - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:11:10 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22020) Support session local timezone - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:11:10 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18502) Spark does not handle columns that contain backquote (`) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:11:10 UTC, 0 replies.
- [jira] [Resolved] (SPARK-24150) Race condition in FsHistoryProvider - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:11:11 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14236) UDAF does not use incomingSchema for update Method - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:11:11 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22196) Combine multiple input splits into a HadoopPartition - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:11:12 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14864) [MLLIB] Implement Doc2Vec - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:11:12 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20325) Spark Structured Streaming documentation Update: checkpoint configuration - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:11:12 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15302) Implement FK/PK "rely novalidate" constraints for better CBO - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:11:13 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19522) --executor-memory flag doesn't work in local-cluster mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:11:13 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16280) Implement histogram_numeric SQL function - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:11:13 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14401) Switch to stock sbt-pom-reader plugin - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:11:13 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22928) API Documentation issue for org.apache.spark.sql.streaming.Trigger - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:22 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12648) UDF with Option[Double] throws ClassCastException - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:22 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13337) DataFrame join-on-columns function should support null-safe equal - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:22 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18069) Many examples in Python docstrings are incomplete - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:23 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19230) View creation in Derby gets SQLDataException because definition gets very big - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:23 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22351) Support user-created custom Encoders for Datasets - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:23 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18170) Confusing error message when using rangeBetween without specifying an "orderBy" - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:23 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7290) Add StringVectorizer - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:23 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21465) array('L') support might lead to overflow error - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:23 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19928) Incorrect error message when grouping function used with wrong types - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:23 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19772) Flaky test: pyspark.streaming.tests.WindowFunctionTests - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:23 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16551) Accumulator Examples should demonstrate different use case from UDAFs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:24 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15834) Time zone / locale sensitivity umbrella - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:24 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22765) Create a new executor allocation scheme based on that of MR - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:24 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21152) Use level 3 BLAS operations in LogisticAggregator - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:24 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21558) Kinesis lease failover time should be increased or made configurable - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:24 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18780) "org.apache.spark.sql.catalyst.errors.package$TreeNodeException: Binding attribute, tree fromunixtime(cast(ā€¦))" - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:24 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8971) Support balanced class labels when splitting train/cross validation sets - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:24 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22625) Properly cleanup inheritable thread-locals - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:25 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19856) Turn partitioning related test cases in FileSourceStrategySuite from integration tests into unit tests - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:25 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12140) Support Streaming UI in HistoryServer - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:25 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11222) Add style checker rules to validate doc tests aren't included in docs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:25 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22137) Failed to insert VectorUDT to hive table with DataFrameWriter.insertInto(tableName: String) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:25 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17360) PySpark can create dataframe from a Python generator - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:25 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9338) Aliases from SELECT not available in GROUP BY - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:26 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13333) DataFrame filter + randn + unionAll has bad interaction - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:26 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18978) Spark streaming ClassCastException - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:26 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9279) Spark Master Refuses to Bind WebUI to a Privileged Port - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18713) using SparkR build step wise regression model (glm) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21001) Staging folders from Hive table are not being cleared. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20112) SIGSEGV in GeneratedIterator.sort_addToSorter - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20029) LiR supports bound constrained optimization - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20469) Add a method to display DataFrame schema in PipelineStage - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19738) Consider adding error handler to DataStreamWriter - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19651) ParallelCollectionRDD.collect should not issue a Spark job - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19819) Use concrete data in SparkR DataFrame examples - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21719) Enable complex expression in Column.getItem(...) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17134) Use level 2 BLAS operations in LogisticAggregator - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:28 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19621) R Windows AppVeyor test should run CRAN checks - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:28 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20800) Allow users to set job group when connecting through the SQL thrift server - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:29 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18512) FileNotFoundException on _temporary directory with Spark Streaming 2.0.1 and S3A - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:29 UTC, 0 replies.
- [jira] [Resolved] (SPARK-2141) Add sc.getPersistentRDDs() to PySpark - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:30 UTC, 0 replies.
- [jira] [Resolved] (SPARK-24223) ExternalShuffleService looses registrations - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:30 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20479) Performance degradation for large number of hash-aggregated columns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19984) ERROR codegen.CodeGenerator: failed to compile: org.codehaus.commons.compiler.CompileException: File 'generated.java' - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18683) REST APIs for standalone Master态Workers and Applications - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18131) Support returning Vector/Dense Vector from backend - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-24203) Make executor's bindAddress configurable - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22561) Dynamically update topics list for spark kafka consumer - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20902) Word2Vec implementations with Negative Sampling - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19051) test_hivecontext (pyspark.sql.tests.HiveSparkSubmitTests) fails in python/run-tests - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22878) Count totalDroppedEvents for LiveListenerBus - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13964) Feature hashing improvements - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20332) Avro/Parquet GenericFixed decimal is not read into Spark correctly - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21683) "TaskKilled (another attempt succeeded)" log message should be INFO level, not WARN - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20973) insert table fail caused by unable to fetch data definition file from remote hdfs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20138) Add imports to snippets in Spark SQL, DataFrames and Datasets Guide doc - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22246) UnsafeRow, UnsafeArrayData, and UnsafeMapData use MemoryBlock - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22546) Allow users to update the dataType of a column - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20201) Flaky Test: org.apache.spark.sql.catalyst.expressions.OrderingSuite - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15772) Add missing parameter descriptions and examples in Java/Scala API docs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20054) [Mesos] Detectability for resource starvation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20411) New features for expression.scalalang.typed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20486) Encapsulate ALS in-block and out-block data structures and methods into a separate class - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23502) Support async init of spark context during spark-shell startup - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20091) DagScheduler should allow running concurrent attempts of a stage in case of multiple fetch failure - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18233) Failed to deserialize the task - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20485) Split ALS.scala into multiple files - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11966) Spark API for UDTFs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20573) --packages fails when transitive dependency can only be resolved from repository specified in POM's tag - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23381) Murmur3 hash generates a different value from other implementations - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17885) Spark Streaming deletes checkpointed RDD then tries to load it after restart - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18884) Support Array[_] in ScalaUDF - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-2489) Unsupported parquet datatype optional fixed_len_byte_array - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20263) create empty dataframes in sparkR - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:38 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19485) Launch tasks async i.e. dont wait for the network - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:38 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18359) Let user specify locale in CSV parsing - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:38 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21641) Combining windowing (groupBy) and mapGroupsWithState (groupByKey) in Spark Structured Streaming - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20128) MetricsSystem not always killed in SparkContext.stop() - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20142) Move RewriteDistinctAggregates later into query execution - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22876) spark.yarn.am.attemptFailuresValidityInterval does not work correctly - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21606) HiveServer2/HiveThriftServer2 catches OOMs on request threads - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23625) spark sql long-running mission will be dead - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20158) crash in Spark sql insert in partitioned hive tables - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22532) Spark SQL function 'drop_duplicates' throws error when passing in a column that is an element of a struct - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18887) Executor OOM due to tungsten memory leak in external sorter - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22345) Sort-merge join generates incorrect code for CodegenFallback filter conditions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19875) Map->filter on many columns gets stuck in constraint inference optimization code - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22205) Incorrect result with user defined agg function followed by a non deterministic function - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22742) Spark2.x does not support read data from Hive 2.2 and 2.3 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22619) Implement the CG method for ALS - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23297) Spark job is finished but the stage process is error - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19141) VectorAssembler metadata causing memory issues - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20072) Clarify ALS-WR documentation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12823) Cannot create UDF with StructType input - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23641) Wrong username when making relative path to Hive LOAD DATA absolute - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20237) Spark-1.6 current and later versions of memory management issues - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22013) Allow to read the results of a streaming query as non-streaming datasource - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22539) Add second order for rangepartitioner since partition number may be small if the specified key is skewed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22116) Should ignore fetchFaileException if caused by kill event - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21423) MODE average aggregate function. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20797) mllib lda's LocalLDAModel's save: out of memory. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18536) Failed to save to hive table when case class with empty field - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23286) Partial registration to external shuffle services on Mesos - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20723) Random Forest Classifier should expose intermediateRDDStorageLevel similar to ALS - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19926) Make pyspark exception more readable - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11237) PMML export for ML KMeans - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-24220) java.lang.NullPointerException at org.apache.spark.sql.execution.UnsafeExternalRowSorter.(UnsafeExternalRowSorter.java:83) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22970) Getting Application ID from Submission ID - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22906) External shuffle IP different from Host ip - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22527) Reuse coordinated ShuffleExchange if possible - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21867) Support async spilling in UnsafeShuffleWriter - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:45 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15328) Word2Vec import for original binary format - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:45 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20315) Set ScalaUDF's deterministic to true - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:45 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21591) Implement treeAggregate on Dataset API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:46 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20320) AnalysisException: Columns of grouping_id (count(value#17L)) does not match grouping columns (count(value#17L)) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:46 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20110) Windowed aggregation do not work when the timestamp is a nested field - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:46 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21056) InMemoryFileIndex.listLeafFiles should create at most one spark job when listing files in parallel - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:47 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20530) "Cannot evaluate expression" when filtering on parquet partition column - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:47 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21360) Spark failing to query SQL Server. Query contains a column having space in where clause - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:47 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19147) netty throw NPE - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:47 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20227) Job hangs when joining a lot of aggregated columns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:47 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19524) newFilesOnly does not work according to docs. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:47 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22299) Use OFFSET and LIMIT for JDBC DataFrameReader striping - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:48 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13801) DataFrame.col should return unresolved attribute - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:48 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19315) StructType should support nested lookup; throws IllegalArgumentException - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:48 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18825) Eliminate duplicate links in SparkR API doc index - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:49 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15682) Hive partition write looks for root hdfs folder for existence - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:49 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16992) Pep8 code style - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:50 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11421) Add the ability to add a jar to the current class loader - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:50 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21564) TaskDescription decoding failure should fail the task - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:50 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11782) Master Web UI should link to correct Application UI in cluster mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:50 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21480) Memory leak in org.apache.hadoop.hive.metastore.MetaStoreDirectSql.executeNoResult - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:51 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12945) ERROR LiveListenerBus: Listener JobProgressListener threw an exception - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:51 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18023) Adam optimizer - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:51 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18450) Add AND-amplification to Locality Sensitive Hashing - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:52 UTC, 0 replies.
- [jira] [Resolved] (SPARK-5206) Accumulators are not re-registered during recovering from checkpoint - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:52 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14887) Generated SpecificUnsafeProjection Exceeds JVM Code Size Limits - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:53 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19341) Bucketing support for Structured Streaming - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:53 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18397) cannot create table by using the hive default fileformat - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:54 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21048) Add an option --merged-properties-file to distinguish the configuration loading behavior - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:54 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20328) HadoopRDDs create a MapReduce JobConf, but are not MapReduce jobs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:54 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20169) Groupby Bug with Sparksql - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:55 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20562) Support Maintenance by having a threshold for unavailability - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:55 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18996) Spark SQL support for post hooks - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:55 UTC, 0 replies.
- [jira] [Resolved] (SPARK-3434) Distributed block matrix - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:56 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21340) Bring PySpark MLLib evaluation metrics to parity with Scala API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:56 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19917) qualified partition location stored in catalog - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:56 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6442) MLlib Local Linear Algebra Package - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:57 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21054) Reset Command support reset specific property which is compatible with Hive - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:58 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20982) Consider adding SSL support for Spark REST submission server and client - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:58 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20547) ExecutorClassLoader's findClass may not work correctly when a task is cancelled. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:58 UTC, 1 replies.
- [jira] [Resolved] (SPARK-22383) Generate code to directly get value of primitive type array from ColumnVector for table cache - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:59 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22331) Make MLlib string params case-insensitive - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:12:59 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18889) Spark incorrectly reads default columns from a Hive view - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18404) RPC call from executor to driver blocks when getting map output locations (Netty Only) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19845) failed to uncache datasource table after the table location altered - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19731) IN Operator should support arrays - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21317) Avoid unnecessary sort in FileFormatWriter if data is already bucketed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20724) spark-submit verbose mode should list default settings values - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22382) Spark on mesos: doesn't support public IP setup for agent and master. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20851) Drop spark table failed if a column name is a numeric string - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19783) Treat shorter/longer lengths of tokens as malformed records in CSV parser - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22441) JDBC REAL type is mapped to Double instead of Float - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:02 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13007) Document where configuration / properties are read and applied - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:02 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18301) VectorAssembler does not support StructTypes - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:02 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6143) Improve FP-Growth for mining closed-forms of frequent patterns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:02 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12832) Mesos cluster mode should handle constraints - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:03 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14172) Hive table partition predicate not passed down correctly - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:03 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19340) Opening a file in CSV format will result in an exception if the filename contains special characters - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:03 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22910) Wrong results in Spark Job because failed to move to Trash - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:04 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12686) Support group-by push down into data sources - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:04 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19892) Implement findAnalogies method for Word2VecModel - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:04 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19225) Spark SQL round constant double return null - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:04 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22182) Incorrect Date and Timestamp conversion beyon before 1000 year - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:05 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22691) Custom HttpFileSystem, issue with question-marks in path - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:05 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21888) Cannot add stuff to Client Classpath for Yarn Cluster Mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:05 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21533) "configure(...)" method not called when using Hive Generic UDFs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:06 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19135) SQL: Type inconsistencies with Structs, Arrays and Nulls - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:06 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18392) LSH API, algorithm, and documentation follow-ups - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:06 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7129) Add generic boosting algorithm to spark.ml - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:06 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22048) Show id, runId, batch in Description column in SQL tab for streaming queries (as in Jobs) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:07 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23395) Add an option to return an empty DataFrame from an RDD generated by a Hadoop file when there are no usable paths - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:07 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21529) Uniontype not supported when reading from Hive tables. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:07 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19418) Dataset generated java code fails to compile as java.lang.Long does not accept UTF8String in constructor - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:08 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18454) Changes to improve Nearest Neighbor Search for LSH - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:08 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20045) Make sure SparkHadoopMapReduceWriter is resilient to failures of writers and committers - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:08 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22958) Spark is stuck when the only one executor fails to register with driver - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:08 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21859) SparkFiles.get failed on driver in yarn-cluster and yarn-client mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:08 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21290) R document Programmatically Specifying the Schema in SQL guide - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:09 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18959) invalid resource statistics for standalone cluster - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:09 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20210) Scala tests aborted in Spark SQL on ppc64le - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:09 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20005) There is no "Newline" in UI in describtion - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:09 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20827) cannot express HAVING without a GROUP BY clause - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:09 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23626) Spark DAGScheduler scheduling performance hindered on JobSubmitted Event - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:09 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19756) drop the table cache after inserting into a data source table - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:09 UTC, 0 replies.
- [jira] [Resolved] (SPARK-3165) DecisionTree does not use sparsity in data - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:10 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18859) Catalyst codegen does not mark column as nullable when it should. Causes NPE - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:12 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10780) Set initialModel in KMeans in Pipelines API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:13 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23492) Application shows up as running in history server even when latest attempt has completed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:14 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21021) Reading partitioned parquet does not respect specified schema column order - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:14 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14141) Let user specify datatypes of pandas dataframe in toPandas() - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:15 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16900) Complete-mode output for file sinks - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:15 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20116) Remove task-level functionality from the DAGScheduler - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:15 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21195) MetricSystem should pick up dynamically registered metrics in sources - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:16 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23768) Proxy configuration for extraJavaOptions in defaults conf - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:16 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17174) Provide support for Timestamp type Column in add_months function to return HH:mm:ss - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:16 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20372) Word2Vec Continuous Bag Of Words model - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:16 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14698) CREATE FUNCTION cloud not add function to hive metastore - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:16 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16849) Improve subquery execution by deduplicating the subqueries with the same results - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:16 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22277) Chi Square selector garbling Vector content. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:17 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22240) S3 CSV number of partitions incorrectly computed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:17 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23511) Catalyst: Implement GetField - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:17 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22536) VectorizedParquetRecordReader doesn't use Parquet's dictionary filtering feature - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21005) VectorIndexerModel does not prepare output column field correctly - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16944) [MESOS] Improve data locality when launching new executors when dynamic allocation is enabled - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21861) Add more details to PageRank illustration - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22104) Add new option to dataframe -> parquet ==> custom extension to file name - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8515) Improve ML attribute API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21894) Some Netty errors do not propagate to the top level driver - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22343) Add support for publishing Spark metrics into Prometheus - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21226) Save empty dataframe in pyspark prints nothing - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20271) Add FuncTransformer to simplify custom transformer creation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21684) df.write double escaping all the already escaped characters except the first one - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21507) Exception when using spark.jars.packages - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23061) Support default write mode, settable via spark config - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:22 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18728) Consider using Algebird's Aggregator instead of org.apache.spark.sql.expressions.Aggregator - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:22 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21349) Make TASK_SIZE_TO_WARN_KB configurable - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:22 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19615) Provide Dataset union convenience for divergent schema - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:22 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23130) Spark Thrift does not clean-up temporary files (/tmp/*_resources and /tmp/hive/*.pipeout) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:22 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11010) Fixes and enhancements addressing UDTs' api and several usability concerns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:23 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19364) Stream Blocks in Storage Persists Forever when Kinesis Checkpoints are enabled and an exception is thrown - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:23 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19568) Must include class/method documentation for CRAN check - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:24 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17565) Janino exception when calculating metrics for large generated class - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:24 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22220) Spark SQL: LATERAL VIEW OUTER null pointer exception with GROUP BY - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:24 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17423) Support IGNORE NULLS option in Window functions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:24 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18618) SparkR GLM model predict should support type as a argument - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:24 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15044) spark-sql will throw "input path does not exist" exception if it handles a partition which exists in hive table, but the path is removed manually - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:25 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19234) AFTSurvivalRegression chokes silently or with confusing errors when any labels are zero - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:25 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21612) Allow unicode strings in __getitem__ of StructType - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:25 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22270) Renaming DF column breaks sparkPlan.outputOrdering - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:25 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22260) java.lang.RuntimeException: hdfs://HdfsHA/logrep/1/sspstatistic/_metadata is not a Parquet file (too small) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:26 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17934) Support percentile scale in ml.feature - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:26 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13911) Having condition and order by cannot both have aggregate functions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:26 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19367) Hive metastore temporary configuration doesn't specify default filesystem - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18691) Spark can hang if a node goes down during a shuffle - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21710) ConsoleSink causes OOM crashes with large inputs. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-2926) Add MR-style (merge-sort) SortShuffleReader for sort-based shuffle - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20663) Data missing after insert overwrite table partition which is created on specific location - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:29 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19487) Low latency execution for Spark - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:29 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17977) DataFrameReader and DataStreamReader should have an ancestor class - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19982) JavaDatasetSuite.testJavaBeanEncoder sometimes fails with "Unable to generate an encoder for inner class" - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19371) Cannot spread cached partitions evenly across executors - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8418) Add single- and multi-value support to ML Transformers - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19867) merge defaultTablePath logic when create table for InMemroyCatalog and HiveExternalCatalog - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21968) Improved KernelDensity support - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18704) CrossValidator should preserve more tuning statistics - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19979) [MLLIB] Multiple Estimators/Pipelines In CrossValidator - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18621) PySQL SQL Types (aka Dataframa Schema) have __repr__() with Scala and not Python representation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13365) should coalesce do anything if coalescing to same number of partitions without shuffle - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23766) Not able to execute multiple queries in spark structured streaming - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20219) Schedule tasks based on size of input from ScheduledRDD - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19209) "No suitable driver" on first try - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12225) Support adding or replacing multiple columns at once in DataFrame API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21143) Fail to fetch blocks >1MB in size in presence of conflicting Netty version - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15816) SQL server based on Postgres protocol - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18833) Changing partition location using the 'ALTER TABLE .. SET LOCATION' command via beeline doesn't get reflected in Spark - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21841) Spark SQL doesn't pick up column added in hive when table created with saveAsTable - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20952) ParquetFileFormat should forward TaskContext to its forkjoinpool - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9139) Add backwards-compatibility tests for DataType.fromJson() - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14146) Imported implicits can't be found in Spark REPL in some cases - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:37 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13158) Show the information of broadcast blocks in WebUI - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:37 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20221) Port pyspark.mllib.linalg tests in pyspark/mllib/tests.py to pyspark.ml.linalg - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:38 UTC, 0 replies.
- [jira] [Resolved] (SPARK-24261) Spark cannot read renamed managed Hive table - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:38 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16683) Group by does not work after multiple joins of the same dataframe - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:38 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15605) ML JavaDeveloperApiExample is broken - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:38 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18725) Creating a datasource table with schema should not scan all files for table - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:38 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20970) Deprecate TaskMetrics._updatedBlockStatuses - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22316) Cannot Select ReducedAggregator Column - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18994) worker clean up app directory block the heartbeat sending - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-3976) Detect block matrix partitioning schemes - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21245) Resolve code duplication for classification/regression summarizers - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21957) Add current_user function - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21363) Prevent column name duplication in temporary view - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20008) hiveContext.emptyDataFrame.except(hiveContext.emptyDataFrame).count() returns 1 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23552) Dataset.withColumn does not allow overriding of a struct field - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13572) HiveContext reads avro Hive tables incorrectly - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22589) Subscribe to multiple roles in Mesos - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22340) pyspark setJobGroup doesn't match java threads - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20699) The end of Python stdout/stderr streams may be lost by PythonRunner - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20454) Improvement of ShortestPaths in Spark GraphX - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17966) Support Spark packages with R code on Mesos - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12519) "Managed memory leak detected" when using distinct on PySpark DataFrame - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16205) dict -> StructType conversion is undocumented - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19973) StagePage should display the number of executors. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20034) IndexedRowMatrix should have toLocalMatrix defined on it. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20996) Better handling AM reattempt based on exit code in yarn mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19100) Schedule tasks in descending order of estimated input size / estimated task duration - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17762) invokeJava fails when serialized argument list is larger than INT_MAX (2,147,483,647) bytes - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22923) Non-equi join(theta join) should use sort merge join - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16394) Timestamp conversion error in pyspark.sql.Row because of timezones - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-1548) Add Partial Random Forest algorithm to MLlib - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20675) Support Index to skip when retrieval disk structure in CoGroupedRDD - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13619) Jobs page UI shows wrong number of failed tasks - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:45 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23519) Create View Commands Fails with The view output (col1,col1) contains duplicate column name - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:45 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20522) hostname -f does not work with coreutils - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:45 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22558) SparkHiveDynamicPartition fails when trying to write data from kafka to hive using spark streaming - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:45 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20357) Expose Calendar.getWeekYear() as Spark SQL date function to be consistent with weekofyear() - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:45 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21194) Fail the putNullmethod when containsNull=false. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:45 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21500) Update Shuffle Fetch bookkeeping immediately after receiving remote block - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:46 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22555) Possibly incorrect scaling of L2 regularization strength in LinearRegression - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:46 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21493) Add more metrics to External Shuffle Service - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:46 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16738) Queryable state for Spark State Store - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:46 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19643) Document how to use Spark/SparkR on Windows - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:46 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23613) Different Analyzed logical plan data types for the same table in different queries - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:47 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18731) Task size in K-means is so large - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:47 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18496) java.lang.AssertionError: assertion failed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:47 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22079) Serializer in HiveOutputWriter miss loading job configuration - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:48 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21162) Cannot count rows in an empty Hive table stored as parquet when spark.sql.parquet.cacheMetadata is set to false - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:48 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21242) Allow spark executors to function in mesos w/ container networking enabled - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:48 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18409) LSH approxNearestNeighbors should use approxQuantile instead of sort - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:48 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17487) Configurable bucketing info extraction - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:49 UTC, 0 replies.
- [jira] [Resolved] (SPARK-24177) Spark returning inconsistent rows and data in a join query when run using Spark SQL (using SQLContext.sql(...)) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:49 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18116) spark streaming ui show 0 events when recovering from checkpoint - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:49 UTC, 0 replies.
- [jira] [Resolved] (SPARK-24116) SparkSQL inserting overwrite table has inconsistent behavior regarding HDFS trash - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:49 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21518) Warnings if spark.mesos.task.labels is unset - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:49 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20207) Add ablity to exclude current row in WindowSpec - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:49 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19351) Support for obtaining file splits from underlying InputFormat - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:50 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18736) CreateMap allows non-unique keys - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:50 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20340) Size estimate very wrong in ExternalAppendOnlyMap from CoGroupedRDD, cause OOM - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:50 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20171) Analyzer should include the arity of a function when reporting "AnalysisException: Undefined function" - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:50 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18616) Pure Python Implementation of MLWritable for use in Pipeline - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:50 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23139) Read eventLog file with mixed encodings - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:51 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21521) History service requires user is in any group - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:51 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12105) Add a DataFrame.show() with argument for output PrintStream - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:51 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23480) NullPointerException in AppendOnlyMap.growTable - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:51 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21876) Idling Executors that never handled any tasks are not cleared from BlockManager after being removed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:52 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23370) Spark receives a size of 0 for an Oracle Number field and defaults the field type to be BigDecimal(30,10) instead of the actual precision and scale - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:53 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18636) UnsafeShuffleWriter and DiskBlockObjectWriter do not consider encryption / compression in metrics - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:53 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21944) Watermark on window column is wrong - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:53 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22966) Spark SQL should handle Python UDFs that return a datetime.date or datetime.datetime - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:53 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21497) Pull non-deterministic joining keys from Join operator - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:54 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22613) Make UNCACHE TABLE behaviour consistent with CACHE TABLE - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:54 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22432) Allow long creation site to be logged for RDDs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:54 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16578) Configurable hostname for RBackend - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:54 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12964) SparkContext.localProperties leaked - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:55 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19428) Ability to select first row of groupby - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:55 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15176) Job Scheduling Within Application Suffers from Priority Inversion - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:56 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21651) Detect MapType in Json InferSchema - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:56 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19044) PySpark dropna() can fail with AnalysisException - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:56 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22740) [SQL][JDBC] Reserved SQL words are not escaped for table names - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:56 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22207) High memory usage when converting relational data to Hierarchical data - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:56 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19503) Execution Plan Optimizer: avoid sort or shuffle when it does not change end result such as df.sort(...).count() - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:56 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22706) Cannot read Teradata CLOB column type correctly in Spark 2.2.0 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:56 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19420) Confusing error message when using outer join on two large tables - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:56 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20419) Support for Mesos Maintenance primitives - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:56 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19374) java.security.KeyManagementException: Default SSLContext is initialized automatically - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:57 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23895) Job continues to run even though some tasks have been failed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:57 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10678) Specialize PrefixSpan for single-item patterns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:57 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8487) Update reduceByKeyAndWindow docs to highlight that filtering Function must be used - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:58 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21682) Caching 100k-task RDD GC-kills driver (due to updatedBlockStatuses?) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:58 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22070) Spark SQL filter comparisons failing with timestamps and ISO-8601 strings - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:59 UTC, 0 replies.
- [jira] [Resolved] (SPARK-1866) Closure cleaner does not null shadowed fields when outer scope is referenced - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:59 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22987) UnsafeExternalSorter cases OOM when invoking `getIterator` function. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:59 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19090) Dynamic Resource Allocation not respecting spark.executor.cores - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:59 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22468) subtract creating empty DataFrame that isn't initialised properly - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:59 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21038) Reduce redundant generated init code in Catalyst codegen - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:59 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22155) Generates a simple default alias for the function in select - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:13:59 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21301) Should abort active taskSets or kill all running Tasks when that stage success. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19577) insert into a partition datasource table with InMemoryCatalog after the partition location alter by alter command failed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21715) History Server should not respond history page html content multiple times for only one http request - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13209) transitive closure on a dataframe - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9103) Tracking spark's memory usage - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17430) Spark task Hangs after OOM while DAG scheduler tries to serialize a task - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:02 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20188) Catalog recoverPartitions should allow specifying the database name - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:03 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10815) Public API: Streaming Sources and Sinks - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:03 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19114) Backpressure could support non-integral rates (< 1) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:04 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21458) Tear down the framework when failover_timeout > 0 (Mesos cluster mode) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:04 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12264) Add a typeTag or scalaTypeTag method to DataType - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:04 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7953) Spark should cleanup output dir if job fails - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:05 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20139) Spark UI reports partial success for completed stage while log shows all tasks are finished - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:05 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23494) Expose InferSchema's functionalities to the outside - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:05 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20418) multi-label classification support - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:05 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19648) Unable to access column containing '.' for approxQuantile function on DataFrame - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:05 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21955) OneForOneStreamManager may leak memory when network is poor - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:05 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19360) Spark 2.X does not support stored by clause - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:05 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12344) Remove env-based configurations - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:06 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17136) Design optimizer interface for ML algorithms - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:06 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20099) Add transformSchema to pyspark.ml - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:06 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19667) Create table with HiveEnabled in default database use warehouse path instead of the location of default database - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:07 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14046) RandomForest improvement umbrella - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:07 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21061) GMM Error : Matrix is not symmetric - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:07 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8643) local-cluster may not shutdown SparkContext gracefully - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:08 UTC, 0 replies.
- [jira] [Resolved] (SPARK-4899) Support Mesos features: roles and checkpoints - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:08 UTC, 0 replies.
- [jira] [Commented] (SPARK-21484) Wrong query plans of Dataset after persist/unpersist - posted by "Liang-Chi Hsieh (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:09 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12635) More efficient (column batch) serialization for Python/R - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:09 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19743) Exception when creating more than one implicit Encoder in REPL - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:09 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22235) Can not kill job gracefully in spark standalone cluster - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:09 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8529) Set metadata for MinMaxScaler - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:10 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22859) Permission of created table and database folder are not correct - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:11 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20701) dataframe.show has wrong white space when containing Supplement Unicode character - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:11 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21251) Add Kafka consumer metrics - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:11 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7200) Tungsten test suites should fail if memory leak is detected - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:11 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21821) Support to force kill the CoarseGrainedExecutorBackend process which is likely be orphaned. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:11 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19176) Change bin.xml to be compatible with groupId "org.spark-project.hive" - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:12 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22046) Streaming State cannot be scalable - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:12 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18688) Interpolated time series join - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:12 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22955) Error generating jobs when Stopping JobGenerator gracefully - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:12 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10496) Efficient DataFrame cumulative sum - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:13 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14047) GBT improvement umbrella - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:13 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22453) Zookeeper configuration for MesosClusterDispatcher is not documented - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:13 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21314) ByteArrayMethods.arrayEquals could use some optimizations - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:13 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8544) PMML export for Gradient Boosted Trees - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:14 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19576) Task attempt paths exist in output path after saveAsNewAPIHadoopFile completes with speculation enabled - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:14 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12052) DataFrame with self-join fails unless toDF() column aliases provided - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:14 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8984) Developer documentation for ML Pipelines - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:15 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21740) DataFrame.write does not work with Phoenix JDBC Driver - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:15 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22590) SparkContext's local properties missing from TaskContext properties - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:15 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20525) ClassCast exception when interpreting UDFs from a String in spark-shell - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:15 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15127) Column names are handled incorrectly when they originate from a single Dataframe - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:15 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16454) Consider adding a per-batch transform for structured streaming - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:16 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11609) Resolve permanent Hive UDFs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:16 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15810) Aggregator doesn't play nice with Option - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:16 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14501) spark.ml parity for fpm - frequent items - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:16 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13868) Random forest accuracy exploration - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:17 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22234) Distinct window functions are not supported - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:17 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23460) PySpark concurrency python egg cache directory - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:17 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22502) OnlineLDAOptimizer variationalTopicInference might be able to handle empty documents - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21757) Jobs page fails to load when executor removed event's reason contains single quote - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17419) Mesos virtual network support - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21484) Wrong query plans of Dataset after persist/unpersist - posted by "Liang-Chi Hsieh (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23439) Ambiguous reference when selecting column inside StructType with same name that outer colum - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23467) Enable way to create DataFrame from pre-partitioned files (Parquet/ORC/etc.) with each in-memory partition mapped to 1 physical file partition - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21086) CrossValidator, TrainValidationSplit should preserve all models after fitting - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19939) Add support for association rules in ML - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22314) Accessing Hive UDFs defined without 'USING JAR' from Spark - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20447) spark mesos scheduler suppress call - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16882) Failures in JobGenerator Thread are Swallowed, Job Does Not Fail - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18712) keep the order of sql expression and support short circuit - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-5243) Spark will hang if (driver memory + executor memory) exceeds limit on a 1-worker cluster - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19588) Allow putting keytab file to HDFS location specified in spark.yarn.keytab - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:22 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11520) RegressionMetrics should support instance weights - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:22 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19366) Dataset should have getNumPartitions method - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:22 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15571) Pipeline unit test improvements - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:22 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7839) Augment build environment to support native libraries with SparkR - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:23 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23621) DataFrame.insertInto() is persisting all columns for mixed structured data-type - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:23 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20113) overwrite mode appends data on MySQL table that does not have a primary key - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:23 UTC, 0 replies.
- [jira] [Resolved] (SPARK-3528) Reading data from file:/// should be called NODE_LOCAL not PROCESS_LOCAL - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:23 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13225) [SQL] Support Intersect All/Distinct - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:24 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19747) Consolidate code in ML aggregators - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:24 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11075) Spark SQL Thrift Server authentication issue on kerberized yarn cluster - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:24 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10335) GraphX Connected Components fail with large number of iterations - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:25 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22482) Unreadable Parquet array columns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:25 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14216) ML tree models should have a standardized, reusable feature importance test - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:25 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19578) Poor pyspark performance - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:26 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22198) Java incompatibility when extending UnaryTransformer or Transformer - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:26 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19741) ClassCastException when using Dataset with type containing value types - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:26 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21752) Config spark.jars.packages is ignored in SparkSession config - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:26 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21496) Support codegen for TakeOrderedAndProjectExec - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:26 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21598) Collect usability/events information from Spark History Server - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22342) refactor schedulerDriver registration - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13025) Allow user to specify the initial model when training LogisticRegression - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19703) Add Suppress/Revive support to the Mesos Spark Driver - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20261) EventLoggingListener may not truly flush the logger when a compression codec is used - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:28 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19370) Flaky test: MetadataCacheSuite - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:28 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19322) Allow customizable column name in ScalaReflection.schemaFor - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:28 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12347) Write script to run all MLlib examples for testing - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:28 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20013) merge renameTable to alterTable in ExternalCatalog - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:30 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21427) Describe mapGroupsWithState and flatMapGroupsWithState for stateful aggregation in Structured Streaming - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:30 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20803) KernelDensity.estimate in pyspark.mllib.stat.KernelDensity throws net.razorvine.pickle.PickleException when input data is normally distributed (no error when data is not normally distributed) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:30 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19657) start-master.sh accidentally forces the use of a loopback address in master URL - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:30 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14045) DecisionTree improvement umbrella - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:30 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11669) Python interface to SparkR GLM module - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:30 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21425) LongAccumulator, DoubleAccumulator not threadsafe - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19698) Race condition in stale attempt task completion vs current attempt task completion when task is doing persistent state changes - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21346) Spark does not use SSL for HTTP File Server and Broadcast Server - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17716) Hidden Markov Model (HMM) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22879) LogisticRegression inconsistent prediction when proba == threshold - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21488) Make saveAsTable() and createOrReplaceTempView() return dataframe of created table/ created view - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22216) Improving PySpark/Pandas interoperability - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11844) can not read class org.apache.parquet.format.PageHeader: don't know what type: 13 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17654) Propagate bucketing information for Hive tables to / from Catalog - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19419) Unable to detect all the cases of cartesian products when spark.sql.crossJoin.enabled is false - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20837) Spark SQL doesn't support escape of single/double quote as SQL standard. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16366) Time comparison failures in SparkR unit tests - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22096) use aggregateByKeyLocally to save one stage in calculating ItemFrequency in NaiveBayes - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22073) Add workaround for HIVE-15653 to avoid stats being accidentally wiped - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19668) Multiple NGram sizes - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22633) spark-submit.cmd cannot handle long arguments - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20249) Add summary for LinearSVCModel - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21183) Unable to return Google BigQuery INTEGER data type into Spark via google BigQuery JDBC driver: java.sql.SQLDataException: [Simba][JDBC](10140) Error converting value to long. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19416) Dataset.schema is inconsistent with Dataset in handling columns with periods - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8480) Add setName for Dataframe - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:37 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22261) Collect and show failed task metrics for ui - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:37 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21716) The time-range window can't be applied on the reduce operator - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:37 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19575) Reading from or writing to a hive serde table with a non pre-existing location should succeed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:38 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21916) Set isolationOn=true when create client to remote hive metastore - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:38 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18727) Support schema evolution as new files are inserted into table - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19890) Make MetastoreRelation statistics estimation more accurately - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19071) Optimizations for ML Pipeline Tuning - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19119) Approximate percentile support for frequency distribution table - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22180) Allow IPv6 address in org.apache.spark.util.Utils.parseHostPort - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23506) Add refreshByPath in HiveMetastoreCatalog and invalidByPath in FileStatusCache - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6407) Streaming ALS for Collaborative Filtering - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17322) 'ANY n' clause for SQL queries to increase the ease of use of WHERE clause predicates - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22339) Push epoch updates to executors on fetch failure to avoid fetch retries for missing executors - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20016) SparkLauncher submit job failed after setConf with special charaters under windows - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18226) SparkR displaying vector columns in incorrect way - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17608) Long type has incorrect serialization/deserialization - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18348) Improve tree ensemble model summary - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21679) KMeans Clustering is Not Deterministic - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16571) DataFrame repartition leads to unexpected error during shuffle - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23487) Fix the failure in spark-branch-2.2-lint - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23320) RANDOM pseudo environment variable has low resolution under Windows - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21674) Support double/single-quoted strings for alias names in SQL - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20299) NullPointerException when null and string are in a tuple while encoding Dataset - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21429) show on structured Dataset is equivalent to writeStream to console once - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22459) EdgeDirection "Either" Does Not Considerate Real "Either" Direction - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21368) TPCDSQueryBenchmark can't refer query files. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19346) Add further cold-start strategies for ALS prediction - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15642) Metadata gets lost when selecting a field of a StructType - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11471) Improve the way that we plan shuffled join - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21136) Misleading error message for typo in SQL - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21109) union two dataset[A] don't work as expected if one of the datasets is originated from a dataframe - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15798) Secondary sort in Dataset/DataFrame - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:45 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17459) Add Linear Discriminant to dimensionality reduction algorithms - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:46 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21179) Unable to return Hive INT data type into Spark via Hive JDBC driver: Caused by: java.sql.SQLDataException: [Simba][JDBC](10140) Error converting value to int. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:46 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18611) mesos shuffle service on v2 isn't compatible with spark v1 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:46 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20230) FetchFailedExceptions should invalidate file caches in MapOutputTracker even if newer stages are launched - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:46 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20083) Change matrix toArray to not create a new array when matrix is already column major - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:47 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22195) Add cosine similarity to org.apache.spark.ml.linalg.Vectors - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:47 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16180) Task hang on fetching blocks (cached RDD) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:14:47 UTC, 0 replies.
- [jira] [Resolved] (SPARK-1359) SGD implementation is not efficient - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:07 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23876) OR condition in joins causes results to come back to driver - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:08 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14649) DagScheduler re-starts all running tasks on fetch failure - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:08 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17804) Pandas dtypes are not correctly inferred by pyspark - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:08 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19462) when spark.sql.adaptive.enabled is enabled, DF is not resilient to node/container failure - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:08 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11502) Word2VecSuite needs appropriate checks - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:08 UTC, 0 replies.
- [jira] [Resolved] (SPARK-4049) Storage web UI "fraction cached" shows as > 100% - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:09 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22504) Optimization in overwrite table in case of failure - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:09 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22420) Spark SQL return invalid json string for struct with date/datetime field - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:09 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23086) Spark SQL cannot support high concurrency for lock in HiveMetastoreCatalog - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:09 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21036) Truncate action and writes should be in one transaction. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:09 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21186) PySpark with --packages fails to import library due to lack of pythonpath to .ivy2/jars/*.jar - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:09 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20414) avoid creating only 16 reducers when calling topByKey() - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:09 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10294) When Parquet writer's close method throws an exception, we will call close again and trigger a NPE - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:10 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21262) Stop sending 'stream request' when shuffle blocks. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:10 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20722) Replay event log that hasn't be replayed in current checking period in advance for request - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:10 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22250) Be less restrictive on type checking - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:10 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7211) Improvements for FPGrowth - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:10 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20292) string representation of TreeNode is messy - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:10 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22336) When spark-submit cluster mode is run from a Mesos task, the job fails - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:10 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19215) Add necessary check for `RDD.checkpoint` to avoid potential mistakes - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:11 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20959) Add a parameter to UnsafeExternalSorter to configure filebuffersize - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:11 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15809) PySpark SQL UDF default returnType - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:11 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18689) Support prioritized apps utilizing linux cgroups - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:11 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22942) Spark Sql UDF throwing NullPointer when adding a filter on a columns that uses that UDF - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:11 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14351) Optimize ImpurityAggregator for decision trees - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:12 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18925) Reduce memory usage of mapWithState - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:12 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22457) Tables are supposed to be MANAGED only taking into account whether a path is provided - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:12 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22183) Inconsistency in LIKE escaping between literal values and column-based ones - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:12 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12650) No means to specify Xmx settings for spark-submit in cluster deploy mode for Spark on YARN - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:12 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22563) Spark row_number() deterministic generation and materialization as a checkpoint - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:13 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9612) Add instance weight support for GBTs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:13 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23855) Performing a Join after a CrossJoin can lead to data corruption - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:13 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10473) EventLog will loss message in the long-running security application - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:13 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19108) Broadcast all shared parts of tasks (to reduce task serialization time) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:14 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21417) Detect transitive join conditions via expressions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:14 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17121) Support _HOST replacement for principal - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:15 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23270) FileInputDStream Streaming UI 's records should not be set to the default value of 0, it should be the total number of rows of new files. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:15 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19492) Dataset, filter and pattern matching on elements - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:15 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18974) FileInputDStream could not detected files which moved to the directory - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:15 UTC, 0 replies.
- [jira] [Resolved] (SPARK-4638) Spark's MLlib SVM classification to include Kernels like Gaussian / (RBF) to find non linear boundaries - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:16 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21378) Spark Poll timeout when specific offsets are passed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:16 UTC, 0 replies.
- [jira] [Resolved] (SPARK-5535) Add parameter for storage levels - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:16 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22553) Drop FROM in nonReserved - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:16 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21905) ClassCastException when call sqlContext.sql on temp table - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:16 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12806) Support SQL expressions extracting values from VectorUDT - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:17 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12890) Spark SQL query related to only partition fields should not scan the whole data. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:17 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18142) Spark Master tries to launch workers 145 times within 1 minute - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:17 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18664) Don't respond to HTTP OPTIONS in HTTP-based UIs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:17 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21227) Unicode in Json field causes AnalysisException when selecting from Dataframe - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:17 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22671) SortMergeJoin read more data when wholeStageCodegen is off compared with when it is on - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:17 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22474) cannot read a parquet file containing a Seq[Map[MyCaseClass, String]] - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20867) Move individual hints from Statistics into HintInfo class - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23474) mapWithState + async operations = no checkpointing - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-5844) Optimize Pipeline.fit for ParamGrid - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13493) json to DataFrame to parquet does not respect case sensitiveness - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23479) struct() cannot be combined with alias(metadata={}) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23659) Spark Job gets stuck during shuffle - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13610) Create a Transformer to disassemble vectors in DataFrames - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22594) Handling spark-submit and master version mismatch - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20805) updated updateP in SVD++ is error - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18005) optional binary Dataframe Column throws (UTF8) is not a group while loading a Dataframe - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12764) XML Column type is not supported (JDBC connection to Postgres) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22237) Spark submit script should use downloaded files in standalone/local client mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13228) HiveSparkSubmitSuite is flaky - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18626) Concurrent write to table fails from spark - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19929) SparkSQL can't show the hive Managed table's LOCATION property when using the command of 'show create table...' - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11976) Support "." character in DataFrame column name - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:22 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17300) ClosedChannelException caused by missing block manager when speculative tasks are killed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:22 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19689) Job Details page doesn't show 'Tasks: Succeeded/Total' progress bar text properly - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:22 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22620) Deadlock in blacklisting when executor is removed due to a heartbeat timeout - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:23 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22215) Add a configuration parameter to set max size for generated classes - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:23 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21131) Fix batch gradient bug in SVDPlusPlus - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:23 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20240) SparkSQL support limitations of max dynamic partitions when inserting hive table - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:23 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19356) Number of active tasks is negative even when there is no failed executor - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:24 UTC, 0 replies.
- [jira] [Resolved] (SPARK-24226) while reading data from oracle 12c from spark and using the numofpartition more than 1 is not returning the exact count - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:24 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18115) Custom metrics Sink/Source prevent Executor from starting - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:24 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20095) A code bug in CodegenContext.withSubExprEliminationExprs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:25 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20225) Spark Job hangs while writing parquet files to HDFS - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:25 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16636) Missing documentation for CalendarIntervalType type in sql-programming-guide.md - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:25 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21754) No Exception/Warn When Join Columns are Differing Types - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:25 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19486) Investigate using multiple threads for task serialization - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:25 UTC, 0 replies.
- [jira] [Resolved] (SPARK-761) Print a nicer error message when incompatible Spark binaries try to talk - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:26 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19553) Add GroupedData.countApprox() - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:26 UTC, 0 replies.
- [jira] [Resolved] (SPARK-1924) Make local:/ scheme work in more deploy modes - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19908) Direct buffer memory OOM should not cause stage retries. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20481) Wrong mapping for BooleanType in Spark SQL - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22213) Spark to detect slow executors on nodes with problematic hardware - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14760) Feature transformers should always invoke transformSchema in transform or fit - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:28 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15071) Check the result of all TPCDS queries - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:28 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10078) Vector-free L-BFGS - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:29 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18451) Always set -XX:+HeapDumpOnOutOfMemoryError for Spark tests - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:29 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20324) Control itemSets length in PrefixSpan - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:29 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19699) createOrReplaceTable does not always replace an existing table of the same name - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:30 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20623) Application link returns redirect to SPARK_LOCAL_IP and disregards SPARK_PUBLIC_DNS - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:30 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22194) Allow namespacing of configs in spark.internal.config - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19909) Batches will fail in case that temporary checkpoint dir is on local file system while metadata dir is on HDFS - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22338) namedtuple serialization is inefficient - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20322) MinPattern length in PrefixSpan - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14151) Propose to refactor and expose Metrics Sink and Source interface - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10881) Unable to use custom log4j appender in spark executor - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15785) Add initialModel param to Gaussian Mixture Model (GMM) in spark.ml - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16938) Cannot resolve column name after a join - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22480) Dynamic Watermarking - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18122) Fallback to Kryo for unknown classes in ExpressionEncoder - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-24078) reduce with unionAll takes a long time - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14771) Python ML Param and UID issues - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20028) Implement NGrams aggregate function - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20222) No Spark SQL UI when executing queries in Spark SQL CLI - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18534) Datasets Aggregation with Maps - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19310) PySpark Window over function changes behaviour regarding Order-By - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11373) Add metrics to the History Server and providers - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21257) LDA : create an Evaluator to enable cross validation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19771) Support OR-AND amplification in Locality Sensitive Hashing (LSH) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9579) Improve Word2Vec unit tests - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18743) StreamingContext.textFileStream(directory) has no events shown in Web UI - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21390) Dataset filter api inconsistency - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14651) CREATE TEMPORARY TABLE is not supported yet - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18831) SQL tab missing from UI when using ElasticSearch connector - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17968) Support using 3rd-party R packages on Mesos - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20729) Reduce boilerplate in Spark ML models - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21117) Built-in SQL Function Support - WIDTH_BUCKET - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18563) mapWithState: initialState should have a timeout setting per record - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:37 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12061) Persist for Map/filter with Lambda Functions don't always read from Cache - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:37 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21252) The duration times showed by spark web UI are inaccurate - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:37 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19833) remove SQLConf.HIVE_VERIFY_PARTITION_PATH, we always return empty when the path does not exists - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:37 UTC, 0 replies.
- [jira] [Resolved] (SPARK-4862) Streaming | Setting checkpoint as a local directory results in Checkpoint RDD has different partitions error - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:38 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15829) spark master webpage links to application UI broke when running in cluster mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:38 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23180) RFormulaModel should have labels member - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:38 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22639) no rowcount estimation returned if groupby clause involves substring - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22200) Kinesis Receivers stops if Kinesis stream was re-sharded - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-4240) Refine Tree Predictions in Gradient Boosting to Improve Prediction Accuracy. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18139) Dataset mapGroups with return typ Seq[Product] produces scala.ScalaReflectionException: object $line262.$read not found - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13860) TPCDS query 39 returns wrong results compared to TPC official result set - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22248) spark marks all columns as null when its unable to parse single column - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18011) SparkR serialize "NA" throws exception - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22350) Select grouping__id from subquery - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22353) ResultIterable should be indexable - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17763) JacksonParser silently parses null as 0 when the field is not nullable - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16073) Performance of Parquet encodings on saving primitive arrays - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22492) Spark master web ui server in standalone mode do not honor host option - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17169) To use scala macros to update code when SharedParamsCodeGen.scala changed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-3383) DecisionTree aggregate size could be smaller - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20305) Master may keep in the state of "COMPELETING_RECOVERY",then all the application registered cannot get resources, when the leader master change. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:15:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19273) Stage is not retay when shuffle file is lost - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:11 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19023) Memory leak on GraphX with an iterative algorithm and checkpoint on the graph - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:11 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16428) Spark file system watcher not working on Windows - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:11 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19653) `Vector` Type Should Be A First-Class Citizen In Spark SQL - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:12 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22616) df.cache() / df.persist() should have an option blocking like df.unpersist() - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:12 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21420) Support array type code 'q' and 'Q' - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:12 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20856) support statement using nested joins - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:12 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20178) Improve Scheduler fetch failures - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:12 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18255) SQLContext.getOrCreate always returns a SQLContext even if a user originally created a HiveContext - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:12 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18249) StackOverflowError when saving dataset to parquet - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:14 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15060) Fix stack overflow when executing long lineage transform without checkpoint - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:14 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19844) UDF in when control function is executed before the when clause is evaluated. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:14 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23771) Uneven Rowgroup size after repartition - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:14 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21452) SessionState in HiveClientImpl is never closed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:14 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15797) To expose groupingSets for DataFrame - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:14 UTC, 0 replies.
- [jira] [Resolved] (SPARK-24019) AnalysisException for Window function expression to compute derivative - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:14 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21154) ParseException when Create View from another View in Spark SQL - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:14 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19433) ML Pipeline with long stages takes long time to finish - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:14 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21419) Support Mesos failover_timeout in driver (Mesos cluster mode) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:15 UTC, 0 replies.
- [jira] [Resolved] (SPARK-24008) SQL/Hive Context fails with NullPointerException - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:15 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18896) Suppress ScalaCheck warning -- Unknown ScalaCheck args provided when executing tests using sbt - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:15 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21997) Spark shows different results on char/varchar columns on Parquet - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:16 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19258) Spark DataFrame saveAsTable not working properly after Sentry turned on - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:16 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23252) When NodeManager and CoarseGrainedExecutorBackend processes are killed, the job will be blocked - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:16 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20370) create external table on read only location fails - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:16 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23346) Failed tasks reported as success if the failure reason is not ExceptionFailure - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:16 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19879) Spark UI table sort breaks event timeline - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:17 UTC, 0 replies.
- [jira] [Resolved] (SPARK-4366) Aggregation Improvement - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:17 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19288) Failure (at test_sparkSQL.R#1300): date functions on a DataFrame in R/run-tests.sh - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:17 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21964) Enable splitting the Aggregate (on Expand) into a number of Aggregates for grouing analytics - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:17 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12261) pyspark crash for large dataset - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16824) Add API docs for VectorUDT - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20234) Improve Framework for Basic ML Tests - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22448) Add functions like Mode(), NumNulls(), etc. in Summarizer - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19046) Dataset checkpoint consumes too much disk space - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11171) PMML for Pipelines API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20829) var_samp returns Nan while other vendors return a null value - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18844) Add more binary classification metrics to BinaryClassificationMetrics - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10149) Locality Level is ANY on "Details for Stage" WebUI page - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6883) Fork pyspark's cloudpickle as a separate dependency - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14764) Spark SQL documentation should be more precise about which SQL features it supports - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21526) Add support to ML LogisticRegression for setting initial model - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20755) UDF registration should throw exception if UDF not found on classpath - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-5256) Improving MLlib optimization APIs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20990) Multi-line support for JSON - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:22 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12977) Factoring out StreamingListener and UI to support history UI - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:22 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18591) Replace hash-based aggregates with sort-based ones if inputs already sorted - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:22 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19854) Refactor file partitioning strategy to make it easier to extend / unit test - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:22 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22611) Spark Kinesis ProvisionedThroughputExceededException leads to dropped records - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:22 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16087) Spark Hangs When Using Union With Persisted Hadoop RDD - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:23 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22292) Add spark.mem.max to limit the amount of memory received from Mesos - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:23 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20001) Support PythonRunner executing inside a Conda env - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:23 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23738) Memory usage for executors - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:23 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18610) greatest/leatest fails to run with string aginst date/timestamp - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:24 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20811) GBT Classifier failed with mysterious StackOverflowError - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:24 UTC, 0 replies.
- [jira] [Resolved] (SPARK-3255) Faster algorithms for logistic regression - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:24 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14483) Display user name for each job and query - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:24 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16844) Generate code for sort based aggregation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:24 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21882) OutputMetrics doesn't count written bytes correctly in the saveAsHadoopDataset function - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:25 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22321) Improve logging in the mesos scheduler - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:25 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14706) Python ML persistence integration test - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:25 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19063) Add parameter for storage levels to LDA - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:25 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20847) Error reading NULL int[] element from postgres -- null pointer exception. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:26 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14327) Scheduler holds locks which cause huge scheulder delays and executor timeouts - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21930) When the number of attempting to restart receiver greater than 0,spark do nothing in 'else' - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10846) Stray META-INF in directory spark-shell is launched from causes problems - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19001) Worker will submit multiply CleanWorkDir and SendHeartbeat task with each RegisterWorker response - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22629) incorrect handling of calls to random in UDFs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19847) port hive read to FileFormat API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:28 UTC, 0 replies.
- [jira] [Resolved] (SPARK-4412) Parquet logger cannot be configured - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:28 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10719) SQLImplicits.rddToDataFrameHolder is not thread safe when using Scala 2.10 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:28 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23969) Using FPGrowth with PipelinedRDD gives EOF Error - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:28 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23964) why does Spillable wait for 32 elements? - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:29 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22524) Subquery on UI appear as the same node even if it's not reused - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:29 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23610) Cast of ArrayType of NullType to ArrayType of nullable material type does not work - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:29 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17302) Cannot set non-Spark SQL session variables in hive-site.xml, spark-defaults.conf, or using --conf - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:29 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21014) Support get fields with schema name - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:30 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19052) the rest api don't support multiple standby masters on standalone cluster - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:30 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19111) S3 Mesos history upload fails silently if too large - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:30 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22056) Add subconcurrency for KafkaRDDPartition - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18955) Add ability to emit kafka events to DStream or KafkaDStream - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22077) RpcEndpointAddress fails to parse spark URL if it is an ipv6 address. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18806) driverwrapper and executor doesn't exit when worker killed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23764) Utils.tryWithSafeFinally swollows fatal exceptions in the finally block - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19538) DAGScheduler and TaskSetManager can have an inconsistent view of whether a stage is complete. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22117) TasksetManager echo not change after MapoutTracker epoch changed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21386) ML LinearRegression supports warm start from user provided initial model - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22293) Avoid unnecessary traversal in ResolveReferences - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17670) Spark DataFrame/Dataset no longer supports Option[Map] in case classes - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17048) ML model read for custom transformers in a pipeline does not work - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17684) 'null' appears in the data during aggregateByKey action. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-1503) Implement Nesterov's accelerated first-order method - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17519) [MESOS] Enhance robustness when ExternalShuffleService is broken - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22411) Heuristic to combine splits in DataSourceScanExec isn't accurate when dynamic allocation is enabled - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16207) order guarantees for DataFrames - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22491) union all can't execute parallel with group by - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14037) count(df) is very slow for dataframe constructed using SparkR::createDataFrame - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22612) NullPointerException in AppendOnlyMap - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23215) Dataset Grouping: Index out of bounds error - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19353) Support binary I/O in PipedRDD - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19685) PipedRDD tasks should not hang on interruption / errors - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22567) spark.mesos.executor.memoryOverhead equivalent for the Driver when running on Mesos - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21483) Make org.apache.spark.ml.linalg.Vector bean-compliant so it can be used in Encoders.bean(Vector.class) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-5564) Support sparse LDA solutions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19475) (ML|MLlib).linalg.DenseVector method delegation fails for __neg__ - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17820) Spark sqlContext.sql() performs only first insert for HiveQL "FROM target INSERT INTO dest" command to insert into multiple target tables from same source - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:37 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16365) Ideas for moving "mllib-local" forward - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:37 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22037) Collapse Project if it is the child of Aggregate - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:38 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19144) Add test for GaussianMixture with distributed decompositions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:38 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13525) SparkR: java.net.SocketTimeoutException: Accept timed out when running any dataframe function - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:38 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23512) Complex operations on Dataframe corrupts data - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19870) Repeatable deadlock on BlockInfoManager and TorrentBroadcast - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19259) spark locality in CNI context - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-24174) Expose Hadoop config as part of /environment API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11003) Allowing UserDefinedTypes to extend primatives - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21259) More rules for scalastyle - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14822) Add lazy executor startup to Mesos Scheduler - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20060) Support Standalone visiting secured HDFS - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21560) Add hold mode for the LiveListenerBus - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20170) Enhance spark framework to support failover in case mesos master failure - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19902) Support more expression canonicalization: Add, Subtract, Multiply and Divide - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19705) Preferred location supporting HDFS Cache for FileScanRDD - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20846) Incorrect posgres sql array column schema inferred from table. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19504) clearCache fails to delete orphan RDDs, especially in pyspark - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18603) Support `OuterReference` in projection list of IN correlated subqueries - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20422) Worker registration retries should be configurable - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19978) spark thrift server to switch to normative hadoop 2.2+ service lifecycle - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14652) pyspark streaming driver unable to cleanup metadata for cached RDDs leading to driver OOM - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23810) Matrix Multiplication is so bad, file I/O to local python is better - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20745) Data gets wrongly copied from one row to others, possibly related to named structs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18724) Add TuningSummary for TrainValidationSplit and CountVectorizer - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21935) Pyspark UDF causing ExecutorLostFailure - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19233) Inconsistent Behaviour of Spark Streaming Checkpoint - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21863) while performing saveToMemSQL, get Exception in thread "Thread-23" java.lang.AssertionError: assertion failed: Task -1024 release lock on block rdd_5_2 more times than it acquired it - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21430) Add PMML support to SparkR - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20215) ReuseExchange is boken in SparkSQL - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22828) Data corruption happens when same RDD being repeatedly used as parent RDD of a custom RDD which reads each parent RDD in concurrent threads - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19243) Error when selecting from DataFrame containing parsed data from files larger than 1MB - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15491) JSON serialization fails for JDBC DataFrames - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23453) ToolBox compiled Spark UDAF causes java.lang.InternalError: Malformed class name - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20938) explain() for datasources implementing CatalystScan does not show pushed predicates correctly - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12965) Indexer setInputCol() doesn't resolve column names like DataFrame.col() - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22738) Spark YARN UI does not create fully qualified links for paging impacts use with Knox - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14040) Null-safe and equality join produces incorrect result with filtered dataframe - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21537) toPandas() should handle nested columns (as a Pandas MultiIndex) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:45 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10764) Add optional caching to Pipelines - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:45 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22510) Exceptions caused by 64KB JVM bytecode or 64K constant pool entry limit - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:46 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8562) Annoying messages about executor lost after stopping SparkContext - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:46 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22435) Support processing array and map type using script - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:47 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20765) Cannot load persisted PySpark ML Pipeline that includes 3rd party stage (Transformer or Estimator) if the package name of stage is not "org.apache.spark" and "pyspark" - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:47 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23696) StructType.fromString swallows exceptions from DataType.fromJson - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:47 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20965) Support PREPARE/EXECUTE/DECLARE/FETCH statements - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:48 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20622) Parquet partition discovery for non key=value named directories - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:48 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22547) Don't include executor ID in metrics name - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:48 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20927) Add cache operator to Unsupported Operations in Structured Streaming - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:48 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19855) Create an internal FilePartitionStrategy interface - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:48 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19628) Duplicate Spark jobs in 2.1.0 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:49 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18787) spark.shuffle.io.preferDirectBufs does not completely turn off direct buffer usage by Netty - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:49 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12554) Standalone mode may hang if max cores is not a multiple of executor cores - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:49 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20122) Analyzer reports "unresolved operator 'Aggregate" for nonexistent window specifications - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:50 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12157) Support numpy types as return values of Python UDFs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:50 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6685) Use DSYRK to compute AtA in ALS - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:51 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18599) Add the Spectral LDA algorithm - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:51 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21209) Implement Incremental PCA algorithm for ML - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:51 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23437) [ML] Distributed Gaussian Process Regression for MLlib - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:51 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19551) Theme for PySpark documenation could do with improving - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:52 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20281) Table-valued function range in SQL should use the same number of partitions as spark.range - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:52 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17434) Flaky test: HiveSparkSubmitSuite's "SPARK-9757 Persist Parquet relation with decimal column" - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:52 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22427) StackOverFlowError when using FPGrowth - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:52 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22531) Migrate the implementation of IDF from MLLib to ML - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:52 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18272) Test topic addition for subscribePattern on Kafka DStream and Structured Stream - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:52 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22241) Apache spark giving InvalidSchemaException: Cannot write a schema with an empty group: optional group element { - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:53 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20062) Inconsistent checking on ML estimator/model copy in the unit tests. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:53 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15572) ML persistence in R format: compatibility with other languages - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:53 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21482) Make LabeledPoint bean-compliant so it can be used in Encoders.bean(LabeledPoint.class) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:53 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11620) parquet.hadoop.ParquetOutputCommitter.commitJob() throws parquet.io.ParquetEncodingException - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:54 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22679) It's slow to stop streaming context - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:54 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22191) Add hive serde example with serde properties - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:54 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18650) race condition in FileScanRDD.scala - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:54 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18097) Can't drop a table from Hive if the schema is corrupt - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:55 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21449) Hive client's SessionState was not closed properly in HiveExternalCatalog - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:55 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16523) Support Row Based Aggregation HashMap - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:55 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20939) Do not duplicate user-defined functions while optimizing logical query plans - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:56 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18067) SortMergeJoin adds shuffle if join predicates have non partitioned columns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:56 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22599) Avoid extra reading for cached table - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:56 UTC, 0 replies.
- [jira] [Resolved] (SPARK-790) Implement the reregistered() callback in MesosScheduler to support master failover - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:56 UTC, 0 replies.
- [jira] [Resolved] (SPARK-2868) Support named accumulators in Python - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:57 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17878) Support for multiple null values when reading CSV data - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:57 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20680) Spark-sql do not support for void column datatype of view - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:58 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21838) "Completed Applications" links not always working in cluster with spark.ui.reverseProxy=true - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:58 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17154) Wrong result can be returned or AnalysisException can be thrown after self-join or similar operations - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:58 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9272) Persist information of individual partitions when persisting partitioned data source tables to metastore - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:59 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21107) Pyspark: ISO-8859-1 column names inconsistently converted to UTF-8 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:59 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18778) Fix the Scala classpath in the spark-shell - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:16:59 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20859) SQL Loader does not recognize multidimensional columns in postgresql (like integer[]][]) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:17:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19091) createDataset(sc.parallelize(x: Seq)) should be equivalent to createDataset(x: Seq) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:17:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19863) Whether or not use CachedKafkaConsumer need to be configured, when you use DirectKafkaInputDStream to connect the kafka in a Spark Streaming application - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:17:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-4836) Web UI should display separate information for all stage attempts - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:17:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20870) Update the output of spark-sql -H - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:17:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19102) Accuracy error of spark SQL results - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:17:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11593) Replace catalyst converter with RowEncoder in ScalaUDF - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:17:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20810) ML LinearSVC vs MLlib SVMWithSGD output different solution - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:17:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19781) Bucketizer's handleInvalid leave null values untouched unlike the NaNs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:15 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19248) Regex_replace works in 1.6 but not in 2.0 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:15 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14561) History Server does not see new logs in S3 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:15 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20313) Possible lack of join optimization when partitions are in the join condition - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:15 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22201) Dataframe describe includes string columns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:16 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16151) Make generated params non-final - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:16 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18920) Update outdated date formatting - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:16 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21792) Document Spark Streaming Dynamic Allocation Configuration - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:16 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20248) Spark SQL add limit parameter to enhance the reliability. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:16 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23183) Failure caused by TaskContext is missing in the thread spawned by user code - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:16 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6810) Performance benchmarks for SparkR - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:16 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19031) JDBC Streaming Source - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:16 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21706) Support Custom PartitionSpec Provider for Kinesis Firehose or similar - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:16 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23168) Hints for fact tables and unique columns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:16 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20683) Make table uncache chaining optional - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:16 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14927) DataFrame. saveAsTable creates RDD partitions but not Hive partitions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:16 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20347) Provide AsyncRDDActions in Python - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:17 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23420) Datasource loading not handling paths with regex chars. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:17 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20632) Allow 'Column.getItem()' API to accept Vector columns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:17 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21476) RandomForest classification model not using broadcast in transform - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:17 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19839) Fix memory leak in BytesToBytesMap - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:17 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19989) Flaky Test: org.apache.spark.sql.kafka010.KafkaSourceStressSuite - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:17 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7420) Flaky test: o.a.s.streaming.JobGeneratorSuite "Do not clear received block data too soon" - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:17 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22098) Add aggregateByKeyLocally in RDD - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:17 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23746) HashMap UserDefinedType giving cast exception in Spark 1.6.2 while implementing UDAF - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:17 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19596) After a Stage is completed, all Tasksets for the stage should be marked as zombie - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:17 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21761) [Core] Add the application's final state for SparkListenerApplicationEnd event - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:17 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22685) Spark Streaming using Kinesis doesn't work if shard checkpoints exist in DynamoDB - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:17 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6634) Allow replacing columns in Transformers - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:17 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20002) Add support for unions between streaming and batch datasets - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:17 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17538) sqlContext.registerDataFrameAsTable is not working sometimes in pyspark 2.0.0 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:17 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18676) Spark 2.x query plan data size estimation can crash join queries versus 1.x - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19011) ApplicationDescription should add the Submission ID for the standalone cluster - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23287) Spark scheduler does not remove initial executor if not one job submitted - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22724) TakeOrderedAndProjectExec operator has poor performance when sorting on low cardinality keys - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12835) StackOverflowError when aggregating over column from window function - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20802) kolmogorovSmirnovTest in pyspark.mllib.stat.Statistics throws net.razorvine.pickle.PickleException when input data is normally distributed (no error when data is not normally distributed) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20382) fileSystem.closed when run load data on spark beeline - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21236) Make the threshold of using HighlyCompressedStatus configurable. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20071) StringIndexer overflows Kryo serialization buffer when run on column with many long distinct values - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12191) Support "." character in DataFrame column name for ML - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19742) When using SparkSession to write a dataset to Hive the schema is ignored - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19767) API Doc pages for Streaming with Kafka 0.10 not current - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21556) PySpark, Unable to save pipeline of non-spark transformers - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19177) SparkR Data Frame operation between columns elements - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18940) Percentile and approximate percentile support for frequency distribution table - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19035) rand() function in case when cause failed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21037) ignoreNulls does not working properly with window functions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10908) ClassCastException in HadoopRDD.getJobConf - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20226) Call to sqlContext.cacheTable takes an incredibly long time in some cases - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22192) An RDD of nested POJO objects cannot be converted into a DataFrame using SQLContext.createDataFrame API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-3728) RandomForest: Learn models too large to store in memory - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20658) spark.yarn.am.attemptFailuresValidityInterval doesn't seem to have an effect - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-24065) Issue with the property IgnoreLeadingWhiteSpace - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19222) Limit Query Performance issue - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21691) Accessing canonicalized plan for query with limit throws exception - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23216) Multiclass LogisticRegression could have methods like NCE, NEG, Hierarchical SoftMax, Blackout or IS - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22455) Provide an option to store the exception records/files and reasons in log files when reading data from a file-based data source. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19541) High Availability support for ThriftServer - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21874) Support changing database when rename table. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20006) Separate threshold for broadcast and shuffled hash join - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18693) BinaryClassificationEvaluator, RegressionEvaluator, and MulticlassClassificationEvaluator should use sample weight data - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20828) Concatenated grouping sets scenario not supported - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23172) Expand the ReorderJoin rule to handle Project nodes - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19390) Replace the unnecessary usages of hiveQlTable - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21178) Add support for label specific metrics in MulticlassClassificationEvaluator - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18178) Importing Pandas Tables with Missing Values - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22736) Consider caching decoded dictionaries in VectorizedColumnReader - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11758) Missing Index column while creating a DataFrame from Pandas - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6527) sc.binaryFiles can not access files on s3 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21697) NPE & ExceptionInInitializerError trying to load UTF from HDFS - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21659) FileStreamSink checks for _spark_metadata even if path has globs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-19216) LogisticRegressionModel is missing getThreshold() - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-20840) Misleading spurious errors when there are Javadoc (Unidoc) breaks - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8118) Turn off noisy log output produced by Parquet 1.7.0 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21184) QuantileSummaries implementation is wrong and QuantileSummariesSuite fails with larger n - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18839) Executor is active on web, but actually is dead - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-22084) Performance regression in aggregation strategy - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23520) Add support for MapType fields in JSON schema inference - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-21158) SparkSQL function SparkSession.Catalog.ListTables() does not handle spark setting for case-sensitivity - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12042) Python API for mllib.stat.test.StreamingTest - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18977) Heavy udf is not stopped by cancelJobGroup - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:18:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-13975) Cannot specify extra libs for executor from /extra-lib - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:04 UTC, 0 replies.
- [jira] [Updated] (SPARK-7641) Add subsampling of frequent words for Word2Vec - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:04 UTC, 0 replies.
- [jira] [Updated] (SPARK-13588) Unable to map Parquet file to Hive Table using HiveContext - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:04 UTC, 0 replies.
- [jira] [Updated] (SPARK-9953) ML Vector, Matrix semantic equality + hashcode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:04 UTC, 0 replies.
- [jira] [Updated] (SPARK-17325) Inconsistent Spillable threshold and AppendOnlyMap growing threshold may trigger out-of-memory errors - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:04 UTC, 0 replies.
- [jira] [Updated] (SPARK-14784) Build SQL for EXISTS/IN subquery - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:04 UTC, 0 replies.
- [jira] [Updated] (SPARK-9438) restarting leader zookeeper causes spark master to die when the spark master election is assigned to zookeeper - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:04 UTC, 0 replies.
- [jira] [Updated] (SPARK-9333) Tables are not listed in jdbc tools when connected to hivethrift server - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:05 UTC, 0 replies.
- [jira] [Updated] (SPARK-9697) Project Tungsten (Phase 2) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:05 UTC, 0 replies.
- [jira] [Updated] (SPARK-7877) Support non-persistent cluster mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:05 UTC, 0 replies.
- [jira] [Updated] (SPARK-11698) Add option to ignore kafka messages that are out of limit rate - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-17156) Add multiclass logistic regression Scala Example - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-17785) Find a more robust way to detect the existing of the initialModel - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-14695) Error occurs when using OFF_HEAP persistent level - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-15906) Complementary Naive Bayes Algorithm Implementation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-16089) Spark2.0 doesn't support the certain static partition SQL statment as "insert overwrite table targetTB PARTITION (partition field=xx) select field1,field2,...,partition field from sourceTB where partition field=xx" while Spark 1.6 supports - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-9461) Possibly slightly flaky PySpark StreamingLinearRegressionWithTests - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-16258) Automatically append the grouping keys in SparkR's gapply - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-13743) Adding configurable support for Spark Streaming gracefull timeout - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-10569) Kryo serialization fails on sortByKey operation on registered RDDs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-17307) Document what all access is needed on S3 bucket when trying to save a model - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-10839) SPARK_DAEMON_MEMORY has effect on heap size of thriftserver - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-14417) Cleanup Scala deprecation warnings once we drop 2.10.X - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-17168) CSV with header is incorrectly read if file is partitioned - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-10510) Add documentation for how to register a custom Kryo serializer in Spark - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-8716) Write tests for executor shared cache feature - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-15873) JdbcRDD to support more bound types other than Long and allow multiple bound occurrence for subqueries. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-7525) Could not read data from write ahead log record when Receiver failed and WAL is stored in Tachyon - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-10870) Criteo Display Advertising Challenge - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-16937) Confusing behaviors when View and Temp View sharing the same names - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-12943) spark should distribute truststore if used in yarn-cluster mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-17601) SparkSQL vectorization cannot handle schema evolution for parquet tables when parquet files use Int whereas DataFrame uses Long - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-14597) Streaming Listener timing metrics should include time spent in JobGenerator's graph.generateJobs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-11444) Allow batch seqOp combination in treeAggregate - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-18033) Deprecate TaskContext.partitionId - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-13434) Reduce Spark RandomForest memory footprint - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-10935) Avito Context Ad Clicks - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-17155) usage of a Dataset inside a Future throws MissingRequirementError - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-17397) Show example of what to do when awaitTermination() throws an Exception - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-8055) Spark Launcher Improvements - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-13856) Support initialModel in ALS - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:09 UTC, 0 replies.
- [jira] [Updated] (SPARK-7831) Mesos dispatcher doesn't deregister as a framework from Mesos when stopped - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:09 UTC, 0 replies.
- [jira] [Updated] (SPARK-8842) Spark SQL - Insert into table Issue - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:09 UTC, 0 replies.
- [jira] [Updated] (SPARK-14820) Reduce shuffle data by pushing filter toward storage - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:09 UTC, 0 replies.
- [jira] [Updated] (SPARK-14622) Retain lost executors status - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:09 UTC, 0 replies.
- [jira] [Updated] (SPARK-9603) Re-enable complex R package test in SparkSubmitSuite - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:09 UTC, 0 replies.
- [jira] [Updated] (SPARK-9111) Dumping the memory info when an executor dies abnormally - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:10 UTC, 0 replies.
- [jira] [Updated] (SPARK-16623) Datasources should expose ability to define schema conversions they're performing to save data - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:10 UTC, 0 replies.
- [jira] [Updated] (SPARK-13155) add runtime null check when convert catalyst array to external array - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:10 UTC, 2 replies.
- [jira] [Updated] (SPARK-16592) Improving ml.Logistic Regression on speed and scalability - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:10 UTC, 0 replies.
- [jira] [Updated] (SPARK-16840) Please save the aggregate term frequencies as part of the NaiveBayesModel - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-8517) Improve the organization and style of MLlib's user guide - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-18158) Submit app in standalone cluster mode supervised with HA: all masters have to be up and running - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-8985) Create a test harness to improve Spark's combinatorial test coverage of non-default configurations - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-16517) can't add columns on the table create by spark'writer - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-14333) Duration of task should be the total time (not just computation time) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-17467) Spark SQL: Return incorrect result for the data files on Swift - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-14192) Executor is dead in Driver but alive in AM when driver losts rpc with executor, but executor is alive. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-18241) If Spark Launcher fails to startApplication then handle's state does not change - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-13128) API for building arrays / lists encoders - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-11203) UDF doesn't support charType column and lit function doesn't allow charType as argument - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-16244) Failed job/stage couldn't stop JobGenerator immediately. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-13216) Spark streaming application not honoring --num-executors in restarting of an application from a checkpoint - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-17950) Match SparseVector behavior with DenseVector - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-17933) Shuffle fails when driver is on one of the same machines as executor - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-16160) Last remembered metadata window per dstream is not cleared upon context graceful stop - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-11308) Change spark streaming's job scheduler logic to ensuer guaranteed order of batch processing - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-13960) JAR/File HTTP Server doesn't respect "spark.driver.host" and there is no "spark.fileserver.host" option - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-12178) Expose reporting of StreamInputInfo for custom made streams - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-8789) improve SQLQuerySuite resilience by dropping tables in setup - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-10042) Use consistent behavior for Internal Accumulators across stage retries - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-11753) Understand why allowNonNumericNumbers JSON option doesn't work - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-11285) Infinite TaskCommitDenied loop - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-18532) Code generation memory issue - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-7053) KafkaUtils.createStream leaks resources - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-15729) Clarify that saveAs*File doesn't make sense with local FS in cluster context - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-7597) Make default doc build avoid search engine indexing - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-7386) Spark application level metrics application.$AppName.$number.cores doesn't reset on Standalone Master deployment - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-10583) Correctness test for Multilayer Perceptron using Weka Reference - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-16149) API consistency discussion: CountVectorizer.{minDF -> minDocFreq, minTF -> minTermFreq} - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-15769) Add Encoder for input type to Aggregator - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-10774) Put different event log to different directory according to different conditions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-13532) Spark yarn executor container fails if yarn.nodemanager.local-dirs starts with file:// - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-17497) Preserve order when scanning ordered buckets over multiple partitions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-18079) CollectLimitExec.executeToIterator() should perform per-partition limits - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-17836) Use cross validation to determine the number of clusters for EM or KMeans algorithms - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-14026) Subquery not brodcasted - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-7492) Convert LocalDataFrame to LocalMatrix - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-8460) Distinguish between shuffle and non-shuffle spills in metrics - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-13585) addPyFile behavior change between 1.6 and before - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-18299) Allow more aggregations on KeyValueGroupedDataset - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-8500) Support for array types in JDBCRDD - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-12430) Temporary folders do not get deleted after Task completes causing problems with disk space. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-12493) Can't open "details" span of ExecutionsPage in IE11 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-8442) FitTransform method for pipeline - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-10793) Make spark's use/subclassing of hive more maintainable - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-18102) Failed to deserialize the result of task - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-11013) SparkPlan may mistakenly register child plan's accumulators for SQL metrics - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-15634) SQL repl is bricked if a function is registered with a non-existent jar - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-15588) Paginate Stage Table in Stages tab, Job Table in Jobs tab, and Query Table in SQL tab - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-14068) Pluggable DiskBlockManager - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-13051) Do not maintain global singleton map for accumulators - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-14330) Spark SQL does not infer correct type for joda DateTime - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-15699) Add chi-squared test statistic as a split quality metric for decision trees - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-9879) OOM in LIMIT clause with large number - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-14041) Locate possible duplicates and group them into subtasks - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-7706) Allow setting YARN_CONF_DIR from spark argument - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-9195) RDD/Storage metrics don't update cached partition counts when executors are removed/lost - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-10363) DataframeWriter write failed when multi applications write the same partitioned directory - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-10597) MultivariateOnlineSummarizer for weighted instances - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-12419) FetchFailed = false Executor lost should not allowed re-registered in BlockManager Master again? - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-11255) R Test build should run on R 3.1.2 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-8665) Update ALS documentation to include performance tips - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-14559) Netty RPC didn't check channel is active before sending message - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-16860) UDT Stringification Incorrect in PySpark - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-10586) BlockManager ca't be removed when it is re-registered, then disassociats - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-10797) RDD's coalesce should not write out the temporary key - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-12262) describe extended doesn't return table on detail info tabled stored as PARQUET format - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-15554) Duplicated executors in Spark UI - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-18041) activedrivers section in http:sparkMasterurl/json is missing Main class information - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-17453) Broadcast block already exists in MemoryStore - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-8391) showDagViz throws OutOfMemoryError, cause the whole jobPage dies - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-17059) Allow FileFormat to specify partition pruning strategy - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-16013) Add option to disable HiveContext in spark-shell/pyspark - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-15546) HiveContext : Connecting to MySQL metastore db - Always creates a DERBY database in first attempt - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-15917) Define the number of executors in standalone mode with an easy-to-use property - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-12581) Support case-sensitive table names in postgresql - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-10784) Flaky Streaming ML test umbrella - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-12474) support deserialization for physical plan and hive logical plan from JSON string - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-15385) Jobs never complete for ClusterManagers that don't implement killTask - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-17135) Consolidate code in linear/logistic regression where possible - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-13437) Add InternalColumn - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-10448) Parquet schema merging should NOT merge UDT - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-11986) background thread spill the context to disk - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-17152) Spark Flume sink fails with begin() called when transaction is OPEN - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-10882) Add the ability to connect to secured mqtt brokers - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-15161) Consider moving featureImportances into TreeEnsemble models base class - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-17695) Deserialization error when using DataFrameReader.json on JSON line that contains an empty JSON object - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-10961) Specified metastore 0.12.0 but spark-shell still using metastore classes for 0.13+ - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-11666) Find the best `k` by cutting bisecting k-means cluster tree without recomputation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-8514) LU factorization on BlockMatrix - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-14531) Flume streaming should respect maxRate (and backpressure) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-15855) dataframe.R example fails with "java.io.IOException: No input paths specified in job" - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-6821) Refactor SerDe API in SparkR to be more developer friendly - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-7043) KryoSerializer cannot be used with REPL to interpret code in which case class definition and its shipping are in the same line - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-13111) Spark UI is showing negative number of sessions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-18032) Spark test failed as OOM in jenkins - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-16204) Row() interface - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-10153) Unable to query Avro data from Flume using SparkSQL - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-13083) Small spark sql queries get blocked if there is a long running query over a lot a partitions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-13731) expression evaluation for NaN in select statement - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-12947) Spark with Swift throws EOFException when reading parquet file - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-17221) Build File-based Test Cases for Using Join and Left-Semi Join - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-18246) Throws an exception before execution for unsupported types in Json, CSV and text functionailities - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-11143) SparkMesosDispatcher can not launch driver in docker - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-13073) creating R like summary for logistic Regression in Spark - Scala - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-7653) ML Pipeline and meta-algs should take random seed param - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-14800) Dealing with null as a value in options for each internal data source - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-15564) App name is the main class name in Spark streaming jobs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-15374) Spark created Parquet files cause NPE when a column has only NULL values - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-9563) Remove repartition operators when they are the child of Exchange and shuffle=True - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-16235) "evaluateEachIteration" is returning wrong results when calculated for classification model. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-16490) Python mllib example for chi-squared feature selector - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-17553) On Master Change the running Apps goes to wait state even though resource are available - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-9474) Consistent hadoop config for core - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-9313) Enable a "docker run" invocation in place of PYSPARK_PYTHON - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-7412) Designing distributed prediction model abstractions for spark.ml - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-11794) Improve error message when HDFS/S3 access is misconfigured when using Parquet - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-17655) Remove unused variables declarations and definations in a WholeStageCodeGened stage - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-10010) Decide on a name for generating Linear/LogisticRegressionSummary on test set data - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-12675) Executor dies because of ClassCastException and causes timeout - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-8946) Intermittent SnappyCompressionCodec. IllegalArgumentException - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-16423) Inconsistent settings on the first day of a week - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-8423) More informative DecisionTreeModel.toDebugString - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-13795) ClassCast Exception while attempting to show() a DataFrame - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-8724) Need documentation on how to deploy or use SparkR in Spark 1.4.0+ - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-10347) Investigate the usage of normalizePath() - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-16717) Dataframe (jdbc) is missing a way to link and external function to get a connection - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-16417) spark 1.5.2 receiver store(single-record) with ahead log enabled makes executor crash if there is an exception when BlockGenerator storing block - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-8986) GaussianMixture should take smoothing param - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-8502) One character switches into uppercase, causing failures [serialization? shuffle?] - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-17377) Joining Datasets read and aggregated from a partitioned Parquet file gives wrong results - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-10992) Partial Aggregation Support for Hive UDAF - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-18253) ML Instrumentation logging requires too much manual implementation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-7786) Allow StreamingListener to be specified in SparkConf and loaded when creating StreamingContext - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-12338) Support dropping duplicated rows on selected columns in DataFrame in R style - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-9503) Mesos dispatcher NullPointerException (MesosClusterScheduler) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-17476) Proper handling for unseen labels in logistic regression training. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-6840) SparkR: private package functions unavailable when using lapplyPartition in package - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-10854) MesosExecutorBackend: Received launchTask but executor was null - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-16741) spark.speculation causes duplicate rows in df.write.jdbc() - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-12472) OOM when sort a table and save as parquet - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-16616) Allow Catalyst to take Advantage of Hash Partitioned DataSources - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-16413) The JDBC UI does not show the job id when spark.sql.thriftServer.incrementalCollect=true - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-16823) One dimensional typed select on DataFrame does not work as expected - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-9717) Document persistence recommendation for MulticlassMetrics - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-11397) PySpark Streaming uses threadcontext class loader, may cause issues on mesos - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-11250) Generate different alias for columns with same name during join - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-11525) Support spark packages containing R source code in Standalone mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-17435) RowEncoder should be documented and publicly accessable - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-17935) Add KafkaForeachWriter in external kafka-0.8.0 for structured streaming module - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-18096) Spark on have - 'Update' save mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-13909) DataFrames DISK_ONLY persistence leads to OOME - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-15191) createDataFrame() should mark fields that are known not to be null as not nullable - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-10659) DataFrames and SparkSQL saveAsParquetFile does not preserve REQUIRED (not nullable) flag in schema - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-9761) Inconsistent metadata handling with ALTER TABLE - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-14530) When click the job in the webUI, I get a problem about "Access denied." - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-7221) Expose the current processed file name of FileInputDStream to the users - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-12373) Type coercion rule of dividing two decimal values may choose an intermediate precision that does not have enough number of digits at the left of decimal point - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-17030) Remove/Cleanup HiveMetastoreCatalog.scala - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-12726) ParquetConversions doesn't always propagate metastore table identifier to ParquetRelation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-8858) Common interface for Frequent Itemsets - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-10808) LDA user guide: discuss running time of LDA - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-10832) sometimes No event logs found for application using same JavaSparkSQL example - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-16641) Add an Option to Create a Dataset With a Case Class, Ignoring Column Names (Using ordinal instead) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-7618) Word2VecModel cache normalized wordVectors to speed up findSynonyms - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-13654) get_json_object fails with java.io.CharConversionException - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-9919) Matrices should respect Java's equals and hashCode contract - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-17125) Allow to specify spark config using non-string type in SparkR - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-7925) Address inconsistencies in capturing appName in different Metrics Sources - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-14053) Merge absTol and relTol into one in MLlib tests - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-9739) Execution visualizer - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-12431) add local checkpointing to GraphX - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-14588) Consider getting column stats from files (wherever feasible) to get better stats for joins - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-12445) Fix null exception when passing null as array in toCatalystArray - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-17050) Improve initKMeansParallel with treeAggregate - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-16004) Improve CatalogTable information - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-18037) Event listener should be aware of multiple tries of same stage - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-10404) Worker should terminate previous executor before launch new one - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-15846) Allow passing a PrintStream to DataFrame.explain - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-11886) R function name conflicts with base or stats package ones - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-11366) binary functions (methods) such as rlike in pyspark.sql.column only accept strings but should also accept another Column - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-12571) AWS credentials not available for read.parquet in SQLContext - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-12117) Column Aliases are Ignored in callUDF while using struct() - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-10607) Scheduler should include defensive measures against infinite loops due to task commit denial - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-8494) ClassNotFoundException when running with sbt, scala 2.10.4, spray 1.3.3 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:27 UTC, 0 replies.
- [jira] [Updated] (SPARK-13623) Relaxed mode for querying Dataframes, so columns that don't exist or have an incompatible schema return null rather than error - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:27 UTC, 0 replies.
- [jira] [Updated] (SPARK-10493) reduceByKey not returning distinct results - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:27 UTC, 0 replies.
- [jira] [Updated] (SPARK-17600) Cannot set public address for Worker and Master Web UI - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:27 UTC, 0 replies.
- [jira] [Updated] (SPARK-13683) Finalize the public interface for OutputWriter[Factory] - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:27 UTC, 0 replies.
- [jira] [Updated] (SPARK-11543) spark.ml LogisticRegressionModel should prefer thresholds over threshold - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:27 UTC, 0 replies.
- [jira] [Updated] (SPARK-15804) Manually added metadata not saving with parquet - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:27 UTC, 0 replies.
- [jira] [Updated] (SPARK-12763) Spark gets stuck executing SSB query - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-16408) SparkSQL Added file get Exception: is a directory and recursive is not turned on - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-18100) Improve the performance of get_json_object using Gson - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-12853) Update query planner to use only bucketed reads if it is useful - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-14606) Different maxBins value for categorical and continuous features in RandomForest implementation. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-11228) Job stuck in Executor failure loop when NettyTransport failed to bind - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-10945) GraphX computes Pagerank with NaN (with some datasets) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-11907) Allowing errors as values in DataFrames (like 'Either Left/Right') - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-6874) Add support for SQL:2003 array type declaration syntax - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-12565) Document standalone mode jar distribution more clearly - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-16614) DirectJoin with DataSource for SparkSQL - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-14354) Let Expand take name expressions and infer output attributes - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-12257) Non partitioned insert into a partitioned Hive table doesn't fail - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-17937) Clarify Kafka offset semantics for Structured Streaming - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-9696) Add random seed Param to PySpark ML Pipeline - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-13800) Hive conf will be modified on multi-beeline connect to thriftserver - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-15592) Paginate query table in SQL tab - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-11632) Filter out empty partition for KafkaRDD - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-14180) Deadlock in CoarseGrainedExecutorBackend Shutdown - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-7499) Investigate how to specify columns in SparkR without $ or strings - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-16889) Add formatMessage Column expression for formatting strings in java.text.MessageFormat style in Scala API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-15337) Unable to Make Run-time Changes on Hive-related Conf - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-15233) Spark task metrics should include hdfs read write latency - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-9721) TreeTests.checkEqual should compare predictions on data - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-16424) Add support for Structured Streaming to the ML Pipeline API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-14905) create conda environments w/locked package versions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-8663) Dirver will be hang if there is a job submit during SparkContex stop Interval - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-9317) Print DataFrame entries in the shell for small DataFrames - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-18174) Avoid Implicit Type Cast in Arguments of Expressions Extending String2StringExpression - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-14390) Make initialization step in Pregel optional. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-9009) SPARK Encryption FileNotFoundException for truststore - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-12567) Add aes_encrypt and aes_decrypt UDFs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-16075) Make VectorUDT/MatrixUDT singleton under spark.ml package - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-11465) Support multiple eigenvectors in power iteration clustering - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-15700) Spark 2.0 dataframes using more memory (reading/writing parquet) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-18411) Add Argument Types and Test Cases for String Functions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-14707) Linear algebra: clarify light vs heavy constructors and accessors - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-17189) [MINOR] Looses the interface from UnsafeRow to InternalRow in AggregationIterator if UnsafeRow specific method is not used - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-11321) Allow addition of non-nullable UDFs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-11823) HiveThriftBinaryServerSuite tests timing out, leaves hanging processes - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-17146) Add RandomizedSearch to the CrossValidator API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-11411) Use the space at end of pages - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-6823) Add a model.matrix like capability to DataFrames (modelDataFrame) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-13375) PySpark API Utils missing item: kFold - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-11558) Get ML feature names from column - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-14361) Support EXCLUDE clause in Window function framing - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-10193) Eliminate "skipped" stages for shared shuffle dependencies by reusing associated Stage - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-12185) Add Histogram support to Spark SQL/DataFrames - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-13811) No Push Down for Null Filtering of Compound Expressions Generated from Constraints - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-17858) Provide option for Spark SQL to skip corrupt files - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-14849) shuffle broken when accessing standalone cluster through NAT - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-10333) Add user guide for linear-methods.md columns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-10501) support UUID as an atomic type - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-11240) PMML export for SVM models in ML pipeline - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-11384) Wrong duration times for (cached) iterative tasks - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-15002) Calling unpersist can cause spark to hang indefinitely when writing out a result - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-15920) Fix incorrect DataFrame references in Pyspark docs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-16788) Investigate JSR-310 & scala-time alternatives to our own datetime utils - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-8454) Update ShuffleBlockFetcherIterator.next to close previous stream when returning a new one - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-8519) Blockify distance computation in k-means - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-13161) Extend MLlib LDA to include options for Author Topic Modeling - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-11202) Unsupported dataType - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-17691) Add aggregate function to collect list with maximum number of elements - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-15930) Add Row count property to FPGrowth model - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-10054) Add a timeout for launching Receiver - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-8569) Documentation for SPARK_HISTORY_OPTS unclear - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-13691) Scala and Python generate inconsistent results - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-10555) Add INotifyDStream to Spark Streaming - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-15201) Handle integer overflow correctly in hash code computation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-10860) Bivariate Statistics: Chi-Squared independence test - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-8672) throws NPE when running spark sql thrift server with session state authenticator - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-11354) Expose custom log4j to executor page in Spark standalone cluster - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:37 UTC, 0 replies.
- [jira] [Updated] (SPARK-17730) Show task size (including the broadcast variable for the task) in web UI - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:37 UTC, 0 replies.
- [jira] [Updated] (SPARK-7947) Serdes Command not working - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:38 UTC, 0 replies.
- [jira] [Updated] (SPARK-13272) Clean-up CatalystQl - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:38 UTC, 0 replies.
- [jira] [Updated] (SPARK-15499) Add python testsuite with remote debug and single test parameter to help developer debug code easier. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:38 UTC, 0 replies.
- [jira] [Updated] (SPARK-12909) Spark on Mesos accessing Secured HDFS w/Kerberos - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:38 UTC, 0 replies.
- [jira] [Updated] (SPARK-9315) SparkR DataFrame improvements to be more R-friendly - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:38 UTC, 0 replies.
- [jira] [Updated] (SPARK-6915) VectorIndexer improvements - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-8132) Race condition if task is cancelled with interruption while fetching file dependencies - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-8361) Session of ThriftServer is still alive after I exit beeline - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-8987) Increase test coverage of DAGScheduler - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-17831) registerTempTable is ignoring database clarifications - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-11454) DB2 dialect - map DB2 ROWID and TIMESTAMP with TIMEZONE types into valid Spark types - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-16767) existsRecursively method in UserDefinedType is not correct - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-17170) Enable whole partition pruning for InMemoryTableScanExec - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-7341) Fix the flaky test: org.apache.spark.streaming.InputStreamsSuite.socket input stream - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:40 UTC, 0 replies.
- [jira] [Updated] (SPARK-10478) Improve spark.ml.ann implementations for MLP - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:40 UTC, 0 replies.
- [jira] [Updated] (SPARK-12209) spark.streaming.concurrentJobs and spark.streaming.backpressure.enabled - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:40 UTC, 0 replies.
- [jira] [Updated] (SPARK-12688) Spill size metric does not update for tungsten aggregation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:40 UTC, 0 replies.
- [jira] [Updated] (SPARK-16093) Spark2.0 take no effect after set spark.sql.autoBroadcastJoinThreshold = 1 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:40 UTC, 0 replies.
- [jira] [Updated] (SPARK-16657) Replace children by innerChildren in InsertIntoHadoopFsRelationCommand and CreateHiveTableAsSelectCommand - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:40 UTC, 0 replies.
- [jira] [Updated] (SPARK-14859) [PYSPARK] Make Lambda Serializer Configurable - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:21:40 UTC, 0 replies.
- [jira] [Updated] (SPARK-13497) PMML export for logistic regression does not conform to the PMML standard - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:03 UTC, 0 replies.
- [jira] [Updated] (SPARK-18340) Inconsistent error messages in launching scripts and hanging in sparkr script for wrong options - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:03 UTC, 0 replies.
- [jira] [Updated] (SPARK-13662) [SQL][Hive] Have SHOW TABLES return additional fields from Hive MetaStore - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:03 UTC, 0 replies.
- [jira] [Updated] (SPARK-10407) Possible Stack-overflow using InheritableThreadLocal nested-properties for SparkContext.localProperties - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:04 UTC, 0 replies.
- [jira] [Updated] (SPARK-16820) Sparse - Sparse matrix multiplication - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:04 UTC, 0 replies.
- [jira] [Updated] (SPARK-13639) Statistics.colStats(rdd).mean and variance should handle NaN in the input vectors - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:04 UTC, 0 replies.
- [jira] [Updated] (SPARK-17416) Add Dataset.groupByKey overload that takes a value selector function - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:04 UTC, 0 replies.
- [jira] [Updated] (SPARK-18068) Spark SQL doesn't parse some ISO 8601 formatted dates - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:04 UTC, 0 replies.
- [jira] [Updated] (SPARK-14248) Get the path hierarchy from root to leaf in the BisectingKMeansModel - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:04 UTC, 0 replies.
- [jira] [Updated] (SPARK-17481) Flaky test: org.apache.spark.DistributedSuite.passing environment variables to cluster - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:04 UTC, 0 replies.
- [jira] [Updated] (SPARK-17517) Improve generated Code for BroadcastHashJoinExec - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:04 UTC, 0 replies.
- [jira] [Updated] (SPARK-13634) Assigning spark context to variable results in serialization error - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:04 UTC, 0 replies.
- [jira] [Updated] (SPARK-9937) GraphX Performance: Partition overhead scales quadratically - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:04 UTC, 0 replies.
- [jira] [Updated] (SPARK-10218) Adding aggregated metrics by stage or job in history server - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:04 UTC, 0 replies.
- [jira] [Updated] (SPARK-18449) Name option is being ignored when submitting an R application via spark-submit - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:04 UTC, 0 replies.
- [jira] [Updated] (SPARK-14593) Make currentVars work with splitExpressions to enable whole stage codegen for large input columns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:04 UTC, 0 replies.
- [jira] [Updated] (SPARK-12448) Add UserDefinedType support to Cast - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:04 UTC, 0 replies.
- [jira] [Updated] (SPARK-9488) pyspark.sql.types.Row very slow when using named arguments - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:04 UTC, 0 replies.
- [jira] [Updated] (SPARK-9623) RandomForestRegressor: provide variance of predictions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:04 UTC, 0 replies.
- [jira] [Updated] (SPARK-14233) NativeThread.signal(Native Method): No such file or directory - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:05 UTC, 0 replies.
- [jira] [Updated] (SPARK-17550) DataFrameWriter.partitionBy() should throw exception if column is not present in Dataframe - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:05 UTC, 0 replies.
- [jira] [Updated] (SPARK-13259) SPARK_HOME should not be used as the CWD in docker executors - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:05 UTC, 0 replies.
- [jira] [Updated] (SPARK-13946) PySpark DataFrames allows you to silently use aggregate expressions derived from different table expressions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:05 UTC, 0 replies.
- [jira] [Updated] (SPARK-13756) Reuse Query Fragments - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:05 UTC, 0 replies.
- [jira] [Updated] (SPARK-8722) PR merge script should warn when merging a PR that has failed tests - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:05 UTC, 0 replies.
- [jira] [Updated] (SPARK-17457) Spark SQL shows poor performance for group by and sort by on multiple columns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:05 UTC, 0 replies.
- [jira] [Updated] (SPARK-17872) aggregate function on dataset with tuples grouped by non sequential fields - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:05 UTC, 0 replies.
- [jira] [Updated] (SPARK-17784) Add fromCenters method for KMeans - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:05 UTC, 0 replies.
- [jira] [Updated] (SPARK-17995) Use new attributes for columns from outer joins - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:05 UTC, 0 replies.
- [jira] [Updated] (SPARK-10713) SPARK_DIST_CLASSPATH ignored on Mesos executors - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:05 UTC, 0 replies.
- [jira] [Updated] (SPARK-17201) Investigate numerical instability for MLOR without regularization - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:05 UTC, 0 replies.
- [jira] [Updated] (SPARK-6854) No support for get and eval-quote - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:05 UTC, 0 replies.
- [jira] [Updated] (SPARK-12930) NullPointerException running hive query with array dereference in select and where clause - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-7495) Improve ML attribute documentation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-11688) UDF's doesn't work when it has a default arguments - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-18491) Spark uses mutable classes for date/time types mapping - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-17759) Avoid adding duplicate schedulables - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-7348) DAG visualization: add links to RDD page - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-10787) Consider replacing ObjectOutputStream for serialization to prevent OOME - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-11280) Mesos cluster deployment using only one node - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-17789) Don't force users to set k for KMeans if initial model is set - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-17226) Allow defining multiple date formats per column in csv - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-17754) DataFrame reader and writer don't show Input/Output metrics in Spark UI - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-16227) Json schema inference fails when `:` exists in file path - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-14539) Fetching delegation tokens in Hive-Thriftserver fails when hive.server2.enable.doAs = True - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-10517) Console "Output" field is empty when using DataFrameWriter.json - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-11664) Add methods to get bisecting k-means cluster structure - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-12800) Subtle bug on Spark Yarn Client under Kerberos Security Mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-16338) Streaming driver running on standalone cluster mode with supervise goes into bad state when application is killed from the UI - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-16480) Streaming checkpointing does not work well with SIGTERM - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-12083) java.lang.IllegalArgumentException: requirement failed: Overflowed precision (q98) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-11408) Display scientific notation properly in DataFrame.show() - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-12147) Off heap storage and dynamicAllocation operation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-13978) [GSoC 2016] Build monitoring UI and related infrastructure for Spark SQL and structured streaming - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-15143) CSV data source is not being tested as HadoopFsRelation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-17247) when fall back to hdfs is enabled for stats calculation, the hdfs listing and size calcuation should be terminated as soon as total size > broadcast threshold - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-16141) Compress complex-typed data (e.g., Array) in InMemoryRelation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-16269) Support null handling for vectorized hashmap during hash aggregate - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-12208) Abstract the examples into a common place - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-14256) Remove parameter sqlContext from as.DataFrame - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-16484) Incremental Cardinality estimation operations with Hyperloglog - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-6837) SparkR failure in processClosure - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:09 UTC, 0 replies.
- [jira] [Updated] (SPARK-6838) Explore using Reference Classes instead of S4 objects - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:09 UTC, 0 replies.
- [jira] [Updated] (SPARK-12313) getPartitionsByFilter doesnt handle predicates on all / multiple Partition Columns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:09 UTC, 0 replies.
- [jira] [Updated] (SPARK-13283) Spark doesn't escape column names when creating table on JDBC - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:09 UTC, 0 replies.
- [jira] [Updated] (SPARK-9873) Cap the amount of executors launched in Mesos fine grain mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:09 UTC, 0 replies.
- [jira] [Updated] (SPARK-18605) Spark Streaming ERROR TransportResponseHandler: Still have 1 requests outstanding when connection - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:09 UTC, 0 replies.
- [jira] [Updated] (SPARK-8557) Successful Jobs marked as KILLED Spark 1.4 Standalone - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:10 UTC, 0 replies.
- [jira] [Updated] (SPARK-15833) REST API for streaming - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:10 UTC, 0 replies.
- [jira] [Updated] (SPARK-16891) arrayFilter function for filtering elements of array column based on a predicate - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:10 UTC, 0 replies.
- [jira] [Updated] (SPARK-10572) Investigate the contentions bewteen tasks in the same executor - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:10 UTC, 0 replies.
- [jira] [Updated] (SPARK-12402) Memory leak in pyspark - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:10 UTC, 0 replies.
- [jira] [Updated] (SPARK-10936) UDAF "mode" for categorical variables - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:10 UTC, 0 replies.
- [jira] [Updated] (SPARK-18078) Add option for customize zipPartition task preferred locations - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-11154) make specificaition spark.yarn.executor.memoryOverhead consistent with typical JVM options - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-15090) Spark Hive thriftserver can get 413 errors in Kerberos+AD deployments - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-10444) Remove duplication in Mesos schedulers - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-13174) Add API and options for csv data sources - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-13005) Don't require spark.shuffle.service.port to be set on job conf - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-15521) Add high level APIs based on dapply and gapply for easier usage - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-17954) FetchFailedException executor cannot connect to another worker executor - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-11268) Non-daemon startup scripts - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-14035) Flaky spark.mllib.classification.NaiveBayesSuite test - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-11801) Notify driver when OOM is thrown before executor JVM is killed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-17630) jvm-exit-on-fatal-error handler for spark.rpc.netty like there is available for akka - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-8545) PMML improvement umbrella - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-17036) Hadoop config caching could lead to memory pressure and high CPU usage in thrift server - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-14920) Location should not be Specified in Creating Non-external Table - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-17357) Simplified predicates can't be pushed down through operators because of the rule order in Optimizer - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-18334) What hashDistance should MinHash use? - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-7546) Example code for ML Pipelines feature transformations - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-10776) Pass location of SparkR source files from R process to JVM - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-15859) Optimize the Partition Pruning with Disjunction - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-16161) Ambiguous error message for unsupported correlated predicate subqueries - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-9269) Add Set to the matching type in ArrayConverter - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-9325) Support `collect` on DataFrame columns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-14980) Spark is not picking up classes in the MutableURLClassLoader causing errors with drools - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-12097) How to do a cached, batched JDBC-lookup in Spark Streaming? - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-9932) Data source API improvement (Spark 1.6) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-12099) Standalone and Mesos Should use OnOutOfMemoryError handlers - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-16635) Provide Session support in the Spark UI - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-14326) Can't specify "long" type in structField - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:22:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-11238) SparkR: Documentation change for merge function - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-16504) UDAF should be typed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-18202) Spark throws a mysterious system error when a Hive command has at least 100,000 results - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-7609) Add standardized checks for (Model, Estimator) unit tests - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-9857) Add expression functions into SparkR which conflict with the existing R's generic - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-12787) Dataset to support custom encoder - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-6816) Add SparkConf API to configure SparkR - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-11529) Add section in user guide for StreamingLogisticRegressionWithSGD - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-13872) Memory leak in SortMergeOuterJoin - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-15845) Expose metrics for sub-stage transformations and action - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-10919) Association rules class should return the support of each rule - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-7791) Set user for executors in standalone-mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-8734) Expose all Mesos DockerInfo options to Spark - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-11167) Incorrect type resolution on heterogeneous data structures - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-15828) YARN is not aware of Spark's External Shuffle Service - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-15272) DirectKafkaInputDStream doesn't work with window operation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-11576) [SQL] Incorrect results when using the nested self-join - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-10250) Scala PairRDDFunctions.groupByKey() should be fault-tolerant of single large groups - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-6853) Contents of .globalenv in workers - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-17604) Support purging aged file entry for FileStreamSource metadata log - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-9038) Missing TaskEnd event when task attempt is superseded by another (speculative) attempt - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-12013) Add a Hive context method to retrieve the database Names - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-7610) Design clustering abstractions for Pipelines API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-16518) Schema Compatibility of Parquet Data Source - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-8133) sticky partitions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-10658) Could pyspark provide addJars() as scala spark API? - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-8009) [Mesos] Allow provisioning of executor logging configuration - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-14166) Add deterministic sampling like in Hive - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-17177) Make grouping columns accessible from RelationalGroupedDataset - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-12899) Spark Data Security - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-8523) Spark streaming static files not found when running web ui under virtual host - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-10105) Adding most k frequent words parameter to Word2Vec implementation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-15324) Add the takeSample function to the Dataset - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-12473) Reuse serializer instances for performance - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-9442) java.lang.ArithmeticException: / by zero when reading Parquet - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-12782) reindex() columns in DataFrame - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-10561) Provide tooling for auto-generating Spark SQL reference manual - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-10950) ApplicationHistoryInfo to include spark version; History Server to report incompatibility with later versions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-13653) Split DiskBlockObjectWriter to separate object- and byte-based write interfaces - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-15350) Add unit test function for LogisticRegressionWithLBFGS in JavaLogisticRegressionSuite - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-10183) Expose the SparkR backend api - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-12372) Document limitations of MLlib local linear algebra - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-16999) Convert Keyword `path` of Data Source Tables to Lower Case - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-15366) Add Application Detail UI uri to Spark Json API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-7680) Add a fake Receiver that generates random strings, useful for prototyping - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-12506) Push down WHERE clause arithmetic operator to JDBC layer - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-15673) Indefinite hanging issue with combination of cache, sort and unionAll - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-12385) Push projection into Join - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-17603) Utilize Hive-generated Statistics For Partitioned Tables - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-16871) Support getting HBase tokens from multiple clusters dynamically - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-8110) DAG visualizations sometimes look weird in Python - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-14427) Support persisting partitioned data source relations in Hive compatible format - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-12825) Spark-submit Jar URL loading fails on redirect - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-13513) add some tests for leap year handling in catalyst - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-15338) Run-time Change on the conf spark.sql.warehouse.dir Does not Affect the conf hive.metastore.warehouse.dir - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-14788) Initial number of executors should honor min number if streaming dynamic allocation is enabled - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-14683) Configure external links in ScalaDoc - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-12601) worker output a large number of log when size RollingPolicy shouldRollover method use loginfo - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-15896) Clean shuffle files after finish the SQL query - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-13979) Killed executor is respawned without AWS keys in standalone spark cluster - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-12270) JDBC Where clause comparison doesn't work for DB2 char(n) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-11185) Add more task metrics to the "all Stages Page" - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-15027) ALS.train should use DataFrame instead of RDD - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-9372) For a join operator, rows with null equal join key expression can be filtered out early - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-14332) Support Hadoop Input/OutputFormat Counters as Metrics - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-16728) migrate internal API for MLlib trees from spark.mllib to spark.ml - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-6834) Failed with error: ā€˜invalid package nameā€™ Error in as.name(name) : attempt to use zero-length variable name - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-10385) Bivariate statistics in DataFrames - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-14805) accumulator values are not escaped when written to event logs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-11770) Spark SQL field resolution error in GROUP BY HAVING clause - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-9744) Add RDD method to map with lag and lead - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-13729) Reimplement the planning tests on SimpleTextRelation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-7434) DSL for Pipeline assembly - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-14924) Tuning estimatorParamMaps with OneVsRest.classifier fails during persistence - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-12815) Compute Wilcoxon-Mann-Whitney rank sum statistic - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-13273) Improve test coverage of CatalystQl - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-16532) Provide a REST API for submitting and tracking status of jobs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-16126) Better Error Message When using DataFrameReader without `path` - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-15432) Two executors with same id in Spark UI - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-11152) Streaming UI: Input sizes are 0 for makeup batches started from a checkpoint - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-14708) Repl Serialization Issue - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-10401) spark-submit --unsupervise - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-18017) Changing Hadoop parameter through sparkSession.sparkContext.hadoopConfiguration doesn't work - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-16378) HiveContext doesn't release resources - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-9881) Support additional KryoRegistrators via an SPI lookup - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-11168) Explore precise behavior of treeAggregate - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-11570) ambiguous hostname resolving during startup - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-7379) pickle.loads expects a string instead of bytes in Python 3. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-10588) Saving a DataFrame containing only nulls to JSON doesn't work - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-8778) The item of "Scheduler delay" is not consistent between "Event Timeline" and "Task List" - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-8534) Gini for regression metrics and evaluator - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-14890) DAGScheduler should not accept the result of a previous task attempt, since its stage attempt has been completed. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-12214) Spark to provide an API to save user-defined metadata as part of Sequence File header - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-10187) Sometimes Web UI reports Application history not found - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-11231) join returns schema with duplicated and ambiguous join columns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-17045) Moving Auto_Joins from HiveCompatibilitySuite to SQLQueryTestSuite - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-12319) ExchangeCoordinatorSuite fails on big-endian platforms - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-16997) Allow loading of JSON float values as TimestampType - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-16076) Dataset - outer join nulls can sometimes combinate to default values - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-7682) Size of distributed grids still limited by cPickle - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-17781) datetime is serialized as double inside dapply() - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-15582) Support for Groovy closures - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-16593) Provide a pre-fetch mechanism to accelerate shuffle stage. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-10928) Spark Mesos finegrain mode with single core CPUs, rendered useless - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-8912) Documentation on the effective objective function in LoR when `standardization` is true/false - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-10826) MasterSource should not access Master.workers, apps, waitingApps directly - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-10406) Document spark on yarn distributed cache symlink functionality - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-8546) PMML export for Naive Bayes - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-12918) Support R SQL UDF in SparkR - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-8983) ML Tuning Cross-Validation Improvements - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-15993) PySpark RuntimeConfig should be immutable - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-7038) [Streaming] Spark Sink requires spark assembly in classpath - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-15872) Dataset of Array of Custom case class throws MissingRequirementError - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-14148) Kmeans Sum of squares - Within cluster, between clusters and total - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-13116) TungstenAggregate though it is supposedly capable of all processing unsafe & safe rows, fails if the input is safe rows - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-16790) Warn (or fail) when user query reads no data due to pruning - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-7809) MultivariateOnlineSummarizer should allow users to configure what to compute - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-9235) PYSPARK_DRIVER_PYTHON env variable is not set on the YARN Node manager acting as driver in yarn-cluster mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-18571) pyspark: UTF-8 not written correctly (as CSV) when locale is not UTF-8 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-16679) Move `private[sql]` methods in public APIs used for Python/R into a single ā€˜helper classā€™ - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-15909) PySpark classpath uri incorrectly set - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-17824) QR solver for WeightedLeastSquares - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-15219) [Spark SQL] it don't support to detect runtime temporary table for enabling broadcast hash join optimization - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-10568) Error thrown in stopping one component in SparkContext.stop() doesn't allow other components to be stopped - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-10293) Add support for oversubscription in Mesos - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-8137) Improve treeAggregate to combine all data on one machine first - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-17749) Unresolved columns when nesting SQL join clauses - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-12942) Provide option to allow control the precision of numerical type for DataFrameWriter - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-7167) Receivers are not distributed efficiently when starting from checkpoint - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-11196) Support for equality and pushdown of filters on some UDTs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-16363) Spark-submit doesn't work with IAM Roles - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-13374) make it possible to create recoverable accumulator for streaming application - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-16537) Extracting partition columns from the hive table. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-9781) KCL Workers should be configurable from Spark configuration - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-15331) Disallow All the Unsupported CLI Commands - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-15800) Accessing kerberised hdfs from Spark running with Resource Manager - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-13641) getModelFeatures of ml.api.r.SparkRWrapper cannot (always) reveal the original column names - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-16744) spark.yarn.appMasterEnv handling assumes values should be appended - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-15566) Expose null checking function to Python land. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-11838) Spark SQL query fragment RDD reuse across queries - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-11703) Improve the docker-mesos image - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-7409) Designing multilabel abstractions for spark.ml - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-13006) Use Log4j for Spark Standalone Executor Logging - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-9137) Unified label verification for Classifier - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:27 UTC, 0 replies.
- [jira] [Updated] (SPARK-11517) Calc partitions in parallel for multiple partitions table - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:27 UTC, 0 replies.
- [jira] [Updated] (SPARK-11400) BroadcastNestedLoopJoin should support LeftSemi join - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-18225) job will miss when driver removed by master in spark streaming - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-12163) FPGrowth unusable on some datasets without extensive tweaking of the support threshold - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-7872) Transaction stack trace with Spark Streaming and Flume - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-8526) Provide a way to define custom metrics and custom metric sink in Spark - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-8108) Build Hive module by default (i.e. remove -Phive profile) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-18421) Dynamic disk allocation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-9941) Try ML pipeline API on Kaggle competitions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-11160) CloudPickeSerializer conflicts with xmlrunner - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-10972) UDFs in SQL joins - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-15194) Add Python ML API for MultivariateGaussian - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-8566) lateral view query blows up unless pushed down to a subquery - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-8082) Functionality to Reset DF Schemas/Cast Multiple Columns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-15987) PostgreSQL CITEXT type JDBC support - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-15568) TimSort and RadixSort can't support more than 1 billions elements - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-10069) Python's ReduceByKeyAndWindow DStream Keeps Growing - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-16486) Python API parity issues from 2.0 QA - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-16655) Spark thrift server application is not stopped if its in ACCEPTED stage - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-7966) add Spreading Activation algorithm to GraphX - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-14186) Size awareness during spill and shuffle - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-13650) Usage of the window() function on DStream - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-14985) Update LinearRegression, LogisticRegression summary internals to handle model copy - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-8469) Application timeline view unreadable with many executors - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-14003) Multi-session can not work when one session is moving files for "INSERT ... SELECT" clause - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-15507) ClassCastException: SomeCaseClass cannot be cast to org.apache.spark.sql.Row - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-12778) Use of Java Unsafe should take endianness into account - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-17856) JVM Crash during tests: pyspark.mllib.linalg.distributed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-13428) Pushdown Aggregate Functions in Sort into Aggregate - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-10646) Bivariate Statistics: Pearson's Chi-Squared goodness of fit test - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-16001) request that spark history server write a log entry whenever it (1) tries cleaning old event logs and (2) has found and deleted old event logs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-12067) Fix usage of isnan, isnull, isnotnull of Column - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-16262) Impossible to remake new SparkContext using SparkSession API in Pyspark - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-15460) Issue Exceptions from Thrift Server and Spark-SQL Cli when Users Inputting hive.metastore.warehouse.dir - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-10951) Support private S3 repositories using spark-submit via --repositories flag - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-9588) spark sql cache: partition level cache eviction - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-13409) Log the stacktrace when stopping a SparkContext - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-15048) when running Thriftserver with yarn on a secure cluster it will pass the wrong keytab location. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-9003) Add map/update function to MLlib/Vector - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-10806) Following val redefinition, sometimes the old value is still visible - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-15575) Remove breeze from dependencies? - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-15666) Join on two tables generated from a same table throwing query analyzer issue - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-16830) Executors Keep Trying to Fetch Blocks from a Bad Host - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-9431) TimeIntervalType for for time intervals - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-11741) Process doctests using TextTestRunner/XMLTestRunner - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-13345) Adding one way ANOVA to Spark ML stat - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-15015) Log statements lack file name/number - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-8503) SizeEstimator returns negative value for recursive data structures - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-7708) Incorrect task serialization with Kryo closure serializer - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-12845) During join Spark should pushdown predicates on joining column to both tables - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-7885) add config to control map aggregation in spark sql - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-15169) Consider improving HasSolver to allow generalization - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-13222) checkpoint latest stateful RDD on graceful shutdown - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-15970) WARNing message related to persisting table to Hive Megastore while Spark SQL is running in-memory catalog mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-13707) Streaming UI tab misleading for window operations - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-10837) TimeStamp could not work on sparksql very well - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-12528) Make Apache Sparkā€™s gateway hidden REST API (in standalone cluster mode) public API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-12622) spark-submit fails on executors when jar has a space in it - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-8298) Sliding Window CrossValidator - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-9108) Expose Kryo serializer buffer size - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-15039) Kinesis reciever does not work in Yarn - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-10995) Graceful shutdown drops processing in Spark Streaming - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:37 UTC, 0 replies.
- [jira] [Updated] (SPARK-11067) Spark SQL thrift server fails to handle decimal value - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:37 UTC, 0 replies.
- [jira] [Updated] (SPARK-8543) PMML export for Random Forest - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:37 UTC, 0 replies.
- [jira] [Updated] (SPARK-11911) spark-perf test for MultilayerPerceptron - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:37 UTC, 0 replies.
- [jira] [Updated] (SPARK-14464) Logistic regression performs poorly for very large vectors, even when the number of non-zero features is small - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:37 UTC, 0 replies.
- [jira] [Updated] (SPARK-10628) Add support for arbitrary RandomRDD generation to PySparkAPI - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:38 UTC, 0 replies.
- [jira] [Updated] (SPARK-16407) Allow users to supply custom StreamSinkProviders - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:38 UTC, 0 replies.
- [jira] [Updated] (SPARK-14534) Should SparkContext.parallelize(List) take an Iterable instead? - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:38 UTC, 0 replies.
- [jira] [Updated] (SPARK-16680) Set spark.driver.userClassPathFirst=true, and run spark-sql failed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:38 UTC, 0 replies.
- [jira] [Updated] (SPARK-17256) spark-submit.cmd cannot work if path has space and cut off double-quoted arguments - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:38 UTC, 0 replies.
- [jira] [Updated] (SPARK-16039) Spark SQL - Number of rows inserted by Insert Sql - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:38 UTC, 0 replies.
- [jira] [Updated] (SPARK-16314) Spark application got stuck when NM running executor is restarted - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:38 UTC, 0 replies.
- [jira] [Updated] (SPARK-7172) Worker includes ExecutorRunner and DriverRunner in Akka messages - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:38 UTC, 0 replies.
- [jira] [Updated] (SPARK-11441) HadoopFsRelation is not scalable in number of files read/written - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-8459) Add import/export to spark.mllib bisecting k-means - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-12283) Use UnsafeRow as the buffer in SortBasedAggregation to avoid Unsafe/Safe conversion - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-8113) SQL module test cleanup - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-15835) The read path of json doesn't support write path when schema contains Options - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-13516) Dataframe inconsistency after aggregation+union+projection. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-13836) IsNotNull Constraints for the BinaryComparison inside Not - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:40 UTC, 0 replies.
- [jira] [Updated] (SPARK-9300) histogram_numeric aggregate function - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:40 UTC, 0 replies.
- [jira] [Updated] (SPARK-10055) San Francisco Crime Classification - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:40 UTC, 0 replies.
- [jira] [Updated] (SPARK-8333) Spark failed to delete temp directory created by HiveContext - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:40 UTC, 0 replies.
- [jira] [Updated] (SPARK-8449) HDF5 read/write support for Spark MLlib - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:40 UTC, 0 replies.
- [jira] [Updated] (SPARK-16810) Refactor registerSinks with multiple constructos - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:41 UTC, 0 replies.
- [jira] [Updated] (SPARK-10910) spark.{executor,driver}.userClassPathFirst don't work for kryo (probably others) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:41 UTC, 0 replies.
- [jira] [Updated] (SPARK-13799) Add memory back pressure for Spark Streaming - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:41 UTC, 0 replies.
- [jira] [Updated] (SPARK-7874) Add a global setting for the fine-grained mesos scheduler that limits the number of concurrent tasks of a job - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:41 UTC, 0 replies.
- [jira] [Updated] (SPARK-10897) Custom job/stage names - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:41 UTC, 0 replies.
- [jira] [Updated] (SPARK-16954) UDFs should allow output type to be specified in terms of the input type - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:41 UTC, 0 replies.
- [jira] [Updated] (SPARK-12923) Optimize successive dapply() calls in SparkR - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:41 UTC, 0 replies.
- [jira] [Updated] (SPARK-9957) Spark ML trees should filter out 1-category features - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:41 UTC, 0 replies.
- [jira] [Updated] (SPARK-13388) PySpark Pipeline and PipelineModel should take advantages of its Scala companion - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:42 UTC, 0 replies.
- [jira] [Updated] (SPARK-16727) SparkR unit test fails - incorrect expected output - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:42 UTC, 0 replies.
- [jira] [Updated] (SPARK-14140) Futures timeout exception in executor logs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:42 UTC, 0 replies.
- [jira] [Updated] (SPARK-9842) Push down Spark SQL UDF to datasource UDF - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:42 UTC, 0 replies.
- [jira] [Updated] (SPARK-12469) Data Property Accumulators for Spark (formerly Consistent Accumulators) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:42 UTC, 0 replies.
- [jira] [Updated] (SPARK-13630) Add optimizer rule to collapse sorts - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:42 UTC, 0 replies.
- [jira] [Updated] (SPARK-10519) Investigate if we should encode timezone information to a timestamp value stored in JSON - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:42 UTC, 0 replies.
- [jira] [Updated] (SPARK-18499) Add back support for custom Spark SQL dialects - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:42 UTC, 0 replies.
- [jira] [Updated] (SPARK-9578) Stemmer feature transformer - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:43 UTC, 0 replies.
- [jira] [Updated] (SPARK-9427) Add expression functions in SparkR - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:43 UTC, 0 replies.
- [jira] [Updated] (SPARK-15486) dropTempTable does not work with backticks - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:43 UTC, 0 replies.
- [jira] [Updated] (SPARK-6832) Handle partial reads in SparkR JVM to worker communication - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:43 UTC, 0 replies.
- [jira] [Updated] (SPARK-10803) Allow users to write and query Parquet user-defined key-value metadata directly - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:43 UTC, 0 replies.
- [jira] [Updated] (SPARK-16737) ListingFileCatalog comments about RPC calls in object store isn't correct - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:43 UTC, 0 replies.
- [jira] [Updated] (SPARK-11943) Rapidly starting and stopping SparkContexts in local-cluster mode may cause JVM to exit - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:44 UTC, 0 replies.
- [jira] [Updated] (SPARK-16833) [Spark2.0]when creating temporary function,command "add jar" doesn't work unless restart spark - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:44 UTC, 0 replies.
- [jira] [Updated] (SPARK-10898) Setting spark.streaming.concurrentJobs causes blocks to be deleted before read - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:44 UTC, 0 replies.
- [jira] [Updated] (SPARK-9377) Shuffle tuning should discuss task size optimisation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:44 UTC, 0 replies.
- [jira] [Updated] (SPARK-11365) consolidate aggregates for summary statistics in weighted least squares - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:44 UTC, 0 replies.
- [jira] [Updated] (SPARK-14723) A new way to support dynamic allocation in Spark Streaming - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:44 UTC, 0 replies.
- [jira] [Updated] (SPARK-7413) Time to write shuffle spill files is not captured in ShuffleWriteMetrics - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:44 UTC, 0 replies.
- [jira] [Updated] (SPARK-14112) Unique Constraints over a Set of AttributeReferences - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:44 UTC, 0 replies.
- [jira] [Updated] (SPARK-11309) Clean up hacky use of MemoryManager inside of HashedRelation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:45 UTC, 0 replies.
- [jira] [Updated] (SPARK-6826) `hashCode` support for arbitrary R objects - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:45 UTC, 0 replies.
- [jira] [Updated] (SPARK-14715) Provide a way to mask partitions of a Dataset/Dataframe - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:23:45 UTC, 0 replies.
- [jira] [Updated] (SPARK-13392) KafkaSink for Metrics - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:04 UTC, 0 replies.
- [jira] [Updated] (SPARK-9203) Make filesystem pluggable in BlockStore and BlockManager - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:04 UTC, 0 replies.
- [jira] [Updated] (SPARK-14336) Add more unit tests for buffer release in off-heap-caching codepaths - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:04 UTC, 0 replies.
- [jira] [Updated] (SPARK-14262) Correct app state after master leader changed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:04 UTC, 0 replies.
- [jira] [Updated] (SPARK-12206) Streaming WebUI shows incorrect batch statistics when using Window operations - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:04 UTC, 0 replies.
- [jira] [Updated] (SPARK-10370) After a stages map outputs are registered, all running attempts should be marked as zombies - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:04 UTC, 0 replies.
- [jira] [Updated] (SPARK-16108) Why is KMeansModel (scala) private? - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:04 UTC, 0 replies.
- [jira] [Updated] (SPARK-13332) Decimal datatype support for SQL pow - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:04 UTC, 0 replies.
- [jira] [Updated] (SPARK-18405) Add yarn-cluster mode support to Spark Thrift Server - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:04 UTC, 0 replies.
- [jira] [Updated] (SPARK-13777) Weighted Least Squares fails when there are features with identical values - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:05 UTC, 0 replies.
- [jira] [Updated] (SPARK-7825) Poor performance in Cross Product due to no combine operations for small files. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:05 UTC, 0 replies.
- [jira] [Updated] (SPARK-12045) Use joda's DateTime to replace Calendar - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:05 UTC, 1 replies.
- [jira] [Updated] (SPARK-13605) Bean encoder cannot handle nonbean properties - no way to Encode nonbean Java objects with columns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:05 UTC, 0 replies.
- [jira] [Updated] (SPARK-15258) Nested/Chained case statements generate codegen over 64k exception - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:05 UTC, 0 replies.
- [jira] [Updated] (SPARK-16629) UDTs can not be compared to DataTypes in dataframes. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:05 UTC, 0 replies.
- [jira] [Updated] (SPARK-17083) Make the use of the default database more explicit - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-9004) Add s3 bytes read/written metrics - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-7177) Create standard way to wrap Spark CLI scripts for external projects - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-15890) Support Stata-like tabulation of values in a single column, optionally with weights - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-7338) Survival Modelling - Cox proportional hazards - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-18596) Add checking and caching to ML BisectingKmeans - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-14947) Showtable Command - Can't List Tables Using JDBC Connector - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-8142) Spark Job Fails with ResultTask ClassCastException - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-12161) SparkSQL Cache Matching Improvement - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-11288) Specify the return type for UDF in Scala - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-6822) lapplyPartition passes empty list to function - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-16594) Physical Plan Differences when Table Scan Having Duplicate Columns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-16481) Spark does not update statistics when making use of Hive partitions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:06 UTC, 0 replies.
- [jira] [Updated] (SPARK-18500) Make GenericStrategy be able to prune plans by itself after placeholders are replaced. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-14586) SparkSQL doesn't parse decimal like Hive - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-13102) Run query using ThriftServer, and open web using IE11, i click ā€+detail" in SQLPage, but not response - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-14283) Avoid sort in randomSplit when possible - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-10031) Join two UnsafeRows in SortMergeJoin if possible - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-8520) Improve GLM's scalability on number of features - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-17522) [MESOS] More even distribution of executors on Mesos cluster - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-15877) DataSource executed twice when using ORDER BY - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-16684) Standalone mode local dirs not properly cleaned if job is killed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-14730) Expose ColumnPruner as feature transformer - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:07 UTC, 0 replies.
- [jira] [Updated] (SPARK-8369) Support dependency jar and files on HDFS in standalone cluster mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-9595) Adding API to SparkConf for kryo serializers registration - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-11651) LinearRegressionSummary should support get residuals by type - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-14054) Support parameters for UDTs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-11535) StringIndexer should handle empty String specially - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-12886) Expose params to control how feature names are generated - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-10062) Use tut for typechecking and running code in user guides - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-11941) JSON representation of nested StructTypes could be more uniform - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-14276) Allow each HiveContext to have distinct users that they will use to connect to the HiveMetastore - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-11665) Support other distance metrics for bisecting k-means - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-17959) spark.sql.join.preferSortMergeJoin has no effect for simple join due to calculated size of LogicalRdd - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:08 UTC, 0 replies.
- [jira] [Updated] (SPARK-10627) Regularization for artificial neural networks - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:09 UTC, 0 replies.
- [jira] [Updated] (SPARK-15038) Add ability to do broadcasts in SQL at execution time - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:09 UTC, 0 replies.
- [jira] [Updated] (SPARK-11151) Use Long internally for DecimalType with precision <= 18 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:09 UTC, 0 replies.
- [jira] [Updated] (SPARK-12607) spark-class.cmd produced null command strings for "exec" - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:09 UTC, 0 replies.
- [jira] [Updated] (SPARK-13317) SPARK_LOCAL_IP does not bind to public IP on Slaves - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:09 UTC, 0 replies.
- [jira] [Updated] (SPARK-9778) remove unnecessary evaluation from SortOrder - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:10 UTC, 0 replies.
- [jira] [Updated] (SPARK-18167) Flaky test when hive partition pruning is enabled - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:10 UTC, 0 replies.
- [jira] [Updated] (SPARK-14193) Skip unnecessary sorts if input data have been already ordered in InMemoryRelation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:10 UTC, 0 replies.
- [jira] [Updated] (SPARK-12072) python dataframe ._jdf.schema().json() breaks on large metadata dataframes - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-18095) There is a display problem in spark UI storage tab when rdd was persisted in multiple replicas - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-15912) Replace getPartitionsByFilter by getPartitions in inputFiles of MetastoreRelation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-18556) Suboptimal number of tasks when writing partitioned data with desired number of files per directory - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-9237) Added Top N Column Values for DataFrames - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-16539) When worker is killed driver continues to run causing issues in supervise mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-12699) R driver process should start in a clean state - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-13900) Spark SQL queries with OR condition is not optimized properly - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-15831) Kryo 2.21 TreeMap serialization bug causes random job failures with RDDs of HBase puts - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-9184) spark.task.cpus under Mesos should accept floating point numbers (partial CPU) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:11 UTC, 0 replies.
- [jira] [Updated] (SPARK-15941) Netty RPC implementation ignores the executor bind address - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-15482) ClassCast exception when join two tables. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-15376) DataFrame write.jdbc() inserts more rows than acutal - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-12861) Changes to support KMeans with large feature space - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-18014) Filters are incorrectly being grouped together when there is processing in between - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-9820) NullPointerException that causes failure to request executors. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-10551) Successful task-end event after task failed due to executor loss - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-15897) Function Registry should just take in FunctionIdentifier for type safety and avoid duplicating - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-9961) ML prediction abstractions should have defaultEvaluator fields - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-11022) Spark Worker need improve the executor garbage while the app has massive failures - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-17386) Default polling and trigger intervals cause excessive RPC calls - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-11458) add word count example for Dataset - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-15389) DataFrame filter by isNotNull fails in complex, large case - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-17775) pyspark: take(num) failed, but collect() worked for big dataset - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-16327) The two subquery in join operator can't be execute in parallel - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-14759) After join one cannot drop dynamically added column - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-16807) Optimize some ABS() statements - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-16263) SparkSession caches configuration in an unituitive global way - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-17132) binaryFiles method can't handle paths with embedded commas - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:24:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-13432) Add the origin of the source code into a generated Java code - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-10306) sbt hive/update issue - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-11579) Method SGDOptimizer and LBFGSOptimizer in FeedForwardTrainer should not create new optimizer every time they got invoked - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-15145) port binary classification evaluator to spark.ml - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:12 UTC, 0 replies.
- [jira] [Updated] (SPARK-14032) Eliminate Unnecessary Distinct/Aggregate - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-11364) HadoopFsRelation doesn't reload the hadoop configuration for each execution - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-10376) Once/When YARN permits it, only use POST for kill action - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-12836) spark enable both driver run executor & write to HDFS - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-18018) Specify alternate escape character in 'LIKE' expression - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-9599) Dynamic partitioning based on key-distribution - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-14297) DAG visualization cropped in SparkUI - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-16246) Too many block-manager-slave-async-thread opened (TIMED_WAITING) for spark Kafka streaming - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-12609) Make R to JVM timeout configurable - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-10977) SQL injection bugs in JdbcUtils and DataFrameWriter - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:13 UTC, 0 replies.
- [jira] [Updated] (SPARK-9475) Consistent hadoop config for external/* - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-13999) Run 'group by' before building cube - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-10387) Code generation for decision tree - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-13753) Column nullable is derived incorrectly - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-9035) Spark on Mesos Thread Context Class Loader issues - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-15976) Make Stage Numbering determinstic - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-12360) Support using 64-bit long type in SparkR - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-14428) [SQL] Allow more flexibility when parsing dates and timestamps in json datasources - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-11478) ML StringIndexer return inconsistent schema - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-13463) Support Column pruning for Dataset logical plan - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-17254) Filter operator should have ā€œstop if falseā€ semantics for sorted data - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-15746) SchemaUtils.checkColumnType with VectorUDT prints instance details in error message - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:14 UTC, 0 replies.
- [jira] [Updated] (SPARK-13943) The behavior of sum(booleantype) in Spark DataFrames is not intuitive - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-8587) Return cost and cluster index KMeansModel.predict - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-12605) Pushing Join Predicates Through Union All - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-14926) OneVsRest labelMetadata uses incorrect name - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-9134) LDA Asymmetric topic-word prior - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-15503) Not able to join two hive tables having partition using HiveContext.sql - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-16682) pyspark 1.6.0 not handling multiple level import when the necessary files are zipped - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-15720) MLLIB Word2Vec loading large number of vectors in the model results in java.lang.NegativeArraySizeException - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-13886) ArrayType of BinaryType not supported in Row.equals method - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-18248) spark on mesos memory sizing with offheap - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-17249) java.lang.IllegalStateException: Did not find registered driver with class org.apache.spark.sql.execution.datasources.jdbc.DriverWrapper - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-13377) binaryFileRDD preferredLocations issue - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-17520) Create a more performant __eq__ for SparseMatrix - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:15 UTC, 0 replies.
- [jira] [Updated] (SPARK-16237) PySpark gapply - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-12358) Spark SQL query with lots of small tables under broadcast threshold leading to java.lang.OutOfMemoryError - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-15567) Refactor ExecutorAllocationManager - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-17020) Materialization of RDD via DataFrame.rdd forces a poor re-distribution of data - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-16631) Stopping sparkcontext does not shutdown fileserver - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-13853) QueryPlan sub-classes should override producedAttributes to fix missingInput - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-18277) na.fill() and friends should work on struct fields - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-17151) Decide how to handle inferring number of classes in Multinomial logistic regression - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-18040) Improve R handling or messaging of JVM exception - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-16070) DataFrame/Parquet issues with primitive arrays - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-13003) Provide a option to trim InputRelation metadata in QueryPlan.toJSON - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-13077) Expose BlockGenerator functionality in the public API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-14956) Spark dependencies conflicts - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-13212) Provide a way to unregister data sources from a SQLContext - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-9346) Conversion is applied three times on partitioned data sources that require conversion - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-7857) IDF w/ minDocFreq on SparseVectors results in literal zeros - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-16743) converter and access code out of sync: createDataFrame on RDD[Option[C]] fails with MatchError - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-13908) Limit not pushed down - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-7960) Serialization problem when multiple receivers are specified in a loop - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-13940) Predicate Transitive Closure Transformation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-8341) Significant selector feature transformation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-10625) Spark SQL JDBC read/write is unable to handle JDBC Drivers that adds unserializable objects into connection properties - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-9345) Failure to cleanup on exceptions causes persistent I/O problems later on - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-7002) Persist on RDD fails the second time if the action is called on a child RDD without showing a FAILED message - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-17067) Revocable resource support - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-17911) Scheduler does not need messageScheduler for ResubmitFailedStages - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-14736) Deadlock in registering applications while the Master is in the RECOVERING mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-9525) Optimize SparseVector initializations in linalg - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-11524) Support SparkR with Mesos cluster - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-15336) Detect and take action about executors running on corrupted nodes - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-14583) SparkSQL doesn't apply TBLPROPERTIES('serialization.null.format'='') when Hive Table has partitions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-10313) Support HA stateful driver on Mesos cluster mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-7126) For spark.ml Classifiers, automatically index labels if they are not yet indexed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-12514) Spark MetricsSystem can fill disks/cause OOMs when using GangliaSink - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-10297) When save data to a data source table, we should bound the size of a saved file - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-11028) When planning queries without partial aggregation support, we should try to use TungstenAggregate. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-9695) Add random seed Param to ML Pipeline - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-15907) Issue Exception when Not Enough Input Columns for Dynamic Partitioning - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-16607) Aggregator with null initialisation will result in null - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-12277) Use sparkIMain to compile and interpret string throw java.lang.ClassNotFoundException. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-15147) Catalog should have a property to indicate case-sensitivity - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-16969) GBTClassifier needs a raw prediction column - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-10520) Dates cannot be summarised - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-8453) Unioning two RDDs in PySpark doesn't spill to disk - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-10645) Bivariate Statistics: Spearman's Correlation in DataFrames - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-16506) Subsequent dataframe join dont work - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-18056) Update KafkaDStreams from 10.0.1 to 10.1.0 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-17539) Streaming Backpressure Starves DirectStream When Used In Combination With Receivers - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-14751) SparkR fails on Cassandra map with numeric key - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-13773) UDF being applied to filtered data - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-13284) Cannot submit app from Windows java.io.FileNotFoundException: /C: - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-17871) Dataset joinwith syntax should support specifying the condition in a compile-time safe way - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-7871) Improve the outputPartitioning for HashOuterJoin(full outer join) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-13145) checkAnswer in SQL query suites should tolerate small float number error - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-9189) Takes locality and the sum of partition length into account when partition is instance of HadoopPartition in operator coalesce - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-8513) _temporary may be left undeleted when a write job committed with FileOutputCommitter fails due to a race condition - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-10420) Implementing Reactive Streams based Spark Streaming Receiver - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-16471) Remove Hive-specific CreateHiveTableAsSelectLogicalPlan - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-10857) SQL injection bug in JdbcDialect.getTableExistsQuery() - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-9559) Worker redundancy/failover in spark stand-alone mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-15747) Support SPARK_CONF_DIR/spark-defaults.d/*.conf drop-in style config files - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-8761) Master.removeApplication is not thread-safe but is called from multiple threads - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-8516) ML attribute API in PySpark - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-8295) SparkContext shut down in Spark Project Streaming test suite - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-15727) Add UPSERT/MERGE mode to DataFrameWriter - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-17288) Spark sbin script on windows support - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-10869) Auto-normalization of semi-structured schema from a dataframe - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-10150) --force=true option is not working in beeline - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-12113) Add timing metrics to blocking phases for spark sql - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-7764) Add negative sampling to Word2Vec - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-16147) Add package docs to packages under spark.ml - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-14207) Transformer for splitting a Vector/Array column into individual columns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-6920) Be more explicit about references to "executor" and "task" in Spark on Mesos - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-15005) Usage of Temp Table twice in Hive query fails with bad error - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-10687) Discuss nonparametric survival analysis model - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-15712) Proper temp table support - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-14533) RowMatrix.computeCovariance inaccurate when values are very large - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-13762) support only column names in schema string at createDataFrame - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-17482) Analyzer should be able run on top of optimized rule - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-16718) gbm-style treeboost - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-12929) Adding IBM Platform Conductor for Spark as new Supplemental Project - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-9484) Word2Vec import/export for original binary format - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-11241) Add a common trait for PMML exportable ML pipeline models - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-14857) Table/Database Name Validation in SessionCatalog - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-17501) Re-register BlockManager again and again - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-17443) SparkLauncher should allow stoppingApplication and need not rely on SparkSubmit binary - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-7398) Add back-pressure to Spark Streaming (umbrella JIRA) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-16701) Make parameters configurable in BlockManager - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-14755) Dynamic proxy class cause a ClassNotFoundException on task deserialization - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-15006) Generated JavaDoc should hide package private objects - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-16092) Spark2.0 take no effect after set hive.exec.dynamic.partition.mode=nonstrict as a global variable in Spark2.0 configuration file while Spark1.6 does - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-7852) Set the initial weights based on the previous when GLMs are run with multiple regParams - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-15358) Rename HiveDDLCommandSuite and DDLCommandSuite to HivePlanParserSuite and PlanParserSuite - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-12628) SparkUI: weird formatting on additional metrics tooltip - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-15619) spark builds filling up /tmp - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-16298) spark.yarn.principal not working - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-8982) Worker hostnames not showing in Master web ui when launched with start-slaves.sh - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-11106) Should ML Models contains single models or Pipelines? - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-11387) minimize shuffles during joins by using existing partitions and bundling messages - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-11095) Simplify Netty RPC implementation by using a separate thread pool for each endpoint - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-8697) MatchIterator not serializable exception in RegexTokenizer - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-15130) PySpark shared params should include default values to match Scala - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-15830) Spark application should get hive tokens only when it is required - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-9473) Consistent hadoop config for SQL - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-11822) Add docs for new Netty RPC configuration - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-11132) Mean Shift algorithm integration - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-12308) Better onDisconnected logic for Master - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-11102) Uninformative exception when specifing non-exist input for JSON data source - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:27 UTC, 0 replies.
- [jira] [Updated] (SPARK-11708) 20-25% performance regression in TeraSort - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:27 UTC, 0 replies.
- [jira] [Updated] (SPARK-11977) Support accessing a DataFrame column using its name without backticks if the name contains '.' - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:27 UTC, 0 replies.
- [jira] [Updated] (SPARK-13461) Merge and clean up duplicated MLlib example code - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:27 UTC, 0 replies.
- [jira] [Updated] (SPARK-12170) Deprecate the JAVA-specific deserialized storage levels - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:27 UTC, 0 replies.
- [jira] [Updated] (SPARK-11156) Web UI doesn't count or show info about replicated blocks - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:27 UTC, 0 replies.
- [jira] [Updated] (SPARK-13239) Click-Through Rate Prediction - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-6987) Node Locality is determined with String Matching instead of Inet Comparison - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-13611) import Aggregator doesn't work in Spark Shell - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-11704) Optimize the Cartesian Join - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-15903) Support AllColumn expression in UDF functions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-14443) parse_url() does not escape query parameters - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-15105) Remove HiveSessionHook from ThriftServer - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-14325) some strange name conflicts in `group_by` - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-14557) Reading textfile (created though CTAS) doesn't work when pathFilter is enabled. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-13717) Let RandomSampler can sample with Java iterator - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-9532) Standardize checks for Predictor and other prediction abstractions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-16913) [SQL] Better codegen where querying nested struct - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-14405) broadcast variables can cause lots of warning messages on shutdown - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-11159) Nested SQL UDF raises java.lang.UnsupportedOperationException: Cannot evaluate expression - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-14111) Correct output nullability with constraints for logical plans - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-13336) Add non-numerical summaries to DataFrame.describe - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-10702) Dynamic Allocation in Standalone Breaking Parallelism - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-17774) Add support for head on DataFrame Column - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-17334) Provide management tools for broadcasted variables - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-18595) Handling ignoreIfExists in HiveExternalCatalog createTable API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-13934) SqlParser.parseTableIdentifier cannot recognize table name start with scientific notation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-9017) More timers for MLlib algorithms - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-10086) Flaky StreamingKMeans test in PySpark - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-12428) Write a script to run all PySpark MLlib examples for testing - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-12843) Spark should avoid scanning all partitions when limit is set - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-17319) Move addJar from HiveSessionState to HiveSharedState - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-16739) GBTClassifier should be a Classifier, not Predictor - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-15540) RFormula and R feature processing improvement umbrella - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-13243) jersey WADL generator complains about Seq[] type in REST APIs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-16595) Spark History server Rest Api gives Application not found error for yarn-cluster mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-15589) Anaylze simple PySpark closures and generate SQL expressions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-13059) Sort inputsplits by size in HadoopRDD to avoid long tails - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-12674) Spark on Mesos executor exit incorrect - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-14641) Specify worker log dir separately from scratch space dir - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-13536) Missing Sort Attributes in Transform Clause - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-17395) Queries on CSV partition table result in frequent GC - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-13720) SQLBuilder.toSQL is broken when there is a sort by after having - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-16869) Wrong projection when join on columns with the same name which are derived from the same dataframe - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-10609) Improve task distribution strategy in taskSetManager - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-15119) DecisionTreeParams.minInfoGain does not have a validator - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-17606) New batches are not created when there are 1000 created after restarting streaming from checkpoint. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-10895) Add pushdown string filters for Parquet - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-17489) Improve filtering for bucketed tables - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-7867) Support "revoke role ..." - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-16362) Suport ArrayType and StructType in vectorization Parquet reader - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-11038) Consolidate the format of UnsafeArrayData and UnsafeMapData - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-10312) Enhance SerDe to handle atomic vector - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-7494) spark.ml Model should call copyValues in construction - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-14425) SQL/dataframe join error: mixes up the columns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-18083) Locality Sensitive Hashing (LSH) - BitSampling - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-18469) Cannot make MLlib model predictions in Spark streaming with checkpointing - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-15418) SparkSQL does not support using a UDAF in a CREATE VIEW clause - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-16066) Right now we don't have provision to pass custom param to executor dockers(Something like spark.mesos.executor.docker.parameters). - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-10638) spark streaming stop gracefully keeps the spark context - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-11733) Allow shuffle readers to request data from just one mapper - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-12141) Use Jackson to serialize all events when writing event log - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-16921) RDD/DataFrame persist() and cache() should return Python context managers - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-11401) PMML export for Logistic Regression Multiclass Classification - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-18059) Spark not respecting partitions in partitioned Hive views - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-13288) [1.6.0] Memory leak in Spark streaming - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-17116) Allow params to be a {string, value} dict at fit time - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-13618) Make Streaming web UI display rate-limit lines in the statistics graph - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-17815) Report committed offsets - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:25:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16227) Json schema inference fails when `:` exists in file path - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:30 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10055) San Francisco Crime Classification - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10218) Adding aggregated metrics by stage or job in history server - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16737) ListingFileCatalog comments about RPC calls in object store isn't correct - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10031) Join two UnsafeRows in SortMergeJoin if possible - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7925) Address inconsistencies in capturing appName in different Metrics Sources - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8500) Support for array types in JDBCRDD - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8716) Write tests for executor shared cache feature - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8523) Spark streaming static files not found when running web ui under virtual host - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9237) Added Top N Column Values for DataFrames - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7809) MultivariateOnlineSummarizer should allow users to configure what to compute - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15896) Clean shuffle files after finish the SQL query - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14390) Make initialization step in Pregel optional. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15002) Calling unpersist can cause spark to hang indefinitely when writing out a result - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9488) pyspark.sql.types.Row very slow when using named arguments - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13336) Add non-numerical summaries to DataFrame.describe - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18014) Filters are incorrectly being grouped together when there is processing in between - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17858) Provide option for Spark SQL to skip corrupt files - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:34 UTC, 1 replies.
- [jira] [Resolved] (SPARK-9919) Matrices should respect Java's equals and hashCode contract - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11911) spark-perf test for MultilayerPerceptron - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8113) SQL module test cleanup - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18299) Allow more aggregations on KeyValueGroupedDataset - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8526) Provide a way to define custom metrics and custom metric sink in Spark - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12763) Spark gets stuck executing SSB query - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9842) Push down Spark SQL UDF to datasource UDF - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10776) Pass location of SparkR source files from R process to JVM - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11159) Nested SQL UDF raises java.lang.UnsupportedOperationException: Cannot evaluate expression - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12628) SparkUI: weird formatting on additional metrics tooltip - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16871) Support getting HBase tokens from multiple clusters dynamically - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13886) ArrayType of BinaryType not supported in Row.equals method - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16607) Aggregator with null initialisation will result in null - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13513) add some tests for leap year handling in catalyst - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11160) CloudPickeSerializer conflicts with xmlrunner - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8912) Documentation on the effective objective function in LoR when `standardization` is true/false - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8361) Session of ThriftServer is still alive after I exit beeline - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18469) Cannot make MLlib model predictions in Spark streaming with checkpointing - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:37 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8663) Dirver will be hang if there is a job submit during SparkContex stop Interval - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:37 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7947) Serdes Command not working - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:37 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11570) ambiguous hostname resolving during startup - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:37 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10313) Support HA stateful driver on Mesos cluster mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:37 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16013) Add option to disable HiveContext in spark-shell/pyspark - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:37 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11688) UDF's doesn't work when it has a default arguments - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:37 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17950) Match SparseVector behavior with DenseVector - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:38 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15038) Add ability to do broadcasts in SQL at execution time - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:38 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14926) OneVsRest labelMetadata uses incorrect name - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:38 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9345) Failure to cleanup on exceptions causes persistent I/O problems later on - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:38 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11524) Support SparkR with Mesos cluster - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:38 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8513) _temporary may be left undeleted when a write job committed with FileOutputCommitter fails due to a race condition - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16298) spark.yarn.principal not working - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15350) Add unit test function for LogisticRegressionWithLBFGS in JavaLogisticRegressionSuite - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12277) Use sparkIMain to compile and interpret string throw java.lang.ClassNotFoundException. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7874) Add a global setting for the fine-grained mesos scheduler that limits the number of concurrent tasks of a job - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18078) Add option for customize zipPartition task preferred locations - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14336) Add more unit tests for buffer release in off-heap-caching codepaths - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16001) request that spark history server write a log entry whenever it (1) tries cleaning old event logs and (2) has found and deleted old event logs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9325) Support `collect` on DataFrame columns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17067) Revocable resource support - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13900) Spark SQL queries with OR condition is not optimized properly - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17824) QR solver for WeightedLeastSquares - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16506) Subsequent dataframe join dont work - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15130) PySpark shared params should include default values to match Scala - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14956) Spark dependencies conflicts - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8672) throws NPE when running spark sql thrift server with session state authenticator - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17836) Use cross validation to determine the number of clusters for EM or KMeans algorithms - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13536) Missing Sort Attributes in Transform Clause - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8566) lateral view query blows up unless pushed down to a subquery - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14533) RowMatrix.computeCovariance inaccurate when values are very large - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6920) Be more explicit about references to "executor" and "task" in Spark on Mesos - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7653) ML Pipeline and meta-algs should take random seed param - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7885) add config to control map aggregation in spark sql - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9473) Consistent hadoop config for SQL - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12319) ExchangeCoordinatorSuite fails on big-endian platforms - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15582) Support for Groovy closures - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14723) A new way to support dynamic allocation in Spark Streaming - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14035) Flaky spark.mllib.classification.NaiveBayesSuite test - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17522) [MESOS] More even distribution of executors on Mesos cluster - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14530) When click the job in the webUI, I get a problem about "Access denied." - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8724) Need documentation on how to deploy or use SparkR in Spark 1.4.0+ - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16075) Make VectorUDT/MatrixUDT singleton under spark.ml package - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17655) Remove unused variables declarations and definations in a WholeStageCodeGened stage - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8110) DAG visualizations sometimes look weird in Python - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16631) Stopping sparkcontext does not shutdown fileserver - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8055) Spark Launcher Improvements - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17489) Improve filtering for bucketed tables - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15272) DirectKafkaInputDStream doesn't work with window operation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12262) describe extended doesn't return table on detail info tabled stored as PARQUET format - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13212) Provide a way to unregister data sources from a SQLContext - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:45 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11143) SparkMesosDispatcher can not launch driver in docker - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:45 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7002) Persist on RDD fails the second time if the action is called on a child RDD without showing a FAILED message - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:45 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7398) Add back-pressure to Spark Streaming (umbrella JIRA) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:45 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15835) The read path of json doesn't support write path when schema contains Options - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:45 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14531) Flume streaming should respect maxRate (and backpressure) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:46 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16092) Spark2.0 take no effect after set hive.exec.dynamic.partition.mode=nonstrict as a global variable in Spark2.0 configuration file while Spark1.6 does - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:46 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10687) Discuss nonparametric survival analysis model - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:47 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11013) SparkPlan may mistakenly register child plan's accumulators for SQL metrics - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:47 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16263) SparkSession caches configuration in an unituitive global way - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:47 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17334) Provide management tools for broadcasted variables - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:47 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8543) PMML export for Random Forest - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:48 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16743) converter and access code out of sync: createDataFrame on RDD[Option[C]] fails with MatchError - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:48 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11168) Explore precise behavior of treeAggregate - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:48 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14539) Fetching delegation tokens in Hive-Thriftserver fails when hive.server2.enable.doAs = True - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:48 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9195) RDD/Storage metrics don't update cached partition counts when executors are removed/lost - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:48 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16823) One dimensional typed select on DataFrame does not work as expected - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:48 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18248) spark on mesos memory sizing with offheap - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:49 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12338) Support dropping duplicated rows on selected columns in DataFrame in R style - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:49 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17759) Avoid adding duplicate schedulables - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:49 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16070) DataFrame/Parquet issues with primitive arrays - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:49 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7038) [Streaming] Spark Sink requires spark assembly in classpath - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:49 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12360) Support using 64-bit long type in SparkR - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:49 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10645) Bivariate Statistics: Spearman's Correlation in DataFrames - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:50 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10797) RDD's coalesce should not write out the temporary key - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:50 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18100) Improve the performance of get_json_object using Gson - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:50 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13317) SPARK_LOCAL_IP does not bind to public IP on Slaves - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:50 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10646) Bivariate Statistics: Pearson's Chi-Squared goodness of fit test - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:50 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12923) Optimize successive dapply() calls in SparkR - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:50 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16741) spark.speculation causes duplicate rows in df.write.jdbc() - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:51 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9721) TreeTests.checkEqual should compare predictions on data - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:51 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15540) RFormula and R feature processing improvement umbrella - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:51 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17226) Allow defining multiple date formats per column in csv - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:51 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14140) Futures timeout exception in executor logs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:51 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18040) Improve R handling or messaging of JVM exception - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:51 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12930) NullPointerException running hive query with array dereference in select and where clause - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:51 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15912) Replace getPartitionsByFilter by getPartitions in inputFiles of MetastoreRelation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:51 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12688) Spill size metric does not update for tungsten aggregation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:51 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9623) RandomForestRegressor: provide variance of predictions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:52 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9932) Data source API improvement (Spark 1.6) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:52 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12402) Memory leak in pyspark - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:53 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7867) Support "revoke role ..." - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:53 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13641) getModelFeatures of ml.api.r.SparkRWrapper cannot (always) reveal the original column names - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:53 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10406) Document spark on yarn distributed cache symlink functionality - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:53 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13005) Don't require spark.shuffle.service.port to be set on job conf - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:53 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13975) Cannot specify extra libs for executor from /extra-lib - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:53 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14166) Add deterministic sampling like in Hive - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:53 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14597) Streaming Listener timing metrics should include time spent in JobGenerator's graph.generateJobs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:53 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10627) Regularization for artificial neural networks - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:54 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17059) Allow FileFormat to specify partition pruning strategy - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:54 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8108) Build Hive module by default (i.e. remove -Phive profile) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:54 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11632) Filter out empty partition for KafkaRDD - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:54 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17045) Moving Auto_Joins from HiveCompatibilitySuite to SQLQueryTestSuite - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:54 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12601) worker output a large number of log when size RollingPolicy shouldRollover method use loginfo - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:54 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15846) Allow passing a PrintStream to DataFrame.explain - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:54 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10347) Investigate the usage of normalizePath() - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:54 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14054) Support parameters for UDTs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:54 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18449) Name option is being ignored when submitting an R application via spark-submit - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:54 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17606) New batches are not created when there are 1000 created after restarting streaming from checkpoint. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:55 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11708) 20-25% performance regression in TeraSort - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:55 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11517) Calc partitions in parallel for multiple partitions table - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:55 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11444) Allow batch seqOp combination in treeAggregate - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:55 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15385) Jobs never complete for ClusterManagers that don't implement killTask - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:55 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17397) Show example of what to do when awaitTermination() throws an Exception - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:55 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15486) dropTempTable does not work with backticks - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:55 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12313) getPartitionsByFilter doesnt handle predicates on all / multiple Partition Columns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:56 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13634) Assigning spark context to variable results in serialization error - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:56 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18079) CollectLimitExec.executeToIterator() should perform per-partition limits - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:56 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15233) Spark task metrics should include hdfs read write latency - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:56 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13073) creating R like summary for logistic Regression in Spark - Scala - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:56 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12605) Pushing Join Predicates Through Union All - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:57 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13979) Killed executor is respawned without AWS keys in standalone spark cluster - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:57 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17155) usage of a Dataset inside a Future throws MissingRequirementError - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:57 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18421) Dynamic disk allocation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:57 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15336) Detect and take action about executors running on corrupted nodes - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:57 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12147) Off heap storage and dynamicAllocation operation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:57 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17254) Filter operator should have ā€œstop if falseā€ semantics for sorted data - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:58 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12567) Add aes_encrypt and aes_decrypt UDFs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:58 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8469) Application timeline view unreadable with many executors - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:58 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18500) Make GenericStrategy be able to prune plans by itself after placeholders are replaced. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:58 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8987) Increase test coverage of DAGScheduler - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:33:59 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12372) Document limitations of MLlib local linear algebra - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15700) Spark 2.0 dataframes using more memory (reading/writing parquet) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8534) Gini for regression metrics and evaluator - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14405) broadcast variables can cause lots of warning messages on shutdown - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10387) Code generation for decision tree - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12419) FetchFailed = false Executor lost should not allowed re-registered in BlockManager Master again? - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16161) Ambiguous error message for unsupported correlated predicate subqueries - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17170) Enable whole partition pruning for InMemoryTableScanExec - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:02 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17789) Don't force users to set k for KMeans if initial model is set - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:02 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8082) Functionality to Reset DF Schemas/Cast Multiple Columns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:02 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14586) SparkSQL doesn't parse decimal like Hive - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:02 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15833) REST API for streaming - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:02 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12099) Standalone and Mesos Should use OnOutOfMemoryError handlers - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:02 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11400) BroadcastNestedLoopJoin should support LeftSemi join - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:03 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11268) Non-daemon startup scripts - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:03 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6834) Failed with error: ā€˜invalid package nameā€™ Error in as.name(name) : attempt to use zero-length variable name - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:03 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11411) Use the space at end of pages - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:03 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7126) For spark.ml Classifiers, automatically index labels if they are not yet indexed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:03 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7167) Receivers are not distributed efficiently when starting from checkpoint - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:03 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13516) Dataframe inconsistency after aggregation+union+projection. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:04 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6826) `hashCode` support for arbitrary R objects - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:04 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11151) Use Long internally for DecimalType with precision <= 18 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:04 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17395) Queries on CSV partition table result in frequent GC - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:04 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8569) Documentation for SPARK_HISTORY_OPTS unclear - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:05 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10882) Add the ability to connect to secured mqtt brokers - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:05 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15143) CSV data source is not being tested as HadoopFsRelation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:05 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11822) Add docs for new Netty RPC configuration - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:05 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16592) Improving ml.Logistic Regression on speed and scalability - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:06 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15976) Make Stage Numbering determinstic - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:06 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12045) Use joda's DateTime to replace Calendar - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:06 UTC, 1 replies.
- [jira] [Resolved] (SPARK-16840) Please save the aggregate term frequencies as part of the NaiveBayesModel - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:06 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16684) Standalone mode local dirs not properly cleaned if job is killed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:06 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17774) Add support for head on DataFrame Column - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:06 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18605) Spark Streaming ERROR TransportResponseHandler: Still have 1 requests outstanding when connection - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:07 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13392) KafkaSink for Metrics - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:07 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12853) Update query planner to use only bucketed reads if it is useful - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:07 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10857) SQL injection bug in JdbcDialect.getTableExistsQuery() - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:08 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16727) SparkR unit test fails - incorrect expected output - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:08 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14695) Error occurs when using OFF_HEAP persistent level - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:08 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6816) Add SparkConf API to configure SparkR - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:09 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10784) Flaky Streaming ML test umbrella - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:09 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8137) Improve treeAggregate to combine all data on one machine first - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:09 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18174) Avoid Implicit Type Cast in Arguments of Expressions Extending String2StringExpression - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:10 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12918) Support R SQL UDF in SparkR - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:10 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17476) Proper handling for unseen labels in logistic regression training. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:10 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16679) Move `private[sql]` methods in public APIs used for Python/R into a single ā€˜helper classā€™ - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:11 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15521) Add high level APIs based on dapply and gapply for easier usage - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:11 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13909) DataFrames DISK_ONLY persistence leads to OOME - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:11 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9315) SparkR DataFrame improvements to be more R-friendly - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:11 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10404) Worker should terminate previous executor before launch new one - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:11 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7412) Designing distributed prediction model abstractions for spark.ml - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:12 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16593) Provide a pre-fetch mechanism to accelerate shuffle stage. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:12 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12514) Spark MetricsSystem can fill disks/cause OOMs when using GangliaSink - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:12 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14985) Update LinearRegression, LogisticRegression summary internals to handle model copy - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:12 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13623) Relaxed mode for querying Dataframes, so columns that don't exist or have an incompatible schema return null rather than error - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:12 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13003) Provide a option to trim InputRelation metadata in QueryPlan.toJSON - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:13 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17050) Improve initKMeansParallel with treeAggregate - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:13 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15727) Add UPSERT/MERGE mode to DataFrameWriter - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:13 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15903) Support AllColumn expression in UDF functions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:13 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10928) Spark Mesos finegrain mode with single core CPUs, rendered useless - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:13 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18277) na.fill() and friends should work on struct fields - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:14 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17482) Analyzer should be able run on top of optimized rule - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:14 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14683) Configure external links in ScalaDoc - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:14 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12013) Add a Hive context method to retrieve the database Names - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:14 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17288) Spark sbin script on windows support - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:15 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12622) spark-submit fails on executors when jar has a space in it - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:15 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11943) Rapidly starting and stopping SparkContexts in local-cluster mode may cause JVM to exit - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:15 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16833) [Spark2.0]when creating temporary function,command "add jar" doesn't work unless restart spark - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:15 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13272) Clean-up CatalystQl - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:16 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14736) Deadlock in registering applications while the Master is in the RECOVERING mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:16 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7857) IDF w/ minDocFreq on SparseVectors results in literal zeros - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:16 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18068) Spark SQL doesn't parse some ISO 8601 formatted dates - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:17 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10919) Association rules class should return the support of each rule - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:17 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7960) Serialization problem when multiple receivers are specified in a loop - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:17 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12886) Expose params to control how feature names are generated - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:17 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16039) Spark SQL - Number of rows inserted by Insert Sql - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:17 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14332) Support Hadoop Input/OutputFormat Counters as Metrics - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13978) [GSoC 2016] Build monitoring UI and related infrastructure for Spark SQL and structured streaming - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13691) Scala and Python generate inconsistent results - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11703) Improve the docker-mesos image - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18595) Handling ignoreIfExists in HiveExternalCatalog createTable API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10519) Investigate if we should encode timezone information to a timestamp value stored in JSON - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12206) Streaming WebUI shows incorrect batch statistics when using Window operations - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16237) PySpark gapply - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11941) JSON representation of nested StructTypes could be more uniform - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10561) Provide tooling for auto-generating Spark SQL reference manual - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11288) Specify the return type for UDF in Scala - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15389) DataFrame filter by isNotNull fails in complex, large case - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15877) DataSource executed twice when using ORDER BY - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17730) Show task size (including the broadcast variable for the task) in web UI - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17177) Make grouping columns accessible from RelationalGroupedDataset - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7338) Survival Modelling - Cox proportional hazards - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:22 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14751) SparkR fails on Cassandra map with numeric key - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:22 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12209) spark.streaming.concurrentJobs and spark.streaming.backpressure.enabled - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:22 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18056) Update KafkaDStreams from 10.0.1 to 10.1.0 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:22 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16076) Dataset - outer join nulls can sometimes combinate to default values - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:23 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8519) Blockify distance computation in k-means - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:23 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16969) GBTClassifier needs a raw prediction column - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:23 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13707) Streaming UI tab misleading for window operations - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:23 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12943) spark should distribute truststore if used in yarn-cluster mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:23 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8587) Return cost and cluster index KMeansModel.predict - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:23 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10551) Successful task-end event after task failed due to executor loss - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:23 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7386) Spark application level metrics application.$AppName.$number.cores doesn't reset on Standalone Master deployment - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:23 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13605) Bean encoder cannot handle nonbean properties - no way to Encode nonbean Java objects with columns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:24 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11308) Change spark streaming's job scheduler logic to ensuer guaranteed order of batch processing - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:24 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11280) Mesos cluster deployment using only one node - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:24 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9317) Print DataFrame entries in the shell for small DataFrames - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:24 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12448) Add UserDefinedType support to Cast - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:25 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15746) SchemaUtils.checkColumnType with VectorUDT prints instance details in error message - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:25 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16407) Allow users to supply custom StreamSinkProviders - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:25 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6853) Contents of .globalenv in workers - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:25 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15191) createDataFrame() should mark fields that are known not to be null as not nullable - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:26 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13161) Extend MLlib LDA to include options for Author Topic Modeling - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:26 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17995) Use new attributes for columns from outer joins - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:26 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9761) Inconsistent metadata handling with ALTER TABLE - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:26 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13388) PySpark Pipeline and PipelineModel should take advantages of its Scala companion - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:26 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17781) datetime is serialized as double inside dapply() - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:26 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11666) Find the best `k` by cutting bisecting k-means cluster tree without recomputation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18225) job will miss when driver removed by master in spark streaming - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10910) spark.{executor,driver}.userClassPathFirst don't work for kryo (probably others) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13999) Run 'group by' before building cube - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14032) Eliminate Unnecessary Distinct/Aggregate - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8697) MatchIterator not serializable exception in RegexTokenizer - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10520) Dates cannot be summarised - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:28 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16258) Automatically append the grouping keys in SparkR's gapply - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:28 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7618) Word2VecModel cache normalized wordVectors to speed up findSynonyms - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:29 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11022) Spark Worker need improve the executor garbage while the app has massive failures - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:29 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9559) Worker redundancy/failover in spark stand-alone mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:29 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15432) Two executors with same id in Spark UI - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:29 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8423) More informative DecisionTreeModel.toDebugString - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:30 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18556) Suboptimal number of tasks when writing partitioned data with desired number of files per directory - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:30 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8295) SparkContext shut down in Spark Project Streaming test suite - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:30 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10597) MultivariateOnlineSummarizer for weighted instances - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:30 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10895) Add pushdown string filters for Parquet - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:30 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8546) PMML export for Naive Bayes - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11250) Generate different alias for columns with same name during join - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10897) Custom job/stage names - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12825) Spark-submit Jar URL loading fails on redirect - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16147) Add package docs to packages under spark.ml - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14707) Linear algebra: clarify light vs heavy constructors and accessors - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6822) lapplyPartition passes empty list to function - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9461) Possibly slightly flaky PySpark StreamingLinearRegressionWithTests - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13102) Run query using ThriftServer, and open web using IE11, i click ā€+detail" in SQLPage, but not response - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15920) Fix incorrect DataFrame references in Pyspark docs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6838) Explore using Reference Classes instead of S4 objects - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11454) DB2 dialect - map DB2 ROWID and TIMESTAMP with TIMEZONE types into valid Spark types - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10568) Error thrown in stopping one component in SparkContext.stop() doesn't allow other components to be stopped - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11535) StringIndexer should handle empty String specially - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15358) Rename HiveDDLCommandSuite and DDLCommandSuite to HivePlanParserSuite and PlanParserSuite - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15575) Remove breeze from dependencies? - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9431) TimeIntervalType for for time intervals - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14622) Retain lost executors status - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9346) Conversion is applied three times on partitioned data sources that require conversion - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6854) No support for get and eval-quote - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7610) Design clustering abstractions for Pipelines API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14920) Location should not be Specified in Creating Non-external Table - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10837) TimeStamp could not work on sparksql very well - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9184) spark.task.cpus under Mesos should accept floating point numbers (partial CPU) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16921) RDD/DataFrame persist() and cache() should return Python context managers - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16869) Wrong projection when join on columns with the same name which are derived from the same dataframe - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16537) Extracting partition columns from the hive table. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10898) Setting spark.streaming.concurrentJobs causes blocks to be deleted before read - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13639) Statistics.colStats(rdd).mean and variance should handle NaN in the input vectors - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12800) Subtle bug on Spark Yarn Client under Kerberos Security Mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:37 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9134) LDA Asymmetric topic-word prior - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:37 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17249) java.lang.IllegalStateException: Did not find registered driver with class org.apache.spark.sql.execution.datasources.jdbc.DriverWrapper - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:37 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7825) Poor performance in Cross Product due to no combine operations for small files. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:37 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9474) Consistent hadoop config for core - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:37 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13051) Do not maintain global singleton map for accumulators - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:37 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17325) Inconsistent Spillable threshold and AppendOnlyMap growing threshold may trigger out-of-memory errors - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:38 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13872) Memory leak in SortMergeOuterJoin - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:38 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12506) Push down WHERE clause arithmetic operator to JDBC layer - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:38 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14800) Dealing with null as a value in options for each internal data source - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:38 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17030) Remove/Cleanup HiveMetastoreCatalog.scala - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13731) expression evaluation for NaN in select statement - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15366) Add Application Detail UI uri to Spark Json API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10293) Add support for oversubscription in Mesos - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10062) Use tut for typechecking and running code in user guides - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16728) migrate internal API for MLlib trees from spark.mllib to spark.ml - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7597) Make default doc build avoid search engine indexing - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11231) join returns schema with duplicated and ambiguous join columns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18083) Locality Sensitive Hashing (LSH) - BitSampling - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9427) Add expression functions in SparkR - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14425) SQL/dataframe join error: mixes up the columns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9333) Tables are not listed in jdbc tools when connected to hivethrift server - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11664) Add methods to get bisecting k-means cluster structure - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8665) Update ALS documentation to include performance tips - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9695) Add random seed Param to ML Pipeline - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11408) Display scientific notation properly in DataFrame.show() - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11579) Method SGDOptimizer and LBFGSOptimizer in FeedForwardTrainer should not create new optimizer every time they got invoked - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9009) SPARK Encryption FileNotFoundException for truststore - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16891) arrayFilter function for filtering elements of array column based on a predicate - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10972) UDFs in SQL joins - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10376) Once/When YARN permits it, only use POST for kill action - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18018) Specify alternate escape character in 'LIKE' expression - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10493) reduceByKey not returning distinct results - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14849) shuffle broken when accessing standalone cluster through NAT - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9111) Dumping the memory info when an executor dies abnormally - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16089) Spark2.0 doesn't support the certain static partition SQL statment as "insert overwrite table targetTB PARTITION (partition field=xx) select field1,field2,...,partition field from sourceTB where partition field=xx" while Spark 1.6 supports - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10193) Eliminate "skipped" stages for shared shuffle dependencies by reusing associated Stage - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16889) Add formatMessage Column expression for formatting strings in java.text.MessageFormat style in Scala API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:45 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17539) Streaming Backpressure Starves DirectStream When Used In Combination With Receivers - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:45 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13432) Add the origin of the source code into a generated Java code - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:45 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9879) OOM in LIMIT clause with large number - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:45 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17691) Add aggregate function to collect list with maximum number of elements - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:45 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13611) import Aggregator doesn't work in Spark Shell - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:45 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17189) [MINOR] Looses the interface from UnsafeRow to InternalRow in AggregationIterator if UnsafeRow specific method is not used - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:46 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8722) PR merge script should warn when merging a PR that has failed tests - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:46 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15873) JdbcRDD to support more bound types other than Long and allow multiple bound occurrence for subqueries. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:46 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16717) Dataframe (jdbc) is missing a way to link and external function to get a connection - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:46 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18571) pyspark: UTF-8 not written correctly (as CSV) when locale is not UTF-8 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:46 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8298) Sliding Window CrossValidator - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:46 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13811) No Push Down for Null Filtering of Compound Expressions Generated from Constraints - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:46 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10609) Improve task distribution strategy in taskSetManager - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:47 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10950) ApplicationHistoryInfo to include spark version; History Server to report incompatibility with later versions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:47 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7680) Add a fake Receiver that generates random strings, useful for prototyping - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:48 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11102) Uninformative exception when specifing non-exist input for JSON data source - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:48 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15568) TimSort and RadixSort can't support more than 1 billions elements - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:48 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10583) Correctness test for Multilayer Perceptron using Weka Reference - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:48 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14464) Logistic regression performs poorly for very large vectors, even when the number of non-zero features is small - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:48 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17959) spark.sql.join.preferSortMergeJoin has no effect for simple join due to calculated size of LogicalRdd - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:48 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14068) Pluggable DiskBlockManager - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:49 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9696) Add random seed Param to PySpark ML Pipeline - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:49 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12787) Dataset to support custom encoder - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:49 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17775) pyspark: take(num) failed, but collect() worked for big dataset - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:49 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12726) ParquetConversions doesn't always propagate metastore table identifier to ParquetRelation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:50 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7043) KryoSerializer cannot be used with REPL to interpret code in which case class definition and its shipping are in the same line - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:50 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16539) When worker is killed driver continues to run causing issues in supervise mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:50 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17630) jvm-exit-on-fatal-error handler for spark.rpc.netty like there is available for akka - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:50 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9857) Add expression functions into SparkR which conflict with the existing R's generic - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:51 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14148) Kmeans Sum of squares - Within cluster, between clusters and total - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:51 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11038) Consolidate the format of UnsafeArrayData and UnsafeMapData - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:51 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12778) Use of Java Unsafe should take endianness into account - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:51 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11907) Allowing errors as values in DataFrames (like 'Either Left/Right') - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:51 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12843) Spark should avoid scanning all partitions when limit is set - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:51 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10995) Graceful shutdown drops processing in Spark Streaming - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:52 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13083) Small spark sql queries get blocked if there is a long running query over a lot a partitions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:53 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7221) Expose the current processed file name of FileInputDStream to the users - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:53 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13284) Cannot submit app from Windows java.io.FileNotFoundException: /C: - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:53 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16954) UDFs should allow output type to be specified in terms of the input type - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:53 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6823) Add a model.matrix like capability to DataFrames (modelDataFrame) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:54 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17911) Scheduler does not need messageScheduler for ResubmitFailedStages - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:54 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9881) Support additional KryoRegistrators via an SPI lookup - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:54 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8789) improve SQLQuerySuite resilience by dropping tables in setup - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:54 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15219) [Spark SQL] it don't support to detect runtime temporary table for enabling broadcast hash join optimization - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:54 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9781) KCL Workers should be configurable from Spark configuration - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:54 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10153) Unable to query Avro data from Flume using SparkSQL - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:55 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8517) Improve the organization and style of MLlib's user guide - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:55 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16635) Provide Session support in the Spark UI - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:55 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12283) Use UnsafeRow as the buffer in SortBasedAggregation to avoid Unsafe/Safe conversion - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:55 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13155) add runtime null check when convert catalyst array to external array - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:56 UTC, 1 replies.
- [jira] [Resolved] (SPARK-10501) support UUID as an atomic type - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:56 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11651) LinearRegressionSummary should support get residuals by type - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:57 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7495) Improve ML attribute documentation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:57 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16595) Spark History server Rest Api gives Application not found error for yarn-cluster mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:57 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12208) Abstract the examples into a common place - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:57 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11478) ML StringIndexer return inconsistent schema - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:57 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17604) Support purging aged file entry for FileStreamSource metadata log - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:58 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13077) Expose BlockGenerator functionality in the public API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:58 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11384) Wrong duration times for (cached) iterative tasks - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:58 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9778) remove unnecessary evaluation from SortOrder - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:59 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14534) Should SparkContext.parallelize(List) take an Iterable instead? - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:59 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16532) Provide a REST API for submitting and tracking status of jobs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:34:59 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7609) Add standardized checks for (Model, Estimator) unit tests - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11543) spark.ml LogisticRegressionModel should prefer thresholds over threshold - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7791) Set user for executors in standalone-mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11886) R function name conflicts with base or stats package ones - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10401) spark-submit --unsupervise - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14193) Skip unnecessary sorts if input data have been already ordered in InMemoryRelation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9442) java.lang.ArithmeticException: / by zero when reading Parquet - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11741) Process doctests using TextTestRunner/XMLTestRunner - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18158) Submit app in standalone cluster mode supervised with HA: all masters have to be up and running - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11354) Expose custom log4j to executor page in Spark standalone cluster - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13753) Column nullable is derived incorrectly - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17785) Find a more robust way to detect the existing of the initialModel - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:02 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16327) The two subquery in join operator can't be execute in parallel - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:02 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15830) Spark application should get hive tokens only when it is required - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:02 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10363) DataframeWriter write failed when multi applications write the same partitioned directory - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:02 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16378) HiveContext doesn't release resources - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:03 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18102) Failed to deserialize the result of task - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:03 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12909) Spark on Mesos accessing Secured HDFS w/Kerberos - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:03 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7525) Could not read data from write ahead log record when Receiver failed and WAL is stored in Tachyon - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:04 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6832) Handle partial reads in SparkR JVM to worker communication - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:05 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12674) Spark on Mesos executor exit incorrect - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:05 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11152) Streaming UI: Input sizes are 0 for makeup batches started from a checkpoint - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:05 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16657) Replace children by innerChildren in InsertIntoHadoopFsRelationCommand and CreateHiveTableAsSelectCommand - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:05 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11458) add word count example for Dataset - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:06 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12141) Use Jackson to serialize all events when writing event log - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:06 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12571) AWS credentials not available for read.parquet in SQLContext - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:06 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10870) Criteo Display Advertising Challenge - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:07 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9035) Spark on Mesos Thread Context Class Loader issues - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:07 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17856) JVM Crash during tests: pyspark.mllib.linalg.distributed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:07 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10569) Kryo serialization fails on sortByKey operation on registered RDDs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:08 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17457) Spark SQL shows poor performance for group by and sort by on multiple columns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:08 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17871) Dataset joinwith syntax should support specifying the condition in a compile-time safe way - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:08 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13174) Add API and options for csv data sources - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:08 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17937) Clarify Kafka offset semantics for Structured Streaming - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:09 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15566) Expose null checking function to Python land. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:09 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13756) Reuse Query Fragments - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:09 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8986) GaussianMixture should take smoothing param - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:10 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13630) Add optimizer rule to collapse sorts - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:10 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17497) Preserve order when scanning ordered buckets over multiple partitions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:10 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6915) VectorIndexer improvements - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:10 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15828) YARN is not aware of Spark's External Shuffle Service - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:10 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15592) Paginate query table in SQL tab - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:10 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10702) Dynamic Allocation in Standalone Breaking Parallelism - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:10 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12782) reindex() columns in DataFrame - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:11 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7877) Support non-persistent cluster mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:11 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16338) Streaming driver running on standalone cluster mode with supervise goes into bad state when application is killed from the UI - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:11 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16997) Allow loading of JSON float values as TimestampType - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:11 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18532) Code generation memory issue - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:11 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16314) Spark application got stuck when NM running executor is restarted - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:11 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11365) consolidate aggregates for summary statistics in weighted least squares - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:11 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16424) Add support for Structured Streaming to the ML Pipeline API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:12 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11095) Simplify Netty RPC implementation by using a separate thread pool for each endpoint - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:13 UTC, 1 replies.
- [jira] [Resolved] (SPARK-10832) sometimes No event logs found for application using same JavaSparkSQL example - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:13 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17416) Add Dataset.groupByKey overload that takes a value selector function - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:13 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14557) Reading textfile (created though CTAS) doesn't work when pathFilter is enabled. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:14 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11441) HadoopFsRelation is not scalable in number of files read/written - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:14 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15090) Spark Hive thriftserver can get 413 errors in Kerberos+AD deployments - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:14 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12469) Data Property Accumulators for Spark (formerly Consistent Accumulators) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:14 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11203) UDF doesn't support charType column and lit function doesn't allow charType as argument - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:15 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12445) Fix null exception when passing null as array in toCatalystArray - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:16 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9603) Re-enable complex R package test in SparkSubmitSuite - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:16 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14248) Get the path hierarchy from root to leaf in the BisectingKMeansModel - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13128) API for building arrays / lists encoders - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18241) If Spark Launcher fails to startApplication then handle's state does not change - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15499) Add python testsuite with remote debug and single test parameter to help developer debug code easier. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14112) Unique Constraints over a Set of AttributeReferences - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15906) Complementary Naive Bayes Algorithm Implementation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15331) Disallow All the Unsupported CLI Commands - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12373) Type coercion rule of dividing two decimal values may choose an intermediate precision that does not have enough number of digits at the left of decimal point - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10306) sbt hive/update issue - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17247) when fall back to hdfs is enabled for stats calculation, the hdfs listing and size calcuation should be terminated as soon as total size > broadcast threshold - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17135) Consolidate code in linear/logistic regression where possible - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8946) Intermittent SnappyCompressionCodec. IllegalArgumentException - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17036) Hadoop config caching could lead to memory pressure and high CPU usage in thrift server - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11366) binary functions (methods) such as rlike in pyspark.sql.column only accept strings but should also accept another Column - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7852) Set the initial weights based on the previous when GLMs are run with multiple regParams - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10935) Avito Context Ad Clicks - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10961) Specified metastore 0.12.0 but spark-shell still using metastore classes for 0.13+ - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12581) Support case-sensitive table names in postgresql - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10385) Bivariate statistics in DataFrames - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17221) Build File-based Test Cases for Using Join and Left-Semi Join - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:22 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8333) Spark failed to delete temp directory created by HiveContext - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:22 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13777) Weighted Least Squares fails when there are features with identical values - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:23 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16913) [SQL] Better codegen where querying nested struct - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:23 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13332) Decimal datatype support for SQL pow - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:23 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17116) Allow params to be a {string, value} dict at fit time - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:23 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16830) Executors Keep Trying to Fetch Blocks from a Bad Host - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:23 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15027) ALS.train should use DataFrame instead of RDD - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:23 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15507) ClassCastException: SomeCaseClass cannot be cast to org.apache.spark.sql.Row - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:24 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9697) Project Tungsten (Phase 2) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:24 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8009) [Mesos] Allow provisioning of executor logging configuration - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:26 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15917) Define the number of executors in standalone mode with an easy-to-use property - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:26 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14053) Merge absTol and relTol into one in MLlib tests - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17831) registerTempTable is ignoring database clarifications - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9377) Shuffle tuning should discuss task size optimisation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11794) Improve error message when HDFS/S3 access is misconfigured when using Parquet - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15005) Usage of Temp Table twice in Hive query fails with bad error - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18340) Inconsistent error messages in launching scripts and hanging in sparkr script for wrong options - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17146) Add RandomizedSearch to the CrossValidator API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15194) Add Python ML API for MultivariateGaussian - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7409) Designing multilabel abstractions for spark.ml - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:28 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9532) Standardize checks for Predictor and other prediction abstractions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:28 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14111) Correct output nullability with constraints for logical plans - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:29 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15201) Handle integer overflow correctly in hash code computation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:29 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7966) add Spreading Activation algorithm to GraphX - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:29 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15567) Refactor ExecutorAllocationManager - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:29 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15712) Proper temp table support - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:29 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13463) Support Column pruning for Dataset logical plan - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:30 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13960) JAR/File HTTP Server doesn't respect "spark.driver.host" and there is no "spark.fileserver.host" option - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:30 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11465) Support multiple eigenvectors in power iteration clustering - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9475) Consistent hadoop config for external/* - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16235) "evaluateEachIteration" is returning wrong results when calculated for classification model. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13773) UDF being applied to filtered data - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18095) There is a display problem in spark UI storage tab when rdd was persisted in multiple replicas - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18411) Add Argument Types and Test Cases for String Functions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16363) Spark-submit doesn't work with IAM Roles - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15374) Spark created Parquet files cause NPE when a column has only NULL values - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13934) SqlParser.parseTableIdentifier cannot recognize table name start with scientific notation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11028) When planning queries without partial aggregation support, we should try to use TungstenAggregate. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11753) Understand why allowNonNumericNumbers JSON option doesn't work - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14186) Size awareness during spill and shuffle - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14443) parse_url() does not escape query parameters - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8453) Unioning two RDDs in PySpark doesn't spill to disk - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15015) Log statements lack file name/number - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11255) R Test build should run on R 3.1.2 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9578) Stemmer feature transformer - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15589) Anaylze simple PySpark closures and generate SQL expressions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12117) Column Aliases are Ignored in callUDF while using struct() - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17377) Joining Datasets read and aggregated from a partitioned Parquet file gives wrong results - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10105) Adding most k frequent words parameter to Word2Vec implementation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14026) Subquery not brodcasted - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15588) Paginate Stage Table in Stages tab, Job Table in Jobs tab, and Query Table in SQL tab - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15941) Netty RPC implementation ignores the executor bind address - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:37 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13650) Usage of the window() function on DStream - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:37 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13145) checkAnswer in SQL query suites should tolerate small float number error - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:37 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14606) Different maxBins value for categorical and continuous features in RandomForest implementation. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:38 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12385) Push projection into Join - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:38 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10992) Partial Aggregation Support for Hive UDAF - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:38 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17152) Spark Flume sink fails with begin() called when transaction is OPEN - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13497) PMML export for logistic regression does not conform to the PMML standard - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12431) add local checkpointing to GraphX - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18405) Add yarn-cluster mode support to Spark Thrift Server - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15482) ClassCast exception when join two tables. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12565) Document standalone mode jar distribution more clearly - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11576) [SQL] Incorrect results when using the nested self-join - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9588) spark sql cache: partition level cache eviction - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14859) [PYSPARK] Make Lambda Serializer Configurable - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12430) Temporary folders do not get deleted after Task completes causing problems with disk space. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11801) Notify driver when OOM is thrown before executor JVM is killed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13717) Let RandomSampler can sample with Java iterator - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16614) DirectJoin with DataSource for SparkSQL - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7831) Mesos dispatcher doesn't deregister as a framework from Mesos when stopped - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10586) BlockManager ca't be removed when it is re-registered, then disassociats - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15039) Kinesis reciever does not work in Yarn - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14361) Support EXCLUDE clause in Window function framing - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15554) Duplicated executors in Spark UI - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15970) WARNing message related to persisting table to Hive Megastore while Spark SQL is running in-memory catalog mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16682) pyspark 1.6.0 not handling multiple level import when the necessary files are zipped - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18596) Add checking and caching to ML BisectingKmeans - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7348) DAG visualization: add links to RDD page - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15147) Catalog should have a property to indicate case-sensitivity - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13006) Use Log4j for Spark Standalone Executor Logging - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18037) Event listener should be aware of multiple tries of same stage - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14947) Showtable Command - Can't List Tables Using JDBC Connector - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17933) Shuffle fails when driver is on one of the same machines as executor - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17125) Allow to specify spark config using non-string type in SparkR - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16810) Refactor registerSinks with multiple constructos - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:45 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9873) Cap the amount of executors launched in Mesos fine grain mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:45 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8442) FitTransform method for pipeline - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:45 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11387) minimize shuffles during joins by using existing partitions and bundling messages - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:45 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15720) MLLIB Word2Vec loading large number of vectors in the model results in java.lang.NegativeArraySizeException - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:45 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14759) After join one cannot drop dynamically added column - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:45 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9599) Dynamic partitioning based on key-distribution - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:45 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8133) sticky partitions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:46 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10444) Remove duplication in Mesos schedulers - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:46 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18253) ML Instrumentation logging requires too much manual implementation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:46 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6987) Node Locality is determined with String Matching instead of Inet Comparison - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:46 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10407) Possible Stack-overflow using InheritableThreadLocal nested-properties for SparkContext.localProperties - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:46 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16004) Improve CatalogTable information - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:46 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13283) Spark doesn't escape column names when creating table on JDBC - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:47 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14330) Spark SQL does not infer correct type for joda DateTime - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:47 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8494) ClassNotFoundException when running with sbt, scala 2.10.4, spray 1.3.3 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:47 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14588) Consider getting column stats from files (wherever feasible) to get better stats for joins - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:47 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11156) Web UI doesn't count or show info about replicated blocks - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:47 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18491) Spark uses mutable classes for date/time types mapping - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:48 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16739) GBTClassifier should be a Classifier, not Predictor - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:48 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16486) Python API parity issues from 2.0 QA - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:48 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10150) --force=true option is not working in beeline - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:48 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11309) Clean up hacky use of MemoryManager inside of HashedRelation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:48 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17517) Improve generated Code for BroadcastHashJoinExec - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:49 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17749) Unresolved columns when nesting SQL join clauses - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:49 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11238) SparkR: Documentation change for merge function - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:49 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12493) Can't open "details" span of ExecutionsPage in IE11 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:49 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6840) SparkR: private package functions unavailable when using lapplyPartition in package - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:49 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7341) Fix the flaky test: org.apache.spark.streaming.InputStreamsSuite.socket input stream - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:49 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9372) For a join operator, rows with null equal join key expression can be filtered out early - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:50 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11285) Infinite TaskCommitDenied loop - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:50 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17319) Move addJar from HiveSessionState to HiveSharedState - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:51 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8545) PMML improvement umbrella - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:51 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13377) binaryFileRDD preferredLocations issue - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:52 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7492) Convert LocalDataFrame to LocalMatrix - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:52 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10297) When save data to a data source table, we should bound the size of a saved file - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:52 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12474) support deserialization for physical plan and hive logical plan from JSON string - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:52 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13618) Make Streaming web UI display rate-limit lines in the statistics graph - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:52 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12675) Executor dies because of ClassCastException and causes timeout - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:53 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8557) Successful Jobs marked as KILLED Spark 1.4 Standalone - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:53 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7872) Transaction stack trace with Spark Streaming and Flume - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:54 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13800) Hive conf will be modified on multi-beeline connect to thriftserver - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:54 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9203) Make filesystem pluggable in BlockStore and BlockManager - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:54 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9300) histogram_numeric aggregate function - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:54 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18202) Spark throws a mysterious system error when a Hive command has at least 100,000 results - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:55 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8761) Master.removeApplication is not thread-safe but is called from multiple threads - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:55 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14180) Deadlock in CoarseGrainedExecutorBackend Shutdown - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:55 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9189) Takes locality and the sum of partition length into account when partition is instance of HadoopPartition in operator coalesce - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:55 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7764) Add negative sampling to Word2Vec - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:56 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15769) Add Encoder for input type to Aggregator - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:56 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14428) [SQL] Allow more flexibility when parsing dates and timestamps in json datasources - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:56 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6874) Add support for SQL:2003 array type declaration syntax - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:56 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17501) Re-register BlockManager again and again - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:57 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13856) Support initialModel in ALS - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:57 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9525) Optimize SparseVector initializations in linalg - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:57 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9717) Document persistence recommendation for MulticlassMetrics - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:58 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14820) Reduce shuffle data by pushing filter toward storage - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:58 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12163) FPGrowth unusable on some datasets without extensive tweaking of the support threshold - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:58 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10951) Support private S3 repositories using spark-submit via --repositories flag - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:58 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10806) Following val redefinition, sometimes the old value is still visible - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:58 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17083) Make the use of the default database more explicit - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:59 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17935) Add KafkaForeachWriter in external kafka-0.8.0 for structured streaming module - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:59 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11525) Support spark packages containing R source code in Standalone mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:59 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15634) SQL repl is bricked if a function is registered with a non-existent jar - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:35:59 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16269) Support null handling for vectorized hashmap during hash aggregate - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:36:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17754) DataFrame reader and writer don't show Input/Output metrics in Spark UI - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:36:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17168) CSV with header is incorrectly read if file is partitioned - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:36:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12178) Expose reporting of StreamInputInfo for custom made streams - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:36:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14326) Can't specify "long" type in structField - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:36:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16244) Failed job/stage couldn't stop JobGenerator immediately. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:36:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8514) LU factorization on BlockMatrix - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:36:02 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17256) spark-submit.cmd cannot work if path has space and cut off double-quoted arguments - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:36:02 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16616) Allow Catalyst to take Advantage of Hash Partitioned DataSources - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:36:02 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16594) Physical Plan Differences when Table Scan Having Duplicate Columns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:36:03 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10945) GraphX computes Pagerank with NaN (with some datasets) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:36:03 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10860) Bivariate Statistics: Chi-Squared independence test - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:36:03 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11364) HadoopFsRelation doesn't reload the hadoop configuration for each execution - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:36:03 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16517) can't add columns on the table create by spark'writer - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:36:03 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16790) Warn (or fail) when user query reads no data due to pruning - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:36:03 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9235) PYSPARK_DRIVER_PYTHON env variable is not set on the YARN Node manager acting as driver in yarn-cluster mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:36:03 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15993) PySpark RuntimeConfig should be immutable - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:36:04 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8982) Worker hostnames not showing in Master web ui when launched with start-slaves.sh - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:36:04 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10936) UDAF "mode" for categorical variables - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:36:04 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8460) Distinguish between shuffle and non-shuffle spills in metrics - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:36:04 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10312) Enhance SerDe to handle atomic vector - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:36:04 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18246) Throws an exception before execution for unsupported types in Json, CSV and text functionailities - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:36:04 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15048) when running Thriftserver with yarn on a secure cluster it will pass the wrong keytab location. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:36:04 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9137) Unified label verification for Classifier - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:28 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17435) RowEncoder should be documented and publicly accessable - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:28 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11401) PMML export for Logistic Regression Multiclass Classification - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:28 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10607) Scheduler should include defensive measures against infinite loops due to task commit denial - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:28 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10448) Parquet schema merging should NOT merge UDT - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:28 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18334) What hashDistance should MinHash use? - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:28 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16484) Incremental Cardinality estimation operations with Hyperloglog - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:29 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11167) Incorrect type resolution on heterogeneous data structures - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:29 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13720) SQLBuilder.toSQL is broken when there is a sort by after having - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:29 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14730) Expose ColumnPruner as feature transformer - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:29 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7641) Add subsampling of frequent words for Word2Vec - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:29 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10250) Scala PairRDDFunctions.groupByKey() should be fault-tolerant of single large groups - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:30 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7177) Create standard way to wrap Spark CLI scripts for external projects - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:30 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15105) Remove HiveSessionHook from ThriftServer - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:30 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10839) SPARK_DAEMON_MEMORY has effect on heap size of thriftserver - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:30 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10787) Consider replacing ObjectOutputStream for serialization to prevent OOME - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:30 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14980) Spark is not picking up classes in the MutableURLClassLoader causing errors with drools - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:30 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13662) [SQL][Hive] Have SHOW TABLES return additional fields from Hive MetaStore - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15673) Indefinite hanging issue with combination of cache, sort and unionAll - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9038) Missing TaskEnd event when task attempt is superseded by another (speculative) attempt - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8778) The item of "Scheduler delay" is not consistent between "Event Timeline" and "Task List" - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9563) Remove repartition operators when they are the child of Exchange and shuffle=True - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7434) DSL for Pipeline assembly - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10774) Put different event log to different directory according to different conditions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15800) Accessing kerberised hdfs from Spark running with Resource Manager - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17132) binaryFiles method can't handle paths with embedded commas - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14857) Table/Database Name Validation in SessionCatalog - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12609) Make R to JVM timeout configurable - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10854) MesosExecutorBackend: Received launchTask but executor was null - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14354) Let Expand take name expressions and infer output attributes - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13654) get_json_object fails with java.io.CharConversionException - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17784) Add fromCenters method for KMeans - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12942) Provide option to allow control the precision of numerical type for DataFrameWriter - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14233) NativeThread.signal(Native Method): No such file or directory - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15376) DataFrame write.jdbc() inserts more rows than acutal - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8391) showDagViz throws OutOfMemoryError, cause the whole jobPage dies - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12270) JDBC Where clause comparison doesn't work for DB2 char(n) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16480) Streaming checkpointing does not work well with SIGTERM - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11321) Allow addition of non-nullable UDFs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10420) Implementing Reactive Streams based Spark Streaming Receiver - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17695) Deserialization error when using DataFrameReader.json on JSON line that contains an empty JSON object - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12836) spark enable both driver run executor & write to HDFS - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15546) HiveContext : Connecting to MySQL metastore db - Always creates a DERBY database in first attempt - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6837) SparkR failure in processClosure - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15804) Manually added metadata not saving with parquet - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14784) Build SQL for EXISTS/IN subquery - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16160) Last remembered metadata window per dstream is not cleared upon context graceful stop - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9269) Add Set to the matching type in ArrayConverter - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17553) On Master Change the running Apps goes to wait state even though resource are available - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15907) Issue Exception when Not Enough Input Columns for Dynamic Partitioning - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14924) Tuning estimatorParamMaps with OneVsRest.classifier fails during persistence - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12214) Spark to provide an API to save user-defined metadata as part of Sequence File header - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14805) accumulator values are not escaped when written to event logs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11986) background thread spill the context to disk - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13836) IsNotNull Constraints for the BinaryComparison inside Not - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11823) HiveThriftBinaryServerSuite tests timing out, leaves hanging processes - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13437) Add InternalColumn - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12947) Spark with Swift throws EOFException when reading parquet file - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:37 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9503) Mesos dispatcher NullPointerException (MesosClusterScheduler) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:37 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16490) Python mllib example for chi-squared feature selector - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:37 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10588) Saving a DataFrame containing only nulls to JSON doesn't work - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:37 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16149) API consistency discussion: CountVectorizer.{minDF -> minDocFreq, minTF -> minTermFreq} - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:37 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9937) GraphX Performance: Partition overhead scales quadratically - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:37 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13943) The behavior of sum(booleantype) in Spark DataFrames is not intuitive - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:37 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11770) Spark SQL field resolution error in GROUP BY HAVING clause - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:37 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14641) Specify worker log dir separately from scratch space dir - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:37 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13273) Improve test coverage of CatalystQl - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:37 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15845) Expose metrics for sub-stage transformations and action - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:38 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16246) Too many block-manager-slave-async-thread opened (TIMED_WAITING) for spark Kafka streaming - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:38 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17550) DataFrameWriter.partitionBy() should throw exception if column is not present in Dataframe - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:38 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10086) Flaky StreamingKMeans test in PySpark - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:38 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16999) Convert Keyword `path` of Data Source Tables to Lower Case - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:38 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15897) Function Registry should just take in FunctionIdentifier for type safety and avoid duplicating - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:38 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16937) Confusing behaviors when View and Temp View sharing the same names - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7786) Allow StreamingListener to be specified in SparkConf and loaded when creating StreamingContext - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14003) Multi-session can not work when one session is moving files for "INSERT ... SELECT" clause - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12161) SparkSQL Cache Matching Improvement - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16481) Spark does not update statistics when making use of Hive partitions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17600) Cannot set public address for Worker and Master Web UI - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13434) Reduce Spark RandomForest memory footprint - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10793) Make spark's use/subclassing of hive more maintainable - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16108) Why is KMeansModel (scala) private? - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13461) Merge and clean up duplicated MLlib example code - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15418) SparkSQL does not support using a UDAF in a CREATE VIEW clause - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8369) Support dependency jar and files on HDFS in standalone cluster mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13259) SPARK_HOME should not be used as the CWD in docker executors - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12083) java.lang.IllegalArgumentException: requirement failed: Overflowed precision (q98) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12473) Reuse serializer instances for performance - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8983) ML Tuning Cross-Validation Improvements - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16504) UDAF should be typed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9941) Try ML pipeline API on Kaggle competitions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15987) PostgreSQL CITEXT type JDBC support - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13762) support only column names in schema string at createDataFrame - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16413) The JDBC UI does not show the job id when spark.sql.thriftServer.incrementalCollect=true - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9108) Expose Kryo serializer buffer size - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16518) Schema Compatibility of Parquet Data Source - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15258) Nested/Chained case statements generate codegen over 64k exception - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17872) aggregate function on dataset with tuples grouped by non sequential fields - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15747) Support SPARK_CONF_DIR/spark-defaults.d/*.conf drop-in style config files - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15338) Run-time Change on the conf spark.sql.warehouse.dir Does not Affect the conf hive.metastore.warehouse.dir - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14276) Allow each HiveContext to have distinct users that they will use to connect to the HiveMetastore - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8459) Add import/export to spark.mllib bisecting k-means - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11106) Should ML Models contains single models or Pipelines? - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12607) spark-class.cmd produced null command strings for "exec" - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12358) Spark SQL query with lots of small tables under broadcast threshold leading to java.lang.OutOfMemoryError - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13222) checkpoint latest stateful RDD on graceful shutdown - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:44 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10370) After a stages map outputs are registered, all running attempts should be marked as zombies - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:45 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15666) Join on two tables generated from a same table throwing query analyzer issue - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:45 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8503) SizeEstimator returns negative value for recursive data structures - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:45 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10572) Investigate the contentions bewteen tasks in the same executor - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:45 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11240) PMML export for SVM models in ML pipeline - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:45 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15119) DecisionTreeParams.minInfoGain does not have a validator - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:46 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12308) Better onDisconnected logic for Master - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:46 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10803) Allow users to write and query Parquet user-defined key-value metadata directly - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:46 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16471) Remove Hive-specific CreateHiveTableAsSelectLogicalPlan - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:46 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16417) spark 1.5.2 receiver store(single-record) with ahead log enabled makes executor crash if there is an exception when BlockGenerator storing block - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:47 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9484) Word2Vec import/export for original binary format - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:47 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12170) Deprecate the JAVA-specific deserialized storage levels - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:47 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15831) Kryo 2.21 TreeMap serialization bug causes random job failures with RDDs of HBase puts - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:47 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10659) DataFrames and SparkSQL saveAsParquetFile does not preserve REQUIRED (not nullable) flag in schema - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:47 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10054) Add a timeout for launching Receiver - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:47 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12929) Adding IBM Platform Conductor for Spark as new Supplemental Project - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:48 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15460) Issue Exceptions from Thrift Server and Spark-SQL Cli when Users Inputting hive.metastore.warehouse.dir - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:48 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10042) Use consistent behavior for Internal Accumulators across stage retries - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:48 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15909) PySpark classpath uri incorrectly set - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:48 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12845) During join Spark should pushdown predicates on joining column to both tables - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:48 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13585) addPyFile behavior change between 1.6 and before - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:48 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13288) [1.6.0] Memory leak in Spark streaming - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:48 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14417) Cleanup Scala deprecation warnings once we drop 2.10.X - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:49 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8520) Improve GLM's scalability on number of features - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:49 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15729) Clarify that saveAs*File doesn't make sense with local FS in cluster context - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:49 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11154) make specificaition spark.yarn.executor.memoryOverhead consistent with typical JVM options - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:49 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9953) ML Vector, Matrix semantic equality + hashcode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:50 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16860) UDT Stringification Incorrect in PySpark - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:50 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7871) Improve the outputPartitioning for HashOuterJoin(full outer join) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:50 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12472) OOM when sort a table and save as parquet - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:50 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16141) Compress complex-typed data (e.g., Array) in InMemoryRelation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:50 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17467) Spark SQL: Return incorrect result for the data files on Swift - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:50 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13111) Spark UI is showing negative number of sessions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:50 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10625) Spark SQL JDBC read/write is unable to handle JDBC Drivers that adds unserializable objects into connection properties - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:50 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16126) Better Error Message When using DataFrameReader without `path` - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:51 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18059) Spark not respecting partitions in partitioned Hive views - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:51 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11397) PySpark Streaming uses threadcontext class loader, may cause issues on mesos - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:51 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10510) Add documentation for how to register a custom Kryo serializer in Spark - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:51 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11067) Spark SQL thrift server fails to handle decimal value - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:51 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7682) Size of distributed grids still limited by cPickle - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:51 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12072) python dataframe ._jdf.schema().json() breaks on large metadata dataframes - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:51 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18033) Deprecate TaskContext.partitionId - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:51 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13588) Unable to map Parquet file to Hive Table using HiveContext - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:52 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16408) SparkSQL Added file get Exception: is a directory and recursive is not turned on - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:52 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12257) Non partitioned insert into a partitioned Hive table doesn't fail - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:52 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10638) spark streaming stop gracefully keeps the spark context - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:52 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15324) Add the takeSample function to the Dataset - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:52 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10628) Add support for arbitrary RandomRDD generation to PySparkAPI - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:53 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17151) Decide how to handle inferring number of classes in Multinomial logistic regression - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:53 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10713) SPARK_DIST_CLASSPATH ignored on Mesos executors - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:53 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8516) ML attribute API in PySpark - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:53 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13795) ClassCast Exception while attempting to show() a DataFrame - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:53 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12113) Add timing metrics to blocking phases for spark sql - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:53 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17815) Report committed offsets - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:54 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15699) Add chi-squared test statistic as a split quality metric for decision trees - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:54 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9739) Execution visualizer - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:54 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10808) LDA user guide: discuss running time of LDA - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:54 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11185) Add more task metrics to the "all Stages Page" - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:54 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8502) One character switches into uppercase, causing failures [serialization? shuffle?] - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:54 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9438) restarting leader zookeeper causes spark master to die when the spark master election is assigned to zookeeper - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:55 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10826) MasterSource should not access Master.workers, apps, waitingApps directly - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:55 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12067) Fix usage of isnan, isnull, isnotnull of Column - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:55 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13729) Reimplement the planning tests on SimpleTextRelation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:55 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11241) Add a common trait for PMML exportable ML pipeline models - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:55 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11202) Unsupported dataType - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:55 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16655) Spark thrift server application is not stopped if its in ACCEPTED stage - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:55 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10517) Console "Output" field is empty when using DataFrameWriter.json - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:56 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14583) SparkSQL doesn't apply TBLPROPERTIES('serialization.null.format'='') when Hive Table has partitions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:56 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12899) Spark Data Security - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:56 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9820) NullPointerException that causes failure to request executors. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:57 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11132) Mean Shift algorithm integration - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:57 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13239) Click-Through Rate Prediction - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:57 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9961) ML prediction abstractions should have defaultEvaluator fields - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:57 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12861) Changes to support KMeans with large feature space - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:58 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10977) SQL injection bugs in JdbcUtils and DataFrameWriter - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:58 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16066) Right now we don't have provision to pass custom param to executor dockers(Something like spark.mesos.executor.docker.parameters). - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:58 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18167) Flaky test when hive partition pruning is enabled - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:58 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9744) Add RDD method to map with lag and lead - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:59 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14207) Transformer for splitting a Vector/Array column into individual columns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:37:59 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7053) KafkaUtils.createStream leaks resources - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17443) SparkLauncher should allow stoppingApplication and need not rely on SparkSubmit binary - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12097) How to do a cached, batched JDBC-lookup in Spark Streaming? - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17307) Document what all access is needed on S3 bucket when trying to save a model - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16807) Optimize some ABS() statements - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7413) Time to write shuffle spill files is not captured in ShuffleWriteMetrics - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17601) SparkSQL vectorization cannot handle schema evolution for parquet tables when parquet files use Int whereas DataFrame uses Long - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7379) pickle.loads expects a string instead of bytes in Python 3. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11665) Support other distance metrics for bisecting k-means - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14715) Provide a way to mask partitions of a Dataset/Dataframe - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:02 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14325) some strange name conflicts in `group_by` - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:02 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10187) Sometimes Web UI reports Application history not found - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:02 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17520) Create a more performant __eq__ for SparseMatrix - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:02 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9004) Add s3 bytes read/written metrics - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:03 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7708) Incorrect task serialization with Kryo closure serializer - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:03 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12699) R driver process should start in a clean state - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:03 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7172) Worker includes ExecutorRunner and DriverRunner in Akka messages - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:03 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15006) Generated JavaDoc should hide package private objects - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:04 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8449) HDF5 read/write support for Spark MLlib - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:04 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14755) Dynamic proxy class cause a ClassNotFoundException on task deserialization - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:04 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13908) Limit not pushed down - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:04 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18096) Spark on have - 'Update' save mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:04 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18499) Add back support for custom Spark SQL dialects - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:05 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6821) Refactor SerDe API in SparkR to be more developer friendly - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:05 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16788) Investigate JSR-310 & scala-time alternatives to our own datetime utils - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:05 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17201) Investigate numerical instability for MLOR without regularization - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:05 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9003) Add map/update function to MLlib/Vector - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:05 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13853) QueryPlan sub-classes should override producedAttributes to fix missingInput - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:06 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16820) Sparse - Sparse matrix multiplication - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:06 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14427) Support persisting partitioned data source relations in Hive compatible format - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:06 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16204) Row() interface - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:06 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17357) Simplified predicates can't be pushed down through operators because of the rule order in Optimizer - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:07 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13532) Spark yarn executor container fails if yarn.nodemanager.local-dirs starts with file:// - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:07 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16744) spark.yarn.appMasterEnv handling assumes values should be appended - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:08 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18032) Spark test failed as OOM in jenkins - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:08 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10658) Could pyspark provide addJars() as scala spark API? - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:08 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9595) Adding API to SparkConf for kryo serializers registration - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:08 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8454) Update ShuffleBlockFetcherIterator.next to close previous stream when returning a new one - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:08 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14905) create conda environments w/locked package versions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:08 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17386) Default polling and trigger intervals cause excessive RPC calls - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:09 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14256) Remove parameter sqlContext from as.DataFrame - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:09 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16641) Add an Option to Create a Dataset With a Case Class, Ignoring Column Names (Using ordinal instead) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:09 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8985) Create a test harness to improve Spark's combinatorial test coverage of non-default configurations - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:09 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14708) Repl Serialization Issue - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:09 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16362) Suport ArrayType and StructType in vectorization Parquet reader - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:10 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12815) Compute Wilcoxon-Mann-Whitney rank sum statistic - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:10 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13409) Log the stacktrace when stopping a SparkContext - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:10 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13345) Adding one way ANOVA to Spark ML stat - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:10 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15337) Unable to Make Run-time Changes on Hive-related Conf - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:10 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13743) Adding configurable support for Spark Streaming gracefull timeout - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:11 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13940) Predicate Transitive Closure Transformation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:11 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14593) Make currentVars work with splitExpressions to enable whole stage codegen for large input columns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:11 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11196) Support for equality and pushdown of filters on some UDTs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:11 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13653) Split DiskBlockObjectWriter to separate object- and byte-based write interfaces - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:12 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17453) Broadcast block already exists in MemoryStore - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:12 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9017) More timers for MLlib algorithms - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:12 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16262) Impossible to remake new SparkContext using SparkSession API in Pyspark - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:12 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14333) Duration of task should be the total time (not just computation time) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:13 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10183) Expose the SparkR backend api - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:13 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15145) port binary classification evaluator to spark.ml - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:13 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13683) Finalize the public interface for OutputWriter[Factory] - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:13 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18017) Changing Hadoop parameter through sparkSession.sparkContext.hadoopConfiguration doesn't work - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:14 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16423) Inconsistent settings on the first day of a week - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:14 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15872) Dataset of Array of Custom case class throws MissingRequirementError - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:14 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11228) Job stuck in Executor failure loop when NettyTransport failed to bind - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:14 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10555) Add INotifyDStream to Spark Streaming - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:14 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12528) Make Apache Sparkā€™s gateway hidden REST API (in standalone cluster mode) public API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:14 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11704) Optimize the Cartesian Join - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:15 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13428) Pushdown Aggregate Functions in Sort into Aggregate - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:16 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7706) Allow setting YARN_CONF_DIR from spark argument - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:16 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10069) Python's ReduceByKeyAndWindow DStream Keeps Growing - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:16 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16623) Datasources should expose ability to define schema conversions they're performing to save data - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:16 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18041) activedrivers section in http:sparkMasterurl/json is missing Main class information - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:17 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14297) DAG visualization cropped in SparkUI - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:17 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12428) Write a script to run all PySpark MLlib examples for testing - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:17 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10010) Decide on a name for generating Linear/LogisticRegressionSummary on test set data - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:17 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11698) Add option to ignore kafka messages that are out of limit rate - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:17 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14890) DAGScheduler should not accept the result of a previous task attempt, since its stage attempt has been completed. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16701) Make parameters configurable in BlockManager - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8142) Spark Job Fails with ResultTask ClassCastException - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10333) Add user guide for linear-methods.md columns - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:18 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13799) Add memory back pressure for Spark Streaming - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16767) existsRecursively method in UserDefinedType is not correct - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13059) Sort inputsplits by size in HadoopRDD to avoid long tails - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15564) App name is the main class name in Spark streaming jobs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14283) Avoid sort in randomSplit when possible - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11558) Get ML feature names from column - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14041) Locate possible duplicates and group them into subtasks - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17481) Flaky test: org.apache.spark.DistributedSuite.passing environment variables to cluster - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16680) Set spark.driver.userClassPathFirst=true, and run spark-sql failed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15169) Consider improving HasSolver to allow generalization - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9957) Spark ML trees should filter out 1-category features - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:19 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11977) Support accessing a DataFrame column using its name without backticks if the name contains '.' - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14559) Netty RPC didn't check channel is active before sending message - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15859) Optimize the Partition Pruning with Disjunction - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16093) Spark2.0 take no effect after set spark.sql.autoBroadcastJoinThreshold = 1 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7494) spark.ml Model should call copyValues in construction - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10478) Improve spark.ml.ann implementations for MLP - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15930) Add Row count property to FPGrowth model - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16629) UDTs can not be compared to DataTypes in dataframes. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-16718) gbm-style treeboost - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15619) spark builds filling up /tmp - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13216) Spark streaming application not honoring --num-executors in restarting of an application from a checkpoint - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11529) Add section in user guide for StreamingLogisticRegressionWithSGD - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14788) Initial number of executors should honor min number if streaming dynamic allocation is enabled - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15890) Support Stata-like tabulation of values in a single column, optionally with weights - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13375) PySpark API Utils missing item: kFold - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7499) Investigate how to specify columns in SparkR without $ or strings - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-12185) Add Histogram support to Spark SQL/DataFrames - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:20 UTC, 0 replies.
- [jira] [Resolved] (SPARK-10869) Auto-normalization of semi-structured schema from a dataframe - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-9313) Enable a "docker run" invocation in place of PYSPARK_PYTHON - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17020) Materialization of RDD via DataFrame.rdd forces a poor re-distribution of data - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15503) Not able to join two hive tables having partition using HiveContext.sql - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11733) Allow shuffle readers to request data from just one mapper - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8132) Race condition if task is cancelled with interruption while fetching file dependencies - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-11838) Spark SQL query fragment RDD reuse across queries - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8734) Expose all Mesos DockerInfo options to Spark - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13243) jersey WADL generator complains about Seq[] type in REST APIs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8858) Common interface for Frequent Itemsets - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13374) make it possible to create recoverable accumulator for streaming application - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17603) Utilize Hive-generated Statistics For Partitioned Tables - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15161) Consider moving featureImportances into TreeEnsemble models base class - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:22 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14192) Executor is dead in Driver but alive in AM when driver losts rpc with executor, but executor is alive. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:22 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13116) TungstenAggregate though it is supposedly capable of all processing unsafe & safe rows, fails if the input is safe rows - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:22 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17954) FetchFailedException executor cannot connect to another worker executor - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:22 UTC, 0 replies.
- [jira] [Resolved] (SPARK-15855) dataframe.R example fails with "java.io.IOException: No input paths specified in job" - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:22 UTC, 0 replies.
- [jira] [Resolved] (SPARK-13946) PySpark DataFrames allows you to silently use aggregate expressions derived from different table expressions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:22 UTC, 0 replies.
- [jira] [Resolved] (SPARK-14262) Correct app state after master leader changed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:22 UTC, 0 replies.
- [jira] [Resolved] (SPARK-7546) Example code for ML Pipelines feature transformations - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:22 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8341) Significant selector feature transformation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:22 UTC, 0 replies.
- [jira] [Resolved] (SPARK-17156) Add multiclass logistic regression Scala Example - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:22 UTC, 0 replies.
- [jira] [Resolved] (SPARK-8842) Spark SQL - Insert into table Issue - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:38:22 UTC, 0 replies.
- [jira] [Created] (SPARK-27786) SHA1, MD5, and Base64 expression codegen doesn't work when commons-codec is shaded - posted by "Josh Rosen (JIRA)" <ji...@apache.org> on 2019/05/21 05:30:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27786) SHA1, MD5, and Base64 expression codegen doesn't work when commons-codec is shaded - posted by "Josh Rosen (JIRA)" <ji...@apache.org> on 2019/05/21 05:34:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-3655) Support sorting of values in addition to keys (i.e. secondary sort) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-3514) Provide a utility function for returning the hosts (and number) of live executors - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:16 UTC, 0 replies.
- [jira] [Updated] (SPARK-3504) KMeans optimization: track distances and unmoved cluster centers across iterations - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-5337) respect spark.task.cpus when launch executors - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-6396) Add timeout control for broadcast - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-5046) Update KinesisReceiver to use updated Receiver API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-5392) Shuffle spill size is shown as negative - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:17 UTC, 0 replies.
- [jira] [Updated] (SPARK-2253) [Core] Disable partial aggregation automatically when reduction factor is low - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-4800) RDD Preview Feature in WebUI - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-2673) Improve Enable to attach Debugger to Executors easily - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-5431) SparkSubmitSuite and DriverSuite hang indefinitely if Master fails - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-2296) Refactor util.JsonProtocol for evolvability - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-3031) Create JsonSerializable and move JSON serialization from JsonProtocol into each class - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-5225) Support coalesed Input Metrics from different sources - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:18 UTC, 0 replies.
- [jira] [Updated] (SPARK-1486) Support multi-model training in MLlib - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-4698) Data-locality aware Partitioners - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-5091) Hooks for PySpark tasks - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-3750) Log ulimit settings at warning if they are too low - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:19 UTC, 0 replies.
- [jira] [Updated] (SPARK-5575) Artificial neural networks for MLlib deep learning - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-5272) Refactor NaiveBayes to support discrete and continuous labels,features - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-5915) Spillable should check every N bytes rather than every 32 elements - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-1642) Upgrade FlumeInputDStream's FlumeReceiver to support FLUME-2083 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:20 UTC, 0 replies.
- [jira] [Updated] (SPARK-1747) check for Spark on Yarn ApplicationMaster split brain - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-4524) Add documentation on packaging Python dependencies / installing them on clusters - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-3153) shuffle will run out of space when disks have different free space - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-4716) Avoid shuffle when all-to-all operation has single input and output partition - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-2475) Check whether #cores > #receivers in local mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-765) Test suite should run Spark example programs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:21 UTC, 0 replies.
- [jira] [Updated] (SPARK-3306) Addition of external resource dependency in executors - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-5104) Distributed Representations of Sentences and Documents - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-6764) Add wheel package support for PySpark - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-6378) srcAttr in graph.triplets don't update when the size of graph is huge - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-4853) Automatically adjust the number of connections between two peers to achieve good performance - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:22 UTC, 0 replies.
- [jira] [Updated] (SPARK-1546) Add AdaBoost algorithm to Spark MLlib - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-4160) Standalone cluster mode does not upload all needed jars to driver node - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-4246) Add testsuite with end-to-end testing of driver failure - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:23 UTC, 0 replies.
- [jira] [Updated] (SPARK-4206) BlockManager warnings in local mode: "Block $blockId already exists on this machine; not re-adding it - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-1107) Add shutdown hook on executor stop to stop running tasks - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-6798) Fix Date serialization in SparkR - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-3717) DecisionTree, RandomForest: Partition by feature - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-5142) Possibly data may be ruined in Spark Streaming's WAL mechanism. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-5480) GraphX pageRank: java.lang.ArrayIndexOutOfBoundsException: - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-3601) Kryo NPE for output operations on Avro complex Objects even after registering. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:24 UTC, 0 replies.
- [jira] [Updated] (SPARK-4144) Support incremental model training of Naive Bayes classifier - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-799) Windows versions of the deploy scripts - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-4911) Report the inputs and outputs of Spark jobs so that external systems can track data lineage - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-4940) Support more evenly distributing cores for Mesos mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-5079) Detect failed jobs / batches in Spark Streaming unit tests - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:25 UTC, 0 replies.
- [jira] [Updated] (SPARK-3380) DecisionTree: overflow and precision in aggregation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-6312) ChiSqTest should check for too few counts - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-4229) Create hadoop configuration in a consistent way - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-6815) Support accumulators in R - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-5674) Spark Job Explain Plan Proof of Concept - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-1823) ExternalAppendOnlyMap can still OOM if one key is very large - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-5150) Strange implicit resolution behavior in Spark REPL - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-1910) Add onBlockComplete API to receiver - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-3631) Add docs for checkpoint usage - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:26 UTC, 0 replies.
- [jira] [Updated] (SPARK-5044) Update ReliableKafkaReceiver to use updated Receiver API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:27 UTC, 0 replies.
- [jira] [Updated] (SPARK-3916) recognize appended data in textFileStream() - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:27 UTC, 0 replies.
- [jira] [Updated] (SPARK-3270) Spark API for Application Extensions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:27 UTC, 0 replies.
- [jira] [Updated] (SPARK-2584) Do not mutate block storage level on the UI - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-5004) PySpark does not handle SOCKS proxy - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-4823) rowSimilarities - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-6183) Skip bad workers when re-launching executors - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-6160) ChiSqSelector should keep test statistic info - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-3115) Improve task broadcast latency for small tasks - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-5045) Update FlumePollingReceiver to use updated Receiver API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-4440) Enhance the job progress API to expose more information - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-2280) Java & Scala reference docs should describe function reference behavior. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-4684) Add a script to run JDBC server on Windows - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:28 UTC, 0 replies.
- [jira] [Updated] (SPARK-3451) spark-submit should support specifying glob wildcards in the --jars CLI option - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-636) Add mechanism to run system management/configuration tasks on all workers - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-3251) Clarify learning interfaces - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-5205) Inconsistent behaviour between Streaming job and others, when click kill link in WebUI - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-3220) K-Means clusterer should perform K-Means initialization in parallel - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-4723) To abort the stages which have attempted some times - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-1655) In naive Bayes, store conditional probabilities distributively. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:29 UTC, 0 replies.
- [jira] [Updated] (SPARK-6637) Test lambda weighting in implicit ALS - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-3703) Ensemble learning methods - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-3137) Use finer grained locking in TorrentBroadcast.readObject - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-5748) Improve Vectors.sqdist implementation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-3257) Enable :cp to add JARs in spark-shell (Scala 2.11) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-3000) Drop old blocks to disk in parallel when memory is not large enough for caching new blocks - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-4488) Add control over map-side aggregation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-4555) Add forward compatibility tests to JsonProtocol - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-3957) Broadcast variable memory usage not reflected in UI - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-1921) Allow duplicate jar files among the app jar and secondary jars in yarn-cluster mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-3134) Update block locations asynchronously in TorrentBroadcast - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:30 UTC, 0 replies.
- [jira] [Updated] (SPARK-6373) Add SSL/TLS for the Netty based BlockTransferService - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-6089) Size of task result fetched can't be found in UI - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-5037) support dynamic loading of input DStreams in pyspark streaming - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-5490) KMeans costs can be incorrect if tasks need to be rerun - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-5782) Python Worker / Pyspark Daemon Memory Issue - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:31 UTC, 0 replies.
- [jira] [Updated] (SPARK-1863) Allowing user jars to take precedence over Spark jars does not work as expected - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-2245) VertexRDD can not be materialized for checkpointing - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-5316) DAGScheduler may make shuffleToMapStage leak if getParentStages failes - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-3835) Spark applications that are killed should show up as "KILLED" or "CANCELLED" in the Spark UI - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-6624) Convert filters into CNF for data sources - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-2545) Add a diagnosis mode for closures to figure out what they're bringing in - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-5685) Show warning when users open text files compressed with non-splittable algorithms like gzip - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-3218) K-Means clusterer can fail on degenerate data - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-3155) Support DecisionTree pruning - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:32 UTC, 0 replies.
- [jira] [Updated] (SPARK-4174) Streaming: Optionally provide notifications to Receivers when DStream has been generated - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-6619) Improve Jar caching on executors - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-4782) Add inferSchema support for RDD[Map[String, Any]] - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-4287) Add TTL-based cleanup in external shuffle service - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-2723) Block Manager should catch exceptions in putValues - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-5646) Record output metrics for cache - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-5264) Support `drop temporary table [if exists]` DDL command - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-5918) Spark Thrift server reports metadata for VARCHAR column as STRING in result set schema - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-4868) Twitter DStream.map() throws "Task not serializable" - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-5961) Allow specific nodes in a Spark Streaming cluster to be dedicated/preferred as Receiver Worker Nodes versus regular Spark Worker Nodes - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-6760) Sketch algorithms for SQL/DataFrames - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-2913) Spark's log4j.properties should always appear ahead of Hadoop's on classpath - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:33 UTC, 0 replies.
- [jira] [Updated] (SPARK-6415) Spark Streaming fail-fast: Stop scheduling jobs when a batch fails, and kills the app - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-6026) Eliminate the bypassMergeThreshold parameter and associated hash-ish shuffle within the Sort shuffle code - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-4024) Remember user preferences for metrics to show in the UI - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-6497) Class is not registered: scala.reflect.ManifestFactory$$anon$9 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:34 UTC, 0 replies.
- [jira] [Updated] (SPARK-3166) Custom serialisers can't be shipped in application jars - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-5713) Support python serialization for RandomForest - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-3244) Add fate sharing across related files in Jenkins - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-1844) Support maven-style dependency resolution in sbt build - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:35 UTC, 0 replies.
- [jira] [Updated] (SPARK-4112) Have a reserved copy of Sorter/SortDataFormat - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-4125) Serializer should work on ManagedBuffers as well - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-6398) Improve utility of GaussianMixture for higher dimensional data - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-4960) Interceptor pattern in receivers - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-1739) Close PR's after period of inactivity - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:36 UTC, 0 replies.
- [jira] [Updated] (SPARK-3210) Flume Polling Receiver must be more tolerant to connection failures. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:37 UTC, 0 replies.
- [jira] [Updated] (SPARK-6441) Add Deflation/Schur Complement to Power Iteration Clustering for improved resilience to inter-class collisions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:37 UTC, 0 replies.
- [jira] [Updated] (SPARK-4886) Support cache control for each partition of a Hive partitioned table - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:37 UTC, 0 replies.
- [jira] [Updated] (SPARK-5114) Should Evaluator be a PipelineStage - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:37 UTC, 0 replies.
- [jira] [Updated] (SPARK-3735) Sending the factor directly or AtA based on the cost in ALS - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:37 UTC, 0 replies.
- [jira] [Updated] (SPARK-6476) Spark fileserver not started on same IP as using spark.driver.host - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:37 UTC, 0 replies.
- [jira] [Updated] (SPARK-1272) Don't fail job if some local directories are buggy - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:38 UTC, 0 replies.
- [jira] [Updated] (SPARK-3963) Support getting task-scoped properties from TaskContext - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:38 UTC, 0 replies.
- [jira] [Updated] (SPARK-6808) Checkpointing after zipPartitions results in NODE_LOCAL execution - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:38 UTC, 0 replies.
- [jira] [Updated] (SPARK-4540) Improve Executor ID Logging - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:38 UTC, 0 replies.
- [jira] [Updated] (SPARK-6377) Set the number of map output partitions for Exchange operator automatically based on the size of input tables and the reduce-side operation. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:38 UTC, 0 replies.
- [jira] [Updated] (SPARK-6165) Aggregate and reduce should be able to work with very large number of tasks. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:38 UTC, 0 replies.
- [jira] [Updated] (SPARK-3847) Enum.hashCode is only consistent within the same JVM - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:38 UTC, 0 replies.
- [jira] [Updated] (SPARK-5506) java.lang.ClassCastException using lambda expressions in combination of spark and Servlet - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:38 UTC, 0 replies.
- [jira] [Updated] (SPARK-5497) start-all script not working properly on Standalone HA cluster (with Zookeeper) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:38 UTC, 0 replies.
- [jira] [Updated] (SPARK-6617) Word2Vec is nondeterministic - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:38 UTC, 0 replies.
- [jira] [Updated] (SPARK-4500) Improve exact stratified sampling implementation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:38 UTC, 0 replies.
- [jira] [Updated] (SPARK-4489) JavaPairRDD.collectAsMap from checkpoint RDD may fail with ClassCastException - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:38 UTC, 0 replies.
- [jira] [Updated] (SPARK-6106) Support user group mapping and groups in view, modify and admin acls - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:38 UTC, 0 replies.
- [jira] [Updated] (SPARK-6380) Resolution of equi-join key in post-join projection - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:38 UTC, 0 replies.
- [jira] [Updated] (SPARK-5372) Change the default storage level of window operators - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-6208) executor-memory does not work when using local cluster - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-5043) Implement updated Receiver API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-2610) When spark.serializer is set as org.apache.spark.serializer.KryoSerializer, importing a method causes multiple spark applications creations - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-4885) Enable fetched blocks to exceed 2 GB - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-4621) Shuffle index can be cached for SortShuffleManager in ExternalShuffle in order to reduce indexFile's io - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-5077) Map output statuses can still exceed spark.akka.frameSize - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-2541) Standalone mode can't access secure HDFS anymore - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-6139) Allow pre-populate sliding window with initial data - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-4628) Put external projects and examples behind a build flag - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-4321) Make Kryo serialization work for closures - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-5042) Updated Receiver API to make it easier to write reliable receivers that ack source - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-3163) Separate continuous and categorical features in DecisionTree - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-6148) cachedDataSourceTables may store outdated metadata if the table is updated from another HiveContext - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-6613) Starting stream from checkpoint causes Streaming tab to throw error - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-2581) complete or withdraw visitedStages optimization in DAGSchedulerā€™s stageDependsOn - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:39 UTC, 0 replies.
- [jira] [Updated] (SPARK-6582) Support ssl for this AvroSink in Spark Streaming External - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:40 UTC, 0 replies.
- [jira] [Updated] (SPARK-6462) UpdateStateByKey should allow inner join of new with old keys - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:40 UTC, 0 replies.
- [jira] [Updated] (SPARK-4653) DAGScheduler refactoring and cleanup - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:40 UTC, 0 replies.
- [jira] [Updated] (SPARK-5880) Change log level of batch pruning string in InMemoryColumnarTableScan from Info to Debug - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:40 UTC, 0 replies.
- [jira] [Updated] (SPARK-2408) RDD.map(func) dependencies issue after checkpoint & count - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:40 UTC, 0 replies.
- [jira] [Updated] (SPARK-5571) LDA should handle text as well - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:40 UTC, 0 replies.
- [jira] [Updated] (SPARK-5488) SPARK_LOCAL_IP not read by mesos scheduler - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:40 UTC, 0 replies.
- [jira] [Updated] (SPARK-6359) Expose IMain binding as part of ILoop Developer API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:40 UTC, 0 replies.
- [jira] [Updated] (SPARK-4545) If first Spark Streaming batch fails, it waits 10x batch duration before stopping - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:40 UTC, 0 replies.
- [jira] [Updated] (SPARK-5569) Checkpoints cannot reference classes defined outside of Spark's assembly - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:40 UTC, 0 replies.
- [jira] [Updated] (SPARK-4679) Race condition in querying the Spark UI JSON endpoint when Jetty context handlers are added and removed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:35:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-3244) Add fate sharing across related files in Jenkins - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-3155) Support DecisionTree pruning - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-3137) Use finer grained locking in TorrentBroadcast.readObject - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-3735) Sending the factor directly or AtA based on the cost in ALS - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-1272) Don't fail job if some local directories are buggy - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-3963) Support getting task-scoped properties from TaskContext - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:21 UTC, 0 replies.
- [jira] [Resolved] (SPARK-3703) Ensemble learning methods - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:22 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6398) Improve utility of GaussianMixture for higher dimensional data - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:22 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6377) Set the number of map output partitions for Exchange operator automatically based on the size of input tables and the reduce-side operation. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:22 UTC, 0 replies.
- [jira] [Resolved] (SPARK-4440) Enhance the job progress API to expose more information - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:23 UTC, 0 replies.
- [jira] [Resolved] (SPARK-2723) Block Manager should catch exceptions in putValues - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:23 UTC, 0 replies.
- [jira] [Resolved] (SPARK-5225) Support coalesed Input Metrics from different sources - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:23 UTC, 0 replies.
- [jira] [Resolved] (SPARK-2296) Refactor util.JsonProtocol for evolvability - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:23 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6208) executor-memory does not work when using local cluster - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:23 UTC, 0 replies.
- [jira] [Resolved] (SPARK-5646) Record output metrics for cache - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:24 UTC, 0 replies.
- [jira] [Resolved] (SPARK-765) Test suite should run Spark example programs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:24 UTC, 0 replies.
- [jira] [Resolved] (SPARK-4853) Automatically adjust the number of connections between two peers to achieve good performance - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:24 UTC, 0 replies.
- [jira] [Resolved] (SPARK-4160) Standalone cluster mode does not upload all needed jars to driver node - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:24 UTC, 0 replies.
- [jira] [Resolved] (SPARK-4125) Serializer should work on ManagedBuffers as well - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:24 UTC, 0 replies.
- [jira] [Resolved] (SPARK-5114) Should Evaluator be a PipelineStage - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:25 UTC, 0 replies.
- [jira] [Resolved] (SPARK-4800) RDD Preview Feature in WebUI - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:25 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6359) Expose IMain binding as part of ILoop Developer API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:25 UTC, 0 replies.
- [jira] [Resolved] (SPARK-5004) PySpark does not handle SOCKS proxy - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:25 UTC, 0 replies.
- [jira] [Resolved] (SPARK-5392) Shuffle spill size is shown as negative - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:26 UTC, 0 replies.
- [jira] [Resolved] (SPARK-3220) K-Means clusterer should perform K-Means initialization in parallel - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:26 UTC, 0 replies.
- [jira] [Resolved] (SPARK-4555) Add forward compatibility tests to JsonProtocol - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:26 UTC, 0 replies.
- [jira] [Resolved] (SPARK-3163) Separate continuous and categorical features in DecisionTree - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:26 UTC, 0 replies.
- [jira] [Resolved] (SPARK-4679) Race condition in querying the Spark UI JSON endpoint when Jetty context handlers are added and removed - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:26 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6441) Add Deflation/Schur Complement to Power Iteration Clustering for improved resilience to inter-class collisions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:26 UTC, 0 replies.
- [jira] [Resolved] (SPARK-3514) Provide a utility function for returning the hosts (and number) of live executors - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:26 UTC, 0 replies.
- [jira] [Resolved] (SPARK-3210) Flume Polling Receiver must be more tolerant to connection failures. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-5506) java.lang.ClassCastException using lambda expressions in combination of spark and Servlet - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-4287) Add TTL-based cleanup in external shuffle service - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-5042) Updated Receiver API to make it easier to write reliable receivers that ack source - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-4886) Support cache control for each partition of a Hive partitioned table - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:27 UTC, 0 replies.
- [jira] [Resolved] (SPARK-1910) Add onBlockComplete API to receiver - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:28 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6613) Starting stream from checkpoint causes Streaming tab to throw error - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:28 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6183) Skip bad workers when re-launching executors - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:28 UTC, 0 replies.
- [jira] [Resolved] (SPARK-1747) check for Spark on Yarn ApplicationMaster split brain - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:28 UTC, 0 replies.
- [jira] [Resolved] (SPARK-5748) Improve Vectors.sqdist implementation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:29 UTC, 0 replies.
- [jira] [Resolved] (SPARK-2584) Do not mutate block storage level on the UI - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:29 UTC, 0 replies.
- [jira] [Resolved] (SPARK-5571) LDA should handle text as well - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:29 UTC, 0 replies.
- [jira] [Resolved] (SPARK-5782) Python Worker / Pyspark Daemon Memory Issue - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:30 UTC, 0 replies.
- [jira] [Resolved] (SPARK-3451) spark-submit should support specifying glob wildcards in the --jars CLI option - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:30 UTC, 0 replies.
- [jira] [Resolved] (SPARK-5044) Update ReliableKafkaReceiver to use updated Receiver API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:30 UTC, 0 replies.
- [jira] [Resolved] (SPARK-4321) Make Kryo serialization work for closures - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:30 UTC, 0 replies.
- [jira] [Resolved] (SPARK-4940) Support more evenly distributing cores for Mesos mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:30 UTC, 0 replies.
- [jira] [Resolved] (SPARK-3031) Create JsonSerializable and move JSON serialization from JsonProtocol into each class - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:30 UTC, 0 replies.
- [jira] [Resolved] (SPARK-4112) Have a reserved copy of Sorter/SortDataFormat - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6089) Size of task result fetched can't be found in UI - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-4960) Interceptor pattern in receivers - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-5918) Spark Thrift server reports metadata for VARCHAR column as STRING in result set schema - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:31 UTC, 0 replies.
- [jira] [Resolved] (SPARK-4885) Enable fetched blocks to exceed 2 GB - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-5480) GraphX pageRank: java.lang.ArrayIndexOutOfBoundsException: - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-5490) KMeans costs can be incorrect if tasks need to be rerun - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6637) Test lambda weighting in implicit ALS - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-4524) Add documentation on packaging Python dependencies / installing them on clusters - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6760) Sketch algorithms for SQL/DataFrames - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:32 UTC, 0 replies.
- [jira] [Resolved] (SPARK-4653) DAGScheduler refactoring and cleanup - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:33 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6160) ChiSqSelector should keep test statistic info - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:34 UTC, 0 replies.
- [jira] [Resolved] (SPARK-5372) Change the default storage level of window operators - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-3380) DecisionTree: overflow and precision in aggregation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-3750) Log ulimit settings at warning if they are too low - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-5150) Strange implicit resolution behavior in Spark REPL - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6378) srcAttr in graph.triplets don't update when the size of graph is huge - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-5104) Distributed Representations of Sentences and Documents - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:35 UTC, 0 replies.
- [jira] [Resolved] (SPARK-1863) Allowing user jars to take precedence over Spark jars does not work as expected - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-5575) Artificial neural networks for MLlib deep learning - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:36 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6165) Aggregate and reduce should be able to work with very large number of tasks. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:38 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6808) Checkpointing after zipPartitions results in NODE_LOCAL execution - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:38 UTC, 0 replies.
- [jira] [Resolved] (SPARK-4540) Improve Executor ID Logging - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:38 UTC, 0 replies.
- [jira] [Resolved] (SPARK-5915) Spillable should check every N bytes rather than every 32 elements - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-3134) Update block locations asynchronously in TorrentBroadcast - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-799) Windows versions of the deploy scripts - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-5685) Show warning when users open text files compressed with non-splittable algorithms like gzip - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:39 UTC, 0 replies.
- [jira] [Resolved] (SPARK-2610) When spark.serializer is set as org.apache.spark.serializer.KryoSerializer, importing a method causes multiple spark applications creations - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-1921) Allow duplicate jar files among the app jar and secondary jars in yarn-cluster mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-3631) Add docs for checkpoint usage - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6462) UpdateStateByKey should allow inner join of new with old keys - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6148) cachedDataSourceTables may store outdated metadata if the table is updated from another HiveContext - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-3153) shuffle will run out of space when disks have different free space - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6415) Spark Streaming fail-fast: Stop scheduling jobs when a batch fails, and kills the app - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-3504) KMeans optimization: track distances and unmoved cluster centers across iterations - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6312) ChiSqTest should check for too few counts - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-5045) Update FlumePollingReceiver to use updated Receiver API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-3306) Addition of external resource dependency in executors - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-5674) Spark Job Explain Plan Proof of Concept - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-2581) complete or withdraw visitedStages optimization in DAGSchedulerā€™s stageDependsOn - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-5046) Update KinesisReceiver to use updated Receiver API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-3717) DecisionTree, RandomForest: Partition by feature - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:40 UTC, 0 replies.
- [jira] [Resolved] (SPARK-2253) [Core] Disable partial aggregation automatically when reduction factor is low - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-4716) Avoid shuffle when all-to-all operation has single input and output partition - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-4868) Twitter DStream.map() throws "Task not serializable" - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-4684) Add a script to run JDBC server on Windows - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-4500) Improve exact stratified sampling implementation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-5142) Possibly data may be ruined in Spark Streaming's WAL mechanism. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-5079) Detect failed jobs / batches in Spark Streaming unit tests - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-4488) Add control over map-side aggregation - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-3835) Spark applications that are killed should show up as "KILLED" or "CANCELLED" in the Spark UI - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-5497) start-all script not working properly on Standalone HA cluster (with Zookeeper) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-3916) recognize appended data in textFileStream() - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-4911) Report the inputs and outputs of Spark jobs so that external systems can track data lineage - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-1823) ExternalAppendOnlyMap can still OOM if one key is very large - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-5077) Map output statuses can still exceed spark.akka.frameSize - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:41 UTC, 0 replies.
- [jira] [Resolved] (SPARK-5043) Implement updated Receiver API - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-5091) Hooks for PySpark tasks - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-636) Add mechanism to run system management/configuration tasks on all workers - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-4545) If first Spark Streaming batch fails, it waits 10x batch duration before stopping - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6497) Class is not registered: scala.reflect.ManifestFactory$$anon$9 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-3115) Improve task broadcast latency for small tasks - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6798) Fix Date serialization in SparkR - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-4489) JavaPairRDD.collectAsMap from checkpoint RDD may fail with ClassCastException - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-4144) Support incremental model training of Naive Bayes classifier - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6026) Eliminate the bypassMergeThreshold parameter and associated hash-ish shuffle within the Sort shuffle code - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:42 UTC, 0 replies.
- [jira] [Resolved] (SPARK-5431) SparkSubmitSuite and DriverSuite hang indefinitely if Master fails - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-1107) Add shutdown hook on executor stop to stop running tasks - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6815) Support accumulators in R - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-6619) Improve Jar caching on executors - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-3601) Kryo NPE for output operations on Avro complex Objects even after registering. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-2913) Spark's log4j.properties should always appear ahead of Hadoop's on classpath - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-4698) Data-locality aware Partitioners - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-5488) SPARK_LOCAL_IP not read by mesos scheduler - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-5713) Support python serialization for RandomForest - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-2408) RDD.map(func) dependencies issue after checkpoint & count - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-4206) BlockManager warnings in local mode: "Block $blockId already exists on this machine; not re-adding it - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-5272) Refactor NaiveBayes to support discrete and continuous labels,features - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-2545) Add a diagnosis mode for closures to figure out what they're bringing in - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:43 UTC, 0 replies.
- [jira] [Resolved] (SPARK-2280) Java & Scala reference docs should describe function reference behavior. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 05:37:43 UTC, 0 replies.
- [jira] [Commented] (SPARK-18277) na.fill() and friends should work on struct fields - posted by "Nicholas Chammas (JIRA)" <ji...@apache.org> on 2019/05/21 06:03:00 UTC, 1 replies.
- [jira] [Resolved] (SPARK-27637) If exception occured while fetching blocks by netty block transfer service, check whether the relative executor is alive before retry - posted by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2019/05/21 06:06:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-19248) Regex_replace works in 1.6 but not in 2.0 - posted by "Nicholas Chammas (JIRA)" <ji...@apache.org> on 2019/05/21 06:20:00 UTC, 1 replies.
- [jira] [Reopened] (SPARK-19248) Regex_replace works in 1.6 but not in 2.0 - posted by "Nicholas Chammas (JIRA)" <ji...@apache.org> on 2019/05/21 06:21:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27787) Eliminate uncessary job to compute SSreg - posted by "zhengruifeng (JIRA)" <ji...@apache.org> on 2019/05/21 06:39:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27787) Eliminate uncessary job to compute SSreg - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/21 06:42:00 UTC, 2 replies.
- [jira] [Comment Edited] (SPARK-18277) na.fill() and friends should work on struct fields - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 07:06:00 UTC, 0 replies.
- [jira] [Reopened] (SPARK-23626) Spark DAGScheduler scheduling performance hindered on JobSubmitted Event - posted by "Ajith S (JIRA)" <ji...@apache.org> on 2019/05/21 07:13:01 UTC, 0 replies.
- [jira] [Comment Edited] (SPARK-15348) Hive ACID - posted by "Gowtham SB (JIRA)" <ji...@apache.org> on 2019/05/21 07:28:00 UTC, 2 replies.
- [jira] [Commented] (SPARK-12045) Use joda's DateTime to replace Calendar - posted by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2019/05/21 07:58:01 UTC, 0 replies.
- [jira] [Reopened] (SPARK-12045) Use joda's DateTime to replace Calendar - posted by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2019/05/21 07:58:01 UTC, 0 replies.
- [jira] [Commented] (SPARK-13155) add runtime null check when convert catalyst array to external array - posted by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2019/05/21 07:59:00 UTC, 0 replies.
- [jira] [Reopened] (SPARK-13155) add runtime null check when convert catalyst array to external array - posted by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2019/05/21 07:59:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27788) Session aware event log for Spark thrift server - posted by "Lantao Jin (JIRA)" <ji...@apache.org> on 2019/05/21 08:29:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27788) Session aware event log for Spark thrift server - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/21 08:35:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27789) Use stopEarly in codegen of ColumnarBatchScan - posted by "EdisonWang (JIRA)" <ji...@apache.org> on 2019/05/21 10:20:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27789) Use stopEarly in codegen of ColumnarBatchScan - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/21 10:26:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-27599) DataFrameWriter.partitionBy should be optional when writing to a hive table - posted by "Alexander Fedosov (JIRA)" <ji...@apache.org> on 2019/05/21 10:51:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27790) Support SQL INTERVAL types - posted by "Maxim Gekk (JIRA)" <ji...@apache.org> on 2019/05/21 11:47:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27791) Support SQL year-month INTERVAL type - posted by "Maxim Gekk (JIRA)" <ji...@apache.org> on 2019/05/21 11:52:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27792) SkewJoin hint - posted by "Jason Guo (JIRA)" <ji...@apache.org> on 2019/05/21 11:54:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27791) Support SQL year-month INTERVAL type - posted by "Maxim Gekk (JIRA)" <ji...@apache.org> on 2019/05/21 11:56:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27793) Support SQL day-time INTERVAL type - posted by "Maxim Gekk (JIRA)" <ji...@apache.org> on 2019/05/21 12:00:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27790) Support SQL INTERVAL types - posted by "Maxim Gekk (JIRA)" <ji...@apache.org> on 2019/05/21 12:00:01 UTC, 0 replies.
- [jira] [Commented] (SPARK-8526) Provide a way to define custom metrics and custom metric sink in Spark - posted by "Dmitry Goldenberg (JIRA)" <ji...@apache.org> on 2019/05/21 12:01:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27792) SkewJoin hint - posted by "Jason Guo (JIRA)" <ji...@apache.org> on 2019/05/21 12:04:01 UTC, 9 replies.
- [jira] [Updated] (SPARK-27792) SkewJoin--handle only skewed keys with broadcastjoin and other keys with normal join - posted by "Jason Guo (JIRA)" <ji...@apache.org> on 2019/05/21 12:29:00 UTC, 3 replies.
- [jira] [Assigned] (SPARK-27792) SkewJoin--handle only skewed keys with broadcastjoin and other keys with normal join - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/21 12:48:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-12823) Cannot create UDF with StructType input - posted by "Simeon H.K. Fitch (JIRA)" <ji...@apache.org> on 2019/05/21 12:59:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-15015) Log statements lack file name/number - posted by "John-Michael Reed (JIRA)" <ji...@apache.org> on 2019/05/21 13:03:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-23696) StructType.fromString swallows exceptions from DataType.fromJson - posted by "Simeon H.K. Fitch (JIRA)" <ji...@apache.org> on 2019/05/21 13:05:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-18748) UDF multiple evaluations causes very poor performance - posted by "Attila Kelemen (JIRA)" <ji...@apache.org> on 2019/05/21 13:17:00 UTC, 1 replies.
- [jira] [Comment Edited] (SPARK-18748) UDF multiple evaluations causes very poor performance - posted by "Attila Kelemen (JIRA)" <ji...@apache.org> on 2019/05/21 13:18:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27786) SHA1, MD5, and Base64 expression codegen doesn't work when commons-codec is shaded - posted by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2019/05/21 13:20:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-26989) Flaky test:DAGSchedulerSuite.Barrier task failures from the same stage attempt don't trigger multiple stage retries - posted by "Thomas Graves (JIRA)" <ji...@apache.org> on 2019/05/21 13:47:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-20827) cannot express HAVING without a GROUP BY clause - posted by "N Campbell (JIRA)" <ji...@apache.org> on 2019/05/21 14:12:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-20856) support statement using nested joins - posted by "N Campbell (JIRA)" <ji...@apache.org> on 2019/05/21 14:13:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-20829) var_samp returns Nan while other vendors return a null value - posted by "N Campbell (JIRA)" <ji...@apache.org> on 2019/05/21 14:14:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27794) Use secure URLs for downloading CRAN artifacts - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/21 14:21:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27794) Use secure URLs for downloading CRAN artifacts - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/21 14:34:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27795) Spark Web UI is broken when running in local mode within WildFly application server - posted by "Alexander Bouriakov (JIRA)" <ji...@apache.org> on 2019/05/21 14:38:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27795) Spark Web UI is broken when running in local mode within WildFly application server - posted by "Alexander Bouriakov (JIRA)" <ji...@apache.org> on 2019/05/21 14:38:00 UTC, 3 replies.
- [jira] [Commented] (SPARK-27795) Spark Web UI is broken when running in local mode within WildFly application server - posted by "Alexander Bouriakov (JIRA)" <ji...@apache.org> on 2019/05/21 14:39:00 UTC, 0 replies.
- [jira] [Closed] (SPARK-21349) Make TASK_SIZE_TO_WARN_KB configurable - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/21 15:56:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27790) Support SQL INTERVAL types - posted by "Maxim Gekk (JIRA)" <ji...@apache.org> on 2019/05/21 16:18:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27796) Remove obsolete spark-mesos Docker image - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/21 16:46:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27796) Remove obsolete spark-mesos Docker image - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/21 16:52:00 UTC, 1 replies.
- [jira] [Assigned] (SPARK-23626) Spark DAGScheduler scheduling performance hindered on JobSubmitted Event - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/21 16:57:01 UTC, 1 replies.
- [jira] [Resolved] (SPARK-27726) Performance of InMemoryStore suffers under load - posted by "Marcelo Vanzin (JIRA)" <ji...@apache.org> on 2019/05/21 17:24:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27762) Support user provided avro schema for writing fields with different ordering - posted by "DB Tsai (JIRA)" <ji...@apache.org> on 2019/05/21 17:35:00 UTC, 0 replies.
- [jira] [Closed] (SPARK-10719) SQLImplicits.rddToDataFrameHolder is not thread safe when using Scala 2.10 - posted by "Shixiong Zhu (JIRA)" <ji...@apache.org> on 2019/05/21 17:46:00 UTC, 0 replies.
- [jira] [Reopened] (SPARK-17858) Provide option for Spark SQL to skip corrupt files - posted by "Shixiong Zhu (JIRA)" <ji...@apache.org> on 2019/05/21 18:01:00 UTC, 0 replies.
- [jira] [Reopened] (SPARK-11095) Simplify Netty RPC implementation by using a separate thread pool for each endpoint - posted by "Shixiong Zhu (JIRA)" <ji...@apache.org> on 2019/05/21 18:02:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27728) Address thread-safety of InMemoryStore and ElementTrackingStores. - posted by "David C Navas (JIRA)" <ji...@apache.org> on 2019/05/21 18:11:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27727) Asynchronous ElementStore cleanup should have only one pending cleanup per class - posted by "David C Navas (JIRA)" <ji...@apache.org> on 2019/05/21 18:11:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27730) Add support for removeAllKeys - posted by "David C Navas (JIRA)" <ji...@apache.org> on 2019/05/21 18:12:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27729) Extract deletion of the summaries from the stage deletion loop - posted by "David C Navas (JIRA)" <ji...@apache.org> on 2019/05/21 18:12:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27731) Cleanup some non-compile time type checking and exception handling - posted by "David C Navas (JIRA)" <ji...@apache.org> on 2019/05/21 18:13:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27796) Remove obsolete spark-mesos Docker image - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/21 18:25:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27439) Explainging Dataset should show correct resolved plans - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/21 18:28:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27248) REFRESH TABLE should recreate cache with same cache name and storage level - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/21 18:45:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-25139) PythonRunner#WriterThread released block after TaskRunner finally block which invoke BlockManager#releaseAllLocksForTask - posted by "Reza Safi (JIRA)" <ji...@apache.org> on 2019/05/21 18:54:01 UTC, 2 replies.
- [jira] [Commented] (SPARK-16738) Queryable state for Spark State Store - posted by "Stavros Kontopoulos (JIRA)" <ji...@apache.org> on 2019/05/21 19:28:00 UTC, 6 replies.
- [jira] [Created] (SPARK-27797) Shuffle service metric "registeredConnections" not tracked correctly - posted by "Marcelo Vanzin (JIRA)" <ji...@apache.org> on 2019/05/21 20:06:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27495) Support Stage level resource configuration and scheduling - posted by "Thomas Graves (JIRA)" <ji...@apache.org> on 2019/05/21 20:14:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27676) InMemoryFileIndex should hard-fail on missing files instead of logging and continuing - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/21 20:42:00 UTC, 2 replies.
- [jira] [Reopened] (SPARK-20547) ExecutorClassLoader's findClass may not work correctly when a task is cancelled. - posted by "Shixiong Zhu (JIRA)" <ji...@apache.org> on 2019/05/21 22:03:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27798) from_avro can modify variables in other rows - posted by "Yosuke Mori (JIRA)" <ji...@apache.org> on 2019/05/21 22:26:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27798) from_avro can modify variables in other rows - posted by "Yosuke Mori (JIRA)" <ji...@apache.org> on 2019/05/21 22:27:00 UTC, 4 replies.
- [jira] [Updated] (SPARK-27798) from_avro can modify variables in other rows in local mode - posted by "Yosuke Mori (JIRA)" <ji...@apache.org> on 2019/05/21 22:37:00 UTC, 3 replies.
- [jira] [Commented] (SPARK-27798) from_avro can modify variables in other rows in local mode - posted by "Yosuke Mori (JIRA)" <ji...@apache.org> on 2019/05/21 22:39:00 UTC, 8 replies.
- [jira] [Commented] (SPARK-23978) Kryo much slower when mllib jar not on classpath - posted by "Josh Rosen (JIRA)" <ji...@apache.org> on 2019/05/21 22:59:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27799) Allow SerializerManager.canUseKryo whitelist to be extended via a configuration - posted by "Josh Rosen (JIRA)" <ji...@apache.org> on 2019/05/22 00:38:00 UTC, 4 replies.
- [jira] [Created] (SPARK-27799) Allow SerializerManager.canUseKryo to be customized via configuration - posted by "Josh Rosen (JIRA)" <ji...@apache.org> on 2019/05/22 00:38:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27737) Upgrade to 2.3.5 for Hive Metastore Client 2.3 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/22 01:27:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27774) Avoid hardcoded configs - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/22 01:47:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27800) Example for xor function has a wrong answer - posted by "Alex Liu (JIRA)" <ji...@apache.org> on 2019/05/22 02:16:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27800) Example for xor function has a wrong answer - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/22 02:20:00 UTC, 1 replies.
- [jira] [Updated] (SPARK-27800) Example for xor function has a wrong answer - posted by "Alex Liu (JIRA)" <ji...@apache.org> on 2019/05/22 02:21:02 UTC, 1 replies.
- [jira] [Commented] (SPARK-27800) Example for xor function has a wrong answer - posted by "Alex Liu (JIRA)" <ji...@apache.org> on 2019/05/22 02:24:00 UTC, 0 replies.
- [jira] [Issue Comment Deleted] (SPARK-27800) Example for xor function has a wrong answer - posted by "Alex Liu (JIRA)" <ji...@apache.org> on 2019/05/22 02:25:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27698) Add new method for getting pushed down filters in Parquet file reader - posted by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2019/05/22 03:32:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-24586) Upcast should not allow casting from string to other types - posted by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2019/05/22 03:37:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27778) toPandas with arrow enabled fails for DF with no partitions - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/22 04:24:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27800) Example for xor function has a wrong answer - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/22 04:57:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27797) Shuffle service metric "registeredConnections" not tracked correctly - posted by "Steven Rand (JIRA)" <ji...@apache.org> on 2019/05/22 05:40:00 UTC, 2 replies.
- [jira] [Created] (SPARK-27801) InMemoryFileIndex.listLeafFiles should use listLocatedStatus for DistributedFileSystem - posted by "Rob Russo (JIRA)" <ji...@apache.org> on 2019/05/22 05:41:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27801) InMemoryFileIndex.listLeafFiles should use listLocatedStatus for DistributedFileSystem - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/22 05:46:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-16820) Sparse - Sparse matrix multiplication - posted by "Ohad Raviv (JIRA)" <ji...@apache.org> on 2019/05/22 06:03:00 UTC, 0 replies.
- [jira] [Closed] (SPARK-16820) Sparse - Sparse matrix multiplication - posted by "Ohad Raviv (JIRA)" <ji...@apache.org> on 2019/05/22 06:04:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27773) Add shuffle service metric for number of exceptions caught in ExternalShuffleBlockHandler - posted by "Steven Rand (JIRA)" <ji...@apache.org> on 2019/05/22 06:49:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27802) SparkUI throws NoSuchElementException when inconsistency appears between `ExecutorStageSummaryWrapper`s and `ExecutorSummaryWrapper`s - posted by "liupengcheng (JIRA)" <ji...@apache.org> on 2019/05/22 06:59:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27802) SparkUI throws NoSuchElementException when inconsistency appears between `ExecutorStageSummaryWrapper`s and `ExecutorSummaryWrapper`s - posted by "liupengcheng (JIRA)" <ji...@apache.org> on 2019/05/22 07:00:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-19468) Dataset slow because of unnecessary shuffles - posted by "Josh Rosen (JIRA)" <ji...@apache.org> on 2019/05/22 07:17:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27803) fix column pruning for python UDF - posted by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2019/05/22 07:23:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27803) fix column pruning for python UDF - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/22 07:38:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-26045) Error in the spark 2.4 release package with the spark-avro_2.11 depdency - posted by "Renze Post (JIRA)" <ji...@apache.org> on 2019/05/22 11:25:01 UTC, 1 replies.
- [jira] [Created] (SPARK-27804) LiveListenerBus#addToQueue : create multiple AsyncEventQueues under race condition - posted by "duyu (JIRA)" <ji...@apache.org> on 2019/05/22 11:38:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27804) LiveListenerBus#addToQueue : create multiple AsyncEventQueues under race condition - posted by "duyu (JIRA)" <ji...@apache.org> on 2019/05/22 11:43:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27804) LiveListenerBus#addToQueue : create multiple AsyncEventQueues under race condition - posted by "Jungtaek Lim (JIRA)" <ji...@apache.org> on 2019/05/22 12:24:00 UTC, 2 replies.
- [jira] [Assigned] (SPARK-27804) LiveListenerBus#addToQueue : create multiple AsyncEventQueues under race condition - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/22 12:39:00 UTC, 1 replies.
- [jira] [Comment Edited] (SPARK-27804) LiveListenerBus#addToQueue : create multiple AsyncEventQueues under race condition - posted by "duyu (JIRA)" <ji...@apache.org> on 2019/05/22 14:09:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27805) toPandas does not propagate SparkExceptions with arrow enabled - posted by "David Vogelbacher (JIRA)" <ji...@apache.org> on 2019/05/22 15:09:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27805) toPandas does not propagate SparkExceptions with arrow enabled - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/22 15:32:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27806) byName/byPosition should apply to struct fields as well - posted by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2019/05/22 15:56:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27806) byName/byPosition should apply to struct fields as well - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/22 16:18:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27807) Parallel resolve leaf statuses in InMemoryFileIndex - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/22 16:38:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27807) Parallel resolve leaf statuses in InMemoryFileIndex - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/22 16:42:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-27807) Parallel resolve leaf statuses in InMemoryFileIndex - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/22 16:42:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27808) Ability to ignore existing files for structured streaming - posted by "Vladimir Matveev (JIRA)" <ji...@apache.org> on 2019/05/22 17:26:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27808) Ability to ignore existing files for structured streaming - posted by "Vladimir Matveev (JIRA)" <ji...@apache.org> on 2019/05/22 17:27:00 UTC, 2 replies.
- [jira] [Reopened] (SPARK-16824) Add API docs for VectorUDT - posted by "Nicholas Chammas (JIRA)" <ji...@apache.org> on 2019/05/22 18:26:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27804) LiveListenerBus#addToQueue : create multiple AsyncEventQueues under race condition - posted by "Marcelo Vanzin (JIRA)" <ji...@apache.org> on 2019/05/22 18:40:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-26045) Error in the spark 2.4 release package with the spark-avro_2.11 depdency - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/22 20:27:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27809) Make optional clauses order insensitive for CREATE DATABASE/VIEW SQL statement - posted by "Yesheng Ma (JIRA)" <ji...@apache.org> on 2019/05/22 21:45:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27809) Make optional clauses order insensitive for CREATE DATABASE/VIEW SQL statement - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/22 21:50:00 UTC, 2 replies.
- [jira] [Assigned] (SPARK-20547) ExecutorClassLoader's findClass may not work correctly when a task is cancelled. - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/22 21:55:00 UTC, 1 replies.
- [jira] [Assigned] (SPARK-23153) Support application dependencies in submission client's local file system - posted by "Erik Erlandson (JIRA)" <ji...@apache.org> on 2019/05/22 23:26:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23153) Support application dependencies in submission client's local file system - posted by "Erik Erlandson (JIRA)" <ji...@apache.org> on 2019/05/22 23:26:01 UTC, 0 replies.
- [jira] [Created] (SPARK-27810) PySpark breaks Cloudpickle serialization of collections.namedtuple objects - posted by "Travis Addair (JIRA)" <ji...@apache.org> on 2019/05/23 01:14:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27810) PySpark breaks Cloudpickle serialization of collections.namedtuple objects - posted by "Travis Addair (JIRA)" <ji...@apache.org> on 2019/05/23 01:16:00 UTC, 2 replies.
- [jira] [Reopened] (SPARK-15015) Log statements lack file name/number - posted by "John-Michael Reed (JIRA)" <ji...@apache.org> on 2019/05/23 02:48:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27807) Parallel resolve leaf statuses in InMemoryFileIndex - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/23 03:20:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27811) Docs of spark.driver.memoryOverhead and spark.executor.memoryOverhead exists a little ambiguity - posted by "jiaan.geng (JIRA)" <ji...@apache.org> on 2019/05/23 04:24:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27811) Docs of spark.driver.memoryOverhead and spark.executor.memoryOverhead exists a little ambiguity - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/23 04:25:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-27811) Docs of spark.driver.memoryOverhead and spark.executor.memoryOverhead exists a little ambiguity - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/23 04:25:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-26388) No support for "alter table .. replace columns" to drop columns - posted by "Mathew Wicks (JIRA)" <ji...@apache.org> on 2019/05/23 06:37:00 UTC, 3 replies.
- [jira] [Created] (SPARK-27812) kubernetes client import non-daemon thread which block jvm exit. - posted by "Henry Yu (JIRA)" <ji...@apache.org> on 2019/05/23 06:54:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27657) ml.util.Instrumentation.logFailure doesn't log error message - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/23 06:59:01 UTC, 2 replies.
- [jira] [Created] (SPARK-27813) DataSourceV2: Add DropTable logical operation - posted by "John Zhuge (JIRA)" <ji...@apache.org> on 2019/05/23 07:48:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27814) The cast operation may push down an correct filter, which is fatal. - posted by "feiwang (JIRA)" <ji...@apache.org> on 2019/05/23 08:42:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27814) The cast operation may push down uncorrect filter, which is fatal. - posted by "feiwang (JIRA)" <ji...@apache.org> on 2019/05/23 08:58:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27814) The cast operation may push down uncorrect filter, which is fatal. - posted by "feiwang (JIRA)" <ji...@apache.org> on 2019/05/23 08:59:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27815) do not support file source v2 in DataFrameWriter - posted by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2019/05/23 09:02:01 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27814) The cast operation may push down uncorrect filter, which is fatal. - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/23 09:14:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-27813) DataSourceV2: Add DropTable logical operation - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/23 09:16:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27813) DataSourceV2: Add DropTable logical operation - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/23 09:16:00 UTC, 2 replies.
- [jira] [Updated] (SPARK-27814) The cast operation for partitioned column may push down uncorrect filter, which is fatal. - posted by "feiwang (JIRA)" <ji...@apache.org> on 2019/05/23 09:29:00 UTC, 3 replies.
- [jira] [Created] (SPARK-27816) make TreeNode tag type safe - posted by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2019/05/23 10:08:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27816) make TreeNode tag type safe - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/23 10:10:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27817) Is it possible to achieve global scheduling optimization by predicting task execution times (e.g., training models with historical data using machine learning)? - posted by "shenghuang (JIRA)" <ji...@apache.org> on 2019/05/23 10:40:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27818) Spark Structured Streaming executors fails with OutOfMemoryError due to KafkaMbeans - posted by "Ruslan Taran (JIRA)" <ji...@apache.org> on 2019/05/23 11:41:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27818) Spark Structured Streaming executors fails with OutOfMemoryError due to KafkaMbeans - posted by "Ruslan Taran (JIRA)" <ji...@apache.org> on 2019/05/23 11:42:00 UTC, 3 replies.
- [jira] [Created] (SPARK-27819) Retry cleanup of disk persisted RDD via external shuffle service when it failed via executor - posted by "Attila Zsolt Piros (JIRA)" <ji...@apache.org> on 2019/05/23 11:59:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27819) Retry cleanup of disk persisted RDD via external shuffle service when it failed via executor - posted by "Attila Zsolt Piros (JIRA)" <ji...@apache.org> on 2019/05/23 12:00:01 UTC, 0 replies.
- [jira] [Commented] (SPARK-27819) Retry cleanup of disk persisted RDD via external shuffle service when it failed via executor - posted by "Attila Zsolt Piros (JIRA)" <ji...@apache.org> on 2019/05/23 12:03:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27820) case insensitive resolver should be used in GetMapValue - posted by "Michel Lemay (JIRA)" <ji...@apache.org> on 2019/05/23 13:34:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27820) case insensitive resolver should be used in GetMapValue - posted by "Michel Lemay (JIRA)" <ji...@apache.org> on 2019/05/23 13:35:00 UTC, 2 replies.
- [jira] [Created] (SPARK-27821) Spark WebUI - show numbers of drivers/apps in waiting/submitted/killed/running state - posted by "t oo (JIRA)" <ji...@apache.org> on 2019/05/23 15:22:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27821) Spark WebUI - show numbers of drivers/apps in waiting/submitted/killed/running state - posted by "t oo (JIRA)" <ji...@apache.org> on 2019/05/23 15:23:00 UTC, 1 replies.
- [jira] [Updated] (SPARK-27815) do not leak SaveMode to file source v2 - posted by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2019/05/23 16:06:00 UTC, 3 replies.
- [jira] [Created] (SPARK-27822) Spark WebUi - for running applications have a drivername column - posted by "t oo (JIRA)" <ji...@apache.org> on 2019/05/23 16:07:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-25331) Structured Streaming File Sink duplicates records in case of driver failure - posted by "Gabor Somogyi (JIRA)" <ji...@apache.org> on 2019/05/23 17:15:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-25331) Structured Streaming File Sink duplicates records in case of driver failure - posted by "Gabor Somogyi (JIRA)" <ji...@apache.org> on 2019/05/23 17:16:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27322) DataSourceV2: Logical relation in multiple catalogs - posted by "John Zhuge (JIRA)" <ji...@apache.org> on 2019/05/23 17:31:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27806) byName/byPosition should apply to struct fields as well - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/23 17:40:01 UTC, 0 replies.
- [jira] [Updated] (SPARK-27539) Fix inaccurate aggregate outputRows estimation with column containing null values - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/23 18:22:00 UTC, 0 replies.
- [jira] [Reopened] (SPARK-27539) Fix inaccurate aggregate outputRows estimation with column containing null values - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/23 18:22:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27351) Wrong outputRows estimation after AggregateEstimation with only null value column - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/23 18:26:00 UTC, 0 replies.
- [jira] [Reopened] (SPARK-27351) Wrong outputRows estimation after AggregateEstimation with only null value column - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/23 18:26:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27351) Wrong outputRows estimation after AggregateEstimation with only null value column - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/23 18:35:00 UTC, 1 replies.
- [jira] [Resolved] (SPARK-27488) Driver interface to support GPU resources - posted by "Xiangrui Meng (JIRA)" <ji...@apache.org> on 2019/05/23 18:47:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27823) Add an abstraction layer for accelerator resource handling to avoid manipulating raw confs - posted by "Xiangrui Meng (JIRA)" <ji...@apache.org> on 2019/05/23 18:50:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27816) make TreeNode tag type safe - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/23 18:54:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27539) Fix inaccurate aggregate outputRows estimation with column containing null values - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/23 19:34:00 UTC, 1 replies.
- [jira] [Resolved] (SPARK-27677) Disk-persisted RDD blocks served by shuffle service, and ignored for Dynamic Allocation - posted by "Marcelo Vanzin (JIRA)" <ji...@apache.org> on 2019/05/23 20:17:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-26045) Error in the spark 2.4 release package with the spark-avro_2.11 depdency - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/23 20:23:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27711) InputFileBlockHolder should be unset at the end of tasks - posted by "Shixiong Zhu (JIRA)" <ji...@apache.org> on 2019/05/23 20:38:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27351) Wrong outputRows estimation after AggregateEstimation with only null value column - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/23 20:42:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27539) Fix inaccurate aggregate outputRows estimation with column containing null values - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/23 20:43:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27818) Spark Structured Streaming executors fails with OutOfMemoryError due to KafkaMbeans - posted by "Jungtaek Lim (JIRA)" <ji...@apache.org> on 2019/05/23 22:02:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-23710) Upgrade the built-in Hive to 2.3.5 for hadoop-3.2 - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/23 22:42:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27737) Upgrade to Hive 2.3.5 for Hive Metastore Client and Hadoop-3.2 profile - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/23 22:43:00 UTC, 0 replies.
- [jira] [Comment Edited] (SPARK-27818) Spark Structured Streaming executors fails with OutOfMemoryError due to KafkaMbeans - posted by "Jungtaek Lim (JIRA)" <ji...@apache.org> on 2019/05/23 22:46:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27824) Make rule EliminateResolvedHint idempotent - posted by "Maryann Xue (JIRA)" <ji...@apache.org> on 2019/05/23 22:59:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-18673) Dataframes doesn't work on Hadoop 3.x; Hive rejects Hadoop version - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/23 23:04:02 UTC, 0 replies.
- [jira] [Updated] (SPARK-18673) Dataframes doesn't work on Hadoop 3.x; Hive rejects Hadoop version - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/23 23:04:04 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27824) Make rule EliminateResolvedHint idempotent - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/23 23:05:00 UTC, 2 replies.
- [jira] [Resolved] (SPARK-27738) Upgrade the built-in Hive to 2.3.5 for hadoop-3.2 - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/23 23:08:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27770) Add aggregates.sql - Part1 - posted by "Xingbo Jiang (JIRA)" <ji...@apache.org> on 2019/05/23 23:38:00 UTC, 0 replies.
- [jira] [Issue Comment Deleted] (SPARK-18673) Dataframes doesn't work on Hadoop 3.x; Hive rejects Hadoop version - posted by "KaiXu (JIRA)" <ji...@apache.org> on 2019/05/24 01:42:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27825) spark thriftserver session's first username cause the impersonation issue. - posted by "wangxinxin (JIRA)" <ji...@apache.org> on 2019/05/24 02:12:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27825) spark thriftserver session's first username cause the impersonation issue. - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/24 02:14:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27826) saveAsTable() function case table have "HiveFileFormat" "ParquetFileFormat" format issue - posted by "fengtlyer (JIRA)" <ji...@apache.org> on 2019/05/24 02:31:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27827) File does not exist notice is misleading in FileScanRDD - posted by "zhoukang (JIRA)" <ji...@apache.org> on 2019/05/24 02:43:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-17477) SparkSQL cannot handle schema evolution from Int -> Long when parquet files have Int as its type while hive metastore has Long as its type - posted by "Mathew Wicks (JIRA)" <ji...@apache.org> on 2019/05/24 02:47:01 UTC, 0 replies.
- [jira] [Commented] (SPARK-16544) Support for conversion from compatible schema for Parquet data source when data types are not matched - posted by "Mathew Wicks (JIRA)" <ji...@apache.org> on 2019/05/24 02:47:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27440) Optimize uncorrelated predicate subquery - posted by "Mingcong Han (JIRA)" <ji...@apache.org> on 2019/05/24 02:50:00 UTC, 1 replies.
- [jira] [Issue Comment Deleted] (SPARK-16544) Support for conversion from compatible schema for Parquet data source when data types are not matched - posted by "Mathew Wicks (JIRA)" <ji...@apache.org> on 2019/05/24 02:53:00 UTC, 0 replies.
- [jira] [Comment Edited] (SPARK-17477) SparkSQL cannot handle schema evolution from Int -> Long when parquet files have Int as its type while hive metastore has Long as its type - posted by "Mathew Wicks (JIRA)" <ji...@apache.org> on 2019/05/24 02:53:00 UTC, 0 replies.
- [jira] [Reopened] (SPARK-27440) Optimize uncorrelated predicate subquery - posted by "Mingcong Han (JIRA)" <ji...@apache.org> on 2019/05/24 02:54:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27732) DataSourceV2: Add CreateTable logical operation - posted by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2019/05/24 03:15:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27828) spark job hangs when kryo.serializers.FieldSerializer is called under multi-executor-cores settings - posted by "Itsuki Toyota (JIRA)" <ji...@apache.org> on 2019/05/24 03:28:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27828) spark job hangs when kryo.serializers.FieldSerializer is called under multi-executor-cores settings - posted by "Itsuki Toyota (JIRA)" <ji...@apache.org> on 2019/05/24 03:29:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27829) In Dataset.joinWith inner joins, don't nest data before shuffling - posted by "Josh Rosen (JIRA)" <ji...@apache.org> on 2019/05/24 03:32:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27829) In Dataset.joinWith inner joins, don't nest data before shuffling - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/24 03:50:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-27828) spark job hangs when kryo.serializers.FieldSerializer is called under multi-executor-cores settings - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/24 03:59:00 UTC, 2 replies.
- [jira] [Updated] (SPARK-27826) saveAsTable() function case table have "HiveFileFormat" "ParquetFileFormat" format issue - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/24 04:01:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27826) saveAsTable() function case table have "HiveFileFormat" "ParquetFileFormat" format issue - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/24 04:05:01 UTC, 2 replies.
- [jira] [Updated] (SPARK-27825) spark thriftserver session's first username cause the impersonation issue. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/24 04:06:00 UTC, 1 replies.
- [jira] [Resolved] (SPARK-27825) spark thriftserver session's first username cause the impersonation issue. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/24 04:06:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27822) Spark WebUi - for running applications have a drivername column - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/24 04:07:00 UTC, 2 replies.
- [jira] [Commented] (SPARK-27820) case insensitive resolver should be used in GetMapValue - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/24 04:13:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27817) Is it possible to achieve global scheduling optimization by predicting task execution times (e.g., training models with historical data using machine learning)? - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/24 04:14:01 UTC, 1 replies.
- [jira] [Resolved] (SPARK-27818) Spark Structured Streaming executors fails with OutOfMemoryError due to KafkaMbeans - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/24 04:14:01 UTC, 0 replies.
- [jira] [Commented] (SPARK-27817) Is it possible to achieve global scheduling optimization by predicting task execution times (e.g., training models with historical data using machine learning)? - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/24 04:14:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27817) Is it possible to achieve global scheduling optimization by predicting task execution times (e.g., training models with historical data using machine learning)? - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/24 04:15:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27812) kubernetes client import non-daemon thread which block jvm exit. - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/24 04:15:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-27810) PySpark breaks Cloudpickle serialization of collections.namedtuple objects - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/24 04:19:00 UTC, 0 replies.
- [jira] [Comment Edited] (SPARK-27798) from_avro can modify variables in other rows in local mode - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/24 04:21:00 UTC, 1 replies.
- [jira] [Resolved] (SPARK-16544) Support for conversion from compatible schema for Parquet data source when data types are not matched - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/24 04:22:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27830) Show Spark version at app lists of Spark History UI - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/24 06:47:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27830) Show Spark version at app lists of Spark History UI - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/24 06:53:00 UTC, 1 replies.
- [jira] [Updated] (SPARK-27822) Spark WebUi - for running applications have a drivername column - posted by "t oo (JIRA)" <ji...@apache.org> on 2019/05/24 08:05:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27831) Move Hive test jars to maven dependency - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/24 08:56:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27831) Move Hive test jars to maven dependency - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/24 09:03:00 UTC, 3 replies.
- [jira] [Created] (SPARK-27832) Don't decompress and create column batch when the task is completed - posted by "Liang-Chi Hsieh (JIRA)" <ji...@apache.org> on 2019/05/24 10:24:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27832) Don't decompress and create column batch when the task is completed - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/24 10:30:00 UTC, 1 replies.
- [jira] [Reopened] (SPARK-16738) Queryable state for Spark State Store - posted by "Stavros Kontopoulos (JIRA)" <ji...@apache.org> on 2019/05/24 10:42:00 UTC, 0 replies.
- [jira] [Issue Comment Deleted] (SPARK-16738) Queryable state for Spark State Store - posted by "Stavros Kontopoulos (JIRA)" <ji...@apache.org> on 2019/05/24 10:42:01 UTC, 0 replies.
- [jira] [Commented] (SPARK-27808) Ability to ignore existing files for structured streaming - posted by "Gabor Somogyi (JIRA)" <ji...@apache.org> on 2019/05/24 10:45:00 UTC, 2 replies.
- [jira] [Comment Edited] (SPARK-27808) Ability to ignore existing files for structured streaming - posted by "Gabor Somogyi (JIRA)" <ji...@apache.org> on 2019/05/24 10:46:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27833) Structured Streaming Custom Sink -- - posted by "Raviteja (JIRA)" <ji...@apache.org> on 2019/05/24 13:03:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27833) Structured Streaming Custom Sink -- - posted by "Raviteja (JIRA)" <ji...@apache.org> on 2019/05/24 13:04:00 UTC, 3 replies.
- [jira] [Updated] (SPARK-27833) java.lang.AssertionError: assertion failed: No plan for EventTimeWatermark - posted by "Raviteja (JIRA)" <ji...@apache.org> on 2019/05/24 13:06:00 UTC, 1 replies.
- [jira] [Updated] (SPARK-27833) Structured Streaming Custom Sink -- java.lang.AssertionError: assertion failed: No plan for EventTimeWatermark - posted by "Raviteja (JIRA)" <ji...@apache.org> on 2019/05/24 13:06:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27833) java.lang.AssertionError: assertion failed: No plan for EventTimeWatermark - posted by "Gabor Somogyi (JIRA)" <ji...@apache.org> on 2019/05/24 14:20:00 UTC, 4 replies.
- [jira] [Issue Comment Deleted] (SPARK-27833) java.lang.AssertionError: assertion failed: No plan for EventTimeWatermark - posted by "Gabor Somogyi (JIRA)" <ji...@apache.org> on 2019/05/24 14:22:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27834) Make separate PySpark/SparkR vectorization configurations - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/24 14:45:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27666) Stop python runner threads when task finishes - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/24 14:47:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27835) Resource Scheduling: change driver config from addresses to resourcesFile - posted by "Thomas Graves (JIRA)" <ji...@apache.org> on 2019/05/24 14:54:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27834) Make separate PySpark/SparkR vectorization configurations - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/24 14:59:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-27816) make TreeNode tag type safe - posted by "Mark Hamstra (JIRA)" <ji...@apache.org> on 2019/05/24 15:07:01 UTC, 0 replies.
- [jira] [Commented] (SPARK-7898) pyspark merges stderr into stdout - posted by "Kyle Brooks (JIRA)" <ji...@apache.org> on 2019/05/24 15:45:00 UTC, 0 replies.
- [jira] [Comment Edited] (SPARK-7898) pyspark merges stderr into stdout - posted by "Kyle Brooks (JIRA)" <ji...@apache.org> on 2019/05/24 15:45:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27831) Move Hive test jars to maven dependency - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/24 17:35:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-26356) Remove SaveMode from data source v2 API - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/24 17:47:01 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27362) Kubernetes support for GPU-aware scheduling - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/24 18:09:00 UTC, 2 replies.
- [jira] [Comment Edited] (SPARK-27593) CSV Parser returns 2 DataFrame - Valid and Malformed DFs - posted by "Ladislav Jech (JIRA)" <ji...@apache.org> on 2019/05/24 18:16:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27824) Make rule EliminateResolvedHint idempotent - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/24 18:26:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27836) Issue with seeded rand() function in Spark SQL - posted by "Jason Ferrell (JIRA)" <ji...@apache.org> on 2019/05/24 19:22:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27837) Running rand() in SQL with seed of column results in error (rand(col1)) - posted by "Jason Ferrell (JIRA)" <ji...@apache.org> on 2019/05/24 19:24:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-24149) Automatic namespaces discovery in HDFS federation - posted by "Dhruve Ashar (JIRA)" <ji...@apache.org> on 2019/05/24 19:48:01 UTC, 8 replies.
- [jira] [Created] (SPARK-27838) Support user provided non-nullable avro schema for nullable catalyst schema without any null record - posted by "DB Tsai (JIRA)" <ji...@apache.org> on 2019/05/24 20:46:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27838) Support user provided non-nullable avro schema for nullable catalyst schema without any null record - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/24 20:48:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27838) Support user provided non-nullable avro schema for nullable catalyst schema without any null record - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/24 20:48:00 UTC, 2 replies.
- [jira] [Resolved] (SPARK-27838) Support user provided non-nullable avro schema for nullable catalyst schema without any null record - posted by "DB Tsai (JIRA)" <ji...@apache.org> on 2019/05/24 21:48:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27809) Make optional clauses order insensitive for CREATE DATABASE/VIEW SQL statement - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/24 22:21:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27830) Show Spark version at app lists of Spark History UI - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/25 00:42:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27839) Improve UTF8String.replace() / StringReplace performance - posted by "Josh Rosen (JIRA)" <ji...@apache.org> on 2019/05/25 07:19:01 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27839) Improve UTF8String.replace() / StringReplace performance - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/25 07:20:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-27837) Running rand() in SQL with seed of column results in error (rand(col1)) - posted by "Liang-Chi Hsieh (JIRA)" <ji...@apache.org> on 2019/05/25 10:08:00 UTC, 6 replies.
- [jira] [Commented] (SPARK-27836) Issue with seeded rand() function in Spark SQL - posted by "Liang-Chi Hsieh (JIRA)" <ji...@apache.org> on 2019/05/25 10:13:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27840) Hadoop attempts to create a temporary folder in root folder - posted by "M. Le Bihan (JIRA)" <ji...@apache.org> on 2019/05/25 17:53:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27840) Hadoop attempts to create a temporary folder in root folder - posted by "M. Le Bihan (JIRA)" <ji...@apache.org> on 2019/05/25 18:23:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27801) InMemoryFileIndex.listLeafFiles should use listLocatedStatus for DistributedFileSystem - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/25 22:52:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27841) Improve UTF8String fromString()/toString()/numChars() performance when strings are ASCII - posted by "Josh Rosen (JIRA)" <ji...@apache.org> on 2019/05/25 23:30:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27711) InputFileBlockHolder should be unset at the end of tasks - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/26 00:27:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27841) Improve UTF8String fromString()/toString()/numChars() performance when strings are ASCII - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/26 01:38:00 UTC, 1 replies.
- [jira] [Updated] (SPARK-27147) Create new unit test cases for SortShuffleWriter - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/26 04:05:00 UTC, 1 replies.
- [jira] [Resolved] (SPARK-27147) Create new unit test cases for SortShuffleWriter - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/26 04:05:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27322) DataSourceV2: Select from multiple catalogs - posted by "John Zhuge (JIRA)" <ji...@apache.org> on 2019/05/26 06:29:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27614) Executor shuffle fetch hang - posted by "feiwang (JIRA)" <ji...@apache.org> on 2019/05/26 10:16:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27842) Inconsistent results of Statistics.corr() and PearsonCorrelation.computeCorrelationMatrix() - posted by "Peter Nijem (JIRA)" <ji...@apache.org> on 2019/05/26 10:26:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27842) Inconsistent results of Statistics.corr() and PearsonCorrelation.computeCorrelationMatrix() - posted by "Peter Nijem (JIRA)" <ji...@apache.org> on 2019/05/26 10:28:00 UTC, 1 replies.
- [jira] [Resolved] (SPARK-27114) SQL Tab shows duplicate executions for some commands - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/26 10:50:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27787) Eliminate uncessary job to compute SSreg - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/26 13:17:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27787) Eliminate uncessary job to compute SSreg - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/26 13:18:00 UTC, 0 replies.
- [jira] [Issue Comment Deleted] (SPARK-27614) Executor shuffle fetch hang - posted by "feiwang (JIRA)" <ji...@apache.org> on 2019/05/26 14:23:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27843) Remove duplicate logic of calculate table size - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/26 14:46:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27843) Remove duplicate logic of calculate table size - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/26 14:50:00 UTC, 1 replies.
- [jira] [Updated] (SPARK-27843) Remove duplicate logic of calculate table size - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/26 15:04:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27074) Hive 3.1 metastore support HiveClientImpl.runHive - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/26 15:27:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27844) Avoid hard-coded config: spark.rdd.parallelListingThreshold in SQL module - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/26 15:57:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27844) Avoid hard-coded config: spark.rdd.parallelListingThreshold in SQL module - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/26 15:59:00 UTC, 1 replies.
- [jira] [Assigned] (SPARK-27844) Avoid hard-coded config: spark.rdd.parallelListingThreshold in SQL module - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/26 15:59:00 UTC, 1 replies.
- [jira] [Resolved] (SPARK-27844) Avoid hard-coded config: spark.rdd.parallelListingThreshold in SQL module - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/26 16:03:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27845) DataSourceV2: InsertInto multiple catalogs - posted by "John Zhuge (JIRA)" <ji...@apache.org> on 2019/05/26 17:56:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27846) Eagerly compute Configuration.properties in sc.hadoopConfiguration - posted by "Josh Rosen (JIRA)" <ji...@apache.org> on 2019/05/26 23:46:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27847) One-Pass MultilabelMetrics & MulticlassMetrics - posted by "zhengruifeng (JIRA)" <ji...@apache.org> on 2019/05/27 02:29:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27848) AppVeyor change to latest R version (3.6.0) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/27 02:30:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27848) AppVeyor change to latest R version (3.6.0) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/27 02:30:00 UTC, 2 replies.
- [jira] [Resolved] (SPARK-24766) CreateHiveTableAsSelect and InsertIntoHiveDir won't generate decimal column stats in parquet - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/27 03:21:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27847) One-Pass MultilabelMetrics & MulticlassMetrics - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/27 03:38:00 UTC, 1 replies.
- [jira] [Resolved] (SPARK-27782) Use '#' to mark expression id embedded in the subquery name in the SubqueryExec operator. - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/27 03:49:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-13283) Spark doesn't escape column names when creating table on JDBC - posted by "Maciej Bryński (JIRA)" <ji...@apache.org> on 2019/05/27 06:51:00 UTC, 1 replies.
- [jira] [Assigned] (SPARK-27322) DataSourceV2: Select from multiple catalogs - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/27 07:30:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27849) Redact treeString of FileTable and DataSourceV2ScanExecBase - posted by "Gengliang Wang (JIRA)" <ji...@apache.org> on 2019/05/27 07:32:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27849) Redact treeString of FileTable and DataSourceV2ScanExecBase - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/27 07:40:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27850) Make SparkPlan#doExecuteBroadcast public - posted by "Marc Arndt (JIRA)" <ji...@apache.org> on 2019/05/27 08:10:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27851) Allow for custom BroadcastMode return values - posted by "Marc Arndt (JIRA)" <ji...@apache.org> on 2019/05/27 08:32:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27851) Allow for custom BroadcastMode return values - posted by "Marc Arndt (JIRA)" <ji...@apache.org> on 2019/05/27 08:43:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27852) One updateBytesWritten operaton may be missed in DiskBlockObjectWriter.scala - posted by "Shuaiqi Ge (JIRA)" <ji...@apache.org> on 2019/05/27 08:51:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27852) One updateBytesWritten operaton may be missed in DiskBlockObjectWriter.scala - posted by "Shuaiqi Ge (JIRA)" <ji...@apache.org> on 2019/05/27 08:52:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27853) Allow for custom Partitioning implementations - posted by "Marc Arndt (JIRA)" <ji...@apache.org> on 2019/05/27 09:04:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27852) One updateBytesWritten operaton may be missed in DiskBlockObjectWriter.scala - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/27 09:04:00 UTC, 1 replies.
- [jira] [Updated] (SPARK-27851) Allow for custom BroadcastMode#transform return values - posted by "Marc Arndt (JIRA)" <ji...@apache.org> on 2019/05/27 09:04:01 UTC, 0 replies.
- [jira] [Created] (SPARK-27854) [Spark-SQL] OOM when using unequal join sql - posted by "kai zhao (JIRA)" <ji...@apache.org> on 2019/05/27 09:15:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27854) [Spark-SQL] OOM when using unequal join sql - posted by "kai zhao (JIRA)" <ji...@apache.org> on 2019/05/27 09:16:00 UTC, 3 replies.
- [jira] [Commented] (SPARK-13182) Spark Executor retries infinitely - posted by "Atul Anand (JIRA)" <ji...@apache.org> on 2019/05/27 09:16:00 UTC, 2 replies.
- [jira] [Comment Edited] (SPARK-13182) Spark Executor retries infinitely - posted by "Atul Anand (JIRA)" <ji...@apache.org> on 2019/05/27 09:17:00 UTC, 2 replies.
- [jira] [Resolved] (SPARK-27803) fix column pruning for python UDF - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/27 12:41:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27855) Union failed between 2 datasets of the same type converted from different dataframes - posted by "Hao Ren (JIRA)" <ji...@apache.org> on 2019/05/27 13:18:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27855) Union failed between 2 datasets of the same type converted from different dataframes - posted by "Hao Ren (JIRA)" <ji...@apache.org> on 2019/05/27 13:19:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27856) do not forcibly add cast when inserting table - posted by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2019/05/27 13:51:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27856) do not forcibly add cast when inserting table - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/27 13:57:00 UTC, 1 replies.
- [jira] [Updated] (SPARK-27776) Avoid duplicate Java reflection in DataSource - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/27 14:01:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27665) Split fetch shuffle blocks protocol from OpenBlocks - posted by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2019/05/27 14:42:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27855) Union failed between 2 datasets of the same type converted from different dataframes - posted by "Liang-Chi Hsieh (JIRA)" <ji...@apache.org> on 2019/05/27 15:18:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27777) Eliminate uncessary sliding job in AreaUnderCurve - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/27 15:32:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27071) Expose additional metrics in status.api.v1.StageData - posted by "Herman van Hovell (JIRA)" <ji...@apache.org> on 2019/05/27 15:39:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27071) Expose additional metrics in status.api.v1.StageData - posted by "Herman van Hovell (JIRA)" <ji...@apache.org> on 2019/05/27 15:39:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27857) DataSourceV2: Support ALTER TABLE statements - posted by "Ryan Blue (JIRA)" <ji...@apache.org> on 2019/05/27 21:59:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27857) DataSourceV2: Support ALTER TABLE statements - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/27 22:11:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27858) Fix for avro deserialization on union types with multiple non-null types - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/28 00:10:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27858) Fix for avro deserialization on union types with multiple non-null types - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/28 00:14:00 UTC, 2 replies.
- [jira] [Commented] (SPARK-27858) Fix for avro deserialization on union types with multiple non-null types - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/28 00:14:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27858) Fix for avro deserialization on union types with multiple non-null types - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/28 03:24:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27578) Add support for "interval '23:59:59' hour to second" - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/28 03:33:00 UTC, 1 replies.
- [jira] [Updated] (SPARK-27578) Support INTERVAL ... HOUR TO SECOND syntax - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/28 04:01:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-23191) Workers registration failes in case of network drop - posted by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2019/05/28 04:02:00 UTC, 0 replies.
- [jira] [Comment Edited] (SPARK-27833) java.lang.AssertionError: assertion failed: No plan for EventTimeWatermark - posted by "Raviteja (JIRA)" <ji...@apache.org> on 2019/05/28 04:26:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27859) Use efficient sorting instead of `.sorted.reverse` sequence - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/28 04:52:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27859) Use efficient sorting instead of `.sorted.reverse` sequence - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/28 04:53:00 UTC, 1 replies.
- [jira] [Resolved] (SPARK-27859) Use efficient sorting instead of `.sorted.reverse` sequence - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/28 04:55:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27860) Use efficient sorting instead of `.sorted.reverse` sequence - posted by "wenxuanguan (JIRA)" <ji...@apache.org> on 2019/05/28 05:28:00 UTC, 0 replies.
- [jira] [Closed] (SPARK-341) Added MapPartitionsWithSplitRDD. - posted by "wenxuanguan (JIRA)" <ji...@apache.org> on 2019/05/28 05:29:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27860) Use efficient sorting instead of `.sorted.reverse` sequence - posted by "wenxuanguan (JIRA)" <ji...@apache.org> on 2019/05/28 05:34:00 UTC, 0 replies.
- [jira] [Closed] (SPARK-27860) Use efficient sorting instead of `.sorted.reverse` sequence - posted by "wenxuanguan (JIRA)" <ji...@apache.org> on 2019/05/28 05:35:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27848) AppVeyor change to latest R version (3.6.0) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/28 05:44:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-27848) AppVeyor change to latest R version (3.6.0) - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/28 05:45:00 UTC, 0 replies.
- [jira] [Reopened] (SPARK-27848) AppVeyor change to latest R version (3.6.0) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/28 05:46:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-25944) AppVeyor change to latest R version - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/28 05:46:00 UTC, 3 replies.
- [jira] [Issue Comment Deleted] (SPARK-27848) AppVeyor change to latest R version (3.6.0) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/28 05:47:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-25944) AppVeyor change to latest R version - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/28 05:49:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27861) get_json_object in sql will truncate long value gotten from jsonpath - posted by "jing.yan (JIRA)" <ji...@apache.org> on 2019/05/28 06:43:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27862) Upgrade json4s-jackson to 3.6.5 - posted by "Izek Greenfield (JIRA)" <ji...@apache.org> on 2019/05/28 07:03:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-24009) spark2.3.0 INSERT OVERWRITE LOCAL DIRECTORY '/home/spark/aaaaab' - posted by "chris_j (JIRA)" <ji...@apache.org> on 2019/05/28 07:24:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27863) Metadata files and temporary files should not be counted as data files - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/28 07:59:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27863) Metadata files and temporary files should not be counted as data files - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/28 08:11:00 UTC, 1 replies.
- [jira] [Updated] (SPARK-27863) Metadata files and temporary files should not be counted as data files - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/28 08:11:00 UTC, 1 replies.
- [jira] [Updated] (SPARK-21160) Filtering rows with "not equal" operator yields unexpected result with null rows - posted by "Ahmed Maher (JIRA)" <ji...@apache.org> on 2019/05/28 08:15:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-21160) Filtering rows with "not equal" operator yields unexpected result with null rows - posted by "Ahmed Maher (JIRA)" <ji...@apache.org> on 2019/05/28 08:33:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27864) spark-submit 2.4 cannot run apps compiled with Scala 2.12 - posted by "Sergey Torgashov (JIRA)" <ji...@apache.org> on 2019/05/28 08:38:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27864) spark-submit 2.4 cannot run apps compiled with Scala 2.12 - posted by "Sergey Torgashov (JIRA)" <ji...@apache.org> on 2019/05/28 08:39:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27865) Spark SQL support 1:N sort merge bucket join - posted by "Jason Guo (JIRA)" <ji...@apache.org> on 2019/05/28 08:51:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27865) Spark SQL support 1:N sort merge bucket join without shuffle - posted by "Jason Guo (JIRA)" <ji...@apache.org> on 2019/05/28 08:52:00 UTC, 1 replies.
- [jira] [Assigned] (SPARK-27865) Spark SQL support 1:N sort merge bucket join without shuffle - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/28 09:01:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27866) Cannot connect to hive metastore - posted by "Ricardo Pinto (JIRA)" <ji...@apache.org> on 2019/05/28 09:12:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27867) RegressionEvaluator cache lastest RegressionMetrics to avoid duplicated computation - posted by "zhengruifeng (JIRA)" <ji...@apache.org> on 2019/05/28 09:46:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27866) Cannot connect to hive metastore - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/28 09:48:00 UTC, 6 replies.
- [jira] [Assigned] (SPARK-27867) RegressionEvaluator cache lastest RegressionMetrics to avoid duplicated computation - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/28 09:51:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-27554) org.apache.hadoop.security.AccessControlException: Client cannot authenticate via:[TOKEN, KERBEROS] - posted by "Jepson (JIRA)" <ji...@apache.org> on 2019/05/28 10:52:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27862) Upgrade json4s-jackson to 3.6.5 - posted by "Izek Greenfield (JIRA)" <ji...@apache.org> on 2019/05/28 13:12:00 UTC, 1 replies.
- [jira] [Assigned] (SPARK-27862) Upgrade json4s-jackson to 3.6.5 - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/28 13:13:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-15060) Fix stack overflow when executing long lineage transform without checkpoint - posted by "Michael Wu (JIRA)" <ji...@apache.org> on 2019/05/28 13:41:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27852) One updateBytesWritten operaton may be missed in DiskBlockObjectWriter.scala - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/28 13:57:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27666) Do not release lock while TaskContext already completed - posted by "wuyi (JIRA)" <ji...@apache.org> on 2019/05/28 14:07:00 UTC, 3 replies.
- [jira] [Resolved] (SPARK-27776) Avoid duplicate Java reflection in DataSource - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/28 14:27:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27657) ml.util.Instrumentation.logFailure doesn't log error message - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/28 14:29:00 UTC, 2 replies.
- [jira] [Resolved] (SPARK-27657) ml.util.Instrumentation.logFailure doesn't log error message - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/28 14:32:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27835) Resource Scheduling: change driver config from addresses to resourcesFile - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/28 14:55:00 UTC, 2 replies.
- [jira] [Comment Edited] (SPARK-27866) Cannot connect to hive metastore - posted by "Ricardo Pinto (JIRA)" <ji...@apache.org> on 2019/05/28 15:00:00 UTC, 2 replies.
- [jira] [Resolved] (SPARK-27084) Add function alias for Bitwise functions - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/28 15:51:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27434) memory leak in spark driver - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/28 15:51:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27584) Add 'Mean reciprocal rank' to RankingMetrics - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/28 15:52:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-26872) Use a configurable value for final termination in the JobScheduler.stop() method - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/28 15:53:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27785) Introduce .joinWith() overloads for typed inner joins of 3 or more tables - posted by "Swapnil (JIRA)" <ji...@apache.org> on 2019/05/28 16:08:00 UTC, 1 replies.
- [jira] [Comment Edited] (SPARK-27785) Introduce .joinWith() overloads for typed inner joins of 3 or more tables - posted by "Swapnil (JIRA)" <ji...@apache.org> on 2019/05/28 16:10:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27725) GPU Scheduling - add an example discovery Script - posted by "Thomas Graves (JIRA)" <ji...@apache.org> on 2019/05/28 16:16:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-24815) Structured Streaming should support dynamic allocation - posted by "Karthik Palaniappan (JIRA)" <ji...@apache.org> on 2019/05/28 17:33:00 UTC, 6 replies.
- [jira] [Commented] (SPARK-25588) SchemaParseException: Can't redefine: list when reading from Parquet - posted by "Michael Heuer (JIRA)" <ji...@apache.org> on 2019/05/28 18:19:00 UTC, 0 replies.
- [jira] [Comment Edited] (SPARK-24815) Structured Streaming should support dynamic allocation - posted by "Stavros Kontopoulos (JIRA)" <ji...@apache.org> on 2019/05/28 20:44:00 UTC, 3 replies.
- [jira] [Created] (SPARK-27868) Better document shuffle / RPC listen backlog - posted by "Marcelo Vanzin (JIRA)" <ji...@apache.org> on 2019/05/28 22:42:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27869) Redact sensitive information in System Properties from UI - posted by "Aaruna Godthi (JIRA)" <ji...@apache.org> on 2019/05/28 22:52:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27868) Better document shuffle / RPC listen backlog - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/28 23:07:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-27869) Redact sensitive information in System Properties from UI - posted by "Aaruna Godthi (JIRA)" <ji...@apache.org> on 2019/05/28 23:49:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27869) Redact sensitive information in System Properties from UI - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/28 23:50:00 UTC, 1 replies.
- [jira] [Comment Edited] (SPARK-27772) SQLTestUtils Refactoring - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/29 00:59:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27864) spark-submit 2.4 cannot run apps compiled with Scala 2.12 - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/29 01:19:00 UTC, 1 replies.
- [jira] [Updated] (SPARK-27861) get_json_object in sql will truncate long value gotten from jsonpath - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/29 01:20:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27861) get_json_object in sql will truncate long value gotten from jsonpath - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/29 01:22:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27861) get_json_object in sql will truncate long value gotten from jsonpath - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/29 01:22:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27855) Union failed between 2 datasets of the same type converted from different dataframes - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/29 01:25:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27854) [Spark-SQL] OOM when using unequal join sql - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/29 01:26:00 UTC, 1 replies.
- [jira] [Resolved] (SPARK-27851) Allow for custom BroadcastMode#transform return values - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/29 01:30:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27851) Allow for custom BroadcastMode#transform return values - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/29 01:30:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27850) Make SparkPlan#doExecuteBroadcast public - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/29 01:31:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27850) Make SparkPlan#doExecuteBroadcast public - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/29 01:32:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27842) Inconsistent results of Statistics.corr() and PearsonCorrelation.computeCorrelationMatrix() - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/29 01:32:00 UTC, 1 replies.
- [jira] [Resolved] (SPARK-27837) Running rand() in SQL with seed of column results in error (rand(col1)) - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/29 01:34:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27836) Issue with seeded rand() function in Spark SQL - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/29 01:47:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27828) spark job hangs when kryo.serializers.FieldSerializer is called under multi-executor-cores settings - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/29 02:05:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27826) saveAsTable() function case table have "HiveFileFormat" "ParquetFileFormat" format issue - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/29 02:09:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27870) Flush each batch for pandas UDF - posted by "Weichen Xu (JIRA)" <ji...@apache.org> on 2019/05/29 03:01:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27870) Flush each batch for pandas UDF - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/29 03:05:00 UTC, 1 replies.
- [jira] [Updated] (SPARK-27870) Flush each batch for python UDF - posted by "Weichen Xu (JIRA)" <ji...@apache.org> on 2019/05/29 03:34:01 UTC, 1 replies.
- [jira] [Updated] (SPARK-27870) Flush each batch for pandas UDF (for improving pandas UDFs pipeline) - posted by "Weichen Xu (JIRA)" <ji...@apache.org> on 2019/05/29 03:41:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27871) LambdaVariable should use per-query unique IDs instead of globally unique IDs - posted by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2019/05/29 04:33:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27871) LambdaVariable should use per-query unique IDs instead of globally unique IDs - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/29 04:40:00 UTC, 1 replies.
- [jira] [Comment Edited] (SPARK-27862) Upgrade json4s-jackson to 3.6.5 - posted by "Izek Greenfield (JIRA)" <ji...@apache.org> on 2019/05/29 05:48:00 UTC, 0 replies.
- [jira] [Issue Comment Deleted] (SPARK-27862) Upgrade json4s-jackson to 3.6.5 - posted by "Izek Greenfield (JIRA)" <ji...@apache.org> on 2019/05/29 05:50:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-19053) Supporting multiple evaluation metrics in DataFrame-based API: discussion - posted by "zhengruifeng (JIRA)" <ji...@apache.org> on 2019/05/29 06:04:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27829) In Dataset.joinWith inner joins, don't nest data before shuffling - posted by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2019/05/29 08:15:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-23098) Migrate Kafka batch source to v2 - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/29 09:40:00 UTC, 1 replies.
- [jira] [Resolved] (SPARK-27832) Don't decompress and create column batch when the task is completed - posted by "Liang-Chi Hsieh (JIRA)" <ji...@apache.org> on 2019/05/29 11:23:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27872) Driver and executors use a different service acount - posted by "Stavros Kontopoulos (JIRA)" <ji...@apache.org> on 2019/05/29 12:20:00 UTC, 11 replies.
- [jira] [Created] (SPARK-27872) Driver and executors use a different service acount - posted by "Stavros Kontopoulos (JIRA)" <ji...@apache.org> on 2019/05/29 12:20:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27872) Driver and executors use a different service acount - posted by "Stavros Kontopoulos (JIRA)" <ji...@apache.org> on 2019/05/29 12:22:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27872) Driver and executors use a different service account breaking pull secrets - posted by "Stavros Kontopoulos (JIRA)" <ji...@apache.org> on 2019/05/29 12:41:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-23472) Add config properties for administrator JVM options - posted by "Gabor Somogyi (JIRA)" <ji...@apache.org> on 2019/05/29 12:51:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-23472) Add config properties for administrator JVM options - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/29 12:53:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-27872) Driver and executors use a different service account breaking pull secrets - posted by "Erik Erlandson (JIRA)" <ji...@apache.org> on 2019/05/29 16:20:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27869) Redact sensitive information in System Properties from UI - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/29 17:51:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27869) Redact sensitive information in System Properties from UI - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/29 18:00:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27538) sparksql could not start in jdk11, exception org.datanucleus.exceptions.NucleusException: The java type java.lang.Long (jdbc-type='', sql-type="") cant be mapped for this datastore. No mapping is available. - posted by "Li Jin (JIRA)" <ji...@apache.org> on 2019/05/29 18:36:00 UTC, 1 replies.
- [jira] [Resolved] (SPARK-25466) Documentation does not specify how to set Kafka consumer cache capacity for SS - posted by "Gabor Somogyi (JIRA)" <ji...@apache.org> on 2019/05/29 18:45:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27873) Csv reader, adding a corrupt record column causes error if enforceSchema=false - posted by "Marcin Mejran (JIRA)" <ji...@apache.org> on 2019/05/29 20:17:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27849) Redact treeString of FileTable and DataSourceV2ScanExecBase - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/29 20:34:01 UTC, 0 replies.
- [jira] [Updated] (SPARK-27361) YARN support for GPU-aware scheduling - posted by "Thomas Graves (JIRA)" <ji...@apache.org> on 2019/05/29 20:59:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27361) YARN support for GPU-aware scheduling - posted by "Thomas Graves (JIRA)" <ji...@apache.org> on 2019/05/29 21:00:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27376) Design: YARN supports Spark GPU-aware scheduling - posted by "Thomas Graves (JIRA)" <ji...@apache.org> on 2019/05/29 21:10:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27378) spark-submit requests GPUs in YARN mode - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/29 21:58:00 UTC, 2 replies.
- [jira] [Updated] (SPARK-27868) Better document shuffle / RPC listen backlog - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/29 22:00:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27868) Better document shuffle / RPC listen backlog - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/29 22:00:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27874) ShuffleBlockFetchIterator can log information about per-block sizes - posted by "Bjorn Jonsson (JIRA)" <ji...@apache.org> on 2019/05/30 00:41:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27875) Wrap all PrintWrite with Utils.tryWithResource - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/30 01:00:41 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27875) Wrap all PrintWrite with Utils.tryWithResource - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/30 01:07:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27876) [CORE][SHUFFLE] Split large shuffle partition to multi-segments to enable transfer oversize shuffle partition block. - posted by "feiwang (JIRA)" <ji...@apache.org> on 2019/05/30 02:43:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27876) [CORE][SHUFFLE] Split large shuffle partition to multi-segments to enable transfer oversize shuffle partition block. - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/30 02:48:00 UTC, 1 replies.
- [jira] [Updated] (SPARK-27876) [CORE][SHUFFLE] Split large shuffle partition to multi-segments to enable transfer oversize shuffle partition block. - posted by "feiwang (JIRA)" <ji...@apache.org> on 2019/05/30 03:00:02 UTC, 1 replies.
- [jira] [Commented] (SPARK-27873) Csv reader, adding a corrupt record column causes error if enforceSchema=false - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/30 05:35:00 UTC, 2 replies.
- [jira] [Resolved] (SPARK-27854) [Spark-SQL] OOM when using unequal join sql - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/30 05:37:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27725) GPU Scheduling - add an example discovery Script - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/30 05:38:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27866) Cannot connect to hive metastore - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/30 05:55:00 UTC, 1 replies.
- [jira] [Resolved] (SPARK-27866) Cannot connect to hive metastore - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/30 06:12:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27864) spark-submit 2.4 cannot run apps compiled with Scala 2.12 - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/30 06:16:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27863) Metadata files and temporary files should not be counted as data files - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/30 06:18:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27876) Split large shuffle partition to multi-segments to enable transfer oversize shuffle partition block. - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/30 06:35:00 UTC, 1 replies.
- [jira] [Closed] (SPARK-27840) Hadoop attempts to create a temporary folder in root folder - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/30 06:36:00 UTC, 0 replies.
- [jira] [Closed] (SPARK-27832) Don't decompress and create column batch when the task is completed - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/30 06:44:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27827) File does not exist notice is misleading in FileScanRDD - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/30 06:46:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27877) Implement SQL-standard LATERAL subqueries - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/30 08:27:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27878) Support ARRAY(sub-SELECT) expressions - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/30 08:39:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27879) Implement bitwise integer aggregates(BIT_AND and BIT_OR) - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/30 08:56:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27880) Implement boolean aggregates(BOOL_AND, BOOL_OR and EVERY) - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/30 09:00:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27880) Implement boolean aggregates(BOOL_AND, BOOL_OR and EVERY) - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/30 09:01:00 UTC, 1 replies.
- [jira] [Updated] (SPARK-27788) Session aware event log for Spark thrift server - posted by "Lantao Jin (JIRA)" <ji...@apache.org> on 2019/05/30 09:09:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27630) Stage retry causes totalRunningTasks calculation to be negative - posted by "dzcxzl (JIRA)" <ji...@apache.org> on 2019/05/30 09:16:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27875) Wrap all PrintWriter with Utils.tryWithResource - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/30 09:45:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27706) Add SQL metrics of numOutputRows for BroadcastExchangeExec - posted by "dzcxzl (JIRA)" <ji...@apache.org> on 2019/05/30 10:29:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27875) Wrap all PrintWriter with Utils.tryWithResource - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/30 10:56:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27875) Wrap all PrintWriter with Utils.tryWithResource - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/30 10:56:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27835) Resource Scheduling: change driver config from addresses to resourcesFile - posted by "Thomas Graves (JIRA)" <ji...@apache.org> on 2019/05/30 12:54:00 UTC, 0 replies.
- [jira] [Comment Edited] (SPARK-27742) Security Support in Sources and Sinks for SS and Batch - posted by "Stavros Kontopoulos (JIRA)" <ji...@apache.org> on 2019/05/30 12:58:00 UTC, 10 replies.
- [jira] [Created] (SPARK-27881) Support CAST (... FORMAT ) expression - posted by "Peter Toth (JIRA)" <ji...@apache.org> on 2019/05/30 13:25:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27881) Support CAST (... FORMAT ) expression - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/30 13:31:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27882) Support SQL:2016 compatible datetime patterns - posted by "Peter Toth (JIRA)" <ji...@apache.org> on 2019/05/30 13:59:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27881) Support CAST (... FORMAT ) expression - posted by "Peter Toth (JIRA)" <ji...@apache.org> on 2019/05/30 14:00:02 UTC, 0 replies.
- [jira] [Updated] (SPARK-27882) Support SQL:2016 compatible datetime patterns - posted by "Peter Toth (JIRA)" <ji...@apache.org> on 2019/05/30 14:02:01 UTC, 0 replies.
- [jira] [Created] (SPARK-27883) Add aggregates.sql - Part2 - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/30 14:09:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-25557) ORC predicate pushdown for nested fields - posted by "Ivan Vergiliev (JIRA)" <ji...@apache.org> on 2019/05/30 14:17:00 UTC, 1 replies.
- [jira] [Assigned] (SPARK-27883) Add aggregates.sql - Part2 - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/30 14:31:00 UTC, 1 replies.
- [jira] [Resolved] (SPARK-27757) Bump Jackson to 2.9.9 - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/30 14:36:01 UTC, 0 replies.
- [jira] [Updated] (SPARK-27757) Bump Jackson to 2.9.9 - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/30 14:36:01 UTC, 0 replies.
- [jira] [Closed] (SPARK-27706) Add SQL metrics of numOutputRows for BroadcastExchangeExec - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/30 15:00:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27884) Deprecate Python 2 support in Spark 3.0 - posted by "Xiangrui Meng (JIRA)" <ji...@apache.org> on 2019/05/30 16:13:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27885) Update Spark website and put deprecation warning - posted by "Xiangrui Meng (JIRA)" <ji...@apache.org> on 2019/05/30 16:14:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27885) Update Spark website and put deprecation warning - posted by "Xiangrui Meng (JIRA)" <ji...@apache.org> on 2019/05/30 16:15:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27886) Add Apache Spark project to https://python3statement.org/ - posted by "Xiangrui Meng (JIRA)" <ji...@apache.org> on 2019/05/30 16:15:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27887) Check python version and print deprecation warning if version < 3 - posted by "Xiangrui Meng (JIRA)" <ji...@apache.org> on 2019/05/30 16:16:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27887) Check python version and print deprecation warning if version < 3 - posted by "Xiangrui Meng (JIRA)" <ji...@apache.org> on 2019/05/30 16:16:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27888) Python 2->3 migration guide for PySpark users - posted by "Xiangrui Meng (JIRA)" <ji...@apache.org> on 2019/05/30 16:18:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27885) Announce deprecation of Python 2 support - posted by "Xiangrui Meng (JIRA)" <ji...@apache.org> on 2019/05/30 16:18:00 UTC, 2 replies.
- [jira] [Updated] (SPARK-27884) Deprecate Python 2 support in Spark 3.0 - posted by "Xiangrui Meng (JIRA)" <ji...@apache.org> on 2019/05/30 16:19:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27862) Upgrade json4s-jackson to 3.6.6 - posted by "Izek Greenfield (JIRA)" <ji...@apache.org> on 2019/05/30 16:29:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27884) Deprecate Python 2 support in Spark 3.0 - posted by "Xiangrui Meng (JIRA)" <ji...@apache.org> on 2019/05/30 16:58:00 UTC, 4 replies.
- [jira] [Resolved] (SPARK-27813) DataSourceV2: Add DropTable logical operation - posted by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2019/05/30 16:59:00 UTC, 0 replies.
- [jira] [Reopened] (SPARK-27831) Move Hive test jars to maven dependency - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/30 17:08:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27886) Add Apache Spark project to https://python3statement.org/ - posted by "Xiangrui Meng (JIRA)" <ji...@apache.org> on 2019/05/30 17:10:00 UTC, 1 replies.
- [jira] [Commented] (SPARK-27886) Add Apache Spark project to https://python3statement.org/ - posted by "Xiangrui Meng (JIRA)" <ji...@apache.org> on 2019/05/30 17:11:00 UTC, 2 replies.
- [jira] [Assigned] (SPARK-27772) SQLTestUtils Refactoring - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/30 17:42:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27889) Make development scripts under dev/ support Python 3 - posted by "Xiangrui Meng (JIRA)" <ji...@apache.org> on 2019/05/30 18:01:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27378) spark-submit requests GPUs in YARN mode - posted by "Marcelo Vanzin (JIRA)" <ji...@apache.org> on 2019/05/30 20:25:01 UTC, 0 replies.
- [jira] [Commented] (SPARK-27812) kubernetes client import non-daemon thread which block jvm exit. - posted by "Igor Calabria (JIRA)" <ji...@apache.org> on 2019/05/30 20:48:00 UTC, 3 replies.
- [jira] [Assigned] (SPARK-27773) Add shuffle service metric for number of exceptions caught in ExternalShuffleBlockHandler - posted by "Marcelo Vanzin (JIRA)" <ji...@apache.org> on 2019/05/30 20:58:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27773) Add shuffle service metric for number of exceptions caught in ExternalShuffleBlockHandler - posted by "Marcelo Vanzin (JIRA)" <ji...@apache.org> on 2019/05/30 20:58:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27872) Driver and executors use a different service account breaking pull secrets - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/30 21:11:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27890) Improve SQL parser error message when missing backquotes for identifiers with hyphens - posted by "Yesheng Ma (JIRA)" <ji...@apache.org> on 2019/05/30 21:22:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-22151) PYTHONPATH not picked up from the spark.yarn.appMasterEnv properly - posted by "Ruslan Dautkhanov (JIRA)" <ji...@apache.org> on 2019/05/30 21:37:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27361) YARN support for GPU-aware scheduling - posted by "Thomas Graves (JIRA)" <ji...@apache.org> on 2019/05/30 21:46:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27891) Long running spark jobs fail because of HDFS delegation token expires - posted by "hemshankar sahu (JIRA)" <ji...@apache.org> on 2019/05/30 21:47:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27891) Long running spark jobs fail because of HDFS delegation token expires - posted by "Marcelo Vanzin (JIRA)" <ji...@apache.org> on 2019/05/30 21:52:00 UTC, 0 replies.
- [jira] [Issue Comment Deleted] (SPARK-22151) PYTHONPATH not picked up from the spark.yarn.appMasterEnv properly - posted by "Ruslan Dautkhanov (JIRA)" <ji...@apache.org> on 2019/05/30 21:59:00 UTC, 0 replies.
- [jira] [Comment Edited] (SPARK-27812) kubernetes client import non-daemon thread which block jvm exit. - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/30 22:07:00 UTC, 0 replies.
- [jira] [Reopened] (SPARK-27891) Long running spark jobs fail because of HDFS delegation token expires - posted by "hemshankar sahu (JIRA)" <ji...@apache.org> on 2019/05/30 22:08:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27891) Long running spark jobs fail because of HDFS delegation token expires - posted by "hemshankar sahu (JIRA)" <ji...@apache.org> on 2019/05/30 22:10:00 UTC, 4 replies.
- [jira] [Commented] (SPARK-27891) Long running spark jobs fail because of HDFS delegation token expires - posted by "Marcelo Vanzin (JIRA)" <ji...@apache.org> on 2019/05/30 22:14:02 UTC, 3 replies.
- [jira] [Issue Comment Deleted] (SPARK-27891) Long running spark jobs fail because of HDFS delegation token expires - posted by "hemshankar sahu (JIRA)" <ji...@apache.org> on 2019/05/30 22:15:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27890) Improve SQL parser error message when missing backquotes for identifiers with hyphens - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/30 23:02:00 UTC, 1 replies.
- [jira] [Resolved] (SPARK-27684) Reduce ScalaUDF conversion overheads for primitives - posted by "Josh Rosen (JIRA)" <ji...@apache.org> on 2019/05/31 00:11:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27892) Saving/loading stages in PipelineModel should be parallel - posted by "Jason Wang (JIRA)" <ji...@apache.org> on 2019/05/31 00:20:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27892) Saving/loading stages in PipelineModel should be parallel - posted by "Jason Wang (JIRA)" <ji...@apache.org> on 2019/05/31 00:35:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27862) Upgrade json4s-jackson to 3.6.6 - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/31 00:44:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27862) Upgrade json4s-jackson to 3.6.6 - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/31 00:44:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-24815) Structured Streaming should support dynamic allocation - posted by "Karthik Palaniappan (JIRA)" <ji...@apache.org> on 2019/05/31 01:19:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27893) Create an integrated test base for Python, Scalar Pandas, Scala UDF by sql files - posted by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/31 04:03:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27893) Create an integrated test base for Python, Scalar Pandas, Scala UDF by sql files - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/31 05:18:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27894) PySpark streaming transform RDD join not works when checkpoint enabled - posted by "Jeffrey(Xilang) Yan (JIRA)" <ji...@apache.org> on 2019/05/31 06:29:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27894) PySpark streaming transform RDD join not works when checkpoint enabled - posted by "Jeffrey(Xilang) Yan (JIRA)" <ji...@apache.org> on 2019/05/31 06:31:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27895) Spark streaming - RDD filter is always refreshing providing updated filtered items - posted by "Ilias Karalis (JIRA)" <ji...@apache.org> on 2019/05/31 07:17:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27791) Support SQL year-month INTERVAL type - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/31 07:36:00 UTC, 1 replies.
- [jira] [Resolved] (SPARK-27700) SparkSubmit closes with SocketTimeoutException in kubernetes mode. - posted by "Udbhav Agrawal (JIRA)" <ji...@apache.org> on 2019/05/31 09:07:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27896) Fix definition of clustering silhouette coefficient for 1-element clusters - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/31 12:22:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27896) Fix definition of clustering silhouette coefficient for 1-element clusters - posted by "Sean Owen (JIRA)" <ji...@apache.org> on 2019/05/31 12:23:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27897) GPU Scheduling - move example discovery Script to scripts directory - posted by "Thomas Graves (JIRA)" <ji...@apache.org> on 2019/05/31 13:45:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27897) GPU Scheduling - move example discovery Script to scripts directory - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/31 13:51:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27898) Support 4 date operators(date + integer, integer + date, date - integer and date - date) - posted by "Yuming Wang (JIRA)" <ji...@apache.org> on 2019/05/31 13:55:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27898) Support 4 date operators(date + integer, integer + date, date - integer and date - date) - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/31 14:05:00 UTC, 1 replies.
- [jira] [Assigned] (SPARK-27896) Fix definition of clustering silhouette coefficient for 1-element clusters - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/31 14:09:00 UTC, 1 replies.
- [jira] [Closed] (SPARK-26822) Upgrade the deprecated module 'optparse' - posted by "Neo Chien (JIRA)" <ji...@apache.org> on 2019/05/31 14:30:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27899) Make HiveMetastoreClient.getTableObjectsByName available in ExternalCatalog/SessionCatalog API - posted by "Juliusz Sompolski (JIRA)" <ji...@apache.org> on 2019/05/31 14:42:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27899) Make HiveMetastoreClient.getTableObjectsByName available in ExternalCatalog/SessionCatalog API - posted by "Juliusz Sompolski (JIRA)" <ji...@apache.org> on 2019/05/31 14:43:00 UTC, 1 replies.
- [jira] [Assigned] (SPARK-27873) Csv reader, adding a corrupt record column causes error if enforceSchema=false - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/31 14:45:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27900) Spark on K8s will not report container failure due to oom - posted by "Stavros Kontopoulos (JIRA)" <ji...@apache.org> on 2019/05/31 15:08:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27900) Spark on K8s will not report container failure due to oom - posted by "Stavros Kontopoulos (JIRA)" <ji...@apache.org> on 2019/05/31 15:10:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-26192) MesosClusterScheduler reads options from dispatcher conf instead of submission conf - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/31 15:11:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27395) Improve EXPLAIN command - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/31 15:58:00 UTC, 1 replies.
- [jira] [Updated] (SPARK-27899) Make HiveMetastoreClient.getTableObjectsByName available in ExternalCatalog/SessionCatalog API - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/31 16:50:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27899) Make HiveMetastoreClient.getTableObjectsByName available in ExternalCatalog/SessionCatalog API - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/31 16:55:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27901) Improve the error messages of SQL parser - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/31 17:03:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27901) Improve the error messages of SQL parser - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/31 17:04:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27890) Improve SQL parser error message when missing backquotes for identifiers with hyphens - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/31 17:05:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27809) Make optional clauses order insensitive for CREATE DATABASE/VIEW SQL statement - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/31 17:06:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-21154) ParseException when Create View from another View in Spark SQL - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/31 17:18:00 UTC, 0 replies.
- [jira] [Reopened] (SPARK-21136) Misleading error message for typo in SQL - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/31 17:19:00 UTC, 0 replies.
- [jira] [Reopened] (SPARK-24077) Why spark SQL not support `CREATE TEMPORARY FUNCTION IF NOT EXISTS`? - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/31 17:21:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-24077) Issue a better error message for `CREATE TEMPORARY FUNCTION IF NOT EXISTS`? - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/31 17:21:01 UTC, 2 replies.
- [jira] [Assigned] (SPARK-21136) Misleading error message for typo in SQL - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/31 17:23:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-27098) Flaky missing file parts when writing to Ceph without error - posted by "Martin Loncaric (JIRA)" <ji...@apache.org> on 2019/05/31 17:24:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27902) Improve error message for DESCRIBE statement - posted by "Yesheng Ma (JIRA)" <ji...@apache.org> on 2019/05/31 17:34:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27903) Improve parser error message for mismatched parentheses in expressions - posted by "Yesheng Ma (JIRA)" <ji...@apache.org> on 2019/05/31 17:46:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-21529) Improve the error message for unsupported Uniontype - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/31 17:50:01 UTC, 2 replies.
- [jira] [Reopened] (SPARK-21529) Improve the error message for unsupported Uniontype - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/31 17:50:01 UTC, 0 replies.
- [jira] [Created] (SPARK-27904) Improve parser error message for SHOW VIEW statement - posted by "Yesheng Ma (JIRA)" <ji...@apache.org> on 2019/05/31 17:58:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27905) Add higher order function`forall` - posted by "Nikolas Vanderhoof (JIRA)" <ji...@apache.org> on 2019/05/31 18:01:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27903) Improve parser error message for mismatched parentheses in expressions - posted by "Yesheng Ma (JIRA)" <ji...@apache.org> on 2019/05/31 18:02:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27906) Improve parser error message for CREATE LOCAL TABLE statement - posted by "Yesheng Ma (JIRA)" <ji...@apache.org> on 2019/05/31 18:04:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27906) Improve parser error message for CREATE LOCAL TEMPORARY TABLE statement - posted by "Yesheng Ma (JIRA)" <ji...@apache.org> on 2019/05/31 18:05:00 UTC, 1 replies.
- [jira] [Updated] (SPARK-27906) Improve parser error message for CREATE LOCAL TABLE statement - posted by "Yesheng Ma (JIRA)" <ji...@apache.org> on 2019/05/31 18:05:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27905) Add higher order function`forall` - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/31 18:12:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27907) HiveUDAF with 0 rows throw NPE when try to serialize - posted by "Ajith S (JIRA)" <ji...@apache.org> on 2019/05/31 19:09:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27907) HiveUDAF with 0 rows throw NPE when try to serialize - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/31 19:13:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27908) Improve parser error message for SELECT TOP statement - posted by "Yesheng Ma (JIRA)" <ji...@apache.org> on 2019/05/31 19:54:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27907) HiveUDAF with 0 rows throw NPE - posted by "Ajith S (JIRA)" <ji...@apache.org> on 2019/05/31 19:56:00 UTC, 1 replies.
- [jira] [Created] (SPARK-27909) Fix CTE substitution dependence on ResolveRelations throwing AnalysisException - posted by "Ryan Blue (JIRA)" <ji...@apache.org> on 2019/05/31 20:28:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27897) GPU Scheduling - move example discovery Script to scripts directory - posted by "Thomas Graves (JIRA)" <ji...@apache.org> on 2019/05/31 20:28:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27373) Design: Kubernetes support for GPU-aware scheduling - posted by "Thomas Graves (JIRA)" <ji...@apache.org> on 2019/05/31 20:29:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27362) Kubernetes support for GPU-aware scheduling - posted by "Thomas Graves (JIRA)" <ji...@apache.org> on 2019/05/31 20:29:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27374) Fetch assigned resources from TaskContext - posted by "Thomas Graves (JIRA)" <ji...@apache.org> on 2019/05/31 20:30:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27909) Fix CTE substitution dependence on ResolveRelations throwing AnalysisException - posted by "Apache Spark (JIRA)" <ji...@apache.org> on 2019/05/31 20:47:00 UTC, 1 replies.
- [jira] [Updated] (SPARK-17164) Query with colon in the table name fails to parse in 2.0 - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/31 20:48:00 UTC, 0 replies.
- [jira] [Commented] (SPARK-17164) Query with colon in the table name fails to parse in 2.0 - posted by "Xiao Li (JIRA)" <ji...@apache.org> on 2019/05/31 20:49:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27910) Improve parser error message for misused numeric identifiers - posted by "Yesheng Ma (JIRA)" <ji...@apache.org> on 2019/05/31 21:07:00 UTC, 0 replies.
- [jira] [Updated] (SPARK-27911) PySpark Packages should automatically choose correct scala version - posted by "Michael Armbrust (JIRA)" <ji...@apache.org> on 2019/05/31 21:16:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27911) PySpark Packages should automatically choose correct scala version - posted by "Michael Armbrust (JIRA)" <ji...@apache.org> on 2019/05/31 21:16:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27912) Improve parser error message for CASE clause - posted by "Yesheng Ma (JIRA)" <ji...@apache.org> on 2019/05/31 21:33:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27913) Spark SQL's native ORC reader implements its own schema evolution - posted by "Owen O'Malley (JIRA)" <ji...@apache.org> on 2019/05/31 21:35:00 UTC, 0 replies.
- [jira] [Created] (SPARK-27914) Improve parser error message for ALTER TABLE ADD COLUMNS statement - posted by "Yesheng Ma (JIRA)" <ji...@apache.org> on 2019/05/31 22:26:00 UTC, 0 replies.
- [jira] [Assigned] (SPARK-27885) Announce deprecation of Python 2 support - posted by "Xiangrui Meng (JIRA)" <ji...@apache.org> on 2019/05/31 22:33:00 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27794) Use secure URLs for downloading CRAN artifacts - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/31 22:55:01 UTC, 0 replies.
- [jira] [Resolved] (SPARK-27896) Fix definition of clustering silhouette coefficient for 1-element clusters - posted by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/05/31 23:31:00 UTC, 0 replies.