You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@crunch.apache.org by "Josh Wills (JIRA)" <ji...@apache.org> on 2014/01/23 21:53:38 UTC
[jira] [Created] (CRUNCH-331) Change default settings for
CombineFileInputFormat
Josh Wills created CRUNCH-331:
---------------------------------
Summary: Change default settings for CombineFileInputFormat
Key: CRUNCH-331
URL: https://issues.apache.org/jira/browse/CRUNCH-331
Project: Crunch
Issue Type: Bug
Components: IO
Affects Versions: 0.8.2, 0.9.0
Reporter: Josh Wills
Currently, we default to enabling the CombineFileInputFormat settings for any extensions of FileSourceImpl b/c it tends to improve performance for common file formats like text, sequence files, and Avro files. However, this default has caused problems for formats like Parquet and for custom file formats that have complex split logic.
This JIRA is to track modifying the default combine file settings in at least some contexts, such as with From.formattedFile for custom input formats.
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)