You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@gobblin.apache.org by "Kengo Seki (Jira)" <ji...@apache.org> on 2022/02/16 07:06:00 UTC

[jira] [Created] (GOBBLIN-1611) Fix a wrong value for writer.codec.type in the document

Kengo Seki created GOBBLIN-1611:
-----------------------------------

             Summary: Fix a wrong value for writer.codec.type in the document
                 Key: GOBBLIN-1611
                 URL: https://issues.apache.org/jira/browse/GOBBLIN-1611
             Project: Apache Gobblin
          Issue Type: Bug
            Reporter: Kengo Seki


According to https://gobblin.readthedocs.io/en/latest/user-guide/Configuration-Properties-Glossary/#Writer-Properties, possible values for the writer.codec.type property are NOCOMPRESSION, DEFLATE, SNAPPY.
But if NOCOMPRESSION is actually specified, the job fails as follows.

{code}
2022-02-16 14:15:33 JST ERROR [ForkExecutor-0] org.apache.gobblin.runtime.fork.Fork [d_t_1644988532694_0] - Fork 0 of task task_EmbeddedGobblin_1644988532694_0 failed.
org.apache.avro.AvroRuntimeException: Unrecognized codec: nocompression
	at org.apache.avro.file.CodecFactory.fromString(CodecFactory.java:102)
	at org.apache.gobblin.util.WriterUtils.getCodecFactory(WriterUtils.java:263)
	at org.apache.gobblin.writer.AvroHdfsDataWriter.<init>(AvroHdfsDataWriter.java:65)
	at org.apache.gobblin.writer.AvroDataWriterBuilder.build(AvroDataWriterBuilder.java:45)
	at org.apache.gobblin.writer.PartitionedDataWriter$4.get(PartitionedDataWriter.java:220)
	at org.apache.gobblin.writer.PartitionedDataWriter$4.get(PartitionedDataWriter.java:215)
	at org.apache.gobblin.writer.CloseOnFlushWriterWrapper.<init>(CloseOnFlushWriterWrapper.java:73)
	at org.apache.gobblin.writer.PartitionedDataWriter.<init>(PartitionedDataWriter.java:214)
	at org.apache.gobblin.runtime.fork.Fork.buildWriter(Fork.java:571)
	at org.apache.gobblin.runtime.fork.Fork.buildWriterIfNotPresent(Fork.java:579)
	at org.apache.gobblin.runtime.fork.Fork.processRecord(Fork.java:525)
	at org.apache.gobblin.runtime.fork.AsynchronousFork.processRecord(AsynchronousFork.java:103)
	at org.apache.gobblin.runtime.fork.AsynchronousFork.processRecords(AsynchronousFork.java:86)
	at org.apache.gobblin.runtime.fork.Fork.run(Fork.java:257)
	at org.apache.gobblin.util.executors.MDCPropagatingRunnable.run(MDCPropagatingRunnable.java:39)
	at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
	at java.util.concurrent.FutureTask.run(FutureTask.java:266)
	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
	at java.lang.Thread.run(Thread.java:748)
{code}




--
This message was sent by Atlassian Jira
(v8.20.1#820001)