You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@seatunnel.apache.org by "liushuaihang (via GitHub)" <gi...@apache.org> on 2023/09/14 02:57:07 UTC

[GitHub] [seatunnel] liushuaihang opened a new issue, #5484: seatunnel 抽取 hive 指定分区数据到 MySQL

liushuaihang opened a new issue, #5484:
URL: https://github.com/apache/seatunnel/issues/5484

   ### Search before asking
   
   - [X] I had searched in the [issues](https://github.com/apache/seatunnel/issues?q=is%3Aissue+label%3A%22bug%22) and found no similar issues.
   
   
   ### What happened
   
   配置文件应该没有问题,但是跑不通
   
   ### SeaTunnel Version
   
   seatunnel版本2.3.3
   hive版本2.3.9
   
   ### SeaTunnel Config
   
   ```conf
   source {
       Hive {
   table_name = "default.test3"
   metastore_uri = "thrift://xxx:9083"
   hdfs_site_path = "/home/hadoop/hadoop-3.2.2/etc/hadoop/hdfs-site.xml"
   hive_site_path = "/home/hadoop/hive-2.3.9/conf/hive-site.xml"
   read_partitions = ["dt=2023-09-13"]
       }
   }
   ```
   
   
   ### Running Command
   
   ```shell
   ./bin/seatunnel.sh --config config/hive_partition_test.config -e local
   ```
   
   
   ### Error Exception
   
   ```log
   Exception in thread "main" org.apache.seatunnel.core.starter.exception.CommandExecuteException: SeaTunnel job executed failed
   at org.apache.seatunnel.core.starter.seatunnel.command.ClientExecuteCommand.execute(ClientExecuteCommand.java:191)
   at org.apache.seatunnel.core.starter.SeaTunnel.run(SeaTunnel.java:40)
   at org.apache.seatunnel.core.starter.seatunnel.SeaTunnelClient.main(SeaTunnelClient.java:34)
   Caused by: org.apache.seatunnel.connectors.seatunnel.file.exception.FileConnectorException: ErrorCode:[FILE-04], ErrorDescript
   ion:[File list is empty] - The target file list is empty,SeaTunnel will not be able to sync empty table, please check the conf
   iguration parameters such as: [file_filter_pattern]
   at org.apache.seatunnel.connectors.seatunnel.file.source.reader.AbstractReadStrategy.getFileNamesByPath(AbstractReadSt
   rategy.java:158)
   at org.apache.seatunnel.connectors.seatunnel.file.source.reader.AbstractReadStrategy.getFileNamesByPath(AbstractReadSt
   rategy.java:133)
   at org.apache.seatunnel.connectors.seatunnel.file.hdfs.source.BaseHdfsFileSource.prepare(BaseHdfsFileSource.java:76)
   at org.apache.seatunnel.connectors.seatunnel.hive.source.HiveSource.prepare(HiveSource.java:176)
   at org.apache.seatunnel.engine.core.parse.JobConfigParser.parseSource(JobConfigParser.java:85)
   at org.apache.seatunnel.engine.core.parse.MultipleTableJobConfigParser.parseSource(MultipleTableJobConfigParser.java:3
   17)
   at org.apache.seatunnel.engine.core.parse.MultipleTableJobConfigParser.parse(MultipleTableJobConfigParser.java:179)
   at org.apache.seatunnel.engine.core.job.AbstractJobEnvironment.getLogicalDag(AbstractJobEnvironment.java:109)
   at org.apache.seatunnel.engine.client.job.JobExecutionEnvironment.execute(JobExecutionEnvironment.java:73)
   at org.apache.seatunnel.core.starter.seatunnel.command.ClientExecuteCommand.execute(ClientExecuteCommand.java:143)
   ```
   
   
   ### Zeta or Flink or Spark Version
   
   _No response_
   
   ### Java or Scala Version
   
   _No response_
   
   ### Screenshots
   
   _No response_
   
   ### Are you willing to submit PR?
   
   - [ ] Yes I am willing to submit a PR!
   
   ### Code of Conduct
   
   - [X] I agree to follow this project's [Code of Conduct](https://www.apache.org/foundation/policies/conduct)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@seatunnel.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [seatunnel] ruanwenjun commented on issue #5484: Cannot read a specific partition data in hive

Posted by "ruanwenjun (via GitHub)" <gi...@apache.org>.
ruanwenjun commented on issue #5484:
URL: https://github.com/apache/seatunnel/issues/5484#issuecomment-1720376418

   I closed this issue if there is no more thing.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@seatunnel.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [seatunnel] liushuaihang commented on issue #5484: Cannot read a specific partition data in hive

Posted by "liushuaihang (via GitHub)" <gi...@apache.org>.
liushuaihang commented on issue #5484:
URL: https://github.com/apache/seatunnel/issues/5484#issuecomment-1723024137

   > @liushuaihang you can use seatunnel 2.3.2.
   
   Hi,关于这个BUG,同样的配置,seatunnel2.3.3不能跑成功,seatunnel2.3.1可以跑成功


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@seatunnel.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [seatunnel] liushuaihang commented on issue #5484: seatunnel 抽取 hive 指定分区数据到 MySQL

Posted by "liushuaihang (via GitHub)" <gi...@apache.org>.
liushuaihang commented on issue #5484:
URL: https://github.com/apache/seatunnel/issues/5484#issuecomment-1718689582

   hive的dt是string类型


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@seatunnel.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [seatunnel] liunaijie commented on issue #5484: Cannot read a specific partition data in hive

Posted by "liunaijie (via GitHub)" <gi...@apache.org>.
liunaijie commented on issue #5484:
URL: https://github.com/apache/seatunnel/issues/5484#issuecomment-1724629581

   > > @liushuaihang you can use seatunnel 2.3.2.
   > 
   > Hi,关于这个BUG,同样的配置,seatunnel2.3.3不能跑成功,seatunnel2.3.1可以跑成功
   
   Yes, as i said, this bug was introduced in 2.3.3, and will be fix in next version. So you need use another version.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@seatunnel.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [seatunnel] ruanwenjun closed issue #5484: Cannot read a specific partition data in hive

Posted by "ruanwenjun (via GitHub)" <gi...@apache.org>.
ruanwenjun closed issue #5484: Cannot read a specific partition data in hive
URL: https://github.com/apache/seatunnel/issues/5484


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@seatunnel.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [seatunnel] liushuaihang commented on issue #5484: Cannot read a specific partition data in hive

Posted by "liushuaihang (via GitHub)" <gi...@apache.org>.
liushuaihang commented on issue #5484:
URL: https://github.com/apache/seatunnel/issues/5484#issuecomment-1723022391

   Hi,关于这个BUG,同样的配置,seatunnel2.3.3不能跑成功,seatunnel2.3.1可以跑成功


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@seatunnel.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [seatunnel] liunaijie commented on issue #5484: Cannot read a specific partition data in hive

Posted by "liunaijie (via GitHub)" <gi...@apache.org>.
liunaijie commented on issue #5484:
URL: https://github.com/apache/seatunnel/issues/5484#issuecomment-1719043230

   this is an issue, if some path,subpath doesn't have any file will get this error.
   fixed at https://github.com/apache/seatunnel/pull/5427


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@seatunnel.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [seatunnel] ruanwenjun commented on issue #5484: seatunnel 抽取 hive 指定分区数据到 MySQL

Posted by "ruanwenjun (via GitHub)" <gi...@apache.org>.
ruanwenjun commented on issue #5484:
URL: https://github.com/apache/seatunnel/issues/5484#issuecomment-1718706373

   Could you please check if your table exist and hive config is correct.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@seatunnel.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [seatunnel] liunaijie commented on issue #5484: Cannot read a specific partition data in hive

Posted by "liunaijie (via GitHub)" <gi...@apache.org>.
liunaijie commented on issue #5484:
URL: https://github.com/apache/seatunnel/issues/5484#issuecomment-1719064401

   @liushuaihang  you can use seatunnel 2.3.2.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@seatunnel.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org