You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@flink.apache.org by "Maciej Obuchowski (Jira)" <ji...@apache.org> on 2021/02/24 14:40:00 UTC

[jira] [Created] (FLINK-21488) Jdbc XA sink - XID generation conflicts between jobs

Maciej Obuchowski created FLINK-21488:
-----------------------------------------

             Summary: Jdbc XA sink - XID generation conflicts between jobs
                 Key: FLINK-21488
                 URL: https://issues.apache.org/jira/browse/FLINK-21488
             Project: Flink
          Issue Type: Bug
          Components: Connectors / JDBC
    Affects Versions: 1.13.0
            Reporter: Maciej Obuchowski


I'm using Flink 1.13's JDBC XA sink to write data to oracle DB using exactly once semantics.
I want to have two jobs doing this. One is working right now. When starting second one, I encountered errors:

org.apache.flink.util.FlinkRuntimeException: unable to start XA transaction, xid: 201:0600000000000000:9b1d1b84e8ce79bb, error -3: resource manager error has occurred. [XAErr (-3): A resource manager error has occured in the transaction branch. ORA-2079 SQLErr (0)]

Oracle description:
ORA-02079: cannot join a committing distributed transaction
    Cause: Once a transaction branch is prepared, no more new transaction branches are allowed to start, nor is the prepared transaction branch allowed to be joined.
    Action: Check the application code as this is an XA protocol violation.

I've looked at the implementation of XID generation and noticed following line:

private transient byte[] gtridBuffer; // globalTransactionId = checkpoint id (long)

My hypothesis is that second job generated xid that referred to global transaction id that the first job created. If I'm right, then I'd suppose fix would rely on embedding part of job id inside of gtridBuffer.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)