You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@nifi.apache.org by "Abdullah Aqeeli (Jira)" <ji...@apache.org> on 2021/12/12 14:32:00 UTC

[jira] [Created] (NIFI-9477) data loss doing Incremental load based on a timestamp from SQL server using QueryDatabaseTableRecord

Abdullah Aqeeli created NIFI-9477:
-------------------------------------

             Summary: data loss doing Incremental load based on a timestamp from SQL server using QueryDatabaseTableRecord 
                 Key: NIFI-9477
                 URL: https://issues.apache.org/jira/browse/NIFI-9477
             Project: Apache NiFi
          Issue Type: Bug
          Components: Core UI
    Affects Versions: 1.13.2
         Environment: ubuntu 20 
2-node cluster using an embedded zk
            Reporter: Abdullah Aqeeli


hello all,

 

We are using Nifi to do an incremental load from SQL Server using QueryDatabaseTableRecord where max-value column is updated_at column with a [Datetime2 |https://docs.microsoft.com/en-us/sql/t-sql/data-types/datetime2-transact-sql]data type. The source tables get updated in milliseconds. We noticed that we are losing few events doing with each QueryDatabaseTableRecord run. As a work-around we created a custom processor that subtracts few seconds from the max-value in the state fetching overlapping events and then we handle any duplicates later on in the pipeline and it seems to solve the issue. However, this is not optimal. We'd like to know what's causing the skips. Our assumption so far is that timestamp datatype accuracy is causing the skips in the incremental load however we're not sure.  

 

Thanks!



--
This message was sent by Atlassian Jira
(v8.20.1#820001)