You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@nifi.apache.org by ijokarumawak <gi...@git.apache.org> on 2018/01/26 01:34:58 UTC

[GitHub] nifi pull request #2435: NIFI-4818: Fix transit URL parsing at Hive2JDBC and...

GitHub user ijokarumawak opened a pull request:

    https://github.com/apache/nifi/pull/2435

    NIFI-4818: Fix transit URL parsing at Hive2JDBC and KafkaTopic for Reā€¦

    ā€¦portLineageToAtlas
    
    - Hive2JDBC: Handle connection parameters and multiple host entries
    correctly
    - KafkaTopic: Handle multiple host entries correctly
    - Avoid potential "IllegalStateException: Duplicate key" exception
    when NiFiAtlasHook analyzes existing NiFiFlowPath input/output entries
    
    Thank you for submitting a contribution to Apache NiFi.
    
    In order to streamline the review of the contribution we ask you
    to ensure the following steps have been taken:
    
    ### For all changes:
    - [x] Is there a JIRA ticket associated with this PR? Is it referenced 
         in the commit message?
    
    - [x] Does your PR title start with NIFI-XXXX where XXXX is the JIRA number you are trying to resolve? Pay particular attention to the hyphen "-" character.
    
    - [x] Has your PR been rebased against the latest commit within the target branch (typically master)?
    
    - [x] Is your initial contribution a single, squashed commit?
    
    ### For code changes:
    - [ ] Have you ensured that the full suite of tests is executed via mvn -Pcontrib-check clean install at the root nifi folder?
    - [x] Have you written or updated unit tests to verify your changes?
    - [ ] If adding new dependencies to the code, are these dependencies licensed in a way that is compatible for inclusion under [ASF 2.0](http://www.apache.org/legal/resolved.html#category-a)? 
    - [ ] If applicable, have you updated the LICENSE file, including the main LICENSE file under nifi-assembly?
    - [ ] If applicable, have you updated the NOTICE file, including the main NOTICE file found under nifi-assembly?
    - [ ] If adding new Properties, have you added .displayName in addition to .name (programmatic access) for each of the new properties?
    
    ### For documentation related changes:
    - [ ] Have you ensured that format looks appropriate for the output in which it is rendered?
    
    ### Note:
    Please ensure that once the PR is submitted, you check travis-ci for build issues and submit an update to your PR as soon as possible.


You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/ijokarumawak/nifi nifi-4818

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/nifi/pull/2435.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #2435
    
----
commit 91ee885398128e796918a6fc6b98bdf442c5ebf1
Author: Koji Kawamura <ij...@...>
Date:   2018-01-25T04:57:01Z

    NIFI-4818: Fix transit URL parsing at Hive2JDBC and KafkaTopic for ReportLineageToAtlas
    
    - Hive2JDBC: Handle connection parameters and multiple host entries
    correctly
    - KafkaTopic: Handle multiple host entries correctly
    - Avoid potential "IllegalStateException: Duplicate key" exception
    when NiFiAtlasHook analyzes existing NiFiFlowPath input/output entries

----


---

[GitHub] nifi pull request #2435: NIFI-4818: Fix transit URL parsing at Hive2JDBC and...

Posted by asfgit <gi...@git.apache.org>.
Github user asfgit closed the pull request at:

    https://github.com/apache/nifi/pull/2435


---

[GitHub] nifi issue #2435: NIFI-4818: Fix transit URL parsing at Hive2JDBC and KafkaT...

Posted by mcgilman <gi...@git.apache.org>.
Github user mcgilman commented on the issue:

    https://github.com/apache/nifi/pull/2435
  
    Thanks @ijokarumawak! This has been merged to master.


---

[GitHub] nifi pull request #2435: NIFI-4818: Fix transit URL parsing at Hive2JDBC and...

Posted by ijokarumawak <gi...@git.apache.org>.
Github user ijokarumawak commented on a diff in the pull request:

    https://github.com/apache/nifi/pull/2435#discussion_r164017503
  
    --- Diff: nifi-nar-bundles/nifi-atlas-bundle/nifi-atlas-reporting-task/src/main/java/org/apache/nifi/atlas/NiFiAtlasHook.java ---
    @@ -255,7 +255,11 @@ public void commitMessages() {
                 }
                 return new Tuple<>(refQualifiedName, typedQualifiedNameToRef.get(toTypedQualifiedName(typeName, refQualifiedName)));
             }).filter(Objects::nonNull).filter(tuple -> tuple.getValue() != null)
    -                .collect(Collectors.toMap(Tuple::getKey, Tuple::getValue));
    +                // If duplication happens, use new value.
    +                .collect(Collectors.toMap(Tuple::getKey, Tuple::getValue, (oldValue, newValue) -> {
    +                    logger.warn("Duplicated qualified name was found, use the new one. oldValue={}, newValue={}", new Object[]{oldValue, newValue});
    +                    return newValue;
    +                }));
    --- End diff --
    
    While I was testing, I got the following exception:
    ```
    2018-01-25 05:06:41,430 ERROR [Timer-Driven Process Thread-1] o.a.n.a.reporting.ReportLineageToAtlas ReportLineageToAtlas[id=057986ae-0161-1000-d0b0-1b890a17f5aa] Error running task ReportLineageToAtlas[id=057986ae-0161-1000-d0b0-1b890a17f5aa] due to java.lang.IllegalStateException: Duplicate key {Id='(type: fs_path, id: 69be7a40-4ff8-4c4e-b714-2d394c14398d)', traits=[], values={}} NiFiAtlasHook.258
    ```
    The exception means, an existing nifi_flow_path entity has more than one entries having pointing to the same entity having identical qualified name, from its inputs or outputs attribute. This happened because I was using the old test environment which has data created before Atlas integration implemented de-duplication logic. However, it would be more protective to handle such duplication in case if this occurs for some other reason.


---