You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@manifoldcf.apache.org by "Karl Wright (JIRA)" <ji...@apache.org> on 2017/11/21 17:55:00 UTC

[jira] [Created] (CONNECTORS-1472) Confluence connector doesn't call activities.noDocument() properly

Karl Wright created CONNECTORS-1472:
---------------------------------------

             Summary: Confluence connector doesn't call activities.noDocument() properly
                 Key: CONNECTORS-1472
                 URL: https://issues.apache.org/jira/browse/CONNECTORS-1472
             Project: ManifoldCF
          Issue Type: Bug
          Components: Confluence connector
    Affects Versions: ManifoldCF 2.8.1
            Reporter: Karl Wright
            Assignee: Karl Wright
             Fix For: ManifoldCF 2.9


During crawling, the Confluence connector in one installation is throwing the following exception:

{code}
java.lang.IllegalArgumentException: Unrecognized document identifier: 'att44634026'
        at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.computePipelineSpecificationWithVersions(WorkerThread.java:2164)
        at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.noDocument(WorkerThread.java:1627)
        at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.noDocument(WorkerThread.java:1605)
        at org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processPageInternal(ConfluenceRepositoryConnector.java:1078)
        at org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processPageAsAttachment(ConfluenceRepositoryConnector.java:1012)
        at org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processDocuments(ConfluenceRepositoryConnector.java:936)
        at org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:399)
WARN 2017-11-21 10:00:14,373 (Worker thread '111') - Exception: Unrecognized document identifier: 'att69240163'
java.lang.IllegalArgumentException: Unrecognized document identifier: 'att69240163'
        at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.computePipelineSpecificationWithVersions(WorkerThread.java:2164)
        at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.noDocument(WorkerThread.java:1627)
        at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.noDocument(WorkerThread.java:1605)
        at org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processPageInternal(ConfluenceRepositoryConnector.java:1078)
        at org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processPageAsAttachment(ConfluenceRepositoryConnector.java:1012)
        at org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processDocuments(ConfluenceRepositoryConnector.java:936)
        at org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:399)
WARN 2017-11-21 10:00:14,379 (Worker thread '82') - Exception: Unrecognized document identifier: 'att56984899'
java.lang.IllegalArgumentException: Unrecognized document identifier: 'att56984899'
        at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.computePipelineSpecificationWithVersions(WorkerThread.java:2164)
        at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.noDocument(WorkerThread.java:1627)
        at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.noDocument(WorkerThread.java:1605)
        at org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processPageInternal(ConfluenceRepositoryConnector.java:1078)
        at org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processPageAsAttachment(ConfluenceRepositoryConnector.java:1012)
        at org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processDocuments(ConfluenceRepositoryConnector.java:936)
        at org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:399)
WARN 2017-11-21 10:00:14,386 (Worker thread '47') - Exception: Unrecognized document identifier: 'att56986313'
java.lang.IllegalArgumentException: Unrecognized document identifier: 'att56986313'
        at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.computePipelineSpecificationWithVersions(WorkerThread.java:2164)
        at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.noDocument(WorkerThread.java:1627)
        at org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.noDocument(WorkerThread.java:1605)
        at org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processPageInternal(ConfluenceRepositoryConnector.java:1078)
        at org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processPageAsAttachment(ConfluenceRepositoryConnector.java:1012)
        at org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processDocuments(ConfluenceRepositoryConnector.java:936)
        at org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:399)
FATAL 2017-11-21 10:00:14,386 (Worker thread '132') - Error tossed: null
java.lang.NullPointerException
{code}





--
This message was sent by Atlassian JIRA
(v6.4.14#64029)