You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@manifoldcf.apache.org by "DK (Jira)" <ji...@apache.org> on 2022/01/24 16:42:00 UTC

[jira] [Created] (CONNECTORS-1695) Sitemap xml not detected in version 2.17

DK created CONNECTORS-1695:
------------------------------

             Summary: Sitemap xml not detected in version 2.17
                 Key: CONNECTORS-1695
                 URL: https://issues.apache.org/jira/browse/CONNECTORS-1695
             Project: ManifoldCF
          Issue Type: Bug
          Components: Web connector
    Affects Versions: ManifoldCF 2.17
            Reporter: DK


Trying to index sitemap xml and web connector index the whole xml into solr.

Please fix in version 2.17.

If it is any special config that needs to be taken care, please add here and add in documentation to make it clear.

 

Sitemap.xml:

<sitemapindex xmlns="http://www.sitemaps.org/schemas/sitemap/0.9">
<sitemap>
<loc>https://<url>/sitemap_1.xml</loc>
<lastmod>2022-01-21T16:04:45Z</lastmod>
</sitemap>

</sitemapindex>

 

sitemap_1.xml:

<urlset>
<url>
<loc>https://<docurl></loc>
<lastmod>2018-10-31T11:25:27Z</lastmod>
</url>
</urlset>



--
This message was sent by Atlassian Jira
(v8.20.1#820001)