You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by GitBox <gi...@apache.org> on 2022/01/17 18:58:35 UTC

[GitHub] [nutch] sebastian-nagel commented on pull request #723: NUTCH-2935 DeduplicationJob: failure on URLs with invalid percent encoding

sebastian-nagel commented on pull request #723:
URL: https://github.com/apache/nutch/pull/723#issuecomment-1014816831


   Thanks, @lewismc! For the Common Crawls it just took some time to hit a bad URL which also somehow slipped through the URL filters and normalizers during injection.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscribe@nutch.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org