You are viewing a plain text version of this content. The canonical link for it is here.
- [GitHub] [nutch] lewismc commented on a change in pull request #541: NUTCH-2809: Upgrade any23 plugin dependency - posted by GitBox <gi...@apache.org> on 2020/08/01 01:27:47 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2809) Upgrade any23 plugin dependency - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2020/08/01 01:28:00 UTC, 2 replies.
- [GitHub] [nutch] lewismc commented on pull request #541: NUTCH-2809: Upgrade any23 plugin dependency - posted by GitBox <gi...@apache.org> on 2020/08/01 01:34:12 UTC, 0 replies.
- [jira] [Created] (NUTCH-2812) Methods returning array may expose internal representation - posted by "Lewis John McGibbney (Jira)" <ji...@apache.org> on 2020/08/01 04:11:00 UTC, 0 replies.
- [GitHub] [nutch] balashashanka commented on pull request #543: NUTCH-2811 : Setup Github workflows for prs - posted by GitBox <gi...@apache.org> on 2020/08/01 04:45:54 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2811) Setup Github workflows for PR - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2020/08/01 04:46:00 UTC, 3 replies.
- [GitHub] [nutch] sebastian-nagel merged pull request #536: [NUTCH-2799] Add .asf.yaml file - posted by GitBox <gi...@apache.org> on 2020/08/02 11:09:29 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-2799) Add .asf.yaml file - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/08/02 11:10:00 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2799) Add .asf.yaml file - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/08/02 11:10:00 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2799) Add .asf.yaml file - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2020/08/02 11:10:00 UTC, 2 replies.
- [jira] [Created] (NUTCH-2813) MoreIndexingFilter - can't parse erroneous date - 2019-07-03T10:28:14 - posted by "Jakob Berlin (Jira)" <ji...@apache.org> on 2020/08/03 13:43:00 UTC, 0 replies.
- [GitHub] [nutch] derhecht opened a new pull request #544: [NUTCH-2813] Update MoreIndexingFilter.java - posted by GitBox <gi...@apache.org> on 2020/08/03 13:44:18 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2813) MoreIndexingFilter - can't parse erroneous date - 2019-07-03T10:28:14 - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2020/08/03 13:45:00 UTC, 1 replies.
- [GitHub] [nutch] derhecht opened a new pull request #545: [NUTCH-1190] Move data formats used to parse "lastModified" to a config file - posted by GitBox <gi...@apache.org> on 2020/08/03 13:58:14 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1190) MoreIndexingFilter refactor: move data formats used to parse "lastModified" to a config file. - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2020/08/03 13:59:00 UTC, 12 replies.
- [GitHub] [nutch] sebastian-nagel commented on a change in pull request #545: [NUTCH-1190] Move data formats used to parse "lastModified" to a config file - posted by GitBox <gi...@apache.org> on 2020/08/03 14:52:18 UTC, 2 replies.
- [jira] [Updated] (NUTCH-1190) MoreIndexingFilter refactor: move data formats used to parse "lastModified" to a config file. - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/08/03 14:53:00 UTC, 0 replies.
- [GitHub] [nutch] balashashanka merged pull request #543: NUTCH-2811 : Setup Github workflows for prs - posted by GitBox <gi...@apache.org> on 2020/08/03 15:10:58 UTC, 0 replies.
- [GitHub] [nutch] derhecht commented on a change in pull request #545: [NUTCH-1190] Move data formats used to parse "lastModified" to a config file - posted by GitBox <gi...@apache.org> on 2020/08/03 15:13:51 UTC, 1 replies.
- [GitHub] [nutch] sebastian-nagel commented on pull request #545: [NUTCH-1190] Move data formats used to parse "lastModified" to a config file - posted by GitBox <gi...@apache.org> on 2020/08/03 15:14:05 UTC, 1 replies.
- [GitHub] [nutch] derhecht commented on pull request #545: [NUTCH-1190] Move data formats used to parse "lastModified" to a config file - posted by GitBox <gi...@apache.org> on 2020/08/03 15:16:41 UTC, 0 replies.
- [GitHub] [nutch] jorgelbg commented on a change in pull request #545: [NUTCH-1190] Move data formats used to parse "lastModified" to a config file - posted by GitBox <gi...@apache.org> on 2020/08/03 16:54:46 UTC, 1 replies.
- [GitHub] [nutch] sebastian-nagel merged pull request #537: [NUTCH-2801] RobotsRulesParser command-line checker to use http.robots.agents as fall-back - posted by GitBox <gi...@apache.org> on 2020/08/03 19:07:00 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2801) RobotsRulesParser command-line checker to use http.robots.agents as fall-back - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2020/08/03 19:08:00 UTC, 2 replies.
- [jira] [Resolved] (NUTCH-2801) RobotsRulesParser command-line checker to use http.robots.agents as fall-back - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/08/03 19:08:00 UTC, 0 replies.
- [GitHub] [nutch] sebastian-nagel merged pull request #542: NUTCH-2810 FreeGenerator to actually apply configured number of fetch lists - posted by GitBox <gi...@apache.org> on 2020/08/03 19:08:34 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2810) FreeGenerator to actually apply configured number of fetch lists - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/08/03 19:09:00 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2810) FreeGenerator to actually apply configured number of fetch lists - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2020/08/03 19:09:00 UTC, 2 replies.
- [jira] [Resolved] (NUTCH-2811) Setup Github workflows for PR - posted by "Shashanka Balakuntala Srinivasa (Jira)" <ji...@apache.org> on 2020/08/03 19:51:00 UTC, 0 replies.
- Your project website - posted by Andrew Wetmore <an...@apache.org> on 2020/08/05 12:39:16 UTC, 1 replies.
- [jira] [Created] (NUTCH-2814) HttpDateFormat's internal time zone may change after parsing a date - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/08/07 16:06:00 UTC, 0 replies.
- [GitHub] [nutch] sebastian-nagel opened a new pull request #546: NUTCH-2814 HttpDateFormat's internal time zone may change after parsing a date - posted by GitBox <gi...@apache.org> on 2020/08/07 16:29:28 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2814) HttpDateFormat's internal time zone may change after parsing a date - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2020/08/07 16:30:00 UTC, 2 replies.
- [jira] [Created] (NUTCH-2815) Add Spotbugs target to build and address detected "bugs" - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/08/08 08:17:00 UTC, 0 replies.
- [jira] [Created] (NUTCH-2816) Add Spotbugs target to ant build - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/08/08 08:21:00 UTC, 0 replies.
- [GitHub] [nutch] sebastian-nagel opened a new pull request #547: [NUTCH-2816] Add Spotbugs target to ant build - posted by GitBox <gi...@apache.org> on 2020/08/08 08:27:10 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2815) Add Spotbugs target to build and address detected "bugs" - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/08/08 08:28:00 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2816) Add Spotbugs target to ant build - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2020/08/08 08:28:00 UTC, 3 replies.
- Re: Setting up automatic tests and check in GIT - posted by Sebastian Nagel <wa...@googlemail.com> on 2020/08/08 08:33:42 UTC, 1 replies.
- [jira] [Assigned] (NUTCH-2815) Add Spotbugs target to build and address detected "bugs" - posted by "Shashanka Balakuntala Srinivasa (Jira)" <ji...@apache.org> on 2020/08/08 08:34:00 UTC, 1 replies.
- [jira] [Created] (NUTCH-2817) Avoid check for equality of URL path and file part using ==/!= - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/08/08 08:47:00 UTC, 0 replies.
- [GitHub] [nutch] sebastian-nagel opened a new pull request #548: [NUTCH-2817] Avoid check for equality of URL path and file part using == / != - posted by GitBox <gi...@apache.org> on 2020/08/08 08:58:44 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2817) Avoid check for equality of URL path and file part using ==/!= - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2020/08/08 08:59:00 UTC, 3 replies.
- [GitHub] [nutch] sebastian-nagel merged pull request #547: [NUTCH-2816] Add Spotbugs target to ant build - posted by GitBox <gi...@apache.org> on 2020/08/11 07:37:34 UTC, 0 replies.
- [GitHub] [nutch] sebastian-nagel merged pull request #548: [NUTCH-2817] Avoid check for equality of URL path and file part using == / != - posted by GitBox <gi...@apache.org> on 2020/08/11 07:38:57 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2816) Add Spotbugs target to ant build - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/08/11 07:39:00 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2817) Avoid check for equality of URL path and file part using ==/!= - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/08/11 07:40:00 UTC, 0 replies.
- [jira] [Created] (NUTCH-2818) Ant build: upgrade Apache Rat report task - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/08/11 16:41:00 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-2818) Ant build: upgrade Apache Rat report task - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/08/11 16:41:00 UTC, 0 replies.
- [GitHub] [nutch] sebastian-nagel opened a new pull request #549: NUTCH-2818 Fix Apache Rat task to check sources for license headers - posted by GitBox <gi...@apache.org> on 2020/08/11 16:45:22 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2818) Ant build: upgrade Apache Rat report task - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2020/08/11 16:46:00 UTC, 2 replies.
- [jira] [Created] (NUTCH-2820) Review sample files used in any23 unit tests - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/08/11 16:58:00 UTC, 0 replies.
- [jira] [Created] (NUTCH-2819) Move spotbugs "installation" directory to avoid that spotbugs is shipped in Nutch runtime - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/08/11 16:58:00 UTC, 0 replies.
- [jira] [Created] (NUTCH-2821) Deduplicate licenses in LICENSE.txt file - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/08/11 17:00:00 UTC, 0 replies.
- [jira] [Created] (NUTCH-2822) Split the LICENSE.txt file into two files for source resp. binary releases - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/08/11 17:01:00 UTC, 0 replies.
- Build failed in Jenkins: Nutch » Nutch-trunk #1 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2020/08/11 17:10:29 UTC, 1 replies.
- [jira] [Commented] (NUTCH-2819) Move spotbugs "installation" directory to avoid that spotbugs is shipped in Nutch runtime - posted by "Shashanka Balakuntala Srinivasa (Jira)" <ji...@apache.org> on 2020/08/12 05:57:00 UTC, 1 replies.
- [jira] [Updated] (NUTCH-2697) Upgrade Ivy to fix the issue of an unset packaging.type property - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/08/12 10:53:00 UTC, 0 replies.
- [GitHub] [nutch] sebastian-nagel opened a new pull request #550: NUTCH-2697 Upgrade Ivy to 2.5.0 - posted by GitBox <gi...@apache.org> on 2020/08/12 11:37:20 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2697) Upgrade Ivy to fix the issue of an unset packaging.type property - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2020/08/12 11:38:00 UTC, 2 replies.
- Jenkins build is back to normal : Nutch » Nutch-trunk #2 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2020/08/12 12:41:55 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-2819) Move spotbugs "installation" directory to avoid that spotbugs is shipped in Nutch runtime - posted by "Shashanka Balakuntala Srinivasa (Jira)" <ji...@apache.org> on 2020/08/12 19:27:00 UTC, 0 replies.
- [jira] [Created] (NUTCH-2823) IllegalStateException in IndexWriters.describe() when validating url param for SolrIndexer - posted by "Joe Gilvary (Jira)" <ji...@apache.org> on 2020/08/13 11:09:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2823) IllegalStateException in IndexWriters.describe() when validating url param for SolrIndexer - posted by "Joe Gilvary (Jira)" <ji...@apache.org> on 2020/08/13 11:12:00 UTC, 1 replies.
- [GitHub] [nutch] sebastian-nagel closed pull request #545: [NUTCH-1190] Move data formats used to parse "lastModified" to a config file - posted by GitBox <gi...@apache.org> on 2020/08/16 19:04:13 UTC, 0 replies.
- [GitHub] [nutch] sebastian-nagel closed pull request #544: [NUTCH-2813] Update MoreIndexingFilter.java - posted by GitBox <gi...@apache.org> on 2020/08/16 19:04:24 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1190) MoreIndexingFilter refactor: move data formats used to parse "lastModified" to a config file. - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/08/16 19:08:00 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2813) MoreIndexingFilter - can't parse erroneous date - 2019-07-03T10:28:14 - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/08/16 19:09:00 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2788) ParseData: improve presentation of Metadata in method toString() - posted by "Hudson (Jira)" <ji...@apache.org> on 2020/08/16 21:08:01 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2743) Add list of Nutch properties (nutch-default.xml) to documentation - posted by "Hudson (Jira)" <ji...@apache.org> on 2020/08/16 21:08:01 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2720) ROBOTS metatag ignored when capitalized - posted by "Hudson (Jira)" <ji...@apache.org> on 2020/08/16 21:08:01 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2782) protocol-http / lib-http: support TLSv1.3 - posted by "Hudson (Jira)" <ji...@apache.org> on 2020/08/16 21:08:01 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2787) CrawlDb JSON dump does not export metadata primitive data types correctly - posted by "Hudson (Jira)" <ji...@apache.org> on 2020/08/16 21:08:01 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2785) FreeGenerator: command-line option to define number of generated fetch lists - posted by "Hudson (Jira)" <ji...@apache.org> on 2020/08/16 21:08:01 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2758) Add plugin READMEs to binary release packages - posted by "Hudson (Jira)" <ji...@apache.org> on 2020/08/16 21:08:01 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1945) Test for XLSX parser - posted by "Hudson (Jira)" <ji...@apache.org> on 2020/08/16 21:08:01 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2419) Some URL filters and normalizers do not respect command-line override for rule file - posted by "Hudson (Jira)" <ji...@apache.org> on 2020/08/16 21:08:01 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2002) ParserChecker and IndexingFiltersChecker to check robots.txt - posted by "Hudson (Jira)" <ji...@apache.org> on 2020/08/16 21:08:02 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2805) Rename plugin urlfilter-domainblacklist - posted by "Hudson (Jira)" <ji...@apache.org> on 2020/08/16 21:08:02 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2496) Speed up link inversion step in crawling script - posted by "Hudson (Jira)" <ji...@apache.org> on 2020/08/16 21:08:02 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2789) Documentation: update links to point to cwiki - posted by "Hudson (Jira)" <ji...@apache.org> on 2020/08/16 21:08:02 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2753) Add -listen option to command-line help of CrawlDbReader and LinkDbReader - posted by "Hudson (Jira)" <ji...@apache.org> on 2020/08/16 21:08:02 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2434) Add methods to reset parameters HTMLMetaTags - posted by "Hudson (Jira)" <ji...@apache.org> on 2020/08/16 21:08:02 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2790) CSVIndexWriter does not escape leading quotes properly - posted by "Hudson (Jira)" <ji...@apache.org> on 2020/08/16 21:08:02 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1194) Generator: CrawlDB lock should be released earlier - posted by "Hudson (Jira)" <ji...@apache.org> on 2020/08/16 21:08:02 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2791) domainstats, protocolstats and crawlcomplete do not handle GCS URLs - posted by "Hudson (Jira)" <ji...@apache.org> on 2020/08/16 21:08:02 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2794) Add additional ciphers to HTTP base's default cipher suite - posted by "Hudson (Jira)" <ji...@apache.org> on 2020/08/16 21:08:02 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2796) Upgrade to crawler-commons 1.1 - posted by "Hudson (Jira)" <ji...@apache.org> on 2020/08/16 21:08:02 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2730) SitemapProcessor to treat sitemap URLs as Set instead of List - posted by "Hudson (Jira)" <ji...@apache.org> on 2020/08/16 21:08:02 UTC, 0 replies.
- [GitHub] [nutch] sebastian-nagel merged pull request #550: NUTCH-2697 Upgrade Ivy to 2.5.0 - posted by GitBox <gi...@apache.org> on 2020/08/17 13:45:49 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2671) Upgrade ant ivy library - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/08/17 13:49:00 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2697) Upgrade Ivy to fix the issue of an unset packaging.type property - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/08/17 13:49:00 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2669) Reliable solution for javax.ws packaging.type - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/08/17 13:50:00 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2671) Upgrade ant ivy library - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/08/17 13:51:00 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2672) Ant build erronously installs *-test.jar instead *.jar for target "nightly" - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/08/17 13:52:00 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2669) Reliable solution for javax.ws packaging.type - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/08/17 13:52:00 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-2823) IllegalStateException in IndexWriters.describe() when validating url param for SolrIndexer - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/08/17 14:52:00 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2823) IllegalStateException in IndexWriters.describe() when validating url param for SolrIndexer - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/08/17 14:55:00 UTC, 1 replies.
- [GitHub] [nutch] sebastian-nagel opened a new pull request #551: NUTCH-2823 IllegalStateException in IndexWriters.describe() when vali… - posted by GitBox <gi...@apache.org> on 2020/08/17 14:57:22 UTC, 0 replies.
- [GitHub] [nutch] sebastian-nagel commented on pull request #541: NUTCH-2809: Upgrade any23 plugin dependency - posted by GitBox <gi...@apache.org> on 2020/08/18 09:32:22 UTC, 0 replies.
- [GitHub] [nutch] sebastian-nagel commented on a change in pull request #539: NUTCH-2803 Rename property http.robot.rules.whitelist - posted by GitBox <gi...@apache.org> on 2020/08/18 09:35:10 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2803) Rename property http.robot.rules.whitelist - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2020/08/18 09:36:00 UTC, 0 replies.
- [GitHub] [nutch] sebastian-nagel merged pull request #546: NUTCH-2814 HttpDateFormat's internal time zone may change after parsing a date - posted by GitBox <gi...@apache.org> on 2020/08/18 09:41:59 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-2814) HttpDateFormat's internal time zone may change after parsing a date - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/08/18 09:43:00 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2814) HttpDateFormat's internal time zone may change after parsing a date - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/08/18 09:44:00 UTC, 0 replies.
- [GitHub] [nutch] sebastian-nagel merged pull request #549: NUTCH-2818 Fix Apache Rat task to check sources for license headers - posted by GitBox <gi...@apache.org> on 2020/08/18 09:49:09 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2818) Ant build: upgrade Apache Rat report task - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/08/18 09:50:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2807) SitemapProcessor to warn that ignoring robotst.xt affects detection of sitemaps - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/08/18 09:51:00 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2805) Rename plugin urlfilter-domainblacklist - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/08/18 09:53:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2812) Methods returning array may expose internal representation - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/08/18 09:58:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1617) IndexerMapReduce to consider latest fetchDatum - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/08/21 14:09:00 UTC, 0 replies.
- [jira] [Created] (NUTCH-2824) urlnormalizer-basic to unescape percent-encoded host names - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/08/21 14:13:00 UTC, 0 replies.
- [GitHub] [nutch] sebastian-nagel opened a new pull request #552: NUTCH-2824 urlnormalizer-basic to unescape percent-encoded host names - posted by GitBox <gi...@apache.org> on 2020/08/21 14:39:50 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2824) urlnormalizer-basic to unescape percent-encoded host names - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2020/08/21 14:40:00 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1150) http.redirect.max can lead to multiple parses of the same url - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/08/21 15:10:00 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1150) http.redirect.max can lead to multiple parses of the same url - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/08/21 15:10:00 UTC, 0 replies.
- [jira] [Created] (NUTCH-2825) lib-selenium: property - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/08/21 15:31:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2825) lib-selenium: property webdriver.chrome.driver overwritten by selenium.grid.binary - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/08/21 15:34:00 UTC, 0 replies.