You are viewing a plain text version of this content. The canonical link for it is here.
- [Nutch Wiki] Update of "bin/nutch_generate" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/01 00:08:39 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "bin/nutch_generate" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/01 00:09:09 UTC, 3 replies.
- [jira] [Created] (NUTCH-1027) Degrade log level of `can't find rules for scope` - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/01 02:18:28 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1027) Degrade log level of `can't find rules for scope` - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/01 02:20:28 UTC, 3 replies.
- Build failed in Jenkins: Nutch-trunk #1532 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/07/01 06:03:25 UTC, 0 replies.
- [Nutch Wiki] Update of "FrontPage" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/01 06:04:19 UTC, 4 replies.
- [Nutch Wiki] Trivial Update of "bin/nutch_crawl" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/01 06:06:44 UTC, 2 replies.
- [Nutch Wiki] Trivial Update of "bin/nutch_readdb" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/01 06:07:13 UTC, 2 replies.
- [Nutch Wiki] Trivial Update of "bin/nutch readlinkdb" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/01 06:08:18 UTC, 2 replies.
- [Nutch Wiki] Trivial Update of "bin/nutch mergedb" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/01 06:11:10 UTC, 1 replies.
- [Nutch Wiki] Update of "bin/nutch_freegen" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/01 06:21:32 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "bin/nutch_freegen" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/01 06:21:54 UTC, 1 replies.
- [Nutch Wiki] Trivial Update of "bin/nutch_inject" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/01 06:27:47 UTC, 0 replies.
- [Nutch Wiki] Update of "bin/nutch_fetch" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/01 06:51:10 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1028) Log parser keys - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/01 14:12:28 UTC, 0 replies.
- [jira] [Created] (NUTCH-1028) Log parser keys - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/01 14:12:28 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-1028) Log parser keys - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/01 14:14:28 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-1027) Degrade log level of `can't find rules for scope` - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/01 14:16:28 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-993) NullPointerException at FetcherOutputFormat.checkOutputSpecs - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/01 16:28:28 UTC, 0 replies.
- [jira] [Commented] (NUTCH-993) NullPointerException at FetcherOutputFormat.checkOutputSpecs - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/01 16:28:28 UTC, 3 replies.
- [jira] [Updated] (NUTCH-937) When nutch is run on hadoop > 0.20.2 (or cdh) it will not find plugins because MapReduce will not unpack plugin/ directory from the job's pack (due to MAPREDUCE-967) - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/01 16:58:28 UTC, 0 replies.
- [jira] [Issue Comment Edited] (NUTCH-993) NullPointerException at FetcherOutputFormat.checkOutputSpecs - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/01 17:02:28 UTC, 0 replies.
- [jira] [Closed] (NUTCH-872) Change the default fetcher.parse to FALSE - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/01 17:32:28 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "bin/nutch_fetch" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/02 00:30:05 UTC, 0 replies.
- [Nutch Wiki] Update of "bin/nutch_parse" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/02 00:45:34 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "bin/nutch_parse" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/02 00:46:04 UTC, 0 replies.
- [Nutch Wiki] Update of "bin/nutch_readseg" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/02 01:15:15 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "bin/nutch_readseg" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/02 01:16:09 UTC, 1 replies.
- Nutch 2.0 roadmap - posted by lewis john mcgibbney <le...@gmail.com> on 2011/07/02 02:19:35 UTC, 1 replies.
- [Nutch Wiki] Trivial Update of "FrontPage" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/02 02:44:39 UTC, 11 replies.
- [Nutch Wiki] Trivial Update of "NutchResources" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/02 02:45:43 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1533 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/07/02 06:02:19 UTC, 0 replies.
- [Nutch Wiki] Update of "bin/nutch_mergesegs" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/02 08:56:01 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "bin/nutch_mergesegs" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/02 08:58:31 UTC, 0 replies.
- [Nutch Wiki] Update of "bin/nutch_updatedb" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/02 09:17:23 UTC, 0 replies.
- [jira] [Commented] (NUTCH-628) Host database to keep track of host-level information - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/02 09:31:28 UTC, 1 replies.
- [jira] [Issue Comment Edited] (NUTCH-628) Host database to keep track of host-level information - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/02 09:31:29 UTC, 0 replies.
- [Nutch Wiki] Update of "bin/nutch_invertlinks" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/02 17:17:57 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "bin/nutch_invertlinks" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/02 17:19:28 UTC, 0 replies.
- [Nutch Wiki] Update of "bin/nutch_mergelinkdb" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/02 18:14:29 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "bin/nutch_mergelinkdb" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/02 18:15:13 UTC, 0 replies.
- Fwd: Reminder: TAC Assistance to ApacheCon NA 2011 closes July 8th - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2011/07/03 02:34:49 UTC, 0 replies.
- [Nutch Wiki] Update of "bin/nutch solrindex" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/03 05:30:44 UTC, 0 replies.
- [Nutch Wiki] Update of "bin/nutch solrdedup" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/03 05:40:53 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "bin/nutch solrdedup" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/03 05:41:46 UTC, 1 replies.
- [Nutch Wiki] Update of "bin/nutch solrclean" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/03 05:53:02 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1534 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/07/03 06:02:51 UTC, 0 replies.
- [Nutch Wiki] Update of "bin/nutch plugin" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/03 06:07:01 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "CommandLineOptions" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/03 06:11:00 UTC, 3 replies.
- [Nutch Wiki] Update of "NutchTutorial" by sirenfei - posted by Apache Wiki <wi...@apache.org> on 2011/07/03 06:34:06 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "OverviewDeploymentConfigs" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/03 20:18:45 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1535 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/07/04 06:02:39 UTC, 0 replies.
- [Nutch Wiki] Update of "NutchTutorial" by JulienNioche - posted by Apache Wiki <wi...@apache.org> on 2011/07/04 09:41:34 UTC, 6 replies.
- [Nutch Wiki] Update of "Nutch2Roadmap" by JulienNioche - posted by Apache Wiki <wi...@apache.org> on 2011/07/04 09:46:40 UTC, 0 replies.
- [jira] [Updated] (NUTCH-993) NullPointerException at FetcherOutputFormat.checkOutputSpecs - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/04 13:19:21 UTC, 3 replies.
- [jira] [Commented] (NUTCH-1013) Migrate RegexURLNormalizer from Apache ORO to java.util.regex - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/04 14:57:21 UTC, 6 replies.
- [Nutch Wiki] Update of "CommandLineOptions" by MarkusJelsma - posted by Apache Wiki <wi...@apache.org> on 2011/07/04 16:14:10 UTC, 2 replies.
- [jira] [Resolved] (NUTCH-1013) Migrate RegexURLNormalizer from Apache ORO to java.util.regex - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/04 16:29:22 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1011) Normalize duplicate slashes in URL's - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/04 16:31:22 UTC, 4 replies.
- [Nutch Wiki] Trivial Update of "CommandLineOptions" by MarkusJelsma - posted by Apache Wiki <wi...@apache.org> on 2011/07/04 17:32:49 UTC, 3 replies.
- [jira] [Created] (NUTCH-1029) Readdb throws EOFException - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/04 19:14:21 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1001) bin/nutch fetch/parse handle crawl/segments directory - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/04 19:16:22 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "PluginCentral" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/04 20:13:30 UTC, 13 replies.
- Build failed in Jenkins: Nutch-trunk #1536 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/07/05 06:38:38 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "FAQ" by EricPugh - posted by Apache Wiki <wi...@apache.org> on 2011/07/05 15:53:33 UTC, 0 replies.
- [jira] [Created] (NUTCH-1030) WebgraphDB program requires manually added directories - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/06 01:44:16 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1537 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/07/06 06:02:31 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "Archive and Legacy" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/06 06:21:07 UTC, 2 replies.
- [Nutch Wiki] Update of "OldFeatures" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/06 06:22:54 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "Features" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/06 06:41:26 UTC, 2 replies.
- [jira] [Created] (NUTCH-1032) Delegate parsing of robots.txt to crawler-commons - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/06 15:35:16 UTC, 0 replies.
- [jira] [Created] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/06 15:35:16 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-993) NullPointerException at FetcherOutputFormat.checkOutputSpecs - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/06 15:47:17 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1030) WebgraphDB program requires manually added directories - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/06 16:09:16 UTC, 2 replies.
- [jira] [Created] (NUTCH-1033) Backport FetcherJob should run more reduce tasks than default - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/06 16:13:16 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1033) Backport FetcherJob should run more reduce tasks than default - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/06 16:13:17 UTC, 2 replies.
- [jira] [Commented] (NUTCH-1030) WebgraphDB program requires manually added directories - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/06 16:15:17 UTC, 4 replies.
- [jira] [Updated] (NUTCH-1011) Normalize duplicate slashes in URL's - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/06 17:25:22 UTC, 0 replies.
- FeedParser test fails - posted by Markus Jelsma <ma...@openindex.io> on 2011/07/06 17:27:43 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1033) Backport FetcherJob should run more reduce tasks than default - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/06 17:29:16 UTC, 1 replies.
- [jira] [Resolved] (NUTCH-1011) Normalize duplicate slashes in URL's - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/06 17:37:16 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1033) Backport FetcherJob should run more reduce tasks than default - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/06 17:49:16 UTC, 0 replies.
- [ANN] Release crawler-commons 0.1 - posted by Julien Nioche <li...@gmail.com> on 2011/07/06 22:12:15 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1032) Delegate parsing of robots.txt to crawler-commons - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/06 22:35:16 UTC, 0 replies.
- [jira] [Created] (NUTCH-1034) Create Solr Velocity templates - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/06 22:39:16 UTC, 0 replies.
- [jira] [Created] (NUTCH-1035) Tune Solr config for Nutch users - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/06 22:41:19 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1035) Tune Solr config for Nutch users - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/06 23:03:16 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1034) Create Solr Velocity templates - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/06 23:03:16 UTC, 3 replies.
- [jira] [Commented] (NUTCH-937) When nutch is run on hadoop > 0.20.2 (or cdh) it will not find plugins because MapReduce will not unpack plugin/ directory from the job's pack (due to MAPREDUCE-967) - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/06 23:27:16 UTC, 3 replies.
- Build failed in Jenkins: Nutch-trunk #1538 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/07/07 07:35:41 UTC, 0 replies.
- [jira] [Commented] (NUTCH-809) Parse-metatags plugin - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/07 12:05:16 UTC, 3 replies.
- [jira] [Updated] (NUTCH-797) parse-tika is not properly constructing URLs when the target begins with a "?" - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/07 12:05:17 UTC, 0 replies.
- [jira] [Updated] (NUTCH-925) plugins stored in weakhashmap lead memory leak - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/07 12:07:16 UTC, 0 replies.
- [jira] [Commented] (NUTCH-783) IndexerChecker Utilty - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/07 13:44:16 UTC, 2 replies.
- [jira] [Issue Comment Edited] (NUTCH-783) IndexerChecker Utilty - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/07 13:44:16 UTC, 0 replies.
- Upgrade libs to support Hadoop 0.20.203 and 0.21 - posted by Markus Jelsma <ma...@openindex.io> on 2011/07/07 14:34:08 UTC, 0 replies.
- [jira] [Commented] (NUTCH-797) parse-tika is not properly constructing URLs when the target begins with a "?" - posted by "Robert Hohman (JIRA)" <ji...@apache.org> on 2011/07/07 15:14:17 UTC, 1 replies.
- [jira] [Commented] (NUTCH-925) plugins stored in weakhashmap lead memory leak - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/07 15:28:16 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-925) plugins stored in weakhashmap lead memory leak - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/07 15:38:16 UTC, 0 replies.
- Rebuilding site - posted by lewis john mcgibbney <le...@gmail.com> on 2011/07/07 18:11:11 UTC, 2 replies.
- Build failed in Jenkins: Nutch-trunk #1539 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/07/08 06:03:17 UTC, 0 replies.
- [Nutch Wiki] Update of "serenitykeningston" by serenitykeningston - posted by Apache Wiki <wi...@apache.org> on 2011/07/08 20:28:23 UTC, 0 replies.
- [Nutch Wiki] Update of "AcademicArticles" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/08 23:31:39 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "AcademicArticles" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/08 23:33:45 UTC, 2 replies.
- [Nutch Wiki] Trivial Update of "GORA_HBase" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/09 00:40:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-717) Make Nutch Solr integration easier - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/09 01:30:16 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1540 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/07/09 06:02:13 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1029) Readdb throws EOFException - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/09 18:13:51 UTC, 1 replies.
- [jira] [Created] (NUTCH-1036) Solr jobs should increment counters in Reporter - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/09 19:17:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1029) Readdb throws EOFException - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/09 23:18:00 UTC, 2 replies.
- [jira] [Issue Comment Edited] (NUTCH-1029) Readdb throws EOFException - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/09 23:23:59 UTC, 0 replies.
- [jira] [Created] (NUTCH-1037) Deduplicate anchors before indexing - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/10 01:07:59 UTC, 0 replies.
- [jira] [Issue Comment Edited] (NUTCH-937) When nutch is run on hadoop > 0.20.2 (or cdh) it will not find plugins because MapReduce will not unpack plugin/ directory from the job's pack (due to MAPREDUCE-967) - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/10 01:14:00 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1541 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/07/10 06:03:35 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1037) Deduplicate anchors before indexing - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/10 20:19:59 UTC, 13 replies.
- Build failed in Jenkins: Nutch-trunk #1542 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/07/11 06:02:02 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1030) WebgraphDB program requires manually added directories - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/11 12:23:59 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1027) Degrade log level of `can't find rules for scope` - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/11 12:25:59 UTC, 1 replies.
- [jira] [Updated] (NUTCH-783) IndexerChecker Utilty - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/11 12:43:59 UTC, 0 replies.
- [jira] [Created] (NUTCH-1038) Port IndexingFiltersChecker to 2.0 - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/11 12:44:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-783) IndexingFiltersChecker Utilty - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/11 12:44:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-783) IndexingFiltersChecker Utility - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/11 12:48:00 UTC, 1 replies.
- [jira] [Resolved] (NUTCH-783) IndexingFiltersChecker Utilty - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/11 12:48:00 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "NewScoring" by MarkusJelsma - posted by Apache Wiki <wi...@apache.org> on 2011/07/11 12:52:38 UTC, 0 replies.
- [jira] [Closed] (NUTCH-783) IndexingFiltersChecker Utility - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/11 12:57:59 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1027) Degrade log level of `can't find rules for scope` - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/11 13:59:59 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1037) Deduplicate anchors before indexing - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/11 14:02:00 UTC, 5 replies.
- [jira] [Created] (NUTCH-1039) Fetcher fails for pages without content-length header - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/11 14:54:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-987) Support HTTP auth for Solr communication - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/11 15:34:00 UTC, 6 replies.
- [jira] [Issue Comment Edited] (NUTCH-987) Support HTTP auth for Solr communication - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/11 15:35:59 UTC, 0 replies.
- [jira] [Created] (NUTCH-1040) Backport REST-API from 2.0 - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/11 16:53:59 UTC, 0 replies.
- [jira] [Issue Comment Edited] (NUTCH-1037) Deduplicate anchors before indexing - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/11 16:53:59 UTC, 1 replies.
- [jira] [Commented] (NUTCH-1017) Exception getting mime type by name - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/11 18:04:00 UTC, 0 replies.
- [jira] [Created] (NUTCH-1041) Not reading mime-type correctly - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/11 18:54:59 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1041) Not reading mime-type correctly - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/11 19:12:59 UTC, 1 replies.
- Build failed in Jenkins: Nutch-trunk #1543 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/07/12 06:03:24 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "RunningNutchAndSolr" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/12 11:05:25 UTC, 5 replies.
- [Nutch Wiki] Trivial Update of "NutchGotchas" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/12 11:27:05 UTC, 4 replies.
- [Nutch Wiki] Update of "NutchGotchas" by JulienNioche - posted by Apache Wiki <wi...@apache.org> on 2011/07/12 12:10:09 UTC, 0 replies.
- [Nutch Wiki] Update of "FrontPage" by JulienNioche - posted by Apache Wiki <wi...@apache.org> on 2011/07/12 12:16:50 UTC, 0 replies.
- Real-time Solr integration - posted by Matthew Painter <ma...@kusiri.com> on 2011/07/12 14:35:27 UTC, 10 replies.
- Realtime Solr Indexing - posted by Matthew Painter <ma...@kusiri.com> on 2011/07/12 15:40:28 UTC, 0 replies.
- [jira] [Commented] (NUTCH-987) Support HTTP auth for Solr communication - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/12 15:55:00 UTC, 6 replies.
- [jira] [Created] (NUTCH-1042) Fetcher.max.crawl.delay property not taken into account correctly when set to -1 - posted by "Nutch User - 1 (JIRA)" <ji...@apache.org> on 2011/07/12 16:30:59 UTC, 0 replies.
- [jira] [Created] (NUTCH-1043) Add pattern for filtering .js in default url filters - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/12 19:26:59 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1043) Add pattern for filtering .js in default url filters - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/12 19:57:00 UTC, 4 replies.
- [Nutch Wiki] Trivial Update of "Becoming_A_Nutch_Developer" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/12 22:13:29 UTC, 0 replies.
- Nutch benchmark results - posted by lewis john mcgibbney <le...@gmail.com> on 2011/07/12 22:54:45 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "Development" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/12 22:56:48 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "TaskList" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/12 22:57:36 UTC, 0 replies.
- [jira] [Commented] (NUTCH-956) solrindex issues - posted by "Alexis (JIRA)" <ji...@apache.org> on 2011/07/12 23:29:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-956) solrindex issues - posted by "Alexis (JIRA)" <ji...@apache.org> on 2011/07/12 23:32:59 UTC, 1 replies.
- Build failed in Jenkins: Nutch-trunk #1544 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/07/13 06:02:09 UTC, 0 replies.
- [jira] [Created] (NUTCH-1044) Redirected URLs and possibly all of their outlinked URLs have invalid scores. - posted by "Nutch User - 1 (JIRA)" <ji...@apache.org> on 2011/07/13 11:01:01 UTC, 0 replies.
- [Nutch Wiki] Update of "NutchGotchas" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/13 11:06:15 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "OldPluginCentral" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/13 11:26:52 UTC, 4 replies.
- [Nutch Wiki] Trivial Update of "WhyNutchHasAPluginSystem" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/13 11:41:49 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "WhichTechnicalConceptsAreBehindTheNutchPluginSystem" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/13 11:50:50 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "WhatsTheProblemWithPluginsAndClass-loading" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/13 11:52:57 UTC, 1 replies.
- [jira] [Created] (NUTCH-1045) MimeUtil to rely on default config provided by Tika - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/13 12:11:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1045) MimeUtil to rely on default config provided by Tika - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/13 12:12:05 UTC, 2 replies.
- [jira] [Updated] (NUTCH-1043) Add pattern for filtering .js in default url filters - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/13 12:13:59 UTC, 1 replies.
- [jira] [Commented] (NUTCH-1045) MimeUtil to rely on default config provided by Tika - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/13 14:42:59 UTC, 10 replies.
- [jira] [Commented] (NUTCH-1036) Solr jobs should increment counters in Reporter - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/13 16:01:00 UTC, 0 replies.
- [Nutch Wiki] Update of "bin/nutch_invertlinks" by JulienNioche - posted by Apache Wiki <wi...@apache.org> on 2011/07/13 16:09:37 UTC, 0 replies.
- [jira] [Created] (NUTCH-1046) Add tests for indexing to SOLR - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/13 16:21:00 UTC, 0 replies.
- [jira] [Created] (NUTCH-1047) Pluggable indexing backends - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/13 16:26:59 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1046) Add tests for indexing to SOLR - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/13 16:31:00 UTC, 1 replies.
- [Nutch Wiki] Update of "bin/nutch_crawl" by JoeLencioni - posted by Apache Wiki <wi...@apache.org> on 2011/07/13 21:00:55 UTC, 0 replies.
- [jira] [Created] (NUTCH-1048) Busted links on http://nutch.apache.org/mailing_lists.html - posted by "Eric Pugh (JIRA)" <ji...@apache.org> on 2011/07/13 23:23:59 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1545 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/07/14 06:02:09 UTC, 0 replies.
- [jira] [Created] (NUTCH-1049) Add classes to bin/nutch - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/14 13:37:59 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1049) Add classes to bin/nutch - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/14 13:39:59 UTC, 1 replies.
- HTTPS support - posted by Matthew Painter <ma...@kusiri.com> on 2011/07/14 14:02:11 UTC, 3 replies.
- [jira] [Created] (NUTCH-1050) Add segmentDir option to WebGraph - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/14 17:08:59 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1050) Add segmentDir option to WebGraph - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/14 17:10:59 UTC, 1 replies.
- [jira] [Assigned] (NUTCH-1050) Add segmentDir option to WebGraph - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/14 17:15:38 UTC, 0 replies.
- Normalize and filter hyperlinks during parse - posted by Markus Jelsma <ma...@openindex.io> on 2011/07/14 17:37:20 UTC, 6 replies.
- [jira] [Created] (NUTCH-1051) Export WebGraph node scores for solr.ExternalFileField - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/14 18:08:59 UTC, 0 replies.
- [jira] [Commented] (NUTCH-914) Implement Apache Project Branding Requirements - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/14 18:31:00 UTC, 1 replies.
- [Nutch Wiki] Update of "NutchTutorial" by JoeLencioni - posted by Apache Wiki <wi...@apache.org> on 2011/07/14 22:35:36 UTC, 3 replies.
- [Nutch Wiki] Update of "NutchTutorialPre1.3" by JoeLencioni - posted by Apache Wiki <wi...@apache.org> on 2011/07/14 22:39:06 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1052) Multiple deletes of the same URL using SolrClean - posted by "Tim Pease (JIRA)" <ji...@apache.org> on 2011/07/15 00:00:05 UTC, 1 replies.
- [jira] [Created] (NUTCH-1052) Multiple delete of the same URL using SolrClean - posted by "Tim Pease (JIRA)" <ji...@apache.org> on 2011/07/15 00:00:05 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1546 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/07/15 06:02:04 UTC, 0 replies.
- [jira] [Created] (NUTCH-1053) Parsing of RSS feeds fails - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/15 11:30:59 UTC, 0 replies.
- [jira] [Created] (NUTCH-1054) Make linkDB optional during indexing - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/15 14:01:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1054) Make linkDB optional during indexing - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/15 17:59:00 UTC, 4 replies.
- [jira] [Commented] (NUTCH-1054) Make linkDB optional during indexing - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/15 18:11:00 UTC, 4 replies.
- JIRA status - posted by lewis john mcgibbney <le...@gmail.com> on 2011/07/15 20:49:28 UTC, 1 replies.
- [Nutch Wiki] Trivial Update of "PluginGotchas" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/15 22:06:32 UTC, 2 replies.
- [jira] [Commented] (NUTCH-1048) Busted links on http://nutch.apache.org/mailing_lists.html - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/15 22:20:00 UTC, 8 replies.
- [jira] [Commented] (NUTCH-1047) Pluggable indexing backends - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/15 22:25:59 UTC, 1 replies.
- [Nutch Wiki] Trivial Update of "Presentations" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/15 22:48:15 UTC, 1 replies.
- Build failed in Jenkins: Nutch-trunk #1547 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/07/16 06:02:28 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-916) Project Naming And Descriptions - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/16 13:53:00 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-917) Website Navigation Links - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/16 13:55:00 UTC, 0 replies.
- [jira] [Closed] (NUTCH-917) Website Navigation Links - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/16 13:55:00 UTC, 0 replies.
- [jira] [Closed] (NUTCH-915) project website basics - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/16 13:55:00 UTC, 0 replies.
- [jira] [Closed] (NUTCH-916) Project Naming And Descriptions - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/16 13:55:00 UTC, 0 replies.
- [jira] [Closed] (NUTCH-918) Trademark Attributions - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/16 13:57:00 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-918) Trademark Attributions - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/16 13:57:00 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-1019) Edit comment in org.apache.nutch.crawl.Crawl to reflect removal of legacy - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/16 14:18:59 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1019) Edit comment in org.apache.nutch.crawl.Crawl to reflect removal of legacy - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/16 14:20:59 UTC, 1 replies.
- [jira] [Commented] (NUTCH-1019) Edit comment in org.apache.nutch.crawl.Crawl to reflect removal of legacy - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/16 14:22:59 UTC, 1 replies.
- [jira] [Assigned] (NUTCH-1023) Trivial error in error message for org.apache.nutch.crawl.LinkDbReader - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/16 14:27:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1023) Trivial error in error message for org.apache.nutch.crawl.LinkDbReader - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/16 14:28:59 UTC, 2 replies.
- Does i18n have a purpose anymore - posted by lewis john mcgibbney <le...@gmail.com> on 2011/07/16 14:33:01 UTC, 2 replies.
- [jira] [Assigned] (NUTCH-672) allow unit tests to be run from bin/nutch - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/16 14:48:59 UTC, 0 replies.
- [jira] [Updated] (NUTCH-672) allow unit tests to be run from bin/nutch - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/16 14:50:59 UTC, 1 replies.
- [jira] [Commented] (NUTCH-657) Estonian N-gram profile has wrong name - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/16 15:22:00 UTC, 4 replies.
- [jira] [Created] (NUTCH-1055) upgrade package.html file in language identifier plugin - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/16 15:41:59 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1055) upgrade package.html file in language identifier plugin - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/16 15:44:00 UTC, 2 replies.
- [jira] [Commented] (NUTCH-16) boost documents matching a url pattern - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/16 16:07:59 UTC, 1 replies.
- [jira] [Created] (NUTCH-1056) Write a new plugin example for inclusion on the wiki - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/16 19:20:00 UTC, 0 replies.
- [jira] [Created] (NUTCH-1057) Make fetcher thread time out configurable - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/16 19:31:59 UTC, 0 replies.
- [jira] [Closed] (NUTCH-16) boost documents matching a url pattern - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/16 19:53:59 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1047) Pluggable indexing backends - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/16 20:09:59 UTC, 0 replies.
- [jira] [Commented] (NUTCH-62) Add html META tag information into metaData in index-more plugin - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/16 20:49:59 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1548 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/07/17 06:03:18 UTC, 0 replies.
- Running individual test classes from nutch script cont'd - posted by lewis john mcgibbney <le...@gmail.com> on 2011/07/17 15:06:26 UTC, 2 replies.
- [jira] [Updated] (NUTCH-1057) Make fetcher thread time out configurable - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/17 15:44:59 UTC, 1 replies.
- [jira] [Resolved] (NUTCH-1029) Readdb throws EOFException - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/17 16:03:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-961) Expose Tika's boilerpipe support - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/17 16:09:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-965) Skip parsing for truncated documents - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/17 16:15:00 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1044) Redirected URLs and possibly all of their outlinked URLs have invalid scores. - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/17 16:21:00 UTC, 1 replies.
- [jira] [Created] (NUTCH-1058) Upgrade Solr schema to version 1.4 - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/17 16:36:59 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1058) Upgrade Solr schema to version 1.4 - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/17 16:37:00 UTC, 0 replies.
- adding details to mvn.template? - posted by lewis john mcgibbney <le...@gmail.com> on 2011/07/17 17:48:08 UTC, 2 replies.
- [jira] [Commented] (NUTCH-648) debian style autocomplete - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/17 18:29:00 UTC, 0 replies.
- [jira] [Created] (NUTCH-1059) Remove convdb command from /bin/nutch - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/17 18:35:00 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1019) Edit comment in org.apache.nutch.crawl.Crawl to reflect removal of legacy - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/17 22:44:59 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1019) Edit comment in org.apache.nutch.crawl.Crawl to reflect removal of legacy - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/17 22:44:59 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1023) Trivial error in error message for org.apache.nutch.crawl.LinkDbReader - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/17 22:50:59 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1023) Trivial error in error message for org.apache.nutch.crawl.LinkDbReader - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/17 22:51:00 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1023) Trivial error in error message for org.apache.nutch.crawl.LinkDbReader - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/17 22:51:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1059) Remove convdb command from /bin/nutch - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/17 23:13:03 UTC, 0 replies.
- [Nutch Wiki] Update of "RunningNutchAndSolr" by EricPugh - posted by Apache Wiki <wi...@apache.org> on 2011/07/18 03:17:52 UTC, 1 replies.
- Build failed in Jenkins: Nutch-trunk #1549 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/07/18 06:02:37 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1054) Make linkDB optional during indexing - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/18 11:22:59 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1043) Add pattern for filtering .js in default url filters - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/18 11:28:59 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1059) Remove convdb command from /bin/nutch - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/18 12:12:59 UTC, 0 replies.
- [Nutch Wiki] Update of "FAQ" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/18 13:00:58 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "FAQ" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/18 13:02:20 UTC, 1 replies.
- [jira] [Closed] (NUTCH-1059) Remove convdb command from /bin/nutch - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/18 13:25:00 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1059) Remove convdb command from /bin/nutch - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/18 13:25:00 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1055) upgrade package.html file in language identifier plugin - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/18 13:45:00 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1055) upgrade package.html file in language identifier plugin - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/18 13:47:00 UTC, 0 replies.
- changing file and directory names - posted by lewis john mcgibbney <le...@gmail.com> on 2011/07/18 13:50:44 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1049) Add classes to bin/nutch - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/18 13:57:03 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1020) Create or locate class for org.apache.nutch.tools.compat.CrawlDbConverter - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/18 13:59:00 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1020) Create or locate class for org.apache.nutch.tools.compat.CrawlDbConverter - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/18 13:59:00 UTC, 0 replies.
- [jira] [Commented] (NUTCH-881) Good quality documentation for Nutch - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/18 14:05:00 UTC, 0 replies.
- [jira] [Commented] (NUTCH-865) Format source code in unique style - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/18 14:09:00 UTC, 7 replies.
- [jira] [Commented] (NUTCH-910) Cached.jsp has a bug with encoding - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/18 14:11:52 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-1044) Redirected URLs and possibly all of their outlinked URLs have invalid scores. - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/18 14:22:57 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1044) Redirected URLs and possibly all of their outlinked URLs have invalid scores. - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/18 14:22:58 UTC, 1 replies.
- [jira] [Issue Comment Edited] (NUTCH-1044) Redirected URLs and possibly all of their outlinked URLs have invalid scores. - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/18 14:50:57 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "RunningNutchAndSolr" by EricPugh - posted by Apache Wiki <wi...@apache.org> on 2011/07/18 15:05:12 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1051) Export WebGraph node scores for solr.ExternalFileField - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/18 15:37:57 UTC, 1 replies.
- [Nutch Wiki] Trivial Update of "08CommandLineOptions" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/18 16:54:55 UTC, 1 replies.
- Fwd: Nutch 1.3 in Eclipse - posted by Chris Alexander <ch...@kusiri.com> on 2011/07/18 19:27:55 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1048) Busted links on http://nutch.apache.org/mailing_lists.html - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/18 19:36:57 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1048) Busted links on http://nutch.apache.org/mailing_lists.html - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/18 19:38:57 UTC, 1 replies.
- [jira] [Reopened] (NUTCH-1048) Busted links on http://nutch.apache.org/mailing_lists.html - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/18 19:54:00 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-920) Project Metadata - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/18 21:01:57 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-919) Logos and Graphics - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/18 21:01:57 UTC, 0 replies.
- [jira] [Commented] (NUTCH-920) Project Metadata - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/18 21:01:57 UTC, 5 replies.
- Build failed in Jenkins: Nutch-trunk #1550 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/07/19 06:04:15 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1055) upgrade package.html file in language identifier plugin - posted by "Hudson (JIRA)" <ji...@apache.org> on 2011/07/19 06:05:57 UTC, 0 replies.
- [jira] [Created] (NUTCH-1060) URL filters to produce regexes to be used by OutlinkExtractor. - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/19 11:53:57 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1021) Migrate OutlinkExtractor from Apache ORO to java.util.regex - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/19 13:27:58 UTC, 0 replies.
- [jira] [Created] (NUTCH-1061) Migrate MoreIndexingFilter from Apache ORO to java.util.regex - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/19 14:01:58 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1061) Migrate MoreIndexingFilter from Apache ORO to java.util.regex - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/19 14:03:57 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1014) Migrate from Apache ORO to java.util.regex - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/19 14:05:57 UTC, 3 replies.
- [jira] [Created] (NUTCH-1062) Migrate BasicURLNormalizer from Apache ORO to java.util.regex - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/19 14:11:57 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1062) Migrate BasicURLNormalizer from Apache ORO to java.util.regex - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/19 14:15:58 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1050) Add segmentDir option to WebGraph - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/19 14:33:58 UTC, 1 replies.
- [jira] [Commented] (NUTCH-1057) Make fetcher thread time out configurable - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/19 14:35:57 UTC, 3 replies.
- [jira] [Resolved] (NUTCH-1050) Add segmentDir option to WebGraph - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/19 14:51:57 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1037) Deduplicate anchors before indexing - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/19 15:13:58 UTC, 0 replies.
- [jira] [Closed] (NUTCH-729) NPE in FieldIndexer when BasicFields url doesn't exist - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/19 15:19:57 UTC, 0 replies.
- [jira] [Updated] (NUTCH-771) Add WebGraph classes to the bin/nutch script - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/19 15:29:57 UTC, 0 replies.
- [jira] [Created] (NUTCH-1063) OutlinkExtractor test generates an exception but does not fail - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/19 16:12:57 UTC, 0 replies.
- [jira] [Created] (NUTCH-1064) o.a.n.util.MimeUtil uses deprecated Tika methods - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/19 17:08:57 UTC, 0 replies.
- [jira] [Issue Comment Edited] (NUTCH-1045) MimeUtil to rely on default config provided by Tika - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/19 19:50:57 UTC, 1 replies.
- [jira] [Commented] (NUTCH-919) Logos and Graphics - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/19 20:30:58 UTC, 4 replies.
- [jira] [Updated] (NUTCH-865) Format source code in unique style - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/19 22:50:57 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "bin/nutch_updatedb" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/19 22:51:11 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1551 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/07/20 06:11:46 UTC, 0 replies.
- [jira] [Commented] (NUTCH-717) Make Nutch Solr integration easier - posted by "Eric Pugh (JIRA)" <ji...@apache.org> on 2011/07/20 08:39:57 UTC, 3 replies.
- Build failed in Jenkins: Nutch-trunk #1552 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/07/21 06:05:05 UTC, 0 replies.
- [jira] [Updated] (NUTCH-920) Project Metadata - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/21 16:29:57 UTC, 1 replies.
- [jira] [Updated] (NUTCH-919) Logos and Graphics - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/21 16:39:58 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1553 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/07/22 06:01:21 UTC, 0 replies.
- [jira] [Created] (NUTCH-1065) New mvn.template - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/22 11:51:57 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1065) New mvn.template - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/22 11:53:02 UTC, 0 replies.
- [jira] [Created] (NUTCH-1066) trivial correction of - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/22 11:58:57 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1066) trivial correction of domain-urlfilter.txt - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/22 12:00:59 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1066) trivial correction of - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/22 12:00:59 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1066) trivial correction of domain-urlfilter.txt - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/22 13:01:06 UTC, 0 replies.
- [jira] [Created] (NUTCH-1067) Configure minimum throughput for fetcher - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/22 16:32:57 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1067) Configure minimum throughput for fetcher - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/22 16:32:58 UTC, 1 replies.
- [jira] [Resolved] (NUTCH-1066) trivial correction of domain-urlfilter.txt - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/22 17:51:58 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1066) trivial correction of domain-urlfilter.txt - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/22 17:53:57 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1554 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/07/23 06:03:24 UTC, 0 replies.
- .BAT file for running nutch in Windows (no cygwin) - posted by Radim Kolar <hs...@sendmail.cz> on 2011/07/23 15:59:51 UTC, 1 replies.
- [jira] [Commented] (NUTCH-1067) Configure minimum throughput for fetcher - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/24 02:05:09 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1555 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/07/24 06:04:02 UTC, 0 replies.
- Re: Please remove me from the mailing list - posted by Ariana Homan-Cruz <ah...@harding.edu> on 2011/07/25 00:20:56 UTC, 0 replies.
- Automaton improvements - posted by Kirby Bohling <ki...@gmail.com> on 2011/07/25 06:01:36 UTC, 5 replies.
- Build failed in Jenkins: Nutch-trunk #1556 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/07/25 06:03:04 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1065) New mvn.template - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/25 14:13:10 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1045) MimeUtil to rely on default config provided by Tika - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/25 14:41:09 UTC, 0 replies.
- [jira] [Created] (NUTCH-1068) Automaton performance improvements based on Lucene code base - posted by "Kirby Bohling (JIRA)" <ji...@apache.org> on 2011/07/25 18:34:09 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1068) Automaton performance improvements based on Lucene code base - posted by "Kirby Bohling (JIRA)" <ji...@apache.org> on 2011/07/25 18:36:09 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1034) Create Solr Velocity templates - posted by "Umar Shah (JIRA)" <ji...@apache.org> on 2011/07/25 21:26:09 UTC, 0 replies.
- [jira] [Created] (NUTCH-1069) readlinkdb throws exception - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/25 22:58:09 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1557 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/07/26 06:04:22 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1558 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/07/27 06:02:29 UTC, 0 replies.
- Page deletion and tracking change between crawlings - posted by Julio Garcés Teuber <ju...@xinergia.com> on 2011/07/27 16:44:50 UTC, 1 replies.
- [Nutch Wiki] Update of "NutchTutorial" by MarcBoucher - posted by Apache Wiki <wi...@apache.org> on 2011/07/27 16:50:39 UTC, 0 replies.
- Tracking change between crawlings and page deletion - posted by "julio.xng" <ju...@xng.bz> on 2011/07/27 16:55:29 UTC, 0 replies.
- [jira] [Created] (NUTCH-1070) Run nutch under native windows (no cygwin) - posted by "Radim Kolar (JIRA)" <ji...@apache.org> on 2011/07/27 19:08:09 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1070) Run nutch under native windows (no cygwin) - posted by "Radim Kolar (JIRA)" <ji...@apache.org> on 2011/07/27 19:10:10 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "NonDefaultIntranetCrawlingOptions" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/07/27 23:43:44 UTC, 1 replies.
- Build failed in Jenkins: Nutch-trunk #1559 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/07/28 06:03:04 UTC, 0 replies.
- [jira] [Created] (NUTCH-1071) Crawldb update to total counts per status - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/28 15:53:09 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1071) Crawldb update to total counts per status - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/28 15:55:09 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1071) Crawldb update to total counts per status - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/28 15:57:09 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1071) Crawldb update to total counts per status - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/28 15:57:09 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1071) Crawldb update to total counts per status - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/28 16:05:09 UTC, 0 replies.
- [jira] [Created] (NUTCH-1072) Display number and size of queues in Fetcher status - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/28 16:54:09 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1072) Display number and size of queues in Fetcher status - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/28 16:56:09 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1072) Display number and size of queues in Fetcher status - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/28 17:02:09 UTC, 0 replies.
- Correct Nutch tutorial - posted by lewis john mcgibbney <le...@gmail.com> on 2011/07/28 18:00:13 UTC, 0 replies.
- [jira] [Closed] (NUTCH-919) Logos and Graphics - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/28 20:22:09 UTC, 0 replies.
- [jira] [Closed] (NUTCH-920) Project Metadata - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/28 20:24:09 UTC, 0 replies.
- [jira] [Commented] (NUTCH-917) Website Navigation Links - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/28 20:28:09 UTC, 0 replies.
- [jira] [Reopened] (NUTCH-917) Website Navigation Links - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/28 20:28:09 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1560 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/07/29 06:04:01 UTC, 0 replies.
- [jira] [Created] (NUTCH-1073) Rename parameters 'fetcher.threads.per.host.by.ip' and 'fetcher.threads.per.host' - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/07/29 09:50:09 UTC, 0 replies.
- Re: (NUTCH-1071) Crawldb update to total counts per status - posted by Markus Jelsma <ma...@openindex.io> on 2011/07/29 11:23:21 UTC, 5 replies.
- [Nutch Wiki] Update of "NutchHadoopTutorial" by fei33423 - posted by Apache Wiki <wi...@apache.org> on 2011/07/29 14:00:39 UTC, 0 replies.
- Possible use of your bot as a hacking tool - posted by Ardath Rekha <ar...@gmail.com> on 2011/07/29 23:33:46 UTC, 1 replies.
- Build failed in Jenkins: Nutch-trunk #1561 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/07/30 06:03:28 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1562 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/07/31 06:03:12 UTC, 0 replies.