You are viewing a plain text version of this content. The canonical link for it is here.
- [jira] [Commented] (NUTCH-1200) Resolving Ivy dependencies in several plugins - posted by "Blaise Thomson (Commented) (JIRA)" <ji...@apache.org> on 2011/12/01 11:49:40 UTC, 1 replies.
- [Nutch Wiki] Trivial Update of "RunNutchInEclipse" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/12/01 20:35:13 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-trunk #1681 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/12/02 05:25:01 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1206) tika parser of nutch 1.3 is failing to prcess pdfs - posted by "dibyendu ghosh (Commented) (JIRA)" <ji...@apache.org> on 2011/12/02 08:11:40 UTC, 7 replies.
- Fwd: [VOTE] Release Apache Accumulo 1.3.5-incubating (rc8) - posted by Lewis John Mcgibbney <le...@gmail.com> on 2011/12/02 13:23:56 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1206) tika parser of nutch 1.3 is failing to prcess pdfs - posted by "Chris A. Mattmann (Resolved) (JIRA)" <ji...@apache.org> on 2011/12/02 18:07:40 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1084) ReadDB url throws exception - posted by "Marek Bachmann (Commented) (JIRA)" <ji...@apache.org> on 2011/12/03 17:30:39 UTC, 1 replies.
- [jira] [Commented] (NUTCH-296) Image Search - posted by "Sanjib Narzary (Commented) (JIRA)" <ji...@apache.org> on 2011/12/03 23:50:40 UTC, 0 replies.
- [Nutch Wiki] Update of "NutchTutorial" by Frungi - posted by Apache Wiki <wi...@apache.org> on 2011/12/04 07:46:17 UTC, 0 replies.
- Re: nutch and openJDK 1.6 for fedora - posted by Alexander Aristov <al...@gmail.com> on 2011/12/04 10:23:23 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "FAQ" by Frungi - posted by Apache Wiki <wi...@apache.org> on 2011/12/04 10:36:37 UTC, 0 replies.
- [Nutch Wiki] Update of "AdminGroup" by GavinMcDonald - posted by Apache Wiki <wi...@apache.org> on 2011/12/04 10:44:45 UTC, 1 replies.
- [Nutch Wiki] Update of "ContributorsGroup" by GavinMcDonald - posted by Apache Wiki <wi...@apache.org> on 2011/12/04 10:47:37 UTC, 1 replies.
- [jira] [Commented] (NUTCH-1047) Pluggable indexing backends - posted by "Julien Nioche (Commented) (JIRA)" <ji...@apache.org> on 2011/12/05 11:07:40 UTC, 4 replies.
- contribution to wiki - posted by Sebastian Nagel <wa...@googlemail.com> on 2011/12/05 21:04:42 UTC, 2 replies.
- how to search the web with nutch - posted by Jihene Ferchichi Jmal <fe...@hotmail.fr> on 2011/12/06 08:20:08 UTC, 1 replies.
- [Nutch Wiki] Trivial Update of "AdminGroup" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/12/06 12:53:35 UTC, 1 replies.
- Update Notice.txt - posted by Lewis John Mcgibbney <le...@gmail.com> on 2011/12/06 18:58:49 UTC, 2 replies.
- [jira] [Created] (NUTCH-1216) Add trivial comment to lib/native/README.txt - posted by "Lewis John McGibbney (Created) (JIRA)" <ji...@apache.org> on 2011/12/06 20:45:40 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1216) Add trivial comment to lib/native/README.txt - posted by "Lewis John McGibbney (Updated) (JIRA)" <ji...@apache.org> on 2011/12/06 20:47:41 UTC, 1 replies.
- [jira] [Commented] (NUTCH-1216) Add trivial comment to lib/native/README.txt - posted by "Lewis John McGibbney (Commented) (JIRA)" <ji...@apache.org> on 2011/12/06 20:51:41 UTC, 3 replies.
- [Nutch Wiki] Trivial Update of "CrawlDatumStates" by SebastianNagel - posted by Apache Wiki <wi...@apache.org> on 2011/12/06 22:38:11 UTC, 0 replies.
- [jira] [Created] (NUTCH-1217) Update NOTICE.txt to drop some copyrights - posted by "Lewis John McGibbney (Created) (JIRA)" <ji...@apache.org> on 2011/12/07 13:14:39 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1217) Update NOTICE.txt to drop some copyrights - posted by "Lewis John McGibbney (Updated) (JIRA)" <ji...@apache.org> on 2011/12/07 13:56:40 UTC, 2 replies.
- Build failed in Jenkins: Nutch-trunk #1687 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/12/08 05:28:15 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "NutchTutorial" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/12/08 14:23:46 UTC, 1 replies.
- Jenkins build is back to normal : Nutch-trunk #1688 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/12/09 06:10:13 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1216) Add trivial comment to lib/native/README.txt - posted by "Lewis John McGibbney (Closed) (JIRA)" <ji...@apache.org> on 2011/12/09 15:56:40 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1216) Add trivial comment to lib/native/README.txt - posted by "Lewis John McGibbney (Resolved) (JIRA)" <ji...@apache.org> on 2011/12/09 15:56:40 UTC, 0 replies.
- Nutch-1.2 with Bing API - posted by Luis Taveras <lt...@hotmail.com> on 2011/12/11 05:48:34 UTC, 1 replies.
- Nutch + Solr + Carrot2 Tutorials - posted by Swapnil Kulkarni <sw...@usc.edu> on 2011/12/11 16:45:49 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1094) create comprehensive documentation for Nutchgora branch - posted by "Lewis John McGibbney (Updated) (JIRA)" <ji...@apache.org> on 2011/12/12 19:19:31 UTC, 0 replies.
- [jira] [Created] (NUTCH-1218) Improve trunk API documentation - posted by "Lewis John McGibbney (Created) (JIRA)" <ji...@apache.org> on 2011/12/12 19:21:32 UTC, 0 replies.
- Improving API Java Documentation - posted by Lewis John Mcgibbney <le...@gmail.com> on 2011/12/12 19:28:22 UTC, 1 replies.
- Upgrading to Hadoop 0.22.0+ - posted by Markus Jelsma <ma...@openindex.io> on 2011/12/13 17:41:19 UTC, 8 replies.
- [jira] [Created] (NUTCH-1219) Upgrade all jobs to new MapReduce API - posted by "Markus Jelsma (Created) (JIRA)" <ji...@apache.org> on 2011/12/13 17:47:30 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1219) Upgrade all jobs to new MapReduce API - posted by "Markus Jelsma (Updated) (JIRA)" <ji...@apache.org> on 2011/12/13 17:53:30 UTC, 1 replies.
- [jira] [Updated] (NUTCH-1218) Improve trunk API documentation - posted by "Lewis John McGibbney (Updated) (JIRA)" <ji...@apache.org> on 2011/12/13 23:31:30 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1218) Improve trunk API documentation - posted by "Markus Jelsma (Commented) (JIRA)" <ji...@apache.org> on 2011/12/13 23:43:30 UTC, 3 replies.
- [jira] [Created] (NUTCH-1220) Upgrade Solr deps - posted by "Markus Jelsma (Created) (JIRA)" <ji...@apache.org> on 2011/12/14 11:43:36 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1219) Upgrade all jobs to new MapReduce API - posted by "Markus Jelsma (Commented) (JIRA)" <ji...@apache.org> on 2011/12/14 12:29:31 UTC, 1 replies.
- [jira] [Created] (NUTCH-1221) Migrate DomainStatistics to MapReduce API - posted by "Markus Jelsma (Created) (JIRA)" <ji...@apache.org> on 2011/12/14 12:51:30 UTC, 1 replies.
- [jira] [Updated] (NUTCH-1221) Migrate DomainStatistics to MapReduce API - posted by "Markus Jelsma (Updated) (JIRA)" <ji...@apache.org> on 2011/12/14 13:57:30 UTC, 1 replies.
- [jira] [Commented] (NUTCH-1184) Fetcher to parse and follow Nth degree outlinks - posted by "Markus Jelsma (Commented) (JIRA)" <ji...@apache.org> on 2011/12/14 14:17:31 UTC, 5 replies.
- [jira] [Created] (NUTCH-1222) Upgrade to newer Hadoop versions - posted by "Markus Jelsma (Created) (JIRA)" <ji...@apache.org> on 2011/12/14 14:39:30 UTC, 0 replies.
- [jira] [Created] (NUTCH-1223) Migrate WebGraph to MapReduce API - posted by "Markus Jelsma (Created) (JIRA)" <ji...@apache.org> on 2011/12/14 14:41:30 UTC, 0 replies.
- [jira] [Created] (NUTCH-1224) Migrate FreeGenerator to MapReduce API - posted by "Markus Jelsma (Created) (JIRA)" <ji...@apache.org> on 2011/12/14 15:35:30 UTC, 0 replies.
- [jira] [Created] (NUTCH-1225) Migrate CrawlDBScanner to MapReduce API - posted by "Markus Jelsma (Created) (JIRA)" <ji...@apache.org> on 2011/12/14 15:35:30 UTC, 11 replies.
- Fwd: check out the iPhone game, that i have developed. - posted by Lewis John Mcgibbney <le...@gmail.com> on 2011/12/15 13:30:17 UTC, 5 replies.
- Modifying Nutch Ivy & Maven settings [WAS] Re: [jira] [Created] (NUTCH-1225) Migrate CrawlDBScanner to MapReduce API - posted by Lewis John Mcgibbney <le...@gmail.com> on 2011/12/15 13:59:09 UTC, 2 replies.
- [jira] [Created] (NUTCH-1226) Migrate CrawlDbReader to MapReduce API - posted by "Markus Jelsma (Created) (JIRA)" <ji...@apache.org> on 2011/12/15 14:48:30 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1225) Migrate CrawlDBScanner to MapReduce API - posted by "Markus Jelsma (Updated) (JIRA)" <ji...@apache.org> on 2011/12/15 17:06:30 UTC, 2 replies.
- [jira] [Commented] (NUTCH-1225) Migrate CrawlDBScanner to MapReduce API - posted by "Markus Jelsma (Commented) (JIRA)" <ji...@apache.org> on 2011/12/15 17:10:30 UTC, 4 replies.
- [jira] [Updated] (NUTCH-1226) Migrate CrawlDbReader to MapReduce API - posted by "Markus Jelsma (Updated) (JIRA)" <ji...@apache.org> on 2011/12/15 18:52:30 UTC, 1 replies.
- [jira] [Resolved] (NUTCH-1221) Migrate DomainStatistics to MapReduce API - posted by "Markus Jelsma (Resolved) (JIRA)" <ji...@apache.org> on 2011/12/16 12:18:30 UTC, 0 replies.
- Hadoop 0.22 is compatible - posted by Markus Jelsma <ma...@openindex.io> on 2011/12/16 13:09:12 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1221) Migrate DomainStatistics to MapReduce API - posted by "Hudson (Commented) (JIRA)" <ji...@apache.org> on 2011/12/16 13:10:30 UTC, 1 replies.
- consulting nutch and hadoop - posted by ut...@sina.com on 2011/12/18 16:58:07 UTC, 3 replies.
- [jira] [Created] (NUTCH-1227) Set mapreduce.map.speculative for Hadoop 0.21 or higher - posted by "Markus Jelsma (Created) (JIRA)" <ji...@apache.org> on 2011/12/18 17:46:30 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1222) Upgrade to newer Hadoop versions - posted by "Markus Jelsma (Commented) (JIRA)" <ji...@apache.org> on 2011/12/19 12:49:30 UTC, 0 replies.
- [jira] [Created] (NUTCH-1228) Change mapred.task.timeout to mapreduce.task.timeout in fetcher - posted by "Markus Jelsma (Created) (JIRA)" <ji...@apache.org> on 2011/12/19 15:15:30 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1227) Set mapreduce.map.speculative for Hadoop 0.21 or higher - posted by "Markus Jelsma (Resolved) (JIRA)" <ji...@apache.org> on 2011/12/19 15:15:30 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1222) Upgrade to new Hadoop 0.22.0 - posted by "Markus Jelsma (Updated) (JIRA)" <ji...@apache.org> on 2011/12/19 15:25:30 UTC, 2 replies.
- [jira] [Resolved] (NUTCH-1222) Upgrade to new Hadoop 0.22.0 - posted by "Markus Jelsma (Resolved) (JIRA)" <ji...@apache.org> on 2011/12/19 16:15:30 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1225) Migrate CrawlDBScanner to MapReduce API - posted by "Markus Jelsma (Resolved) (JIRA)" <ji...@apache.org> on 2011/12/19 16:17:30 UTC, 0 replies.
- Build failed in Jenkins: nutch-trunk-maven » Apache Nutch #67 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/12/19 17:01:14 UTC, 1 replies.
- Build failed in Jenkins: nutch-trunk-maven #67 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/12/19 17:01:28 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1222) Upgrade to new Hadoop 0.22.0 - posted by "Hudson (Commented) (JIRA)" <ji...@apache.org> on 2011/12/19 17:01:31 UTC, 2 replies.
- Build failed in Jenkins: Nutch-trunk #1698 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/12/20 05:28:39 UTC, 0 replies.
- Build failed in Jenkins: nutch-trunk-maven » Apache Nutch #68 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/12/20 06:01:09 UTC, 0 replies.
- Build failed in Jenkins: nutch-trunk-maven #68 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/12/20 06:01:10 UTC, 0 replies.
- Nutch Developers - posted by liguohong_neu <li...@163.com> on 2011/12/20 09:54:57 UTC, 2 replies.
- [jira] [Resolved] (NUTCH-1184) Fetcher to parse and follow Nth degree outlinks - posted by "Markus Jelsma (Resolved) (JIRA)" <ji...@apache.org> on 2011/12/20 11:13:30 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1184) Fetcher to parse and follow Nth degree outlinks - posted by "Markus Jelsma (Updated) (JIRA)" <ji...@apache.org> on 2011/12/20 11:13:30 UTC, 0 replies.
- [jira] [Created] (NUTCH-1229) Add freegenerator, domainstat and crawldbscanner to log4j - posted by "Markus Jelsma (Created) (JIRA)" <ji...@apache.org> on 2011/12/20 11:21:30 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1229) Add freegenerator, domainstat and crawldbscanner to log4j - posted by "Markus Jelsma (Resolved) (JIRA)" <ji...@apache.org> on 2011/12/20 11:23:30 UTC, 0 replies.
- Build failed in Jenkins: nutch-trunk-maven » Apache Nutch #69 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/12/20 12:05:21 UTC, 0 replies.
- Build failed in Jenkins: nutch-trunk-maven #69 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/12/20 12:05:38 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1129) Any23 Nutch plugin - posted by "Hudson (Commented) (JIRA)" <ji...@apache.org> on 2011/12/20 12:07:30 UTC, 1 replies.
- [jira] [Closed] (NUTCH-1229) Add freegenerator, domainstat and crawldbscanner to log4j - posted by "Markus Jelsma (Closed) (JIRA)" <ji...@apache.org> on 2011/12/20 12:27:30 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1011) Normalize duplicate slashes in URL's - posted by "Markus Jelsma (Closed) (JIRA)" <ji...@apache.org> on 2011/12/20 12:31:31 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1092) overhaul FAQ's and publish to Nutch site - posted by "Markus Jelsma (Closed) (JIRA)" <ji...@apache.org> on 2011/12/20 12:31:31 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1074) topN is ignored with maxNumSegments - posted by "Markus Jelsma (Closed) (JIRA)" <ji...@apache.org> on 2011/12/20 12:31:31 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1028) Log parser keys - posted by "Markus Jelsma (Closed) (JIRA)" <ji...@apache.org> on 2011/12/20 12:31:31 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1067) Configure minimum throughput for fetcher - posted by "Markus Jelsma (Closed) (JIRA)" <ji...@apache.org> on 2011/12/20 12:31:31 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1085) Nutch script does not require HADOOP_HOME - posted by "Markus Jelsma (Closed) (JIRA)" <ji...@apache.org> on 2011/12/20 12:31:31 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1044) Redirected URLs and possibly all of their outlinked URLs have invalid scores. - posted by "Markus Jelsma (Closed) (JIRA)" <ji...@apache.org> on 2011/12/20 12:31:31 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1075) Delegate language identification to Tika - posted by "Markus Jelsma (Closed) (JIRA)" <ji...@apache.org> on 2011/12/20 12:31:32 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1137) LinkDb / invertlinks: command line arguments ignored - posted by "Markus Jelsma (Closed) (JIRA)" <ji...@apache.org> on 2011/12/20 12:31:32 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1152) Upgrade to SolrJ 3.4.0 - posted by "Markus Jelsma (Closed) (JIRA)" <ji...@apache.org> on 2011/12/20 12:31:32 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1016) Strip UTF-8 non-character codepoints - posted by "Markus Jelsma (Closed) (JIRA)" <ji...@apache.org> on 2011/12/20 12:31:32 UTC, 0 replies.
- [jira] [Closed] (NUTCH-993) NullPointerException at FetcherOutputFormat.checkOutputSpecs - posted by "Markus Jelsma (Closed) (JIRA)" <ji...@apache.org> on 2011/12/20 12:31:32 UTC, 0 replies.
- [jira] [Closed] (NUTCH-672) allow unit tests to be run from bin/nutch - posted by "Markus Jelsma (Closed) (JIRA)" <ji...@apache.org> on 2011/12/20 12:31:32 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1045) MimeUtil to rely on default config provided by Tika - posted by "Markus Jelsma (Closed) (JIRA)" <ji...@apache.org> on 2011/12/20 12:31:33 UTC, 0 replies.
- [jira] [Closed] (NUTCH-925) plugins stored in weakhashmap lead memory leak - posted by "Markus Jelsma (Closed) (JIRA)" <ji...@apache.org> on 2011/12/20 12:31:33 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1054) Make linkDB optional during indexing - posted by "Markus Jelsma (Closed) (JIRA)" <ji...@apache.org> on 2011/12/20 12:31:33 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1114) Attr file missing in domain filter - posted by "Markus Jelsma (Closed) (JIRA)" <ji...@apache.org> on 2011/12/20 12:31:33 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1115) Option to disable fixing of embedded params in DomContentUtils - posted by "Markus Jelsma (Closed) (JIRA)" <ji...@apache.org> on 2011/12/20 12:31:33 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1096) Empty (not null) ContentLength results in failure of fetch - posted by "Markus Jelsma (Closed) (JIRA)" <ji...@apache.org> on 2011/12/20 12:31:33 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1052) Multiple deletes of the same URL using SolrClean - posted by "Markus Jelsma (Closed) (JIRA)" <ji...@apache.org> on 2011/12/20 12:31:33 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1057) Make fetcher thread time out configurable - posted by "Markus Jelsma (Closed) (JIRA)" <ji...@apache.org> on 2011/12/20 12:31:33 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1013) Migrate RegexURLNormalizer from Apache ORO to java.util.regex - posted by "Markus Jelsma (Closed) (JIRA)" <ji...@apache.org> on 2011/12/20 12:31:33 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1154) Upgrade to Tika 0.10 - posted by "Markus Jelsma (Closed) (JIRA)" <ji...@apache.org> on 2011/12/20 12:31:34 UTC, 0 replies.
- [jira] [Closed] (NUTCH-937) When nutch is run on hadoop > 0.20.2 (or cdh) it will not find plugins because MapReduce will not unpack plugin/ directory from the job's pack (due to MAPREDUCE-967) - posted by "Markus Jelsma (Closed) (JIRA)" <ji...@apache.org> on 2011/12/20 12:31:34 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1032) Delegate parsing of robots.txt to crawler-commons - posted by "Markus Jelsma (Closed) (JIRA)" <ji...@apache.org> on 2011/12/20 12:31:34 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1043) Add pattern for filtering .js in default url filters - posted by "Markus Jelsma (Closed) (JIRA)" <ji...@apache.org> on 2011/12/20 12:31:34 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1101) Options to purge db_gone records in updatedb - posted by "Markus Jelsma (Closed) (JIRA)" <ji...@apache.org> on 2011/12/20 12:31:34 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1102) Fetcher, rely on fetcher.parse directive only - posted by "Markus Jelsma (Closed) (JIRA)" <ji...@apache.org> on 2011/12/20 12:31:34 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1195) Add Solr 4x (trunk) example schema - posted by "Markus Jelsma (Closed) (JIRA)" <ji...@apache.org> on 2011/12/20 12:31:35 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1073) Rename parameters 'fetcher.threads.per.host.by.ip' and 'fetcher.threads.per.host' - posted by "Markus Jelsma (Closed) (JIRA)" <ji...@apache.org> on 2011/12/20 12:31:35 UTC, 0 replies.
- [jira] [Commented] (NUTCH-925) plugins stored in weakhashmap lead memory leak - posted by "congliu (Commented) (JIRA)" <ji...@apache.org> on 2011/12/20 14:25:32 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1699 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/12/21 05:25:59 UTC, 0 replies.
- Build failed in Jenkins: nutch-trunk-maven » Apache Nutch #70 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/12/21 06:01:19 UTC, 0 replies.
- Build failed in Jenkins: nutch-trunk-maven #70 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/12/21 06:01:20 UTC, 0 replies.
- [jira] [Created] (NUTCH-1230) MimeType utils broken with Tika 1.1 - posted by "Markus Jelsma (Created) (JIRA)" <ji...@apache.org> on 2011/12/21 12:27:30 UTC, 0 replies.
- [jira] [Created] (NUTCH-1231) Upgrade to Tika 1.0 - posted by "Markus Jelsma (Created) (JIRA)" <ji...@apache.org> on 2011/12/21 12:31:30 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1230) MimeType utils broken with Tika 1.1 - posted by "Markus Jelsma (Commented) (JIRA)" <ji...@apache.org> on 2011/12/21 12:33:30 UTC, 1 replies.
- [jira] [Issue Comment Edited] (NUTCH-1230) MimeType utils broken with Tika 1.1 - posted by "Markus Jelsma (Issue Comment Edited) (JIRA)" <ji...@apache.org> on 2011/12/21 12:41:30 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1230) MimeType API deprecated and breaks with Tika 1.0 - posted by "Markus Jelsma (Updated) (JIRA)" <ji...@apache.org> on 2011/12/21 13:17:31 UTC, 2 replies.
- [jira] [Commented] (NUTCH-1230) MimeType API deprecated and breaks with Tika 1.0 - posted by "Markus Jelsma (Commented) (JIRA)" <ji...@apache.org> on 2011/12/21 14:03:30 UTC, 2 replies.
- get rid of outlink code for Tika - posted by Markus Jelsma <ma...@openindex.io> on 2011/12/21 14:51:01 UTC, 2 replies.
- [jira] [Created] (NUTCH-1232) Remove host|site fields from index-basic - posted by "Markus Jelsma (Created) (JIRA)" <ji...@apache.org> on 2011/12/21 16:21:30 UTC, 0 replies.
- [jira] [Created] (NUTCH-1233) Rely on Tika for outlink extraction - posted by "Markus Jelsma (Created) (JIRA)" <ji...@apache.org> on 2011/12/21 16:59:31 UTC, 0 replies.
- [jira] [Created] (NUTCH-1234) Upgrade to Tika 1.1 - posted by "Markus Jelsma (Created) (JIRA)" <ji...@apache.org> on 2011/12/21 17:01:31 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1233) Rely on Tika for outlink extraction - posted by "Markus Jelsma (Updated) (JIRA)" <ji...@apache.org> on 2011/12/21 17:05:30 UTC, 0 replies.
- Build failed in Jenkins: nutch-trunk-maven » Apache Nutch #71 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/12/22 06:03:43 UTC, 0 replies.
- Build failed in Jenkins: nutch-trunk-maven #71 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/12/22 06:03:44 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1700 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/12/22 06:12:07 UTC, 0 replies.
- multiple slf bindings - posted by Markus Jelsma <ma...@openindex.io> on 2011/12/22 14:17:58 UTC, 2 replies.
- Build failed in Jenkins: Nutch-trunk #1701 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/12/23 05:20:15 UTC, 0 replies.
- Build failed in Jenkins: nutch-trunk-maven » Apache Nutch #72 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/12/23 06:01:16 UTC, 0 replies.
- Build failed in Jenkins: nutch-trunk-maven #72 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/12/23 06:01:17 UTC, 0 replies.
- Website Document . - posted by ayyappan <ay...@gmail.com> on 2011/12/23 10:54:00 UTC, 1 replies.
- Jenkins build is back to normal : nutch-trunk-maven » Apache Nutch #73 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/12/23 12:01:36 UTC, 1 replies.
- Jenkins build is back to normal : nutch-trunk-maven #73 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/12/23 12:01:52 UTC, 0 replies.
- [jira] [Reopened] (NUTCH-1222) Upgrade to new Hadoop 0.22.0 - posted by "Markus Jelsma (Reopened) (JIRA)" <ji...@apache.org> on 2011/12/23 14:56:38 UTC, 0 replies.
- [jira] [Reopened] (NUTCH-1225) Migrate CrawlDBScanner to MapReduce API - posted by "Markus Jelsma (Reopened) (JIRA)" <ji...@apache.org> on 2011/12/23 14:58:30 UTC, 0 replies.
- [jira] [Created] (NUTCH-1235) Upgrade to new Hadoop 0.20.205.0 - posted by "Markus Jelsma (Created) (JIRA)" <ji...@apache.org> on 2011/12/23 15:42:31 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1702 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/12/24 08:36:48 UTC, 4 replies.
- Build failed in Jenkins: Nutch-trunk #1703 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/12/25 05:25:31 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1704 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/12/26 06:18:22 UTC, 0 replies.
- [jira] [Created] (NUTCH-1236) Add link to site documentation to download older versions of Nutch. - posted by "Lewis John McGibbney (Created) (JIRA)" <ji...@apache.org> on 2011/12/26 15:08:30 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1236) Add link to site documentation to download older versions of Nutch. - posted by "Lewis John McGibbney (Updated) (JIRA)" <ji...@apache.org> on 2011/12/26 16:32:30 UTC, 1 replies.
- [jira] [Commented] (NUTCH-1217) Update NOTICE.txt to drop some copyrights - posted by "Lewis John McGibbney (Commented) (JIRA)" <ji...@apache.org> on 2011/12/26 16:44:31 UTC, 3 replies.
- [jira] [Commented] (NUTCH-1081) ant tests fail - posted by "Lewis John McGibbney (Commented) (JIRA)" <ji...@apache.org> on 2011/12/26 17:08:31 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1217) Update NOTICE.txt to drop some copyrights - posted by "Lewis John McGibbney (Resolved) (JIRA)" <ji...@apache.org> on 2011/12/26 17:18:30 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1217) Update NOTICE.txt to drop some copyrights - posted by "Lewis John McGibbney (Closed) (JIRA)" <ji...@apache.org> on 2011/12/26 17:20:30 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1138) remove LogUtil from trunk and nutch gora - posted by "Lewis John McGibbney (Commented) (JIRA)" <ji...@apache.org> on 2011/12/26 19:54:30 UTC, 6 replies.
- Build failed in Jenkins: Nutch-nutchgora #109 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/12/27 05:52:39 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1705 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/12/27 06:12:46 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1235) Upgrade to new Hadoop 0.20.205.0 - posted by "Markus Jelsma (Resolved) (JIRA)" <ji...@apache.org> on 2011/12/27 14:30:30 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1235) Upgrade to new Hadoop 0.20.205.0 - posted by "Hudson (Commented) (JIRA)" <ji...@apache.org> on 2011/12/27 15:02:30 UTC, 3 replies.
- [jira] [Commented] (NUTCH-1125) JUnit test for tld - posted by "Hudson (Commented) (JIRA)" <ji...@apache.org> on 2011/12/27 15:02:30 UTC, 1 replies.
- [jira] [Resolved] (NUTCH-1230) MimeType API deprecated and breaks with Tika 1.0 - posted by "Markus Jelsma (Resolved) (JIRA)" <ji...@apache.org> on 2011/12/27 15:38:30 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1231) Upgrade to Tika 1.0 - posted by "Markus Jelsma (Resolved) (JIRA)" <ji...@apache.org> on 2011/12/27 15:38:30 UTC, 0 replies.
- [jira] [Commented] (NUTCH-961) Expose Tika's boilerpipe support - posted by "Markus Jelsma (Commented) (JIRA)" <ji...@apache.org> on 2011/12/27 15:58:30 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1231) Upgrade to Tika 1.0 - posted by "Hudson (Commented) (JIRA)" <ji...@apache.org> on 2011/12/27 16:02:30 UTC, 1 replies.
- Build failed in Jenkins: Nutch-nutchgora #110 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/12/27 16:13:24 UTC, 0 replies.
- Keeping track of resolved issues - posted by Lewis John Mcgibbney <le...@gmail.com> on 2011/12/27 16:19:06 UTC, 2 replies.
- [jira] [Updated] (NUTCH-1104) Port issues from trunk NutchGora branch - posted by "Markus Jelsma (Updated) (JIRA)" <ji...@apache.org> on 2011/12/27 16:34:30 UTC, 0 replies.
- Dependencies with Ivy - posted by Lewis John Mcgibbney <le...@gmail.com> on 2011/12/27 16:35:36 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #111 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/12/27 16:44:38 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-nutchgora #112 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/12/27 16:54:27 UTC, 0 replies.
- [jira] [Created] (NUTCH-1237) Improve javac arguements for more verbose output - posted by "Lewis John McGibbney (Created) (JIRA)" <ji...@apache.org> on 2011/12/27 17:24:30 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1237) Improve javac arguements for more verbose output - posted by "Lewis John McGibbney (Updated) (JIRA)" <ji...@apache.org> on 2011/12/27 17:28:30 UTC, 1 replies.
- [jira] [Commented] (NUTCH-1237) Improve javac arguements for more verbose output - posted by "Lewis John McGibbney (Commented) (JIRA)" <ji...@apache.org> on 2011/12/27 17:28:31 UTC, 1 replies.
- [jira] [Updated] (NUTCH-1205) Upgrade gora modules to 0.2-SNAPSHOT - posted by "Lewis John McGibbney (Updated) (JIRA)" <ji...@apache.org> on 2011/12/27 17:40:30 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1205) Upgrade gora modules to 0.2-incubating in ivy/ivy.xml - posted by "Lewis John McGibbney (Updated) (JIRA)" <ji...@apache.org> on 2011/12/27 17:40:30 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1205) Upgrade gora modules to 0.2-incubating in ivy/ivy.xml - posted by "Lewis John McGibbney (Commented) (JIRA)" <ji...@apache.org> on 2011/12/27 17:40:31 UTC, 0 replies.
- [jira] [Created] (NUTCH-1238) Fetcher throughput threshold must start before feeder finished - posted by "Markus Jelsma (Created) (JIRA)" <ji...@apache.org> on 2011/12/27 21:26:31 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "bin/nutch solrindex" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2011/12/27 22:00:08 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1706 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/12/28 05:58:29 UTC, 8 replies.
- Build failed in Jenkins: Nutch-trunk #1707 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/12/28 12:04:34 UTC, 0 replies.
- Re: svn commit: r1225410 - in /nutch/site/forrest/src/documentation/content/xdocs: index.xml site.xml - posted by Lewis John Mcgibbney <le...@gmail.com> on 2011/12/29 03:07:28 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1708 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/12/29 05:15:59 UTC, 0 replies.
- [jira] [Created] (NUTCH-1239) Webgraph should remove deleted pages from segment input - posted by "Markus Jelsma (Created) (JIRA)" <ji...@apache.org> on 2011/12/29 09:49:30 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1238) Fetcher throughput threshold must start before feeder finished - posted by "Markus Jelsma (Updated) (JIRA)" <ji...@apache.org> on 2011/12/29 12:11:30 UTC, 2 replies.
- [jira] [Commented] (NUTCH-1238) Fetcher throughput threshold must start before feeder finished - posted by "Markus Jelsma (Commented) (JIRA)" <ji...@apache.org> on 2011/12/29 12:29:30 UTC, 2 replies.
- [jira] [Resolved] (NUTCH-1238) Fetcher throughput threshold must start before feeder finished - posted by "Markus Jelsma (Resolved) (JIRA)" <ji...@apache.org> on 2011/12/29 15:33:31 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1239) Webgraph should remove deleted pages from segment input - posted by "Markus Jelsma (Updated) (JIRA)" <ji...@apache.org> on 2011/12/29 15:39:30 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1239) Webgraph should remove deleted pages from segment input - posted by "Markus Jelsma (Commented) (JIRA)" <ji...@apache.org> on 2011/12/29 15:41:30 UTC, 0 replies.
- [jira] [Created] (NUTCH-1240) Domain blacklist URL filter - posted by "Markus Jelsma (Created) (JIRA)" <ji...@apache.org> on 2011/12/29 18:19:30 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1709 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/12/30 05:12:25 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #116 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/12/31 05:15:27 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1710 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/12/31 05:17:16 UTC, 0 replies.