You are viewing a plain text version of this content. The canonical link for it is here.
- Debug Nutch Web Site In Eclipse? - posted by Jason DeMorrow <ja...@gmail.com> on 2010/01/03 09:30:10 UTC, 0 replies.
- [jira] Commented: (NUTCH-658) Add Counter for # of doc fetched in Reporter - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/01/04 11:39:39 UTC, 0 replies.
- [jira] Commented: (NUTCH-666) Analysis plugins for multiple language and new Language Identifier Tool - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/01/04 16:57:54 UTC, 0 replies.
- [jira] Resolved: (NUTCH-658) Add Counter for # of doc fetched in Reporter - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/01/05 11:16:55 UTC, 0 replies.
- [jira] Closed: (NUTCH-658) Add Counter for # of doc fetched in Reporter - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/01/05 11:16:55 UTC, 0 replies.
- Nutch Developers needed for a Nutch powered search engine - posted by SC Interactive Global Media SRL <va...@interactivegm.com> on 2010/01/05 12:56:44 UTC, 0 replies.
- [jira] Assigned: (NUTCH-655) Injecting Crawl metadata - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/01/05 20:45:54 UTC, 0 replies.
- [jira] Commented: (NUTCH-655) Injecting Crawl metadata - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/01/05 20:47:54 UTC, 2 replies.
- [jira] Assigned: (NUTCH-762) Alternative Generator which can generate several segments in one parse of the crawlDB - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/01/05 20:47:54 UTC, 0 replies.
- [jira] Assigned: (NUTCH-719) fetchQueues.totalSize incorrect in Fetcher2 - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/01/05 20:47:58 UTC, 0 replies.
- [jira] Assigned: (NUTCH-692) AlreadyBeingCreatedException with Hadoop 0.19 - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/01/05 20:49:54 UTC, 0 replies.
- [jira] Closed: (NUTCH-655) Injecting Crawl metadata - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/01/06 18:03:54 UTC, 0 replies.
- [jira] Resolved: (NUTCH-655) Injecting Crawl metadata - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/01/06 18:03:54 UTC, 0 replies.
- [Nutch Wiki] Update of "FAQ" by GodmarBack - posted by Apache Wiki <wi...@apache.org> on 2010/01/07 00:30:46 UTC, 3 replies.
- [jira] Commented: (NUTCH-407) Make Nutch crawling parent directories for file protocol configurable - posted by "Godmar Back (JIRA)" <ji...@apache.org> on 2010/01/07 00:48:54 UTC, 0 replies.
- [jira] Commented: (NUTCH-776) Configurable queue depth - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/01/07 16:35:54 UTC, 1 replies.
- help for hadoop and hbase - posted by wnkdu <em...@gmail.com> on 2010/01/07 18:41:39 UTC, 1 replies.
- [Nutch Wiki] Trivial Update of "PublicServers" by GeoffreyMcCaleb - posted by Apache Wiki <wi...@apache.org> on 2010/01/07 20:22:11 UTC, 0 replies.
- Potential Bug: Index documents with incorrect segment numbers - posted by "igor.k" <ig...@thesearchagency.com> on 2010/01/08 00:06:59 UTC, 0 replies.
- Injecting URLs and define Inlink? - posted by MyD <my...@googlemail.com> on 2010/01/08 04:12:37 UTC, 2 replies.
- Build failed in Hudson: Nutch-trunk #1032 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2010/01/08 04:51:05 UTC, 0 replies.
- Why rebuild the index for each crawl? - posted by xiao yang <ya...@gmail.com> on 2010/01/08 09:26:12 UTC, 0 replies.
- [jira] Updated: (NUTCH-774) Retry interval in crawl date is set to 0 - posted by "Reinhard Schwab (JIRA)" <ji...@apache.org> on 2010/01/08 10:33:55 UTC, 0 replies.
- Hudson build is back to normal: Nutch-trunk #1033 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2010/01/08 11:42:00 UTC, 0 replies.
- [jira] Assigned: (NUTCH-269) CrawlDbReducer: OOME because no upper-bound on inlinks count - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/01/08 12:04:54 UTC, 0 replies.
- [jira] Commented: (NUTCH-269) CrawlDbReducer: OOME because no upper-bound on inlinks count - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/01/08 13:00:57 UTC, 1 replies.
- [jira] Resolved: (NUTCH-269) CrawlDbReducer: OOME because no upper-bound on inlinks count - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/01/08 13:02:55 UTC, 0 replies.
- [jira] Created: (NUTCH-778) Running Nutch On linux having whoami exception? - posted by "Prakash Panjwani (JIRA)" <ji...@apache.org> on 2010/01/09 12:17:54 UTC, 0 replies.
- [jira] Closed: (NUTCH-767) Update Tika to v0.5 for the MimeType detection - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/01/11 11:14:54 UTC, 0 replies.
- Nutch on eclipse ant - posted by dhamu <dh...@gmail.com> on 2010/01/11 14:39:43 UTC, 0 replies.
- [jira] Resolved: (NUTCH-751) Upgrade version of HttpClient - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/01/11 17:20:54 UTC, 0 replies.
- [Nutch Wiki] Update of "TikaPlugin" by JulienNioche - posted by Apache Wiki <wi...@apache.org> on 2010/01/11 17:34:41 UTC, 1 replies.
- [jira] Commented: (NUTCH-766) Tika parser - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/01/11 17:48:54 UTC, 10 replies.
- [jira] Commented: (NUTCH-751) Upgrade version of HttpClient - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/01/11 23:24:54 UTC, 0 replies.
- unsubscribe - posted by Ahmad Dahlan <a_...@yahoo.com> on 2010/01/12 01:48:32 UTC, 0 replies.
- [jira] Commented: (NUTCH-767) Update Tika to v0.5 for the MimeType detection - posted by "Hudson (JIRA)" <ji...@apache.org> on 2010/01/12 05:48:54 UTC, 0 replies.
- Re: [jira] Commented: (NUTCH-650) Hbase Integration - posted by xiao yang <ya...@gmail.com> on 2010/01/12 08:43:50 UTC, 2 replies.
- [Nutch Wiki] Update of "RunningNutchAndSolr" by GeoffBentley - posted by Apache Wiki <wi...@apache.org> on 2010/01/18 03:36:44 UTC, 0 replies.
- [jira] Created: (NUTCH-779) Mechanism for passing metadata from parse to crawldb - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/01/18 17:54:54 UTC, 0 replies.
- [jira] Updated: (NUTCH-779) Mechanism for passing metadata from parse to crawldb - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/01/18 17:54:54 UTC, 0 replies.
- [jira] Commented: (NUTCH-779) Mechanism for passing metadata from parse to crawldb - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2010/01/18 18:44:54 UTC, 2 replies.
- Injecting urls and define Inlink - posted by MyD <my...@googlemail.com> on 2010/01/20 02:32:37 UTC, 3 replies.
- Nofollow links on nutch - posted by axi <ax...@gmail.com> on 2010/01/20 16:22:18 UTC, 0 replies.
- Alt text of images as anchor text - posted by axi <ax...@gmail.com> on 2010/01/20 17:16:27 UTC, 4 replies.
- Tried to run Crawl with depth of only 2 and getting IOException - posted by kraman <ki...@gmail.com> on 2010/01/20 20:10:23 UTC, 2 replies.
- Re: [jira] Commented: (NUTCH-779) Mechanism for passing metadata from parse to crawldb - posted by MilleBii <mi...@gmail.com> on 2010/01/20 23:50:25 UTC, 0 replies.
- [jira] Created: (NUTCH-780) Nutch crawler did not read configuration files - posted by "Vu Hoang (JIRA)" <ji...@apache.org> on 2010/01/21 05:02:57 UTC, 0 replies.
- [jira] Updated: (NUTCH-780) Nutch crawler did not read configuration files - posted by "Vu Hoang (JIRA)" <ji...@apache.org> on 2010/01/21 05:02:57 UTC, 4 replies.
- [jira] Commented: (NUTCH-780) Nutch crawler did not read configuration files - posted by "Vu Hoang (JIRA)" <ji...@apache.org> on 2010/01/21 07:30:59 UTC, 1 replies.
- [jira] Issue Comment Edited: (NUTCH-780) Nutch crawler did not read configuration files - posted by "Vu Hoang (JIRA)" <ji...@apache.org> on 2010/01/21 07:33:00 UTC, 2 replies.
- [jira] Updated: (NUTCH-650) Hbase Integration - posted by "Xiao Yang (JIRA)" <ji...@apache.org> on 2010/01/22 08:58:22 UTC, 0 replies.
- [jira] Issue Comment Edited: (NUTCH-650) Hbase Integration - posted by "Xiao Yang (JIRA)" <ji...@apache.org> on 2010/01/22 09:00:22 UTC, 0 replies.
- [jira] Resolved: (NUTCH-778) Running Nutch On linux having whoami exception? - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/01/22 10:49:21 UTC, 0 replies.
- [jira] Issue Comment Edited: (NUTCH-766) Tika parser - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/01/22 15:39:21 UTC, 0 replies.
- Re: State of nutchbase - posted by xiao yang <ya...@gmail.com> on 2010/01/23 10:49:55 UTC, 0 replies.
- Java Heap Limit Exceeded - posted by "Withanage, Dulip" <wi...@asia-europe.uni-heidelberg.de> on 2010/01/25 09:32:57 UTC, 0 replies.
- [Nutch Wiki] Update of "FrontPage" by JohnWhelan - posted by Apache Wiki <wi...@apache.org> on 2010/01/26 05:37:33 UTC, 0 replies.
- Page search2.net deleted from Nutch Wiki - posted by Apache Wiki <wi...@apache.org> on 2010/01/27 00:22:31 UTC, 0 replies.
- [jira] Updated: (NUTCH-766) Tika parser - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/01/28 13:39:34 UTC, 0 replies.
- [Nutch Wiki] Update of "Support" by OtisGospodnetic - posted by Apache Wiki <wi...@apache.org> on 2010/01/28 19:15:42 UTC, 0 replies.
- [jira] Commented: (NUTCH-775) Enhance Searcher interface - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2010/01/28 19:33:34 UTC, 2 replies.
- Configuration - bad conf file - element not property - posted by kraman <ki...@gmail.com> on 2010/01/29 03:24:44 UTC, 0 replies.
- NativeCodeLoader - unable to load native-hadoop library for your platform - posted by kraman <ki...@gmail.com> on 2010/01/31 18:34:36 UTC, 0 replies.