You are viewing a plain text version of this content. The canonical link for it is here.
- Hadoop Get Together @ Berlin - posted by Isabel Drost <is...@apache.org> on 2009/02/02 07:51:12 UTC, 0 replies.
- Re: Release 1.0? - posted by Marko Bauhardt <mb...@101tec.com> on 2009/02/02 15:25:04 UTC, 11 replies.
- [jira] Assigned: (NUTCH-631) MoreIndexingFilter fails with NoSuchElementException - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2009/02/02 18:08:00 UTC, 0 replies.
- [jira] Work started: (NUTCH-631) MoreIndexingFilter fails with NoSuchElementException - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2009/02/02 18:09:59 UTC, 0 replies.
- [jira] Closed: (NUTCH-656) DeleteDuplicates based on crawlDB only - posted by "julien nioche (JIRA)" <ji...@apache.org> on 2009/02/03 11:39:59 UTC, 0 replies.
- [jira] Closed: (NUTCH-518) Fix OpicScoringFilter to respect scoring filter chaining - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/02/03 14:18:00 UTC, 0 replies.
- [jira] Commented: (NUTCH-518) Fix OpicScoringFilter to respect scoring filter chaining - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/02/03 14:18:00 UTC, 0 replies.
- [jira] Commented: (NUTCH-353) pages that serverside forwards will be refetched every time - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/02/03 14:19:59 UTC, 0 replies.
- [jira] Closed: (NUTCH-353) pages that serverside forwards will be refetched every time - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/02/03 14:19:59 UTC, 0 replies.
- [jira] Updated: (NUTCH-558) Need tool to retrieve domain statistics - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/02/03 14:25:59 UTC, 0 replies.
- [jira] Commented: (NUTCH-558) Need tool to retrieve domain statistics - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/02/03 14:25:59 UTC, 0 replies.
- [jira] Closed: (NUTCH-279) Additions for regex-normalize - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/02/03 16:17:59 UTC, 0 replies.
- [jira] Commented: (NUTCH-279) Additions for regex-normalize - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/02/03 16:17:59 UTC, 1 replies.
- [jira] Commented: (NUTCH-92) DistributedSearch incorrectly scores results - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/02/03 16:31:59 UTC, 0 replies.
- [jira] Updated: (NUTCH-92) DistributedSearch incorrectly scores results - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/02/03 16:31:59 UTC, 0 replies.
- [jira] Commented: (NUTCH-671) JSP errors in Nutch searcher webapp running with Tomcat 6 - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/02/03 16:46:00 UTC, 1 replies.
- [jira] Closed: (NUTCH-671) JSP errors in Nutch searcher webapp running with Tomcat 6 - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/02/03 16:46:00 UTC, 0 replies.
- [jira] Created: (NUTCH-685) Content-level redirect status lost in ParseSegment - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/02/06 11:11:59 UTC, 0 replies.
- [jira] Closed: (NUTCH-643) ClassCastException in PdfParser on encrypted PDF with empty password - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/02/06 14:13:59 UTC, 0 replies.
- [jira] Commented: (NUTCH-643) ClassCastException in PdfParser on encrypted PDF with empty password - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/02/06 14:13:59 UTC, 1 replies.
- [jira] Closed: (NUTCH-636) Http client plug-in https doesn't work on IBM JRE - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/02/06 14:17:59 UTC, 0 replies.
- [jira] Commented: (NUTCH-636) Http client plug-in https doesn't work on IBM JRE - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/02/06 14:20:00 UTC, 1 replies.
- [jira] Updated: (NUTCH-251) Administration GUI - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/02/06 14:21:59 UTC, 0 replies.
- [jira] Commented: (NUTCH-251) Administration GUI - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/02/06 14:21:59 UTC, 0 replies.
- [jira] Commented: (NUTCH-563) Include custom fields in BasicQueryFilter - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/02/06 14:29:59 UTC, 1 replies.
- [jira] Commented: (NUTCH-469) changes to geoPosition plugin to make it work on nutch 0.9 - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/02/06 14:31:59 UTC, 0 replies.
- [jira] Updated: (NUTCH-673) Upgrade the Carrot2 plug-in to release 3.0 - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/02/06 14:35:59 UTC, 0 replies.
- [jira] Commented: (NUTCH-683) NUTCH-676 broke CrawlDbMerger - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/02/06 14:35:59 UTC, 1 replies.
- [jira] Commented: (NUTCH-673) Upgrade the Carrot2 plug-in to release 3.0 - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/02/06 14:35:59 UTC, 0 replies.
- [jira] Commented: (NUTCH-261) Multi Language Support - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/02/06 14:43:59 UTC, 0 replies.
- [jira] Closed: (NUTCH-261) Multi Language Support - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/02/06 14:43:59 UTC, 0 replies.
- [jira] Closed: (NUTCH-357) crawling simulation - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/02/06 14:45:59 UTC, 0 replies.
- [jira] Commented: (NUTCH-357) crawling simulation - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/02/06 14:45:59 UTC, 0 replies.
- [jira] Updated: (NUTCH-455) dedup on tokenized fields is faulty - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/02/06 14:52:04 UTC, 0 replies.
- [jira] Commented: (NUTCH-455) dedup on tokenized fields is faulty - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/02/06 14:52:04 UTC, 0 replies.
- [jira] Closed: (NUTCH-262) Summary excerpts and highlights problems - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/02/06 15:13:59 UTC, 0 replies.
- [jira] Commented: (NUTCH-262) Summary excerpts and highlights problems - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/02/06 15:13:59 UTC, 0 replies.
- [jira] Updated: (NUTCH-479) Support for OR queries - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/02/06 15:14:00 UTC, 0 replies.
- [jira] Commented: (NUTCH-479) Support for OR queries - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/02/06 15:14:00 UTC, 0 replies.
- [jira] Commented: (NUTCH-74) French Analyzer Plugin - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/02/06 15:17:59 UTC, 0 replies.
- [jira] Closed: (NUTCH-74) French Analyzer Plugin - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/02/06 15:17:59 UTC, 0 replies.
- Re: writing plugin - posted by Techie <ar...@gmail.com> on 2009/02/08 13:28:04 UTC, 0 replies.
- RPC timeout - posted by 程越强 <st...@gmail.com> on 2009/02/10 03:53:34 UTC, 0 replies.
- [jira] Updated: (NUTCH-686) Russian Analysis Plugin - posted by "OpenTeam.ru (JIRA)" <ji...@apache.org> on 2009/02/10 06:20:59 UTC, 0 replies.
- [jira] Created: (NUTCH-686) Russian Analysis Plugin - posted by "OpenTeam.ru (JIRA)" <ji...@apache.org> on 2009/02/10 06:20:59 UTC, 0 replies.
- [jira] Closed: (NUTCH-686) Russian Analysis Plugin - posted by "OpenTeam.ru (JIRA)" <ji...@apache.org> on 2009/02/10 06:30:59 UTC, 0 replies.
- [jira] Updated: (NUTCH-563) Include custom fields in BasicQueryFilter - posted by "julien nioche (JIRA)" <ji...@apache.org> on 2009/02/10 11:33:05 UTC, 0 replies.
- [Nutch Wiki] Update of "RunNutchInEclipse0.9" by FrankMcCown - posted by Apache Wiki <wi...@apache.org> on 2009/02/10 18:16:49 UTC, 1 replies.
- [jira] Closed: (NUTCH-683) NUTCH-676 broke CrawlDbMerger - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/02/11 10:14:59 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "RunNutchInEclipse0.9" by FrankMcCown - posted by Apache Wiki <wi...@apache.org> on 2009/02/11 19:04:53 UTC, 0 replies.
- [Nutch Wiki] Update of "GettingNutchRunningWithWindows" by FrankMcCown - posted by Apache Wiki <wi...@apache.org> on 2009/02/11 19:25:40 UTC, 0 replies.
- [jira] Commented: (NUTCH-676) MapWritable is written inefficiently and confusingly - posted by "Hudson (JIRA)" <ji...@apache.org> on 2009/02/12 05:13:59 UTC, 0 replies.
- [Nutch Wiki] Update of "IntranetRecrawl" by SAnand - posted by Apache Wiki <wi...@apache.org> on 2009/02/12 13:39:52 UTC, 0 replies.
- [jira] Commented: (NUTCH-668) Domain URL Filter - posted by "julien nioche (JIRA)" <ji...@apache.org> on 2009/02/12 17:23:59 UTC, 0 replies.
- NTCH-635 LinkAnalysis Tool for Nutch - posted by "Eric J. Christeson" <Er...@ndsu.edu> on 2009/02/13 01:05:00 UTC, 1 replies.
- Support for Sitemap Protocol and Canonical URLs - posted by Frank McCown <fm...@harding.edu> on 2009/02/16 18:28:55 UTC, 1 replies.
- Build failed in Hudson: Nutch-trunk #727 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/02/17 05:02:57 UTC, 0 replies.
- [jira] Updated: (NUTCH-631) MoreIndexingFilter fails with NoSuchElementException - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/02/17 14:05:02 UTC, 0 replies.
- [jira] Created: (NUTCH-687) Add RAT - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/02/17 15:01:00 UTC, 0 replies.
- [jira] Updated: (NUTCH-687) Add RAT - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/02/17 15:01:01 UTC, 0 replies.
- [jira] Commented: (NUTCH-688) Fix missing/wrong headers in source files - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/02/17 15:05:00 UTC, 1 replies.
- [jira] Created: (NUTCH-688) Fix missing/wrong headers in source files - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/02/17 15:05:00 UTC, 0 replies.
- [jira] Commented: (NUTCH-631) MoreIndexingFilter fails with NoSuchElementException - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2009/02/17 15:16:59 UTC, 1 replies.
- [jira] Resolved: (NUTCH-631) MoreIndexingFilter fails with NoSuchElementException - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/02/17 15:31:02 UTC, 0 replies.
- [jira] Commented: (NUTCH-609) Allow Plugins to be Loaded from Jar File(s) - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/02/17 15:37:03 UTC, 0 replies.
- [jira] Resolved: (NUTCH-582) Add missing type parameters - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/02/17 19:45:00 UTC, 0 replies.
- [jira] Updated: (NUTCH-86) LanguageIdentifier API enhancements - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/02/17 20:03:00 UTC, 0 replies.
- [jira] Updated: (NUTCH-609) Allow Plugins to be Loaded from Jar File(s) - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/02/17 20:04:59 UTC, 0 replies.
- [jira] Updated: (NUTCH-469) changes to geoPosition plugin to make it work on nutch 0.9 - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/02/17 20:06:59 UTC, 0 replies.
- [jira] Updated: (NUTCH-309) Uses commons logging Code Guards - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/02/17 20:06:59 UTC, 0 replies.
- [jira] Updated: (NUTCH-249) black- white list url filtering - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/02/17 20:40:59 UTC, 0 replies.
- [jira] Updated: (NUTCH-310) Review Log Levels - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/02/17 20:40:59 UTC, 0 replies.
- [jira] Created: (NUTCH-689) Swf parser doesn't seem to handle relative links - posted by "Peter Sparks (JIRA)" <ji...@apache.org> on 2009/02/17 21:54:59 UTC, 0 replies.
- [jira] Updated: (NUTCH-689) Swf parser doesn't seem to handle relative links - posted by "Peter Sparks (JIRA)" <ji...@apache.org> on 2009/02/17 21:58:59 UTC, 2 replies.
- [jira] Created: (NUTCH-690) bug in DomContentUtils.shouldThrowAwayLink? - posted by "Peter Sparks (JIRA)" <ji...@apache.org> on 2009/02/17 22:08:59 UTC, 0 replies.
- [jira] Commented: (NUTCH-689) Swf parser doesn't seem to handle relative links - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/02/17 22:14:59 UTC, 2 replies.
- Hudson build is back to normal: Nutch-trunk #728 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/02/18 05:14:57 UTC, 0 replies.
- [jira] Created: (NUTCH-691) Update jakarta poi jars to the most relevant version - posted by "Dmitry Lihachev (JIRA)" <ji...@apache.org> on 2009/02/18 05:32:01 UTC, 0 replies.
- [jira] Updated: (NUTCH-691) Update jakarta poi jars to the most relevant version - posted by "Dmitry Lihachev (JIRA)" <ji...@apache.org> on 2009/02/18 05:36:01 UTC, 5 replies.
- [jira] Commented: (NUTCH-691) Update jakarta poi jars to the most relevant version - posted by "Dmitry Lihachev (JIRA)" <ji...@apache.org> on 2009/02/18 06:35:02 UTC, 1 replies.
- [jira] Issue Comment Edited: (NUTCH-691) Update jakarta poi jars to the most relevant version - posted by "Dmitry Lihachev (JIRA)" <ji...@apache.org> on 2009/02/18 06:41:01 UTC, 0 replies.
- [jira] Commented: (NUTCH-591) StringIndexOutOfBoundsException when extracting text from a Word document. - posted by "Dmitry Lihachev (JIRA)" <ji...@apache.org> on 2009/02/18 06:59:01 UTC, 0 replies.
- [jira] Resolved: (NUTCH-687) Add RAT - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/02/18 09:13:01 UTC, 0 replies.
- [jira] Resolved: (NUTCH-591) StringIndexOutOfBoundsException when extracting text from a Word document. - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/02/18 09:39:02 UTC, 0 replies.
- [jira] Resolved: (NUTCH-688) Fix missing/wrong headers in source files - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/02/18 10:17:01 UTC, 0 replies.
- [jira] Created: (NUTCH-692) AlreadyBeingCreatedException with Hadoop 0.19 - posted by "julien nioche (JIRA)" <ji...@apache.org> on 2009/02/18 13:31:04 UTC, 0 replies.
- [jira] Resolved: (NUTCH-691) Update jakarta poi jars to the most relevant version - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/02/18 13:45:02 UTC, 0 replies.
- [jira] Resolved: (NUTCH-563) Include custom fields in BasicQueryFilter - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/02/18 13:55:02 UTC, 0 replies.
- [jira] Commented: (NUTCH-692) AlreadyBeingCreatedException with Hadoop 0.19 - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/02/18 14:07:02 UTC, 2 replies.
- [jira] Updated: (NUTCH-583) FeedParser empty links for items - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/02/18 14:47:04 UTC, 0 replies.
- dump Fetcher? - posted by Sami Siren <ss...@gmail.com> on 2009/02/18 14:58:56 UTC, 0 replies.
- would someone help confirm a patch (fix incorrect encoding detection in cached.jsp) - posted by Justin Yao <ju...@snooth.com> on 2009/02/18 19:55:53 UTC, 1 replies.
- [jira] Created: (NUTCH-693) Add configurable option for treating nofollow behaviour. - posted by "Andrew McCall (JIRA)" <ji...@apache.org> on 2009/02/18 22:00:02 UTC, 0 replies.
- [jira] Updated: (NUTCH-693) Add configurable option for treating nofollow behaviour. - posted by "Andrew McCall (JIRA)" <ji...@apache.org> on 2009/02/18 22:00:03 UTC, 0 replies.
- [jira] Commented: (NUTCH-687) Add RAT - posted by "Hudson (JIRA)" <ji...@apache.org> on 2009/02/19 05:17:02 UTC, 0 replies.
- [jira] Created: (NUTCH-694) Distributed Search Server fails - posted by "Dr. Nadine Hochstotter (JIRA)" <ji...@apache.org> on 2009/02/19 09:39:01 UTC, 0 replies.
- [jira] Updated: (NUTCH-694) Distributed Search Server fails - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/02/19 09:45:01 UTC, 2 replies.
- [jira] Created: (NUTCH-695) incorrect mime type detection by MoreIndexingFilter plugin - posted by "Dmitry Lihachev (JIRA)" <ji...@apache.org> on 2009/02/19 11:05:01 UTC, 0 replies.
- [jira] Updated: (NUTCH-695) incorrect mime type detection by MoreIndexingFilter plugin - posted by "Dmitry Lihachev (JIRA)" <ji...@apache.org> on 2009/02/19 11:11:01 UTC, 3 replies.
- [jira] Issue Comment Edited: (NUTCH-695) incorrect mime type detection by MoreIndexingFilter plugin - posted by "Dmitry Lihachev (JIRA)" <ji...@apache.org> on 2009/02/19 11:17:02 UTC, 1 replies.
- [jira] Resolved: (NUTCH-695) incorrect mime type detection by MoreIndexingFilter plugin - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/02/19 11:28:02 UTC, 0 replies.
- [jira] Commented: (NUTCH-695) incorrect mime type detection by MoreIndexingFilter plugin - posted by "Dmitry Lihachev (JIRA)" <ji...@apache.org> on 2009/02/19 11:30:01 UTC, 1 replies.
- [jira] Commented: (NUTCH-694) Distributed Search Server fails - posted by "Dr. Nadine Hochstotter (JIRA)" <ji...@apache.org> on 2009/02/19 11:52:01 UTC, 5 replies.
- [jira] Commented: (NUTCH-650) Hbase Integration - posted by "Andrew McCall (JIRA)" <ji...@apache.org> on 2009/02/19 14:54:01 UTC, 0 replies.
- [jira] Updated: (NUTCH-650) Hbase Integration - posted by "Andrew McCall (JIRA)" <ji...@apache.org> on 2009/02/19 14:54:01 UTC, 2 replies.
- [jira] Created: (NUTCH-696) Timeout for Parser - posted by "julien nioche (JIRA)" <ji...@apache.org> on 2009/02/19 17:58:01 UTC, 0 replies.
- [jira] Commented: (NUTCH-696) Timeout for Parser - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/02/20 00:22:02 UTC, 1 replies.
- [jira] Commented: (NUTCH-684) Dedup support for Solr - posted by "Dmitry Lihachev (JIRA)" <ji...@apache.org> on 2009/02/20 05:10:03 UTC, 5 replies.
- [jira] Updated: (NUTCH-684) Dedup support for Solr - posted by "Dmitry Lihachev (JIRA)" <ji...@apache.org> on 2009/02/20 05:10:03 UTC, 3 replies.
- [jira] Issue Comment Edited: (NUTCH-684) Dedup support for Solr - posted by "Dmitry Lihachev (JIRA)" <ji...@apache.org> on 2009/02/20 07:41:02 UTC, 1 replies.
- [jira] Updated: (NUTCH-697) Generate log output for solr indexer and dedup - posted by "Dmitry Lihachev (JIRA)" <ji...@apache.org> on 2009/02/20 09:11:06 UTC, 0 replies.
- [jira] Created: (NUTCH-697) Generate log output for solr indexer and dedup - posted by "Dmitry Lihachev (JIRA)" <ji...@apache.org> on 2009/02/20 09:11:06 UTC, 0 replies.
- [jira] Created: (NUTCH-698) CrawlDb is corrupted after a few crawl cycles - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/02/20 09:55:01 UTC, 0 replies.
- [jira] Updated: (NUTCH-698) CrawlDb is corrupted after a few crawl cycles - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/02/20 09:55:02 UTC, 2 replies.
- [Nutch Wiki] Update of "RunningNutchAndSolr" by SamiSiren - posted by Apache Wiki <wi...@apache.org> on 2009/02/20 09:56:52 UTC, 0 replies.
- [Nutch Wiki] Update of "InstallingWeb2" by SamiSiren - posted by Apache Wiki <wi...@apache.org> on 2009/02/20 10:01:45 UTC, 2 replies.
- [jira] Updated: (NUTCH-573) Multiple Domains - Query Search - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/02/20 10:39:02 UTC, 0 replies.
- [jira] Updated: (NUTCH-578) URL fetched with 403 is generated over and over again - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/02/20 10:41:02 UTC, 0 replies.
- [jira] Updated: (NUTCH-247) robot parser to restrict. - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/02/20 10:43:01 UTC, 0 replies.
- [jira] Updated: (NUTCH-477) Extend URLFilters to support different filtering chains - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/02/20 10:43:01 UTC, 1 replies.
- [jira] Commented: (NUTCH-477) Extend URLFilters to support different filtering chains - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/02/20 11:17:02 UTC, 2 replies.
- [jira] Created: (NUTCH-699) Add an "official" solr schema for solr integration - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/02/20 11:33:01 UTC, 0 replies.
- [jira] Commented: (NUTCH-699) Add an "official" solr schema for solr integration - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/02/20 11:35:04 UTC, 4 replies.
- [jira] Created: (NUTCH-700) Neko1.9.11 goes into a loop - posted by "julien nioche (JIRA)" <ji...@apache.org> on 2009/02/20 11:45:01 UTC, 0 replies.
- [jira] Commented: (NUTCH-700) Neko1.9.11 goes into a loop - posted by "julien nioche (JIRA)" <ji...@apache.org> on 2009/02/20 12:35:01 UTC, 0 replies.
- [jira] Resolved: (NUTCH-694) Distributed Search Server fails - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/02/23 08:05:02 UTC, 0 replies.
- [jira] Resolved: (NUTCH-626) fetcher2 breaks out the domain with db.ignore.external.links set at cross domain redirects - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/02/24 10:20:02 UTC, 0 replies.
- [jira] Commented: (NUTCH-644) RTF parser doesn't compile anymore - posted by "Dmitry Lihachev (JIRA)" <ji...@apache.org> on 2009/02/24 10:34:03 UTC, 1 replies.
- [jira] Updated: (NUTCH-644) RTF parser doesn't compile anymore - posted by "Dmitry Lihachev (JIRA)" <ji...@apache.org> on 2009/02/24 10:38:01 UTC, 1 replies.
- [jira] Resolved: (NUTCH-247) robot parser to restrict. - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/02/24 10:56:01 UTC, 0 replies.
- [jira] Created: (NUTCH-701) replace Fetcher with Fetcher2 - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/02/24 11:08:01 UTC, 0 replies.
- [jira] Updated: (NUTCH-701) Replace Fetcher with Fetcher2 - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/02/24 11:08:02 UTC, 0 replies.
- [jira] Resolved: (NUTCH-698) CrawlDb is corrupted after a few crawl cycles - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/02/24 11:12:01 UTC, 0 replies.
- [jira] Commented: (NUTCH-701) Replace Fetcher with Fetcher2 - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/02/24 11:48:01 UTC, 0 replies.
- [jira] Resolved: (NUTCH-701) Replace Fetcher with Fetcher2 - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/02/24 11:56:01 UTC, 0 replies.
- [jira] Updated: (NUTCH-669) Consolidate code for Fetcher and Fetcher2 - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/02/24 12:04:01 UTC, 0 replies.
- NutchAnalysis.java STOP_WORDS not configurable? - posted by Bartosz Gadzimski <ba...@o2.pl> on 2009/02/24 14:28:37 UTC, 1 replies.
- [jira] Commented: (NUTCH-698) CrawlDb is corrupted after a few crawl cycles - posted by "Hudson (JIRA)" <ji...@apache.org> on 2009/02/25 05:17:02 UTC, 0 replies.
- [jira] Commented: (NUTCH-247) robot parser to restrict. - posted by "Hudson (JIRA)" <ji...@apache.org> on 2009/02/25 05:17:02 UTC, 0 replies.
- [jira] Commented: (NUTCH-626) fetcher2 breaks out the domain with db.ignore.external.links set at cross domain redirects - posted by "Hudson (JIRA)" <ji...@apache.org> on 2009/02/25 05:17:02 UTC, 0 replies.
- Is there the functions of "More Like This" and "Spell Checking"? - posted by buddha1021 <bu...@yahoo.cn> on 2009/02/25 08:18:10 UTC, 0 replies.
- [jira] Created: (NUTCH-702) Lazy Instanciation of Metadata in CrawlDatum - posted by "julien nioche (JIRA)" <ji...@apache.org> on 2009/02/25 14:07:02 UTC, 0 replies.
- [jira] Updated: (NUTCH-702) Lazy Instanciation of Metadata in CrawlDatum - posted by "julien nioche (JIRA)" <ji...@apache.org> on 2009/02/25 14:27:02 UTC, 2 replies.
- [jira] Created: (NUTCH-703) Upgrade to Hadoop 0.19.1 - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/02/25 17:47:01 UTC, 0 replies.
- [jira] Created: (NUTCH-704) ensure that more important pages are crawled first - posted by "kr (JIRA)" <ji...@apache.org> on 2009/02/26 07:51:04 UTC, 0 replies.
- [jira] Closed: (NUTCH-704) ensure that more important pages are crawled first - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/02/26 09:29:01 UTC, 0 replies.
- [Nutch Wiki] Update of "DownloadingNutch" by BartoszGadzimski - posted by Apache Wiki <wi...@apache.org> on 2009/02/26 18:46:12 UTC, 0 replies.
- [Nutch Wiki] Update of "SimpleMapReduceTutorial" by BartoszGadzimski - posted by Apache Wiki <wi...@apache.org> on 2009/02/26 18:57:29 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "FrontPage" by BartoszGadzimski - posted by Apache Wiki <wi...@apache.org> on 2009/02/26 19:03:25 UTC, 0 replies.
- [jira] Commented: (NUTCH-705) parse-rtf plugin - posted by "Dmitry Lihachev (JIRA)" <ji...@apache.org> on 2009/02/27 05:18:01 UTC, 1 replies.
- [jira] Created: (NUTCH-705) parse-rtf plugin - posted by "Dmitry Lihachev (JIRA)" <ji...@apache.org> on 2009/02/27 05:18:01 UTC, 0 replies.
- [jira] Updated: (NUTCH-705) parse-rtf plugin - posted by "Dmitry Lihachev (JIRA)" <ji...@apache.org> on 2009/02/27 05:30:01 UTC, 1 replies.
- [jira] Commented: (NUTCH-185) XMLParser is configurable xml parser plugin. - posted by "Gopikrishnan (JIRA)" <ji...@apache.org> on 2009/02/27 07:12:02 UTC, 0 replies.
- [jira] Resolved: (NUTCH-699) Add an "official" solr schema for solr integration - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/02/27 07:22:01 UTC, 0 replies.
- [jira] Commented: (NUTCH-703) Upgrade to Hadoop 0.19.1 - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/02/27 07:24:02 UTC, 2 replies.
- [jira] Assigned: (NUTCH-669) Consolidate code for Fetcher and Fetcher2 - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/02/27 07:24:02 UTC, 0 replies.
- Url regex normalizer - posted by Meghna Kukreja <om...@gmail.com> on 2009/02/27 17:32:58 UTC, 3 replies.
- [jira] Created: (NUTCH-706) Url regex normalizer - posted by "Meghna Kukreja (JIRA)" <ji...@apache.org> on 2009/02/27 19:47:13 UTC, 0 replies.
- [jira] Commented: (NUTCH-706) Url regex normalizer - posted by "Meghna Kukreja (JIRA)" <ji...@apache.org> on 2009/02/27 19:49:12 UTC, 0 replies.
- [jira] Closed: (NUTCH-703) Upgrade to Hadoop 0.19.1 - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/02/27 19:55:16 UTC, 0 replies.
- planning for nutch-1.0-rc1 - posted by Sami Siren <ss...@gmail.com> on 2009/02/28 09:26:10 UTC, 2 replies.
- [jira] Created: (NUTCH-707) Generation of multiple segments in multiple runs returns only 1 segment - posted by "Michael Chan (JIRA)" <ji...@apache.org> on 2009/02/28 18:42:12 UTC, 0 replies.
- [jira] Updated: (NUTCH-707) Generation of multiple segments in multiple runs returns only 1 segment - posted by "Michael Chan (JIRA)" <ji...@apache.org> on 2009/02/28 18:44:12 UTC, 1 replies.
- [jira] Commented: (NUTCH-419) unavailable robots.txt kills fetch - posted by "Doug Cook (JIRA)" <ji...@apache.org> on 2009/02/28 20:06:12 UTC, 0 replies.
- [jira] Updated: (NUTCH-419) unavailable robots.txt kills fetch - posted by "Doug Cook (JIRA)" <ji...@apache.org> on 2009/02/28 20:20:12 UTC, 0 replies.