You are viewing a plain text version of this content. The canonical link for it is here.
- [jira] Created: (NUTCH-781) Update Tika to v0.6 for the MimeType detection - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/02/01 10:23:51 UTC, 0 replies.
- [jira] Resolved: (NUTCH-781) Update Tika to v0.6 for the MimeType detection - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/02/01 11:01:50 UTC, 0 replies.
- [jira] Closed: (NUTCH-781) Update Tika to v0.6 for the MimeType detection - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/02/01 11:01:51 UTC, 0 replies.
- [jira] Updated: (NUTCH-766) Tika parser - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/02/01 11:33:50 UTC, 4 replies.
- [jira] Updated: (NUTCH-782) Ability to order htmlparsefilters - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/02/01 12:43:50 UTC, 1 replies.
- [jira] Created: (NUTCH-782) Ability to order htmlparsefilters - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/02/01 12:43:50 UTC, 0 replies.
- [jira] Created: (NUTCH-783) IndexerChecker Utilty - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/02/01 13:01:52 UTC, 0 replies.
- [jira] Assigned: (NUTCH-783) IndexerChecker Utilty - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/02/01 13:03:51 UTC, 0 replies.
- [jira] Updated: (NUTCH-783) IndexerChecker Utilty - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/02/01 13:03:51 UTC, 0 replies.
- [jira] Assigned: (NUTCH-779) Mechanism for passing metadata from parse to crawldb - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/02/01 13:13:50 UTC, 0 replies.
- [jira] Updated: (NUTCH-779) Mechanism for passing metadata from parse to crawldb - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/02/01 13:57:50 UTC, 0 replies.
- [jira] Created: (NUTCH-784) CrawlDBScanner - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/02/01 15:33:51 UTC, 0 replies.
- [jira] Updated: (NUTCH-784) CrawlDBScanner - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/02/01 15:33:51 UTC, 0 replies.
- [jira] Created: (NUTCH-785) Fetcher : copy metadata from origin URL when redirecting + call scfilters.initialScore on newly created URL - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/02/01 17:52:18 UTC, 0 replies.
- [jira] Updated: (NUTCH-785) Fetcher : copy metadata from origin URL when redirecting + call scfilters.initialScore on newly created URL - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/02/01 17:54:19 UTC, 0 replies.
- [jira] Resolved: (NUTCH-775) Enhance Searcher interface - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2010/02/01 21:50:18 UTC, 0 replies.
- [jira] Commented: (NUTCH-781) Update Tika to v0.6 for the MimeType detection - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2010/02/01 22:28:18 UTC, 4 replies.
- [jira] Commented: (NUTCH-775) Enhance Searcher interface - posted by "Hudson (JIRA)" <ji...@apache.org> on 2010/02/02 06:58:19 UTC, 0 replies.
- Logging to the terminal - posted by Santiago Pérez <el...@gmail.com> on 2010/02/02 18:24:07 UTC, 0 replies.
- [jira] Updated: (NUTCH-578) URL fetched with 403 is generated over and over again - posted by "Serykh Evgeniy (JIRA)" <ji...@apache.org> on 2010/02/04 06:42:28 UTC, 2 replies.
- [jira] Commented: (NUTCH-673) Upgrade the Carrot2 plug-in to release 3.0 - posted by "Dawid Weiss (JIRA)" <ji...@apache.org> on 2010/02/05 11:53:28 UTC, 3 replies.
- [jira] Created: (NUTCH-786) Better list of suffix domains - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/02/05 12:49:27 UTC, 0 replies.
- [jira] Updated: (NUTCH-786) Better list of suffix domains - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/02/05 12:51:28 UTC, 0 replies.
- [jira] Closed: (NUTCH-786) Better list of suffix domains - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/02/05 12:53:27 UTC, 0 replies.
- [jira] Created: (NUTCH-787) Upgrade Lucene to 3.0.0. - posted by "Dawid Weiss (JIRA)" <ji...@apache.org> on 2010/02/05 13:09:27 UTC, 0 replies.
- [jira] Commented: (NUTCH-787) Upgrade Lucene to 3.0.0. - posted by "Dawid Weiss (JIRA)" <ji...@apache.org> on 2010/02/05 13:37:27 UTC, 3 replies.
- [jira] Commented: (NUTCH-786) Better list of suffix domains - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/02/05 15:06:27 UTC, 0 replies.
- [jira] Updated: (NUTCH-787) Upgrade Lucene to 3.0.0. - posted by "Dawid Weiss (JIRA)" <ji...@apache.org> on 2010/02/06 19:05:27 UTC, 3 replies.
- plugin dev trouble - posted by Sahil Shah <sa...@gmail.com> on 2010/02/07 13:31:06 UTC, 0 replies.
- Hudson build is back to normal : Nutch-trunk #1062 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2010/02/08 06:33:15 UTC, 0 replies.
- example for crawl a url - posted by Esteve Schouten <es...@ibit.org> on 2010/02/08 13:09:59 UTC, 0 replies.
- Spill failed - posted by Santiago Pérez <el...@gmail.com> on 2010/02/10 09:41:33 UTC, 2 replies.
- [jira] Commented: (NUTCH-766) Tika parser - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2010/02/10 23:26:31 UTC, 11 replies.
- [jira] Created: (NUTCH-788) search.jsp typo causing fail - posted by "Sammy Yu (JIRA)" <ji...@apache.org> on 2010/02/11 06:40:29 UTC, 0 replies.
- [jira] Updated: (NUTCH-788) search.jsp typo causing fail - posted by "Sammy Yu (JIRA)" <ji...@apache.org> on 2010/02/11 06:42:28 UTC, 0 replies.
- [jira] Updated: (NUTCH-788) search.jsp typo causing searches to fail - posted by "Sammy Yu (JIRA)" <ji...@apache.org> on 2010/02/11 06:42:28 UTC, 1 replies.
- [jira] Issue Comment Edited: (NUTCH-766) Tika parser - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/02/11 18:22:30 UTC, 0 replies.
- Compile and Build individual plugins - posted by Sahil Shah <sa...@gmail.com> on 2010/02/12 04:57:57 UTC, 0 replies.
- [jira] Resolved: (NUTCH-766) Tika parser - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/02/12 07:53:28 UTC, 0 replies.
- [jira] Created: (NUTCH-789) Improvements to Tika parser - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/02/12 07:55:27 UTC, 0 replies.
- [jira] Updated: (NUTCH-789) Improvements to Tika parser - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/02/12 07:57:28 UTC, 0 replies.
- [jira] Commented: (NUTCH-762) Alternative Generator which can generate several segments in one parse of the crawlDB - posted by "Jesse Hires (JIRA)" <ji...@apache.org> on 2010/02/13 07:27:28 UTC, 0 replies.
- [jira] Created: (NUTCH-790) Some external javadoc links are broken - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2010/02/14 17:41:27 UTC, 0 replies.
- [jira] Updated: (NUTCH-790) Some external javadoc links are broken - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2010/02/14 17:45:27 UTC, 0 replies.
- [jira] Created: (NUTCH-791) External links for published javadocs are partially broken - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2010/02/14 17:49:27 UTC, 0 replies.
- [jira] Commented: (NUTCH-790) Some external javadoc links are broken - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/02/14 17:57:27 UTC, 1 replies.
- [jira] Resolved: (NUTCH-790) Some external javadoc links are broken - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2010/02/14 18:03:28 UTC, 0 replies.
- [jira] Updated: (NUTCH-792) Nutch version still contains 1.0 - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2010/02/14 18:11:27 UTC, 0 replies.
- [jira] Created: (NUTCH-792) Nutch version still contains 1.0 - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2010/02/14 18:11:27 UTC, 0 replies.
- [jira] Resolved: (NUTCH-792) Nutch version still contains 1.0 - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2010/02/14 18:15:27 UTC, 0 replies.
- exception in search.jsp - posted by Jesse Hires <jh...@gmail.com> on 2010/02/14 23:18:29 UTC, 1 replies.
- [jira] Commented: (NUTCH-792) Nutch version still contains 1.0 - posted by "Hudson (JIRA)" <ji...@apache.org> on 2010/02/15 06:53:27 UTC, 0 replies.
- [jira] Created: (NUTCH-793) search.jsp compile errors - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2010/02/15 09:09:27 UTC, 0 replies.
- [jira] Resolved: (NUTCH-793) search.jsp compile errors - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2010/02/15 09:11:31 UTC, 0 replies.
- [jira] Resolved: (NUTCH-788) search.jsp typo causing searches to fail - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2010/02/15 09:19:27 UTC, 0 replies.
- [jira] Commented: (NUTCH-789) Improvements to Tika parser - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2010/02/15 09:23:28 UTC, 0 replies.
- [jira] Closed: (NUTCH-766) Tika parser - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/02/15 10:45:27 UTC, 0 replies.
- Trying to Add an new NutchDoc from plugin - posted by UDd <de...@gmail.com> on 2010/02/15 19:40:44 UTC, 2 replies.
- Build failed in Hudson: Nutch-trunk #1070 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2010/02/16 06:59:50 UTC, 0 replies.
- [jira] Created: (NUTCH-794) Tika parser does not keep attributes on html tag - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/02/16 10:32:27 UTC, 0 replies.
- [jira] Updated: (NUTCH-794) Tika parser does identify lang attributes on html tag - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/02/16 10:59:27 UTC, 1 replies.
- [jira] Commented: (NUTCH-794) Tika parser does identify lang attributes on html tag - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/02/16 11:13:27 UTC, 0 replies.
- [jira] Work started: (NUTCH-794) Language Identification must use check the parse metadata for language values - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/02/16 11:21:28 UTC, 0 replies.
- [jira] Updated: (NUTCH-794) Language Identification must use check the parse metadata for language values - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/02/16 11:21:28 UTC, 1 replies.
- [jira] Commented: (NUTCH-794) Language Identification must use check the parse metadata for language values - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/02/16 11:21:28 UTC, 1 replies.
- [jira] Updated: (NUTCH-750) HtmlParser plugin - page title extraction - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/02/16 14:03:28 UTC, 0 replies.
- Hudson build is back to normal : Nutch-trunk #1071 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2010/02/17 07:15:30 UTC, 0 replies.
- [jira] Commented: (NUTCH-793) search.jsp compile errors - posted by "Hudson (JIRA)" <ji...@apache.org> on 2010/02/17 07:16:28 UTC, 0 replies.
- [jira] Resolved: (NUTCH-705) parse-rtf plugin - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/02/18 11:49:29 UTC, 0 replies.
- [jira] Resolved: (NUTCH-644) RTF parser doesn't compile anymore - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/02/18 11:51:27 UTC, 0 replies.
- [jira] Commented: (NUTCH-788) search.jsp typo causing searches to fail - posted by "Sammy Yu (JIRA)" <ji...@apache.org> on 2010/02/18 18:50:28 UTC, 0 replies.
- [jira] Created: (NUTCH-795) Add ability to maintain nofollow attribute in linkdb - posted by "Sammy Yu (JIRA)" <ji...@apache.org> on 2010/02/18 20:48:27 UTC, 0 replies.
- [jira] Updated: (NUTCH-795) Add ability to maintain nofollow attribute in linkdb - posted by "Sammy Yu (JIRA)" <ji...@apache.org> on 2010/02/18 20:52:27 UTC, 0 replies.
- need advice trouble shooting zero results problem - posted by Jesse Hires <jh...@gmail.com> on 2010/02/19 04:24:52 UTC, 1 replies.
- [jira] Closed: (NUTCH-719) fetchQueues.totalSize incorrect in Fetcher2 - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/02/19 19:52:28 UTC, 0 replies.
- [jira] Resolved: (NUTCH-719) fetchQueues.totalSize incorrect in Fetcher2 - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/02/19 19:52:28 UTC, 0 replies.
- [Nutch Wiki] Update of "RunNutchInEclipse1.0" by maqboolzee - posted by Apache Wiki <wi...@apache.org> on 2010/02/20 00:21:55 UTC, 2 replies.
- [jira] Created: (NUTCH-796) Zero results problems difficult to troubleshoot due to lack of logging - posted by "Jesse Hires (JIRA)" <ji...@apache.org> on 2010/02/20 01:36:27 UTC, 0 replies.
- [jira] Commented: (NUTCH-719) fetchQueues.totalSize incorrect in Fetcher2 - posted by "Hudson (JIRA)" <ji...@apache.org> on 2010/02/20 07:21:28 UTC, 2 replies.
- How to get similar logging output from tomcat6 and bin/nutch? - posted by Hannu Väisänen <Ha...@uef.fi> on 2010/02/22 11:58:18 UTC, 0 replies.
- please provide solution for nutch crawl for rss feeds - posted by Purnima Balu <pu...@gmail.com> on 2010/02/24 02:18:48 UTC, 0 replies.
- [jira] Updated: (NUTCH-797) parse-tika is not properly constructing URLs when the target begins with a "?" - posted by "Robert Hohman (JIRA)" <ji...@apache.org> on 2010/02/25 21:48:27 UTC, 1 replies.
- [jira] Created: (NUTCH-797) parse-tika is not properly constructing URLs when the target begins with a "?" - posted by "Robert Hohman (JIRA)" <ji...@apache.org> on 2010/02/25 21:48:27 UTC, 0 replies.
- New attachment added to page Evaluations on Nutch Wiki - posted by Apache Wiki <wi...@apache.org> on 2010/02/26 15:45:31 UTC, 0 replies.
- [Nutch Wiki] Update of "Evaluations" by IvanKelly - posted by Apache Wiki <wi...@apache.org> on 2010/02/26 15:48:45 UTC, 0 replies.
- [jira] Created: (NUTCH-798) Upgrade to SOLR1.4 - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/02/26 16:23:28 UTC, 0 replies.
- Hudson build is back to normal : Nutch-trunk #1080 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2010/02/27 22:15:23 UTC, 0 replies.