You are viewing a plain text version of this content. The canonical link for it is here.
- [Nutch Wiki] Update of "首页" by tuanzhang - posted by Apache Wiki <wi...@apache.org> on 2010/07/01 02:24:04 UTC, 0 replies.
- [jira] Commented: (NUTCH-834) Separate the Nutch web site from trunk - posted by "Hudson (JIRA)" <ji...@apache.org> on 2010/07/01 07:45:51 UTC, 0 replies.
- 首页 reverted to revision 5 on Nutch Wiki - posted by Apache Wiki <wi...@apache.org> on 2010/07/01 08:12:21 UTC, 0 replies.
- [jira] Commented: (NUTCH-836) Remove deprecated parse plugins - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2010/07/01 13:38:50 UTC, 3 replies.
- [jira] Commented: (NUTCH-835) document deduplication (exact duplicates) failed using MD5Signature - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2010/07/01 14:06:50 UTC, 3 replies.
- [jira] Resolved: (NUTCH-835) document deduplication (exact duplicates) failed using MD5Signature - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2010/07/01 14:12:51 UTC, 0 replies.
- [jira] Updated: (NUTCH-835) document deduplication (exact duplicates) failed using MD5Signature - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2010/07/01 14:22:50 UTC, 0 replies.
- [jira] Assigned: (NUTCH-837) Remove search servers and Lucene dependencies - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2010/07/01 15:47:50 UTC, 0 replies.
- [jira] Updated: (NUTCH-837) Remove search servers and Lucene dependencies - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2010/07/01 16:12:50 UTC, 2 replies.
- [jira] Commented: (NUTCH-837) Remove search servers and Lucene dependencies - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/01 16:31:52 UTC, 11 replies.
- [jira] Work started: (NUTCH-838) Add timing information to all Tool classes - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/01 16:34:49 UTC, 0 replies.
- [jira] Updated: (NUTCH-831) Allow configuration of how fields crawled by Nutch are stored / indexed / tokenized - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/01 16:34:57 UTC, 0 replies.
- [jira] Commented: (NUTCH-831) Allow configuration of how fields crawled by Nutch are stored / indexed / tokenized - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2010/07/01 16:48:52 UTC, 1 replies.
- [jira] Created: (NUTCH-839) nutch doesnt run under 0.20.2+228-1~karmic-cdh3b1 version of hadoop - posted by "Robert Gonzalez (JIRA)" <ji...@apache.org> on 2010/07/01 21:28:49 UTC, 0 replies.
- [Nutchbase] WebPage class is a generated code? - posted by Andrzej Bialecki <ab...@getopt.org> on 2010/07/02 12:18:59 UTC, 5 replies.
- [jira] Created: (NUTCH-840) Port tests from parse-html to parse-tika - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/02 12:27:50 UTC, 0 replies.
- Nutch 2.0 : Design issue - posted by Julien Nioche <li...@gmail.com> on 2010/07/02 12:42:20 UTC, 2 replies.
- [jira] Closed: (NUTCH-836) Remove deprecated parse plugins - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/02 12:55:50 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "PluginCentral" by AlexMc - posted by Apache Wiki <wi...@apache.org> on 2010/07/02 14:23:16 UTC, 2 replies.
- [jira] Updated: (NUTCH-840) Port tests from parse-html to parse-tika - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/02 15:20:49 UTC, 0 replies.
- [jira] Issue Comment Edited: (NUTCH-837) Remove search servers and Lucene dependencies - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2010/07/02 17:56:50 UTC, 0 replies.
- [jira] Created: (NUTCH-841) Nutch 2.0 webapp - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/02 17:56:51 UTC, 0 replies.
- [jira] Resolved: (NUTCH-837) Remove search servers and Lucene dependencies - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2010/07/02 19:30:50 UTC, 0 replies.
- [Nutch Wiki] Update of "WritingPluginExample-0.9" by Ramprasad Ramachandran - posted by Apache Wiki <wi...@apache.org> on 2010/07/03 02:22:30 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #1196 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2010/07/03 06:58:42 UTC, 0 replies.
- [jira] Created: (NUTCH-842) AutoGenerate WebPage code - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2010/07/03 09:45:49 UTC, 0 replies.
- Minimizing the number of stored fields for Solr - posted by Doğacan Güney <do...@gmail.com> on 2010/07/03 10:00:46 UTC, 4 replies.
- Nutchbase design doc - posted by Doğacan Güney <do...@gmail.com> on 2010/07/03 12:01:18 UTC, 5 replies.
- [jira] Updated: (NUTCH-838) Add timing information to all Tool classes - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/03 19:28:51 UTC, 0 replies.
- [jira] Resolved: (NUTCH-838) Add timing information to all Tool classes - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/03 20:01:52 UTC, 0 replies.
- YCSB benchmark for KV stores - posted by Andrzej Bialecki <ab...@getopt.org> on 2010/07/03 21:40:57 UTC, 1 replies.
- Hudson build is back to normal : Nutch-trunk #1197 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2010/07/04 06:23:21 UTC, 0 replies.
- [jira] Commented: (NUTCH-838) Add timing information to all Tool classes - posted by "Hudson (JIRA)" <ji...@apache.org> on 2010/07/04 06:24:52 UTC, 0 replies.
- [jira] Updated: (NUTCH-821) Use ivy in nutch builds - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/05 12:57:49 UTC, 0 replies.
- [jira] Resolved: (NUTCH-791) External links for published javadocs are partially broken - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/05 13:03:50 UTC, 0 replies.
- [jira] Commented: (NUTCH-821) Use ivy in nutch builds - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2010/07/05 13:50:50 UTC, 9 replies.
- [jira] Updated: (NUTCH-696) Timeout for Parser - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2010/07/05 15:02:50 UTC, 0 replies.
- [jira] Commented: (NUTCH-696) Timeout for Parser - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2010/07/05 16:55:52 UTC, 10 replies.
- [jira] Reopened: (NUTCH-696) Timeout for Parser - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2010/07/05 17:05:50 UTC, 0 replies.
- [jira] Issue Comment Edited: (NUTCH-696) Timeout for Parser - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/05 17:13:52 UTC, 0 replies.
- Classifying pages on Nutch: plugins? - posted by Luan Cestari <lu...@gmail.com> on 2010/07/06 13:51:07 UTC, 6 replies.
- [jira] Closed: (NUTCH-821) Use ivy in nutch builds - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/07 11:27:49 UTC, 0 replies.
- Parse-tika ignores too much data... - posted by Andrzej Bialecki <ab...@getopt.org> on 2010/07/07 14:55:36 UTC, 6 replies.
- [jira] Created: (NUTCH-843) Separate the build and runtime environments - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2010/07/07 17:38:49 UTC, 0 replies.
- [jira] Commented: (NUTCH-843) Separate the build and runtime environments - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/07 18:00:55 UTC, 11 replies.
- [jira] Issue Comment Edited: (NUTCH-843) Separate the build and runtime environments - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/07 18:32:50 UTC, 1 replies.
- [jira] Updated: (NUTCH-843) Separate the build and runtime environments - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2010/07/07 18:34:49 UTC, 1 replies.
- [jira] Resolved: (NUTCH-843) Separate the build and runtime environments - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2010/07/07 22:32:49 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #1201 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2010/07/08 07:55:00 UTC, 0 replies.
- [jira] Created: (NUTCH-844) Improve NutchConfiguration - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2010/07/08 14:59:50 UTC, 0 replies.
- [jira] Updated: (NUTCH-844) Improve NutchConfiguration - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2010/07/08 15:01:57 UTC, 1 replies.
- Nutch with classification - posted by Luan Cestari <lu...@gmail.com> on 2010/07/08 15:08:26 UTC, 3 replies.
- [jira] Created: (NUTCH-845) Native hadoop libs not available through maven - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2010/07/08 15:34:49 UTC, 0 replies.
- [jira] Commented: (NUTCH-845) Native hadoop libs not available through maven - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/08 15:38:50 UTC, 1 replies.
- [jira] Resolved: (NUTCH-845) Native hadoop libs not available through maven - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2010/07/08 16:15:49 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #1202 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2010/07/09 07:50:53 UTC, 2 replies.
- [jira] Created: (NUTCH-846) Remove Hadoop related scripts in /bin - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/09 10:07:49 UTC, 0 replies.
- [jira] Commented: (NUTCH-846) Remove Hadoop related scripts in /bin - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2010/07/09 11:34:50 UTC, 1 replies.
- [jira] Closed: (NUTCH-846) Remove Hadoop related scripts in /bin - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/09 11:40:50 UTC, 0 replies.
- [jira] Created: (NUTCH-847) Wrong version of SOLR in Ivy.xml - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/09 14:05:50 UTC, 0 replies.
- [jira] Closed: (NUTCH-847) Wrong version of SOLR in Ivy.xml - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/09 14:09:50 UTC, 0 replies.
- [jira] Commented: (NUTCH-847) Wrong version of SOLR in Ivy.xml - posted by "Pham Tuan Minh (JIRA)" <ji...@apache.org> on 2010/07/09 16:48:49 UTC, 4 replies.
- [jira] Created: (NUTCH-848) Error when calling 'nutch solrindex' in deployed configuration - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/09 17:37:51 UTC, 0 replies.
- [Nutch Wiki] Update of "GettingNutchRunningWithDebian" by AndreRicardo - posted by Apache Wiki <wi...@apache.org> on 2010/07/09 18:49:51 UTC, 0 replies.
- [Nutch Wiki] Update of "AndreRicardo" by AndreRicardo - posted by Apache Wiki <wi...@apache.org> on 2010/07/09 19:25:07 UTC, 0 replies.
- [Nutch Wiki] Update of "FrontPage" by AndreRicardo - posted by Apache Wiki <wi...@apache.org> on 2010/07/10 01:08:20 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #1203 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2010/07/10 07:12:20 UTC, 0 replies.
- [Nutch Wiki] Update of "NutchTutorial" by AndreRicardo - posted by Apache Wiki <wi...@apache.org> on 2010/07/10 14:39:33 UTC, 0 replies.
- Merging in nutchbase - posted by Doğacan Güney <do...@gmail.com> on 2010/07/10 15:00:16 UTC, 16 replies.
- [jira] Closed: (NUTCH-763) Separate configuration files from resources to be included in the job file - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/12 11:20:52 UTC, 0 replies.
- [jira] Created: (NUTCH-849) different versions of the same library in nutch-2.0-dev.job and local\lib directory - posted by "Pham Tuan Minh (JIRA)" <ji...@apache.org> on 2010/07/12 12:02:50 UTC, 0 replies.
- [jira] Updated: (NUTCH-848) Error when calling 'nutch solrindex' in deployed configuration - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/12 12:12:50 UTC, 7 replies.
- [jira] Created: (NUTCH-850) SolrDeleteDuplicates needs to clone the SolrRecord objects - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/12 17:58:50 UTC, 0 replies.
- [jira] Updated: (NUTCH-850) SolrDeleteDuplicates needs to clone the SolrRecord objects - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/12 17:58:53 UTC, 0 replies.
- [jira] Closed: (NUTCH-850) SolrDeleteDuplicates needs to clone the SolrRecord objects - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/12 18:12:50 UTC, 0 replies.
- [jira] Issue Comment Edited: (NUTCH-848) Error when calling 'nutch solrindex' in deployed configuration - posted by "Pham Tuan Minh (JIRA)" <ji...@apache.org> on 2010/07/13 12:07:50 UTC, 5 replies.
- [jira] Commented: (NUTCH-848) Error when calling 'nutch solrindex' in deployed configuration - posted by "Pham Tuan Minh (JIRA)" <ji...@apache.org> on 2010/07/13 12:07:50 UTC, 7 replies.
- [Nutch Wiki] Trivial Update of "首页" by sunlightcs - posted by Apache Wiki <wi...@apache.org> on 2010/07/13 12:55:47 UTC, 0 replies.
- [jira] Created: (NUTCH-851) Port logging to slf4j - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/13 18:10:50 UTC, 0 replies.
- [jira] Created: (NUTCH-852) parser not found for contentType=application/xhtml+xml - posted by "Pham Tuan Minh (JIRA)" <ji...@apache.org> on 2010/07/13 21:29:53 UTC, 0 replies.
- [jira] Commented: (NUTCH-851) Port logging to slf4j - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2010/07/13 21:47:57 UTC, 4 replies.
- Build failed in Hudson: Nutch-trunk #1204 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2010/07/14 09:04:12 UTC, 0 replies.
- [jira] Updated: (NUTCH-830) ScoringFilter to restrict the crawl to the hosts/domains listed in the seeds - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/14 10:43:51 UTC, 0 replies.
- [jira] Resolved: (NUTCH-852) parser not found for contentType=application/xhtml+xml - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/14 11:16:50 UTC, 0 replies.
- [jira] Commented: (NUTCH-844) Improve NutchConfiguration - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/14 13:13:50 UTC, 1 replies.
- [jira] Commented: (NUTCH-849) different versions of the same library in nutch-2.0-dev.job and local\lib directory - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/14 13:26:51 UTC, 0 replies.
- [jira] Updated: (NUTCH-851) Port logging to slf4j - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/14 14:18:50 UTC, 0 replies.
- [jira] Closed: (NUTCH-848) Error when calling 'nutch solrindex' in deployed configuration - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/14 15:58:08 UTC, 0 replies.
- [jira] Resolved: (NUTCH-844) Improve NutchConfiguration - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2010/07/14 16:43:50 UTC, 0 replies.
- [jira] Created: (NUTCH-853) Remove unused parameter files from conf/ - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/14 17:06:50 UTC, 0 replies.
- [jira] Closed: (NUTCH-853) Remove unused parameter files from conf/ - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/14 17:39:51 UTC, 0 replies.
- [jira] Commented: (NUTCH-853) Remove unused parameter files from conf/ - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/14 18:02:51 UTC, 2 replies.
- [jira] Issue Comment Edited: (NUTCH-853) Remove unused parameter files from conf/ - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/14 18:21:50 UTC, 0 replies.
- [jira] Resolved: (NUTCH-86) LanguageIdentifier API enhancements - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/14 19:40:51 UTC, 0 replies.
- [jira] Commented: (NUTCH-852) parser not found for contentType=application/xhtml+xml - posted by "Pham Tuan Minh (JIRA)" <ji...@apache.org> on 2010/07/14 19:43:50 UTC, 0 replies.
- [jira] Resolved: (NUTCH-309) Uses commons logging Code Guards - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/14 19:47:52 UTC, 0 replies.
- [jira] Resolved: (NUTCH-823) Download page should not have pointer to nightly builds - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/14 19:50:50 UTC, 1 replies.
- [jira] Assigned: (NUTCH-825) Publish nutch artifacts to central maven repository - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/14 19:50:51 UTC, 0 replies.
- [jira] Resolved: (NUTCH-759) Removal of deprecated APIs - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/14 19:50:52 UTC, 0 replies.
- [jira] Updated: (NUTCH-825) Publish nutch artifacts to central maven repository - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/14 19:50:53 UTC, 0 replies.
- [jira] Resolved: (NUTCH-454) Review Debug Level Log Guards - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/14 19:52:50 UTC, 0 replies.
- [jira] Assigned: (NUTCH-697) Generate log output for solr indexer and dedup - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/14 19:54:50 UTC, 0 replies.
- [jira] Work started: (NUTCH-697) Generate log output for solr indexer and dedup - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/14 19:54:51 UTC, 0 replies.
- [jira] Created: (NUTCH-854) Define standard attributes with values and explaination to configuration file in conf directory - posted by "Pham Tuan Minh (JIRA)" <ji...@apache.org> on 2010/07/14 20:01:52 UTC, 0 replies.
- [jira] Updated: (NUTCH-854) Define standard attributes with values and explaination to configuration files in conf directory - posted by "Pham Tuan Minh (JIRA)" <ji...@apache.org> on 2010/07/14 20:04:50 UTC, 0 replies.
- [jira] Work started: (NUTCH-825) Publish nutch artifacts to central maven repository - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/14 20:11:50 UTC, 0 replies.
- [jira] Resolved: (NUTCH-697) Generate log output for solr indexer and dedup - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/14 20:13:51 UTC, 0 replies.
- [jira] Commented: (NUTCH-697) Generate log output for solr indexer and dedup - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/14 20:17:51 UTC, 0 replies.
- [jira] Commented: (NUTCH-854) Define standard attributes with values and explaination to configuration files in conf directory - posted by "Pham Tuan Minh (JIRA)" <ji...@apache.org> on 2010/07/14 20:19:50 UTC, 0 replies.
- [jira] Resolved: (NUTCH-780) Nutch crawler did not read configuration files - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/14 20:31:50 UTC, 0 replies.
- [jira] Assigned: (NUTCH-774) Retry interval in crawl date is set to 0 - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/14 20:42:52 UTC, 0 replies.
- [jira] Work started: (NUTCH-774) Retry interval in crawl date is set to 0 - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/14 20:42:53 UTC, 0 replies.
- I LOVE Ivy! - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2010/07/14 20:44:18 UTC, 2 replies.
- [jira] Closed: (NUTCH-780) Nutch crawler did not read configuration files - posted by "Vu Hoang (JIRA)" <ji...@apache.org> on 2010/07/14 20:59:50 UTC, 0 replies.
- [jira] Resolved: (NUTCH-774) Retry interval in crawl date is set to 0 - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/14 22:13:51 UTC, 0 replies.
- [jira] Issue Comment Edited: (NUTCH-774) Retry interval in crawl date is set to 0 - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/14 22:15:53 UTC, 0 replies.
- [jira] Assigned: (NUTCH-733) plain text view of cached files ignores HTML encoding - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/14 22:15:57 UTC, 0 replies.
- [jira] Resolved: (NUTCH-733) plain text view of cached files ignores HTML encoding - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/14 22:18:54 UTC, 0 replies.
- [jira] Assigned: (NUTCH-677) Segment merge filering based on segment content - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/14 22:34:52 UTC, 0 replies.
- [jira] Work started: (NUTCH-677) Segment merge filering based on segment content - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/14 22:34:53 UTC, 0 replies.
- [jira] Commented: (NUTCH-677) Segment merge filering based on segment content - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/14 22:50:54 UTC, 2 replies.
- [jira] Assigned: (NUTCH-564) External parser supports encoding attribute - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/14 22:53:51 UTC, 0 replies.
- [jira] Work started: (NUTCH-564) External parser supports encoding attribute - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/14 22:53:51 UTC, 0 replies.
- [jira] Updated: (NUTCH-677) Segment merge filering based on segment content - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/14 23:00:51 UTC, 0 replies.
- [jira] Created: (NUTCH-855) ScoringFilter and IndexingFilter: To allow for the propagation of URL Metatags and their subsequent indexing. - posted by "Scott Gonyea (JIRA)" <ji...@apache.org> on 2010/07/15 03:48:52 UTC, 0 replies.
- [jira] Updated: (NUTCH-855) ScoringFilter and IndexingFilter: To allow for the propagation of URL Metatags and their subsequent indexing. - posted by "Scott Gonyea (JIRA)" <ji...@apache.org> on 2010/07/15 03:50:50 UTC, 7 replies.
- Re: [jira] Updated: (NUTCH-855) ScoringFilter and IndexingFilter: To allow for the propagation of URL Metatags and their subsequent indexing. - posted by Scott Gonyea <sc...@aitrus.org> on 2010/07/15 03:55:19 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #1205 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2010/07/15 10:52:55 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #1206 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2010/07/16 07:32:28 UTC, 0 replies.
- [jira] Commented: (NUTCH-18) Windows servers include illegal characters in URLs - posted by "David Escuer (JIRA)" <ji...@apache.org> on 2010/07/16 10:31:51 UTC, 3 replies.
- [Nutch Wiki] Trivial Update of "RunningNutchAndSolr" by GustavoZaera - posted by Apache Wiki <wi...@apache.org> on 2010/07/16 11:10:35 UTC, 0 replies.
- [jira] Created: (NUTCH-856) Use Tika for parsing feed - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/16 17:19:50 UTC, 0 replies.
- [jira] Updated: (NUTCH-856) Use Tika for parsing feed - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/16 17:19:51 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #1207 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2010/07/17 10:14:34 UTC, 0 replies.
- [jira] Work started: (NUTCH-857) DistributedBeans should not close their RPC counterparts - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2010/07/19 23:00:50 UTC, 0 replies.
- [jira] Created: (NUTCH-857) DistributedBeans should not close their RPC counterparts - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2010/07/19 23:00:50 UTC, 2 replies.
- [jira] Updated: (NUTCH-857) DistributedBeans should not close their RPC counterparts - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2010/07/19 23:00:52 UTC, 0 replies.
- Component fetching during parsing. (vertical crawling) - posted by Ferdy <fe...@kalooga.com> on 2010/07/20 14:30:06 UTC, 2 replies.
- Re: svn commit: r965815 - in /nutch/branches/nutchbase/src: java/org/apache/nutch/parse/ParseStatus.java java/org/apache/nutch/parse/ParseText.java test/org/apache/nutch/parse/TestParseText.java - posted by Doğacan Güney <do...@gmail.com> on 2010/07/20 15:14:59 UTC, 4 replies.
- [jira] Commented: (NUTCH-790) Some external javadoc links are broken - posted by "André Ricardo (JIRA)" <ji...@apache.org> on 2010/07/20 15:25:49 UTC, 0 replies.
- [jira] Issue Comment Edited: (NUTCH-790) Some external javadoc links are broken - posted by "André Ricardo (JIRA)" <ji...@apache.org> on 2010/07/20 15:27:50 UTC, 0 replies.
- Re: svn commit: r965815 - in /nutch/branches/nutchbase/src: java/org/apache/nutch/parse/ParseStatus.java java/org/apache/nutch/parse/ParseText.java test/org/apache/nutch/parse/TestParseText.java - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2010/07/20 19:51:01 UTC, 1 replies.
- [jira] Resolved: (NUTCH-856) Use Tika for parsing feed - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/20 20:35:50 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #1208 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2010/07/21 20:26:05 UTC, 0 replies.
- Nutchbase merge strategy - posted by Andrzej Bialecki <ab...@getopt.org> on 2010/07/21 20:26:34 UTC, 7 replies.
- [jira] Created: (NUTCH-858) No longer able to set per-field boosts on lucene documents - posted by "Edward Drapkin (JIRA)" <ji...@apache.org> on 2010/07/21 21:43:50 UTC, 0 replies.
- [jira] Commented: (NUTCH-858) No longer able to set per-field boosts on lucene documents - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2010/07/21 21:59:50 UTC, 3 replies.
- [jira] Updated: (NUTCH-858) No longer able to set per-field boosts on lucene documents - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2010/07/21 21:59:51 UTC, 0 replies.
- [Nutchbase] Multi-value ParseResult missing - posted by Andrzej Bialecki <ab...@getopt.org> on 2010/07/21 23:47:20 UTC, 2 replies.
- Build failed in Hudson: Nutch-trunk #1209 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2010/07/22 06:11:28 UTC, 0 replies.
- [jira] Reopened: (NUTCH-823) Download page should not have pointer to nightly builds - posted by "Sebb (JIRA)" <ji...@apache.org> on 2010/07/23 00:30:49 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #1210 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2010/07/23 06:12:48 UTC, 0 replies.
- [jira] Created: (NUTCH-859) Diff trunk and NutchBase - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/23 15:18:49 UTC, 0 replies.
- [jira] Updated: (NUTCH-859) Diff trunk and NutchBase - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/23 15:20:50 UTC, 0 replies.
- [jira] Created: (NUTCH-860) package task fails - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/23 16:02:10 UTC, 0 replies.
- [jira] Updated: (NUTCH-860) package task fails - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/23 16:03:49 UTC, 0 replies.
- [jira] Created: (NUTCH-861) Rename HTMLParserFilter - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/23 17:44:50 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #1211 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2010/07/25 06:11:55 UTC, 0 replies.
- [jira] Resolved: (NUTCH-677) Segment merge filering based on segment content - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/25 08:57:50 UTC, 0 replies.
- [jira] Assigned: (NUTCH-855) ScoringFilter and IndexingFilter: To allow for the propagation of URL Metatags and their subsequent indexing. - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/25 09:00:54 UTC, 0 replies.
- [jira] Work started: (NUTCH-855) ScoringFilter and IndexingFilter: To allow for the propagation of URL Metatags and their subsequent indexing. - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/25 09:00:55 UTC, 0 replies.
- [jira] Resolved: (NUTCH-855) ScoringFilter and IndexingFilter: To allow for the propagation of URL Metatags and their subsequent indexing. - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/25 19:51:51 UTC, 2 replies.
- Build failed in Hudson: Nutch-trunk #1212 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2010/07/26 06:11:15 UTC, 0 replies.
- [jira] Resolved: (NUTCH-860) package task fails - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/26 12:27:49 UTC, 0 replies.
- [jira] Commented: (NUTCH-861) Rename HTMLParserFilter - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2010/07/26 13:45:50 UTC, 0 replies.
- [jira] Resolved: (NUTCH-857) DistributedBeans should not close their RPC counterparts - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2010/07/26 13:57:51 UTC, 0 replies.
- [jira] Closed: (NUTCH-857) DistributedBeans should not close their RPC counterparts - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2010/07/26 13:59:49 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #1213 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2010/07/27 07:40:33 UTC, 3 replies.
- [jira] Created: (NUTCH-862) HttpClient null pointer exception - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2010/07/27 13:56:16 UTC, 0 replies.
- [jira] Updated: (NUTCH-862) HttpClient null pointer exception - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2010/07/27 14:00:19 UTC, 0 replies.
- [jira] Commented: (NUTCH-629) Detect slow and timeout servers and drop their URLs - posted by "eggs (JIRA)" <ji...@apache.org> on 2010/07/27 17:47:16 UTC, 1 replies.
- [jira] Created: (NUTCH-863) Benchmark and a testbed proxy server - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2010/07/29 15:09:16 UTC, 0 replies.
- [jira] Updated: (NUTCH-863) Benchmark and a testbed proxy server - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2010/07/29 15:09:20 UTC, 0 replies.
- [jira] Commented: (NUTCH-859) Diff trunk and NutchBase - posted by "Pham Tuan Minh (JIRA)" <ji...@apache.org> on 2010/07/30 15:32:17 UTC, 1 replies.
- [jira] Issue Comment Edited: (NUTCH-859) Diff trunk and NutchBase - posted by "Pham Tuan Minh (JIRA)" <ji...@apache.org> on 2010/07/30 15:32:17 UTC, 0 replies.
- [jira] Created: (NUTCH-864) Fetcher generates entries with status 0 - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/30 16:01:16 UTC, 0 replies.
- [jira] Created: (NUTCH-865) Format source code in unique style - posted by "Pham Tuan Minh (JIRA)" <ji...@apache.org> on 2010/07/30 17:40:16 UTC, 0 replies.
- [jira] Created: (NUTCH-866) STOP Nutch without breaking the crawled data - posted by "Pham Tuan Minh (JIRA)" <ji...@apache.org> on 2010/07/30 18:45:17 UTC, 0 replies.
- [jira] Resolved: (NUTCH-863) Benchmark and a testbed proxy server - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2010/07/30 21:53:16 UTC, 0 replies.
- Benchmark of Nutch trunk - posted by Andrzej Bialecki <ab...@getopt.org> on 2010/07/31 00:07:12 UTC, 2 replies.
- [jira] Created: (NUTCH-867) Port Nutch benchmark to Nutchbase - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2010/07/31 20:06:19 UTC, 0 replies.