You are viewing a plain text version of this content. The canonical link for it is here.
- [jira] Created: (NUTCH-466) Flexible segment format - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2007/04/01 22:44:32 UTC, 0 replies.
- Nightly API lin kis broken - posted by Lukas Vlcek <lu...@gmail.com> on 2007/04/02 10:28:42 UTC, 1 replies.
- [jira] Commented: (NUTCH-466) Flexible segment format - posted by "Enis Soztutar (JIRA)" <ji...@apache.org> on 2007/04/02 11:19:32 UTC, 3 replies.
- Replace CJK lanaguage analyzer in nutch - posted by zhao xiuwen <re...@gmail.com> on 2007/04/02 18:33:54 UTC, 2 replies.
- Re: [VOTE] Release Apache Nutch 0.9 - posted by Chris Mattmann <ch...@jpl.nasa.gov> on 2007/04/02 18:39:03 UTC, 12 replies.
- [jira] Updated: (NUTCH-333) SegmentMerger and SegmentReader should use NutchJob - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2007/04/02 22:55:32 UTC, 0 replies.
- Re: svn commit: r524932 - in /lucene/nutch/trunk/src/java/org/apache/nutch/segment: SegmentMerger.java SegmentReader.java - posted by Chris Mattmann <ch...@jpl.nasa.gov> on 2007/04/02 23:49:19 UTC, 2 replies.
- [jira] Closed: (NUTCH-333) SegmentMerger and SegmentReader should use NutchJob - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2007/04/03 03:20:32 UTC, 0 replies.
- [jira] Resolved: (NUTCH-333) SegmentMerger and SegmentReader should use NutchJob - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2007/04/03 03:20:32 UTC, 0 replies.
- how to prune unmatched url?? - posted by "Ratnesh,V2Solutions India" <ra...@in.v2solutions.com> on 2007/04/03 15:00:28 UTC, 3 replies.
- How to prevent indexing at the time of crawling??? - posted by "Ratnesh,V2Solutions India" <ra...@in.v2solutions.com> on 2007/04/03 15:04:08 UTC, 0 replies.
- [jira] Created: (NUTCH-467) DeleteDuplicate fails if Segment index directory has 0 documents - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2007/04/04 16:22:32 UTC, 0 replies.
- [jira] Updated: (NUTCH-467) DeleteDuplicate fails if Segment index directory has 0 documents - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2007/04/04 16:31:32 UTC, 0 replies.
- Nutch Release 0.9 - Waiting for release to propagate to mirrors - posted by Chris Mattmann <ch...@jpl.nasa.gov> on 2007/04/05 04:21:40 UTC, 1 replies.
- Build failed in Hudson: Nutch-Nightly #45 - posted by hu...@lucene.zones.apache.org on 2007/04/05 09:00:08 UTC, 0 replies.
- Nutch 0.9 officially released! - posted by Chris Mattmann <ch...@jpl.nasa.gov> on 2007/04/06 04:46:41 UTC, 0 replies.
- Hudson build is back to normal: Nutch-Nightly #46 - posted by hu...@lucene.zones.apache.org on 2007/04/06 09:12:35 UTC, 0 replies.
- Nutch HTMLParseFilters - posted by Gaurav Agarwal <ga...@yahoo.com> on 2007/04/08 20:44:04 UTC, 0 replies.
- [jira] Updated: (NUTCH-468) Scoring filter should distribute score to all outlinks at once - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2007/04/09 20:59:32 UTC, 1 replies.
- [jira] Created: (NUTCH-468) Scoring filter should distribute score to all outlinks at once - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2007/04/09 20:59:32 UTC, 0 replies.
- Nutch java.io.exception - posted by "Armel T. Nene" <ar...@idna-solutions.com> on 2007/04/10 12:21:05 UTC, 1 replies.
- Re: Have anybody thought of replacing CrawlDb with any kind of Rational DB? - posted by Nuther <nu...@proservice.ge> on 2007/04/12 09:05:53 UTC, 14 replies.
- DummySSLProtocolSocketFactory problem, please help me!!!! 2 - posted by Gavino Marras <g....@ifc.cnr.it> on 2007/04/12 09:51:29 UTC, 0 replies.
- Runing a nutch crawler on Eclipse - posted by Tanmoy Kumar Mukherjee <mu...@wright.edu> on 2007/04/12 20:40:57 UTC, 2 replies.
- problem parsing HTML - posted by Ian Holsman <li...@holsman.net> on 2007/04/13 02:04:52 UTC, 2 replies.
- "WritingPluginExample-0.8" by RicardoJMendez - posted by Mike Schwartz <mf...@gmail.com> on 2007/04/13 21:46:56 UTC, 0 replies.
- [jira] Updated: (NUTCH-393) Indexer doesn't handle null documents returned by filters - posted by "Eelco Lempsink (JIRA)" <ji...@apache.org> on 2007/04/14 13:38:17 UTC, 0 replies.
- Nutch ERROR parse.OutlinkExtractor - getOutlinks - posted by "Armel T. Nene" <ar...@idna-solutions.com> on 2007/04/17 23:17:56 UTC, 0 replies.
- Build failed in Hudson: Nutch-Nightly #58 - posted by hu...@lucene.zones.apache.org on 2007/04/18 09:00:11 UTC, 0 replies.
- Hudson build is back to normal: Nutch-Nightly #59 - posted by hu...@lucene.zones.apache.org on 2007/04/18 17:25:17 UTC, 0 replies.
- Testing Scoring plugin - posted by Lorenzo <de...@ieee.org> on 2007/04/18 17:56:14 UTC, 2 replies.
- Crawl www.yahoo.com using nutch 0.9 - posted by Meryl Silverburgh <si...@gmail.com> on 2007/04/19 04:48:26 UTC, 3 replies.
- Re: [Nutch-dev] Creating a new scoring filter - posted by Lorenzo <de...@ieee.org> on 2007/04/19 19:55:02 UTC, 8 replies.
- [jira] Commented: (NUTCH-386) Plugin to index categories by url rules - posted by "Andrey (JIRA)" <ji...@apache.org> on 2007/04/20 00:33:15 UTC, 1 replies.
- Re: ApacheCon in Amsterdam - posted by Tom White <to...@gmail.com> on 2007/04/21 09:45:03 UTC, 1 replies.
- Perfomance problems and segmenting - posted by JoostRuiter <jo...@adnexus-recruitment.nl> on 2007/04/23 16:31:57 UTC, 8 replies.
- [jira] Commented: (NUTCH-468) Scoring filter should distribute score to all outlinks at once - posted by "Nicolás Lichtmaier (JIRA)" <ji...@apache.org> on 2007/04/23 22:40:15 UTC, 1 replies.
- [jira] Created: (NUTCH-469) changes to geoPosition plugin to make it work on nutch 0.9 - posted by "Mike Schwartz (JIRA)" <ji...@apache.org> on 2007/04/24 00:30:15 UTC, 0 replies.
- modifications to geoPosition plugin to get it working on nutch 0.9 - posted by Mike Schwartz <mf...@gmail.com> on 2007/04/24 00:30:39 UTC, 1 replies.
- [jira] Created: (NUTCH-470) Adding optional terms to a query - posted by "Trond Andersen (JIRA)" <ji...@apache.org> on 2007/04/24 09:11:15 UTC, 0 replies.
- [jira] Updated: (NUTCH-470) Adding optional terms to a query - posted by "Trond Andersen (JIRA)" <ji...@apache.org> on 2007/04/24 09:13:15 UTC, 0 replies.
- [jira] Created: (NUTCH-471) Fix synchronization in NutchBean creation - posted by "Enis Soztutar (JIRA)" <ji...@apache.org> on 2007/04/24 10:11:15 UTC, 0 replies.
- [jira] Updated: (NUTCH-471) Fix synchronization in NutchBean creation - posted by "Enis Soztutar (JIRA)" <ji...@apache.org> on 2007/04/24 10:21:15 UTC, 1 replies.
- Fetcher2's delay between successive requests - posted by Doğacan Güney <do...@gmail.com> on 2007/04/24 12:45:55 UTC, 6 replies.
- [jira] Created: (NUTCH-472) NullPointerException in ZipTextExtractor if no MIME type for zipped file - posted by "Antony Bowesman (JIRA)" <ji...@apache.org> on 2007/04/24 13:56:15 UTC, 0 replies.
- [jira] Created: (NUTCH-473) ExcepExtractor performance bad due to String concatenation - posted by "Antony Bowesman (JIRA)" <ji...@apache.org> on 2007/04/24 14:04:15 UTC, 0 replies.
- [jira] Updated: (NUTCH-473) ExcelExtractor performance bad due to String concatenation - posted by "Antony Bowesman (JIRA)" <ji...@apache.org> on 2007/04/24 14:10:15 UTC, 0 replies.
- [jira] Commented: (NUTCH-471) Fix synchronization in NutchBean creation - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2007/04/24 15:50:15 UTC, 2 replies.
- [jira] Created: (NUTCH-474) Fetcher2 sets server-delay and blocking checks incorrectly - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2007/04/24 16:10:15 UTC, 0 replies.
- [jira] Updated: (NUTCH-474) Fetcher2 sets server-delay and blocking checks incorrectly - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2007/04/24 16:12:15 UTC, 0 replies.
- [jira] Resolved: (NUTCH-473) ExcelExtractor performance bad due to String concatenation - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2007/04/24 16:45:15 UTC, 0 replies.
- [jira] Updated: (NUTCH-469) changes to geoPosition plugin to make it work on nutch 0.9 - posted by "Mike Schwartz (JIRA)" <ji...@apache.org> on 2007/04/24 21:15:15 UTC, 1 replies.
- Re: modifications to geoPosition plugin to get it working on nutch 0.9 - posted by Mike Schwartz <mf...@gmail.com> on 2007/04/24 21:15:18 UTC, 0 replies.
- [jira] Closed: (NUTCH-474) Fetcher2 sets server-delay and blocking checks incorrectly - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2007/04/24 23:34:15 UTC, 0 replies.
- [jira] Created: (NUTCH-475) Adaptive crawl delay - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2007/04/25 14:08:15 UTC, 0 replies.
- [jira] Updated: (NUTCH-475) Adaptive crawl delay - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2007/04/25 14:10:15 UTC, 0 replies.
- retrieving original html from database - posted by Charlie Williams <cw...@gmail.com> on 2007/04/25 16:42:39 UTC, 5 replies.
- [jira] Commented: (NUTCH-475) Adaptive crawl delay - posted by "Enis Soztutar (JIRA)" <ji...@apache.org> on 2007/04/26 07:55:15 UTC, 0 replies.
- [jira] Created: (NUTCH-476) Would like to add a field to the document class for its MD5 signature - posted by "Linh Pham (JIRA)" <ji...@apache.org> on 2007/04/27 23:25:15 UTC, 0 replies.
- How to build and deploy one plugin - posted by Manoharam Reddy <ma...@gmail.com> on 2007/04/30 13:38:19 UTC, 1 replies.