You are viewing a plain text version of this content. The canonical link for it is here.
- [jira] Commented: (NUTCH-240) Scoring API: extension point, scoring filters and an OPIC plugin - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2006/05/01 01:29:47 UTC, 0 replies.
- Re: CrawlDbReducer and the lone STATUS_SIGNATURE record - posted by Andrzej Bialecki <ab...@getopt.org> on 2006/05/01 01:36:06 UTC, 0 replies.
- how characters encoded in nutch - posted by tank <ta...@gmail.com> on 2006/05/01 03:37:25 UTC, 0 replies.
- Re: Php frontend - posted by Andrew Libby <an...@gmail.com> on 2006/05/01 15:22:26 UTC, 0 replies.
- A Developer's getting started doc? - posted by Andrew Libby <an...@gmail.com> on 2006/05/01 15:31:38 UTC, 7 replies.
- Creating a throttle - posted by "Fankhauser, Alain" <Al...@ipi.ch> on 2006/05/01 16:56:48 UTC, 1 replies.
- JobTrackerInfoServer and nutch*.jar - posted by an...@orbita1.ru on 2006/05/02 07:54:13 UTC, 0 replies.
- mapred question - posted by an...@orbita1.ru on 2006/05/02 13:46:43 UTC, 1 replies.
- Re: Content-Type inconsistency? - posted by Jérôme Charron <je...@gmail.com> on 2006/05/02 16:13:52 UTC, 2 replies.
- [jira] Created: (NUTCH-260) Three new plugins that parse, index and query meta tags defined in the configuration - posted by "Jake Vanderdray (JIRA)" <ji...@apache.org> on 2006/05/02 20:39:46 UTC, 0 replies.
- [jira] Updated: (NUTCH-260) Three new plugins that parse, index and query meta tags defined in the configuration - posted by "Jake Vanderdray (JIRA)" <ji...@apache.org> on 2006/05/02 20:41:48 UTC, 0 replies.
- 0.8 tutorial typos in Whole-web indexing? - posted by Lukas Vlcek <lu...@gmail.com> on 2006/05/03 22:50:31 UTC, 0 replies.
- [jira] Created: (NUTCH-261) Multi Language Support - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2006/05/04 00:15:18 UTC, 0 replies.
- [jira] Updated: (NUTCH-261) Multi Language Support - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2006/05/04 00:19:18 UTC, 0 replies.
- [jira] Commented: (NUTCH-134) Summarizer doesn't select the best snippets - posted by "Chris Fellows (JIRA)" <ji...@apache.org> on 2006/05/04 00:23:18 UTC, 7 replies.
- [jira] Created: (NUTCH-262) Summary excerpts and highlights problems - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2006/05/04 00:27:18 UTC, 0 replies.
- [jira] Created: (NUTCH-263) MapWritable.equals() doesn't work properly - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2006/05/04 03:37:16 UTC, 0 replies.
- [jira] Updated: (NUTCH-263) MapWritable.equals() doesn't work properly - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2006/05/04 04:36:17 UTC, 0 replies.
- [jira] Created: (NUTCH-264) Tools for merging and filtering CrawlDb-s and LinkDb-s - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2006/05/04 04:48:16 UTC, 0 replies.
- [jira] Updated: (NUTCH-264) Tools for merging and filtering CrawlDb-s and LinkDb-s - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2006/05/04 04:48:17 UTC, 0 replies.
- [jira] Commented: (NUTCH-263) MapWritable.equals() doesn't work properly - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2006/05/04 10:31:21 UTC, 1 replies.
- Classloader - posted by Christopher Burkey <cb...@openedit.org> on 2006/05/04 17:20:59 UTC, 0 replies.
- plugins in job file. - posted by Stefan Groschupf <sg...@media-style.com> on 2006/05/04 20:42:21 UTC, 2 replies.
- to count the number of pages from each domain - posted by an...@orbita1.ru on 2006/05/05 14:26:02 UTC, 1 replies.
- nutch inject bug(fix) - posted by Jochen Frey <nu...@quontis.com> on 2006/05/05 19:16:10 UTC, 0 replies.
- Re: svn commit: r399515 - /lucene/nutch/trunk/src/java/org/apache/nutch/segment/SegmentReader.java - posted by Doug Cutting <cu...@apache.org> on 2006/05/05 19:46:32 UTC, 0 replies.
- CommerceNet Events » Blog Archive » T3 5/11: Stefan Groschupf on Extending Nutch - posted by Doug Cutting <cu...@apache.org> on 2006/05/06 00:46:06 UTC, 0 replies.
- [jira] Updated: (NUTCH-134) Summarizer doesn't select the best snippets - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2006/05/06 00:56:34 UTC, 0 replies.
- [jira] Commented: (NUTCH-261) Multi Language Support - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2006/05/06 01:06:30 UTC, 0 replies.
- Merging segments - posted by Chris Fellows <cc...@sbcglobal.net> on 2006/05/06 01:32:28 UTC, 3 replies.
- Feature idea - Indexing Text Lengths - posted by Douglas Brunner <he...@gmail.com> on 2006/05/07 18:10:22 UTC, 2 replies.
- generate.max.per.host is per reduce task - posted by Chris Schneider <Sc...@TransPac.com> on 2006/05/07 22:13:10 UTC, 1 replies.
- http chunked content - posted by Stefan Groschupf <sg...@media-style.com> on 2006/05/08 04:36:59 UTC, 8 replies.
- nutch is loosing not modified pages - posted by Stefan Groschupf <sg...@media-style.com> on 2006/05/08 09:01:27 UTC, 1 replies.
- [jira] Created: (NUTCH-265) Getting Clustered results in better form. - posted by "Kris K (JIRA)" <ji...@apache.org> on 2006/05/08 14:45:20 UTC, 0 replies.
- [jira] Commented: (NUTCH-265) Getting Clustered results in better form. - posted by "Dawid Weiss (JIRA)" <ji...@apache.org> on 2006/05/08 15:53:22 UTC, 3 replies.
- [jira] Created: (NUTCH-266) hadoop bug when doing updatedb - posted by "Eugen Kochuev (JIRA)" <ji...@apache.org> on 2006/05/08 22:38:20 UTC, 0 replies.
- [jira] Closed: (NUTCH-264) Tools for merging and filtering CrawlDb-s and LinkDb-s - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2006/05/09 00:17:21 UTC, 0 replies.
- New tools: CrawlDbMerger, LinkDbMerger, SegmentMerger - posted by Andrzej Bialecki <ab...@getopt.org> on 2006/05/09 00:17:51 UTC, 5 replies.
- [jira] Closed: (NUTCH-263) MapWritable.equals() doesn't work properly - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2006/05/09 00:19:21 UTC, 0 replies.
- [jira] Created: (NUTCH-267) Indexer doesn't consider linkdb when calculating boost value - posted by "Chris Schneider (JIRA)" <ji...@apache.org> on 2006/05/09 03:58:22 UTC, 0 replies.
- [jira] Commented: (NUTCH-267) Indexer doesn't consider linkdb when calculating boost value - posted by "Doug Cutting (JIRA)" <ji...@apache.org> on 2006/05/09 04:37:21 UTC, 4 replies.
- Creating different binary databases for indexing - posted by Dennis Kubes <nu...@dragonflymc.com> on 2006/05/09 23:35:12 UTC, 4 replies.
- PATCH - Fixes for 0.8 tutorial - posted by Lukas Vlcek <lu...@gmail.com> on 2006/05/09 23:41:48 UTC, 0 replies.
- Re: svn commit: r405565 - in /lucene/nutch/trunk/src: java/org/apache/nutch/searcher/ test/org/apache/nutch/searcher/ web/jsp/ - posted by Doug Cutting <cu...@apache.org> on 2006/05/10 01:42:13 UTC, 16 replies.
- [jira] Resolved: (NUTCH-257) Summary#toString always Entity encodes -- problem for OpenSearchServlet#description field - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2006/05/10 14:50:06 UTC, 0 replies.
- distance between words - posted by YourSoft <yo...@freemail.hu> on 2006/05/10 19:47:47 UTC, 2 replies.
- dfs -report - posted by Marko Bauhardt <mb...@media-style.com> on 2006/05/10 21:48:52 UTC, 1 replies.
- Issues to work on - posted by Dennis Kubes <nu...@dragonflymc.com> on 2006/05/11 00:26:51 UTC, 1 replies.
- Re: [Nutch-dev] Re: svn commit: r405565 - in /lucene/nutch/trunk/src: java/org/apache/nutch/searcher/ test/org/apache/nutch/searcher/ web/jsp/ - posted by Marvin Humphrey <ma...@rectangular.com> on 2006/05/11 13:21:18 UTC, 1 replies.
- Interleaved (parallel) fetch cycles - posted by Andrzej Bialecki <ab...@getopt.org> on 2006/05/11 14:48:40 UTC, 1 replies.
- Re: [jira] Updated: (NUTCH-251) Administration GUI - posted by TDLN <di...@gmail.com> on 2006/05/11 17:38:06 UTC, 2 replies.
- Preventing overlapped search results. - posted by Brian Hill <hi...@yosemite.cc.ca.us> on 2006/05/11 23:15:55 UTC, 0 replies.
- new location! nutch user meeting San Francisco - posted by Stefan Groschupf <sg...@media-style.com> on 2006/05/12 04:43:04 UTC, 0 replies.
- mozdex - posted by YourSoft <yo...@freemail.hu> on 2006/05/12 22:03:38 UTC, 0 replies.
- summarizer.setConf(conf) should be removed. - posted by Stefan Groschupf <sg...@media-style.com> on 2006/05/13 00:41:55 UTC, 0 replies.
- [jira] Created: (NUTCH-268) Generator and lib-http use different definitions of "unique host" - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2006/05/13 01:19:08 UTC, 0 replies.
- [jira] Commented: (NUTCH-268) Generator and lib-http use different definitions of "unique host" - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2006/05/13 01:29:10 UTC, 0 replies.
- Experiment on crawler behaviour - posted by Andrzej Bialecki <ab...@getopt.org> on 2006/05/13 01:57:09 UTC, 0 replies.
- [jira] Closed: (NUTCH-240) Scoring API: extension point, scoring filters and an OPIC plugin - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2006/05/13 02:58:09 UTC, 0 replies.
- HEADS UP: Config changes related to scoring API - posted by Andrzej Bialecki <ab...@getopt.org> on 2006/05/13 03:03:36 UTC, 0 replies.
- Re: [Nutch-cvs] svn commit: r406044 - /lucene/nutch/trunk/src/plugin/build.xml - posted by Andrzej Bialecki <ab...@getopt.org> on 2006/05/13 10:48:59 UTC, 1 replies.
- [jira] Resolved: (NUTCH-134) Summarizer doesn't select the best snippets - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2006/05/13 10:55:09 UTC, 0 replies.
- Is there any tutorial for developing nutch in eclipse or netbeans - posted by Jackey Yang <ja...@akomedia.com> on 2006/05/15 08:29:21 UTC, 0 replies.
- [jira] Commented: (NUTCH-251) Administration GUI - posted by "Thomas Delnoij (JIRA)" <ji...@apache.org> on 2006/05/15 13:09:06 UTC, 0 replies.
- [jira] Closed: (NUTCH-268) Generator and lib-http use different definitions of "unique host" - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2006/05/16 00:22:06 UTC, 0 replies.
- [jira] Created: (NUTCH-269) CrawlDbReducer: OOME because no upper-bound on inlinks count - posted by "stack@archive.org (JIRA)" <ji...@apache.org> on 2006/05/16 00:29:08 UTC, 0 replies.
- [jira] Updated: (NUTCH-269) CrawlDbReducer: OOME because no upper-bound on inlinks count - posted by "stack@archive.org (JIRA)" <ji...@apache.org> on 2006/05/16 00:31:06 UTC, 1 replies.
- RE: refetching interval - posted by Ledio Ago <la...@looksmart.net> on 2006/05/16 21:15:12 UTC, 2 replies.
- Query Boosting - posted by Marko Bauhardt <mb...@media-style.com> on 2006/05/17 11:18:36 UTC, 0 replies.
- Following
tags - posted by Chris Schneider <Sc...@TransPac.com> on 2006/05/17 22:56:45 UTC, 3 replies.
- Fetcher.java reporting incorrect kb/s? - posted by Greg Kim <gr...@gmail.com> on 2006/05/18 20:09:08 UTC, 2 replies.
- Nutch 'Help Wanted' page on wiki - posted by Gordon Mohr <go...@archive.org> on 2006/05/18 21:13:15 UTC, 0 replies.
- [jira] Created: (NUTCH-270) Apply just the applicable portions of the patch to protocol.httpclient.Http.java - posted by "Jeremy Calvert (JIRA)" <ji...@apache.org> on 2006/05/18 21:35:05 UTC, 0 replies.
- [jira] Created: (NUTCH-271) Meta-data per URL/site/section - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/05/18 23:13:08 UTC, 0 replies.
- [jira] Commented: (NUTCH-271) Meta-data per URL/site/section - posted by "Gal Nitzan (JIRA)" <ji...@apache.org> on 2006/05/19 00:08:06 UTC, 2 replies.
- [jira] Created: (NUTCH-272) Max. pages to crawl/fetch per site (emergency limit) - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/05/19 17:08:29 UTC, 0 replies.
- [jira] Commented: (NUTCH-173) PerHost Crawling Policy ( crawl.ignore.external.links ) - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/05/19 17:41:30 UTC, 0 replies.
- [jira] Updated: (NUTCH-270) Apply just the applicable portions of the patch to protocol.httpclient.Http.java - posted by "Jeremy Calvert (JIRA)" <ji...@apache.org> on 2006/05/19 18:52:30 UTC, 0 replies.
- Status of language plugin - posted by Teruhiko Kurosaka <Ku...@basistech.com> on 2006/05/19 21:14:19 UTC, 0 replies.
- [jira] Commented: (NUTCH-272) Max. pages to crawl/fetch per site (emergency limit) - posted by "Matt Kangas (JIRA)" <ji...@apache.org> on 2006/05/20 00:54:30 UTC, 11 replies.
- [jira] Created: (NUTCH-273) When a page is redirected, the original url is NOT updated. - posted by "Lukas Vlcek (JIRA)" <ji...@apache.org> on 2006/05/20 11:24:29 UTC, 0 replies.
- Re: Submitting for Review :: Tutorial on Nuth Implementation and Maintenace - posted by Lukas Vlcek <lu...@gmail.com> on 2006/05/20 11:48:02 UTC, 0 replies.
- [jira] Commented: (NUTCH-175) No input directories specified in: while crawing in nightly build from the 14.1.2006: sh ./nutch crawl urllist.txt -dir tmpdir - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/05/20 20:24:30 UTC, 0 replies.
- [jira] Updated: (NUTCH-173) PerHost Crawling Policy ( crawl.ignore.external.links ) - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/05/20 20:48:30 UTC, 0 replies.
- Building nightly 2006-05-20 has errors? - posted by Stefan Neufeind <ap...@stefan-neufeind.de> on 2006/05/20 21:33:06 UTC, 1 replies.
- [jira] Created: (NUTCH-274) Empty row in/at end of URL-list results in error - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/05/21 02:40:31 UTC, 0 replies.
- [jira] Created: (NUTCH-275) Fetcher not parsing XHTML-pages at all - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/05/21 03:01:31 UTC, 0 replies.
- [jira] Commented: (NUTCH-275) Fetcher not parsing XHTML-pages at all - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/05/21 03:09:30 UTC, 1 replies.
- [jira] Updated: (NUTCH-275) Fetcher not parsing XHTML-pages at all - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/05/21 03:14:30 UTC, 0 replies.
- [jira] Updated: (NUTCH-48) "Did you mean" query enhancement/refignment feature request - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/05/21 16:05:30 UTC, 0 replies.
- [jira] Created: (NUTCH-276) db.score.link.internal problem - posted by "Eugen Kochuev (JIRA)" <ji...@apache.org> on 2006/05/21 20:03:29 UTC, 0 replies.
- [jira] Commented: (NUTCH-254) Fetcher throws NullPointer if redirect URL is filtered - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/05/21 21:49:32 UTC, 0 replies.
- [jira] Created: (NUTCH-277) Fetcher dies because of "max. redirects" (avoiding infinite loop) - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/05/21 21:55:29 UTC, 0 replies.
- [jira] Updated: (NUTCH-277) Fetcher dies because of "max. redirects" (avoiding infinite loop) - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/05/21 21:57:30 UTC, 0 replies.
- [jira] Updated: (NUTCH-258) Once Nutch logs a SEVERE log item, Nutch fails forevermore - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/05/21 22:03:31 UTC, 0 replies.
- [jira] Commented: (NUTCH-258) Once Nutch logs a SEVERE log item, Nutch fails forevermore - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/05/22 00:42:30 UTC, 0 replies.
- [jira] Commented: (NUTCH-277) Fetcher dies because of "max. redirects" (avoiding infinite loop) - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/05/22 00:45:30 UTC, 1 replies.
- [jira] Created: (NUTCH-278) Fetcher-status might need clarification: kbit/s instead of kb/s shown - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/05/22 01:10:29 UTC, 0 replies.
- [jira] Commented: (NUTCH-278) Fetcher-status might need clarification: kbit/s instead of kb/s shown - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2006/05/22 02:09:30 UTC, 0 replies.
- error - posted by an...@orbita1.ru on 2006/05/22 11:38:58 UTC, 2 replies.
- [jira] Created: (NUTCH-279) Additions for regex-normalize - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/05/22 15:10:29 UTC, 0 replies.
- [jira] Updated: (NUTCH-279) Additions for regex-normalize - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/05/22 15:14:30 UTC, 0 replies.
- [jira] Commented: (NUTCH-255) Regular Expression for RegexUrlNormalizer to remove jsessionid - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/05/22 15:48:30 UTC, 0 replies.
- [jira] Created: (NUTCH-280) url query causes Null - posted by "Grant Glouser (JIRA)" <ji...@apache.org> on 2006/05/23 01:42:29 UTC, 0 replies.
- [jira] Updated: (NUTCH-280) url query causes NullPointerException - posted by "Grant Glouser (JIRA)" <ji...@apache.org> on 2006/05/23 02:03:30 UTC, 0 replies.
- [jira] Updated: (NUTCH-265) Getting Clustered results in better form. - posted by "Kris K (JIRA)" <ji...@apache.org> on 2006/05/23 06:01:30 UTC, 0 replies.
- [jira] Resolved: (NUTCH-280) url query causes NullPointerException - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2006/05/23 19:29:30 UTC, 0 replies.
- A few questions - posted by Artem <ar...@usc.edu> on 2006/05/24 01:22:01 UTC, 0 replies.
- [jira] Created: (NUTCH-281) cached.jsp: base-href needs to be outside comments - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/05/24 02:52:29 UTC, 0 replies.
- [jira] Updated: (NUTCH-281) cached.jsp: base-href needs to be outside comments - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/05/24 02:52:30 UTC, 0 replies.
- [jira] Created: (NUTCH-282) Showing too few results on a page (Paging not correct) - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/05/24 03:15:29 UTC, 0 replies.
- Fetcher and MapReduce - posted by Hamza Kaya <ha...@gmail.com> on 2006/05/24 11:13:29 UTC, 1 replies.
- Querying a site by extracting doc informations - posted by ro...@fastwebnet.it on 2006/05/24 14:52:15 UTC, 0 replies.
- Extract infos from documents and query external sites - posted by HellSpawn <r....@gmail.com> on 2006/05/24 15:19:54 UTC, 3 replies.
- [jira] Created: (NUTCH-283) If the Fetcher times out and abandons Fetcher Threads, severe errors will occur on those Threads - posted by "Scott Ganyo (JIRA)" <ji...@apache.org> on 2006/05/24 19:11:29 UTC, 0 replies.
- [jira] Updated: (NUTCH-283) If the Fetcher times out and abandons Fetcher Threads, severe errors will occur on those Threads - posted by "Scott Ganyo (JIRA)" <ji...@apache.org> on 2006/05/24 19:11:30 UTC, 1 replies.
- [jira] Commented: (NUTCH-44) too many search results - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/05/24 19:52:30 UTC, 0 replies.
- [jira] Commented: (NUTCH-70) duplicate pages - virtual hosts in db. - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/05/24 21:23:30 UTC, 0 replies.
- [jira] Created: (NUTCH-284) NullPointerException during index - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/05/24 21:31:29 UTC, 0 replies.
- Mailing List nutch-agent Reports of Bots Submitting Forms - posted by Jeremy Bensley <jb...@gmail.com> on 2006/05/24 21:50:15 UTC, 4 replies.
- [jira] Created: (NUTCH-285) LinkDb Fails rename doesn't create parent directories - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2006/05/24 23:40:30 UTC, 0 replies.
- [jira] Updated: (NUTCH-285) LinkDb Fails rename doesn't create parent directories - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2006/05/24 23:40:31 UTC, 0 replies.
- [jira] Created: (NUTCH-286) Handling common error-pages as 404 - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/05/25 01:41:30 UTC, 0 replies.
- [jira] Closed: (NUTCH-285) LinkDb Fails rename doesn't create parent directories - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2006/05/25 02:44:30 UTC, 0 replies.
- [jira] Commented: (NUTCH-284) NullPointerException during index - posted by "Marko Bauhardt (JIRA)" <ji...@apache.org> on 2006/05/25 11:07:30 UTC, 2 replies.
- [jira] Created: (NUTCH-287) Exception when searching with sort - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/05/25 14:12:29 UTC, 0 replies.
- [jira] Created: (NUTCH-288) hitsPerSite-functionality "flawed": problems writing a page-navigation - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/05/25 17:26:31 UTC, 0 replies.
- [jira] Updated: (NUTCH-110) OpenSearchServlet outputs illegal xml characters - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/05/25 18:08:30 UTC, 0 replies.
- [jira] Commented: (NUTCH-288) hitsPerSite-functionality "flawed": problems writing a page-navigation - posted by "Doug Cutting (JIRA)" <ji...@apache.org> on 2006/05/25 18:45:31 UTC, 2 replies.
- [jira] Updated: (NUTCH-288) hitsPerSite-functionality "flawed": problems writing a page-navigation - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/05/26 01:24:30 UTC, 0 replies.
- Where exactly nutch scoring takes place ? - posted by ahmed ghouzia <gh...@yahoo.com> on 2006/05/26 15:15:54 UTC, 2 replies.
- [jira] Commented: (NUTCH-273) When a page is redirected, the original url is NOT updated. - posted by "Doug Cutting (JIRA)" <ji...@apache.org> on 2006/05/26 22:28:31 UTC, 1 replies.
- [jira] Created: (NUTCH-289) CrawlDatum should store IP address - posted by "Doug Cutting (JIRA)" <ji...@apache.org> on 2006/05/26 22:38:29 UTC, 0 replies.
- [jira] Commented: (NUTCH-289) CrawlDatum should store IP address - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2006/05/27 22:47:30 UTC, 4 replies.
- [jira] Created: (NUTCH-290) parse-pdf: Garbage (?) indexed when text-extraction now allowed - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/05/28 15:35:35 UTC, 0 replies.
- NPE When using a merged segment - posted by Gal Nitzan <gn...@usa.net> on 2006/05/28 17:50:42 UTC, 6 replies.
- [jira] Created: (NUTCH-291) OpenSearchServlet should return "date" as well as "lastModified" - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/05/28 18:45:29 UTC, 0 replies.
- [jira] Updated: (NUTCH-291) OpenSearchServlet should return "date" as well as "lastModified" - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/05/28 18:50:30 UTC, 0 replies.
- [jira] Created: (NUTCH-292) OpenSearchServlet: OutOfMemoryError: Java heap space - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/05/28 19:38:29 UTC, 0 replies.
- Re: [Nutch-cvs] svn commit: r409869 - in /lucene/nutch/trunk/contrib/web2/plugins/caching-oscache/src/java/org: ./ apache/ apache/nutch/ apache/nutch/webapp/ apache/nutch/webapp/controller/ - posted by og...@yahoo.com on 2006/05/28 19:38:40 UTC, 1 replies.
- [jira] Commented: (NUTCH-290) parse-pdf: Garbage (?) indexed when text-extraction now allowed - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/05/28 20:31:30 UTC, 0 replies.
- [jira] Updated: (NUTCH-290) parse-pdf: Garbage (?) indexed when text-extraction now allowed - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/05/28 21:58:30 UTC, 0 replies.
- [jira] Updated: (NUTCH-290) parse-pdf: Garbage indexed when text-extraction not allowed - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/05/29 00:52:30 UTC, 0 replies.
- [jira] Updated: (NUTCH-292) OpenSearchServlet: OutOfMemoryError: Java heap space - posted by "Marcel Schnippe (JIRA)" <ji...@apache.org> on 2006/05/30 08:20:30 UTC, 0 replies.
- [jira] Commented: (NUTCH-292) OpenSearchServlet: OutOfMemoryError: Java heap space - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/05/30 09:55:31 UTC, 2 replies.
- [jira] Commented: (NUTCH-290) parse-pdf: Garbage indexed when text-extraction not allowed - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/05/30 10:34:30 UTC, 0 replies.
- JVM error while parsing - posted by Uygar Yüzsüren <uy...@gmail.com> on 2006/05/30 14:14:33 UTC, 1 replies.
- java 1.4 versus 1.5 - posted by Owen O'Malley <ow...@yahoo-inc.com> on 2006/05/31 00:21:00 UTC, 1 replies.
- Do analyzer plugins have acces to the Configuration? - posted by Teruhiko Kurosaka <Ku...@basistech.com> on 2006/05/31 00:26:36 UTC, 0 replies.