You are viewing a plain text version of this content. The canonical link for it is here.
- Detecting CJKV / Asian language pages - posted by Andy Liu <an...@gmail.com> on 2005/08/01 18:25:50 UTC, 7 replies.
- nutch prune - posted by Jay Pound <we...@poundwebhosting.com> on 2005/08/01 20:18:00 UTC, 1 replies.
- mapred branch Revision 226742 - posted by Yitao Duan <ol...@gmail.com> on 2005/08/01 22:31:03 UTC, 1 replies.
- Fetcher delays - benchmarks - posted by Christophe Noel <ch...@cetic.be> on 2005/08/02 12:09:45 UTC, 3 replies.
- [jira] Aktualisiert: (NUTCH-21) parser plugin for MS PowerPoint slides - posted by "Stephan Strittmatter (JIRA)" <ji...@apache.org> on 2005/08/02 13:54:35 UTC, 0 replies.
- [jira] Aktualisiert: (NUTCH-20) Extract urls from plain texts - posted by "Stephan Strittmatter (JIRA)" <ji...@apache.org> on 2005/08/02 14:05:35 UTC, 1 replies.
- [jira] Erstellt: (NUTCH-77) Project URL in JIRA - posted by "Stephan Strittmatter (JIRA)" <ji...@apache.org> on 2005/08/02 14:05:37 UTC, 0 replies.
- Memory usage - posted by Jay Pound <we...@poundwebhosting.com> on 2005/08/02 18:53:36 UTC, 2 replies.
- Re: Memory usage2 - posted by Jay Pound <we...@poundwebhosting.com> on 2005/08/02 21:43:59 UTC, 3 replies.
- My wishlist of 12 out of... - posted by EM <em...@cpuedge.com> on 2005/08/03 05:25:02 UTC, 0 replies.
- Strange search results - posted by Howie Wang <ho...@hotmail.com> on 2005/08/03 07:32:08 UTC, 8 replies.
- dns lookup cache? - posted by Stefan Groschupf <sg...@media-style.com> on 2005/08/03 10:19:51 UTC, 6 replies.
- digest field in Nutch index directory - posted by "Feng (Michael) Ji" <fj...@yahoo.com> on 2005/08/04 05:30:42 UTC, 0 replies.
- Re: IndexOptimizer bug? - posted by Michael Nebel <mi...@nebel.de> on 2005/08/04 13:53:15 UTC, 5 replies.
- near-term plan - posted by Doug Cutting <cu...@nutch.org> on 2005/08/04 19:17:49 UTC, 15 replies.
- Documentation - posted by Nishant Chandra <ni...@gmail.com> on 2005/08/04 19:54:45 UTC, 2 replies.
- Detecting unmodified content patches (Re: near-term plan) - posted by Andrzej Bialecki <ab...@getopt.org> on 2005/08/04 23:33:05 UTC, 0 replies.
- [jira] Closed: (NUTCH-65) index-more plugin can't parse large set of modification-date - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2005/08/04 23:49:39 UTC, 0 replies.
- detect page updating - posted by Michael Ji <fj...@yahoo.com> on 2005/08/05 04:17:44 UTC, 0 replies.
- Ignore external links from crawled domains - posted by Christophe Noel <ch...@gmail.com> on 2005/08/05 10:57:00 UTC, 1 replies.
- [jira] Created: (NUTCH-78) German texts on website - posted by "Matthias Jaekle (JIRA)" <ji...@apache.org> on 2005/08/05 18:48:35 UTC, 0 replies.
- [jira] Updated: (NUTCH-78) German texts on website - posted by "Matthias Jaekle (JIRA)" <ji...@apache.org> on 2005/08/05 18:48:36 UTC, 0 replies.
- Crawling directly from URL and Questions about using the index - posted by Nils Hoeller <ni...@arcor.de> on 2005/08/05 19:59:41 UTC, 0 replies.
- NUTCH-7 bug - posted by Piotr Kosiorowski <pk...@gmail.com> on 2005/08/05 20:40:30 UTC, 0 replies.
- fetching redirect bug? - posted by EM <em...@cpuedge.com> on 2005/08/05 22:20:28 UTC, 0 replies.
- mapred - posted by Jay Pound <we...@poundwebhosting.com> on 2005/08/06 16:09:46 UTC, 4 replies.
- mapred question - posted by Jay Pound <we...@poundwebhosting.com> on 2005/08/06 19:39:59 UTC, 0 replies.
- NDFS benchmark results - posted by Jay Pound <we...@poundwebhosting.com> on 2005/08/07 00:30:22 UTC, 0 replies.
- ndfs problem needs fix - posted by Jay Pound <we...@poundwebhosting.com> on 2005/08/07 05:34:44 UTC, 3 replies.
- luke?? - posted by Jay Pound <we...@poundwebhosting.com> on 2005/08/07 22:19:53 UTC, 2 replies.
- Nutch website deployment - posted by Piotr Kosiorowski <pk...@gmail.com> on 2005/08/07 23:27:08 UTC, 2 replies.
- JIRA access - posted by Piotr Kosiorowski <pk...@gmail.com> on 2005/08/07 23:32:13 UTC, 2 replies.
- Creation of a Graph File with the DB Link Graph Database - posted by Nils Hoeller <ni...@arcor.de> on 2005/08/08 12:35:43 UTC, 0 replies.
- Tutorial - posted by Piotr Kosiorowski <pk...@gmail.com> on 2005/08/08 14:37:59 UTC, 2 replies.
- NUTCH 79 Fault tolerant searching. - posted by Piotr Kosiorowski <pk...@gmail.com> on 2005/08/08 19:03:38 UTC, 0 replies.
- regex-url filter - posted by Jay Pound <we...@poundwebhosting.com> on 2005/08/08 20:37:25 UTC, 2 replies.
- Re: svn commit: r230867 - /lucene/nutch/trunk/conf/crawl-urlfilter.txt.template - posted by Doug Cutting <cu...@nutch.org> on 2005/08/08 22:01:08 UTC, 1 replies.
- User agent string - posted by Piotr Kosiorowski <pk...@gmail.com> on 2005/08/08 22:28:39 UTC, 1 replies.
- Re: svn commit: r230887 - /lucene/nutch/trunk/conf/nutch-default.xml - posted by Doug Cutting <cu...@nutch.org> on 2005/08/08 22:51:17 UTC, 3 replies.
- Writable vs Externalizable - posted by Stefan Groschupf <sg...@media-style.com> on 2005/08/08 23:02:55 UTC, 3 replies.
- clucene-java bindings - posted by Ben van Klinken <bv...@gmail.com> on 2005/08/09 11:37:32 UTC, 2 replies.
- Re: [Nutch-cvs] svn commit: r230887 - /lucene/nutch/trunk/conf/nutch-default.xml - posted by Andrzej Bialecki <ab...@getopt.org> on 2005/08/09 20:44:57 UTC, 3 replies.
- Re: [Nutch-dev] Re: regex-url filter - posted by Hasan Diwan <ha...@gmail.com> on 2005/08/09 23:05:39 UTC, 2 replies.
- strange url counting in the fetcher - posted by EM <em...@cpuedge.com> on 2005/08/10 00:17:57 UTC, 0 replies.
- How to extend Nutch? - posted by Fuad Efendi <fu...@efendi.ca> on 2005/08/10 20:56:00 UTC, 1 replies.
- Nutch versions - Was: [Nutch-cvs] svn commit: r230887 - /lucene/nutch/trunk/conf/nutch-default.xml - posted by Piotr Kosiorowski <pk...@gmail.com> on 2005/08/10 22:12:24 UTC, 1 replies.
- Amin GH's invitation - posted by Am...@invitation.sms.ac on 2005/08/11 00:11:06 UTC, 0 replies.
- RE: extend java.net.URL? - posted by Nick Lothian <nl...@educationau.edu.au> on 2005/08/11 08:17:37 UTC, 3 replies.
- CrawlTool - fetching only first page - posted by Fuad Efendi <fu...@efendi.ca> on 2005/08/11 17:16:13 UTC, 7 replies.
- Release HOWTO - posted by Piotr Kosiorowski <pk...@gmail.com> on 2005/08/11 21:52:15 UTC, 2 replies.
- ant setup for Cgywin - posted by Michael Ji <fj...@yahoo.com> on 2005/08/11 22:41:41 UTC, 2 replies.
- page ranking weights - posted by Jay Pound <we...@poundwebhosting.com> on 2005/08/11 22:49:29 UTC, 3 replies.
- Field.Text vs Field.UnStored - posted by EM <em...@cpuedge.com> on 2005/08/12 08:14:11 UTC, 1 replies.
- Different Number of Doc in Index and WebDB - posted by Nils Hoeller <ni...@arcor.de> on 2005/08/12 15:26:53 UTC, 2 replies.
- [jira] Closed: (NUTCH-30) rss feed parser - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2005/08/12 17:21:58 UTC, 0 replies.
- Site Content not indexed ? Nutch 0.7 - posted by Nils Hoeller <ni...@arcor.de> on 2005/08/12 18:47:23 UTC, 1 replies.
- Injecting documents manually. - posted by Dawid Weiss <da...@cs.put.poznan.pl> on 2005/08/12 19:15:28 UTC, 3 replies.
- Clustering plugin upgrade - posted by Dawid Weiss <da...@cs.put.poznan.pl> on 2005/08/12 19:18:11 UTC, 1 replies.
- Language detection - posted by Ken Krugler <kk...@transpac.com> on 2005/08/12 23:47:48 UTC, 0 replies.
- Re: [Nutch-dev] Field.Text vs Field.UnStored - posted by praveen pathiyil <pa...@gmail.com> on 2005/08/14 03:28:48 UTC, 0 replies.
- ParseData Object - posted by Nils Hoeller <ni...@arcor.de> on 2005/08/14 16:57:36 UTC, 2 replies.
- Indexing the whole WebDB or get Pages out of WebDB that are Indexed - posted by Nils Hoeller <ni...@arcor.de> on 2005/08/14 17:00:07 UTC, 1 replies.
- turn on Log - posted by Michael Ji <fj...@yahoo.com> on 2005/08/15 00:54:29 UTC, 0 replies.
- [jira] Created: (NUTCH-80) Web UI only works when project deployed in root - posted by "AJ Banck (JIRA)" <ji...@apache.org> on 2005/08/15 13:27:53 UTC, 0 replies.
- [jira] Created: (NUTCH-81) Webapp only works when deployed in root - posted by "AJ Banck (JIRA)" <ji...@apache.org> on 2005/08/15 13:31:54 UTC, 0 replies.
- [jira] Updated: (NUTCH-81) Webapp only works when deployed in root - posted by "AJ Banck (JIRA)" <ji...@apache.org> on 2005/08/15 13:38:54 UTC, 0 replies.
- [jira] Commented: (NUTCH-80) Web UI only works when project deployed in root - posted by "AJ Banck (JIRA)" <ji...@apache.org> on 2005/08/15 13:40:54 UTC, 0 replies.
- [jira] Commented: (NUTCH-81) Webapp only works when deployed in root - posted by "AJ Banck (JIRA)" <ji...@apache.org> on 2005/08/15 13:54:55 UTC, 0 replies.
- Fetcher, ParseText, ParseData - need to modify - posted by Fuad Efendi <fu...@efendi.ca> on 2005/08/15 19:20:03 UTC, 2 replies.
- VOTE: clustering plugin update for Rel 0.7 - posted by Andrzej Bialecki <ab...@getopt.org> on 2005/08/15 20:11:06 UTC, 5 replies.
- [jira] Created: (NUTCH-82) Nutch Commands should run on Windows without external tools - posted by "AJ Banck (JIRA)" <ji...@apache.org> on 2005/08/15 20:22:54 UTC, 0 replies.
- [jira] Updated: (NUTCH-82) Nutch Commands should run on Windows without external tools - posted by "AJ Banck (JIRA)" <ji...@apache.org> on 2005/08/15 20:27:56 UTC, 1 replies.
- Re: [Nutch-dev] turn on Log - posted by Hasan Diwan <ha...@gmail.com> on 2005/08/15 20:41:37 UTC, 0 replies.
- [jira] Closed: (NUTCH-80) Web UI only works when project deployed in root - posted by "Piotr Kosiorowski (JIRA)" <ji...@apache.org> on 2005/08/15 21:00:54 UTC, 0 replies.
- [jira] Created: (NUTCH-83) Release deliverable as zip - posted by "AJ Banck (JIRA)" <ji...@apache.org> on 2005/08/15 21:11:53 UTC, 0 replies.
- [jira] Updated: (NUTCH-83) Release deliverable as zip - posted by "AJ Banck (JIRA)" <ji...@apache.org> on 2005/08/15 21:33:54 UTC, 0 replies.
- MapRed - Injector - urlDir - Format? - posted by Fuad Efendi <fu...@efendi.ca> on 2005/08/15 21:41:57 UTC, 4 replies.
- Release 0.7 - posted by Piotr Kosiorowski <pk...@gmail.com> on 2005/08/16 19:18:04 UTC, 1 replies.
- Difference Between 0.6 and 0.7 - posted by Paul Harrison <pr...@swbell.net> on 2005/08/16 20:28:22 UTC, 1 replies.
- Slow Results - posted by Paul Harrison <pr...@swbell.net> on 2005/08/16 20:35:04 UTC, 4 replies.
- (mapred branch) Job.xml as a directory instead of a file, other issues. - posted by Jeremy Bensley <jb...@gmail.com> on 2005/08/16 20:56:52 UTC, 3 replies.
- Release 0.7 problem - posted by Piotr Kosiorowski <pk...@gmail.com> on 2005/08/16 23:11:15 UTC, 3 replies.
- 128-bit and 64-bit MD5 Hash Value - posted by Michael Ji <fj...@yahoo.com> on 2005/08/17 03:21:39 UTC, 0 replies.
- result tuning - posted by webmaster <we...@www.poundwebhosting.com> on 2005/08/17 04:16:39 UTC, 0 replies.
- Nutch 0.7 released - posted by Piotr Kosiorowski <pk...@gmail.com> on 2005/08/17 14:02:23 UTC, 0 replies.
- Re: Language Identifier in Nutch - posted by Jérôme Charron <je...@gmail.com> on 2005/08/17 15:29:02 UTC, 0 replies.
- Typo in plugin/build.xml - posted by Fuad Efendi <fu...@efendi.ca> on 2005/08/17 18:54:19 UTC, 1 replies.
- Merge Lucene to Nutch - posted by Michael Ji <fj...@yahoo.com> on 2005/08/18 02:29:34 UTC, 2 replies.
- [jira] Commented: (NUTCH-71) Search web page doesn't not focus on query input - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2005/08/18 14:21:55 UTC, 0 replies.
- Localized docs files - posted by Jérôme Charron <je...@gmail.com> on 2005/08/18 18:46:22 UTC, 1 replies.
- Search Java JSP error after configuration and set up. Please help. - posted by Diane Palla <pa...@shu.edu> on 2005/08/18 20:42:19 UTC, 0 replies.
- Outlink metadata? - posted by Erik Hatcher <er...@ehatchersolutions.com> on 2005/08/18 20:44:50 UTC, 0 replies.
- [jira] Closed: (NUTCH-71) Search web page doesn't not focus on query input - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2005/08/18 22:25:59 UTC, 0 replies.
- [jira] Assigned: (NUTCH-74) French Analyzer Plugin - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2005/08/18 22:33:54 UTC, 0 replies.
- [jira] Updated: (NUTCH-74) French Analyzer Plugin - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2005/08/18 22:35:54 UTC, 0 replies.
- RE: [Nutch-dev] Outlink metadata? - posted by Jeremy Calvert <Je...@vulcan.com> on 2005/08/19 00:54:24 UTC, 0 replies.
- Parse-html should be enhanced! - posted by Jack Tang <hi...@gmail.com> on 2005/08/19 04:14:33 UTC, 7 replies.
- NullWritable / reading the webdb MapFiles - posted by "Mr. Udatny" <ru...@rosa.com> on 2005/08/19 10:38:03 UTC, 0 replies.
- Bug in net.nutch.searcher.FetchedSegments.java - posted by Alan Wang <sf...@gmail.com> on 2005/08/19 11:36:03 UTC, 1 replies.
- Indexed Segment? - posted by Paul Harrison <pa...@personifi.com> on 2005/08/19 13:50:54 UTC, 1 replies.
- Fw: Crawl produced no search results. - posted by Diane Palla <pa...@shu.edu> on 2005/08/19 15:55:51 UTC, 0 replies.
- Q: How to setup eclipse projects to acccess nutch? - posted by Michael Scharf <nu...@scharf-software.com> on 2005/08/19 16:28:52 UTC, 1 replies.
- svn.apache.org down? - posted by Jérôme Charron <je...@gmail.com> on 2005/08/19 17:25:53 UTC, 1 replies.
- [jira] Closed: (NUTCH-10) extension points are defined multiple times - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2005/08/19 17:59:55 UTC, 0 replies.
- Re: Q: How to setup eclipse projects to access Nutch? - posted by Ken Krugler <kk...@transpac.com> on 2005/08/19 20:05:06 UTC, 0 replies.
- Failing JUnit test - posted by Piotr Kosiorowski <pk...@gmail.com> on 2005/08/19 20:30:01 UTC, 9 replies.
- [jira] Closed: (NUTCH-20) Extract urls from plain texts - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2005/08/19 23:22:56 UTC, 0 replies.
- MD5 in fetchlist / fetcher - posted by Michael Ji <fj...@yahoo.com> on 2005/08/20 05:07:29 UTC, 0 replies.
- Redirect requested but followRedirects is disabled - posted by Fuad Efendi <fu...@efendi.ca> on 2005/08/21 08:36:02 UTC, 1 replies.
- dump nutch index - posted by Michael Ji <fj...@yahoo.com> on 2005/08/22 02:22:55 UTC, 5 replies.
- crawl-urlfilter.txt mechanics - posted by Michael Ji <fj...@yahoo.com> on 2005/08/22 05:54:04 UTC, 1 replies.
- Extracted Data Manipulation - org.apache.nutch.io, MapRed? - posted by Fuad Efendi <fu...@efendi.ca> on 2005/08/22 18:10:05 UTC, 0 replies.
- Mapred/0.7 - posted by Zaheed Haque <za...@gmail.com> on 2005/08/22 19:01:27 UTC, 3 replies.
- Re: Searchable mailing lists on nutch.org? - posted by "Will (sent by Nabble.com)" <li...@nabble.com> on 2005/08/23 01:02:42 UTC, 0 replies.
- Fetcher for constrained crawls - posted by Kelvin Tan <ke...@relevanz.com> on 2005/08/23 06:02:26 UTC, 7 replies.
- Searching NDFS with Tomcat - posted by lucene_nutch_ lucene_nutch_2005 <lu...@yahoo.com> on 2005/08/23 18:19:14 UTC, 4 replies.
- 0.7 branch - posted by Piotr Kosiorowski <pk...@gmail.com> on 2005/08/23 20:40:14 UTC, 4 replies.
- small bug - posted by John Maraist <fr...@maraist.org> on 2005/08/24 18:25:21 UTC, 0 replies.
- Language identifier plugin questions - posted by Tom White <to...@gmail.com> on 2005/08/25 00:58:37 UTC, 4 replies.
- [jira] Created: (NUTCH-84) Fetcher for constrained crawls - posted by "Kelvin Tan (JIRA)" <ji...@apache.org> on 2005/08/25 01:02:08 UTC, 0 replies.
- [jira] Updated: (NUTCH-84) Fetcher for constrained crawls - posted by "Kelvin Tan (JIRA)" <ji...@apache.org> on 2005/08/25 01:06:09 UTC, 2 replies.
- Re: (NUTCH-84) Fetcher for constrained crawls - posted by Kelvin Tan <ke...@relevanz.com> on 2005/08/25 01:16:44 UTC, 0 replies.
- [jira] Created: (NUTCH-85) pdf parser caused fetcher hangs. - posted by "Stefan Groschupf (JIRA)" <ji...@apache.org> on 2005/08/25 11:13:10 UTC, 0 replies.
- [jira] Commented: (NUTCH-85) pdf parser caused fetcher hangs. - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2005/08/25 12:28:08 UTC, 2 replies.
- [mapred] Possible bug, static primatives holding config values? - posted by "Jeremy Bensley (sent by Nabble.com)" <li...@nabble.com> on 2005/08/25 18:15:22 UTC, 1 replies.
- Re: svn commit: r240097 - /lucene/nutch/branches/Release-0.7/ - posted by Piotr Kosiorowski <pk...@gmail.com> on 2005/08/25 18:45:57 UTC, 2 replies.
- Re: Implementation of (NUTCH-84) Fetcher for constrained crawls - posted by Kelvin Tan <ke...@relevanz.com> on 2005/08/26 03:28:36 UTC, 3 replies.
- Nutch Website - i18n - posted by Michael Weber <m....@olserv.de> on 2005/08/26 11:18:51 UTC, 3 replies.
- Re: svn commit: r240254 - in /lucene/nutch/tags/Release-0.7/src/plugin/languageidentifier/src/java/org/apache/nutch/analysis/lang: HTMLLanguageParser.java LanguageIdentifier.java LanguageIndexingFilter.java LanguageQueryFilter.java NGramProfile.java - posted by Piotr Kosiorowski <pk...@gmail.com> on 2005/08/26 18:06:11 UTC, 2 replies.
- [jira] Closed: (NUTCH-37) Javadoc Warnings - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2005/08/26 23:06:26 UTC, 0 replies.
- Analysis plugins and lucene-analyzers - posted by Jérôme Charron <je...@gmail.com> on 2005/08/27 10:16:29 UTC, 6 replies.
- indexing and refetching by using NUTCH-84) Fetcher for constrained crawls - posted by Michael Ji <fj...@yahoo.com> on 2005/08/27 18:01:28 UTC, 1 replies.
- [jira] Commented: (NUTCH-65) index-more plugin can't parse large set of modification-date - posted by "Michael Nebel (JIRA)" <ji...@apache.org> on 2005/08/27 20:45:04 UTC, 2 replies.
- Limiting crawl depth using NUTCH-84) Fetcher for constrained crawls - posted by Kelvin Tan <ke...@relevanz.com> on 2005/08/27 22:12:58 UTC, 4 replies.
- Re: [Nutch-cvs] svn commit: r240359 - in /lucene/nutch/trunk/src: java/org/apache/nutch/analysis/ java/org/apache/nutch/indexer/ plugin/nutch-extensionpoints/ - posted by og...@yahoo.com on 2005/08/28 00:05:46 UTC, 2 replies.
- bot-traps and refetching - posted by Michael Ji <fj...@yahoo.com> on 2005/08/28 16:31:06 UTC, 0 replies.
- crawling ability of NUTCH-84 - posted by Michael Ji <fj...@yahoo.com> on 2005/08/28 17:31:29 UTC, 1 replies.
- Launch Nutch Search Engine successfully based on Nutch-84 data - posted by Michael Ji <fj...@yahoo.com> on 2005/08/28 21:51:54 UTC, 0 replies.
- Re: bot-traps and refetching - posted by Kelvin Tan <ke...@relevanz.com> on 2005/08/29 00:56:00 UTC, 1 replies.
- junit test failed - posted by AJ Chen <an...@sbcglobal.net> on 2005/08/29 03:01:12 UTC, 8 replies.
- a couple ant problems - posted by Earl Cahill <ca...@yahoo.com> on 2005/08/29 04:08:13 UTC, 3 replies.
- Re-Crawl? - posted by Fuad Efendi <fu...@efendi.ca> on 2005/08/29 04:09:35 UTC, 0 replies.
- controlled depth crawling - posted by Michael Ji <fj...@yahoo.com> on 2005/08/29 04:37:16 UTC, 4 replies.
- Need to reconstruct URLs from segment - posted by Fuad Efendi <fu...@efendi.ca> on 2005/08/29 04:42:29 UTC, 0 replies.
- Automating workflow using ndfs - posted by Jay Lorenzo <ja...@gmail.com> on 2005/08/29 07:24:17 UTC, 1 replies.
- UpdateSegmentsFromDb - posted by Fuad Efendi <fu...@efendi.ca> on 2005/08/29 08:53:55 UTC, 0 replies.
- a further concern of refetching scenario about depth controlled crawling - posted by Michael Ji <fj...@yahoo.com> on 2005/08/30 02:35:16 UTC, 0 replies.
- HttpAuthentication in protocol-httpclient plugin - posted by Jack Tang <hi...@gmail.com> on 2005/08/30 04:13:30 UTC, 0 replies.
- NDFS question - posted by Egor Chernodarov <eg...@zarinsk.dem.ru> on 2005/08/30 13:38:17 UTC, 1 replies.
- Another NDFS question - posted by "Ian C. Blenke" <ic...@nks.net> on 2005/08/30 16:22:26 UTC, 3 replies.
- Incremental crawling available? - posted by Diane Palla <pa...@shu.edu> on 2005/08/30 20:26:03 UTC, 0 replies.
- Re: [Nutch-dev] Re: Another NDFS question - posted by Erik Hatcher <er...@ehatchersolutions.com> on 2005/08/30 22:20:37 UTC, 0 replies.
- Out of Memory?! 1300Mb!!! - posted by Fuad Efendi <fu...@efendi.ca> on 2005/08/31 07:47:27 UTC, 0 replies.
- null lang bug? and patch? - posted by Earl Cahill <ca...@yahoo.com> on 2005/08/31 08:47:04 UTC, 4 replies.
- [jira] Created: (NUTCH-86) LanguageIdentifier API enhancements - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2005/08/31 12:32:16 UTC, 0 replies.
- Re[2]: NDFS question - posted by Egor Chernodarov <eg...@zarinsk.dem.ru> on 2005/08/31 13:36:11 UTC, 0 replies.
- merge mapred to trunk - posted by Doug Cutting <cu...@nutch.org> on 2005/08/31 18:34:39 UTC, 6 replies.
- Re: [Nutch Wiki] Update of "Committer's Rules" by AndrzejBialecki - posted by Doug Cutting <cu...@nutch.org> on 2005/08/31 18:45:30 UTC, 3 replies.
- Fw: PDF support? Does crawl parse pdf files? How do I get it work? - posted by Diane Palla <pa...@shu.edu> on 2005/08/31 21:39:47 UTC, 0 replies.
- [jira] Commented: (NUTCH-21) parser plugin for MS PowerPoint slides - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2005/08/31 21:53:09 UTC, 0 replies.
- Re: [jira] Commented: (NUTCH-65) index-more plugin can't parse large set of modification-date - posted by Jérôme Charron <je...@gmail.com> on 2005/08/31 23:19:40 UTC, 0 replies.
- [jira] Reopened: (NUTCH-65) index-more plugin can't parse large set of modification-date - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2005/08/31 23:33:13 UTC, 0 replies.
- [jira] Updated: (NUTCH-65) index-more plugin can't parse large set of modification-date - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2005/08/31 23:33:14 UTC, 1 replies.