You are viewing a plain text version of this content. The canonical link for it is here.
- Re: some question about development - posted by lv david <da...@gmail.com> on 2007/12/01 12:20:54 UTC, 0 replies.
- Re: Image Search Engine Input - posted by Trey Spiva <tr...@sun.com> on 2007/12/02 03:30:35 UTC, 0 replies.
- [jira] Commented: (NUTCH-585) [PARSE-HTML plugin] Block certain parts of HTML code from being indexed - posted by "Otis Gospodnetic (JIRA)" <ji...@apache.org> on 2007/12/02 18:26:43 UTC, 2 replies.
- [jira] Commented: (NUTCH-442) Integrate Solr/Nutch - posted by "Otis Gospodnetic (JIRA)" <ji...@apache.org> on 2007/12/02 22:36:43 UTC, 2 replies.
- [jira] Commented: (NUTCH-581) DistributedSearch does not update search servers added to search-servers.txt on the fly - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2007/12/03 21:20:43 UTC, 4 replies.
- [jira] Updated: (NUTCH-581) DistributedSearch does not update search servers added to search-servers.txt on the fly - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2007/12/03 21:26:43 UTC, 1 replies.
- Task process exit with nonzero status of 65 - posted by Ned Rockson <ne...@discoveryengine.com> on 2007/12/03 22:36:37 UTC, 0 replies.
- [jira] Created: (NUTCH-587) Upgrade Nutch to use Hadoop 0.15.1 release - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2007/12/04 00:35:43 UTC, 0 replies.
- [jira] Updated: (NUTCH-587) Upgrade Nutch to use Hadoop 0.15.1 release - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2007/12/04 00:39:43 UTC, 0 replies.
- [jira] Commented: (NUTCH-586) Add option to run compiled classes w/o job file - posted by "Enis Soztutar (JIRA)" <ji...@apache.org> on 2007/12/04 10:59:43 UTC, 2 replies.
- [jira] Updated: (NUTCH-586) Add option to run compiled classes w/o job file - posted by "Enis Soztutar (JIRA)" <ji...@apache.org> on 2007/12/04 14:21:43 UTC, 0 replies.
- [jira] Created: (NUTCH-588) Help Need - posted by "Teccon Ingenieros (JIRA)" <ji...@apache.org> on 2007/12/04 17:40:43 UTC, 0 replies.
- [jira] Resolved: (NUTCH-581) DistributedSearch does not update search servers added to search-servers.txt on the fly - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2007/12/04 20:15:43 UTC, 0 replies.
- [jira] Created: (NUTCH-589) Hierarchical Classloaders - posted by "Ryan Levering (JIRA)" <ji...@apache.org> on 2007/12/05 01:04:43 UTC, 0 replies.
- Nutch\nutch-0.9\build.xml:61: Specify at least one source--a file or resource collection. - posted by quxy <qu...@act.buaa.edu.cn> on 2007/12/05 05:06:29 UTC, 0 replies.
- [jira] Created: (NUTCH-590) Index multiple docs per call using IndexingFilter extension point - posted by "Nathaniel Powell (JIRA)" <ji...@apache.org> on 2007/12/06 01:59:43 UTC, 0 replies.
- Filter spam URLs - posted by Ned Rockson <ne...@discoveryengine.com> on 2007/12/07 02:14:11 UTC, 1 replies.
- [jira] Resolved: (NUTCH-588) Help Need - posted by "Enis Soztutar (JIRA)" <ji...@apache.org> on 2007/12/07 11:46:43 UTC, 0 replies.
- [jira] Commented: (NUTCH-587) Upgrade Nutch to use Hadoop 0.15.1 release - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2007/12/11 00:46:43 UTC, 1 replies.
- fnm frq like files are not creating while crwaling some site - posted by patil <sb...@yahoo.co.in> on 2007/12/12 10:59:06 UTC, 0 replies.
- [jira] Commented: (NUTCH-559) NTLM, Basic and Digest Authentication schemes for web/proxy server - posted by "Les Cheong (JIRA)" <ji...@apache.org> on 2007/12/12 20:34:43 UTC, 1 replies.
- cached.jsp for the new dev-version - posted by "Neumann, Vladimir" <Vl...@sbb.spk-berlin.de> on 2007/12/13 11:24:21 UTC, 1 replies.
- [jira] Created: (NUTCH-591) StringIndexOutOfBoundsException when extracting text from a Word document. - posted by "frank ling (JIRA)" <ji...@apache.org> on 2007/12/14 01:47:43 UTC, 0 replies.
- files are not generated in index folder by indexer for the site http://www.traguiden.se(for other sites its working good) while crwaling - posted by patil <sb...@yahoo.co.in> on 2007/12/14 07:25:03 UTC, 0 replies.
- [jira] Created: (NUTCH-592) Fetcher2 : NPE for page with status ProtocolStatus.TEMP_MOVED - posted by "Emmanuel Joke (JIRA)" <ji...@apache.org> on 2007/12/16 16:23:43 UTC, 0 replies.
- [jira] Updated: (NUTCH-592) Fetcher2 : NPE for page with status ProtocolStatus.TEMP_MOVED - posted by "Emmanuel Joke (JIRA)" <ji...@apache.org> on 2007/12/16 16:23:43 UTC, 0 replies.
- [jira] Resolved: (NUTCH-586) Add option to run compiled classes w/o job file - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2007/12/17 19:24:43 UTC, 0 replies.
- [jira] Commented: (NUTCH-579) Feed plugin only indexes one post per feed due to identical digest - posted by "Joseph Chen (JIRA)" <ji...@apache.org> on 2007/12/19 00:34:43 UTC, 0 replies.
- [jira] Created: (NUTCH-593) Nutch crawl problem - posted by "sudarat (JIRA)" <ji...@apache.org> on 2007/12/19 03:49:43 UTC, 0 replies.
- Hudson Upgrade Dec 19 - posted by Nigel Daley <nd...@yahoo-inc.com> on 2007/12/19 07:59:09 UTC, 1 replies.
- errors compiling index-extra - posted by Peter Boot <pe...@gmail.com> on 2007/12/21 05:25:11 UTC, 0 replies.
- [jira] Created: (NUTCH-594) Serve Nutch search results in XML and JSON - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2007/12/21 18:10:43 UTC, 0 replies.
- [jira] Updated: (NUTCH-594) Serve Nutch search results in XML and JSON - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2007/12/21 18:18:43 UTC, 0 replies.
- [jira] Commented: (NUTCH-422) index-extra plugin creates additional fields in the index, based on configurable logic - posted by "Peter Boot (JIRA)" <ji...@apache.org> on 2007/12/21 22:17:43 UTC, 0 replies.
- scoring algorithm - posted by Lirida Kercelli <li...@gmail.com> on 2007/12/23 15:00:09 UTC, 0 replies.
- Enable Nutch to search for local file system - posted by Torontoer <be...@yahoo.com> on 2007/12/24 04:33:13 UTC, 0 replies.
- [jira] Commented: (NUTCH-575) NPE in OpenSearchServlet when summary is null - posted by "Hudson (JIRA)" <ji...@apache.org> on 2007/12/25 05:19:43 UTC, 0 replies.
- [jira] Commented: (NUTCH-528) CrawlDbReader: add some new stats + dump into a csv format - posted by "Emmanuel Joke (JIRA)" <ji...@apache.org> on 2007/12/27 11:30:43 UTC, 1 replies.
- [jira] Created: (NUTCH-595) "Target file:/.... already exists" - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2007/12/27 14:08:44 UTC, 0 replies.
- [jira] Commented: (NUTCH-595) "Target file:/.... already exists" - posted by "Emmanuel Joke (JIRA)" <ji...@apache.org> on 2007/12/27 14:21:43 UTC, 0 replies.
- [jira] Commented: (NUTCH-534) SegmentMerger: add -normalize option - posted by "Emmanuel Joke (JIRA)" <ji...@apache.org> on 2007/12/27 14:26:43 UTC, 1 replies.
- [jira] Updated: (NUTCH-528) CrawlDbReader: add some new stats + dump into a csv format - posted by "Emmanuel Joke (JIRA)" <ji...@apache.org> on 2007/12/28 03:59:43 UTC, 0 replies.
- Build failed in Hudson: Nutch-Nightly #307 - posted by hu...@lucene.zones.apache.org on 2007/12/28 06:08:19 UTC, 0 replies.
- nutch internet crawling help - posted by NIDHI MALIK <mm...@cse.iitb.ac.in> on 2007/12/28 12:28:04 UTC, 0 replies.
- Build failed in Hudson: Nutch-Nightly #308 - posted by hu...@lucene.zones.apache.org on 2007/12/29 05:10:45 UTC, 0 replies.
- Hudson build is back to normal: Nutch-Nightly #309 - posted by hu...@lucene.zones.apache.org on 2007/12/29 06:33:27 UTC, 0 replies.
- [jira] Created: (NUTCH-596) ParseSegments parse content even if its not CrawlDatum.STATUS_FETCH_SUCCESS - posted by "Emmanuel Joke (JIRA)" <ji...@apache.org> on 2007/12/30 10:52:43 UTC, 0 replies.
- [jira] Commented: (NUTCH-596) ParseSegments parse content even if its not CrawlDatum.STATUS_FETCH_SUCCESS - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2007/12/30 12:16:43 UTC, 0 replies.
- [jira] Created: (NUTCH-597) Fetcher2 - java.lang.NullPointerException when host does not exist and fetcher.threads.per.host.by.ip is set to true causes threads to finish. - posted by "Remco Verhoef (JIRA)" <ji...@apache.org> on 2007/12/30 17:29:43 UTC, 0 replies.
- [jira] Updated: (NUTCH-597) Fetcher2 - java.lang.NullPointerException when host does not exist and fetcher.threads.per.host.by.ip is set to true causes threads to finish. - posted by "Remco Verhoef (JIRA)" <ji...@apache.org> on 2007/12/30 17:31:43 UTC, 0 replies.
- Build failed in Hudson: Nutch-Nightly #311 - posted by hu...@lucene.zones.apache.org on 2007/12/31 05:34:39 UTC, 0 replies.