You are viewing a plain text version of this content. The canonical link for it is here.
- Build failed in Jenkins: Nutch-trunk #1473 - posted by Apache Jenkins Server <hu...@hudson.apache.org> on 2011/05/01 06:02:50 UTC, 0 replies.
- [jira] [Commented] (NUTCH-585) [PARSE-HTML plugin] Block certain parts of HTML code from being indexed - posted by "Wim Mostrey (JIRA)" <ji...@apache.org> on 2011/05/01 15:39:03 UTC, 1 replies.
- Build failed in Jenkins: Nutch-trunk #1474 - posted by Apache Jenkins Server <hu...@hudson.apache.org> on 2011/05/02 06:04:00 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-989) index-basic plugin doesn't use Solr date fieldType - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/05/02 10:33:03 UTC, 0 replies.
- [jira] [Updated] (NUTCH-989) index-basic plugin doesn't use Solr date fieldType - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/05/02 10:33:03 UTC, 0 replies.
- [jira] [Commented] (NUTCH-983) Upgrade SolrJ - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/05/02 11:02:03 UTC, 6 replies.
- [jira] [Updated] (NUTCH-710) Support for rel="canonical" attribute - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/05/02 11:12:03 UTC, 0 replies.
- [jira] [Updated] (NUTCH-717) Make Nutch Solr integration easier - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/05/02 11:14:03 UTC, 0 replies.
- [jira] [Commented] (NUTCH-783) IndexerChecker Utilty - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/05/02 11:16:03 UTC, 2 replies.
- [jira] [Updated] (NUTCH-783) IndexerChecker Utilty - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/05/02 14:38:03 UTC, 0 replies.
- [jira] [Issue Comment Edited] (NUTCH-983) Upgrade SolrJ - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/05/02 14:56:03 UTC, 0 replies.
- [jira] [Updated] (NUTCH-987) Support HTTP auth for Solr communication - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/05/02 15:55:03 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1475 - posted by Apache Jenkins Server <hu...@hudson.apache.org> on 2011/05/03 06:04:01 UTC, 0 replies.
- Re: Nutch Web Interface - not anymore in 1.3 - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2011/05/03 06:08:07 UTC, 1 replies.
- [jira] [Created] (NUTCH-993) NullPointerException at FetcherOutputFormat.checkOutputSpecs - posted by "Christian Guegi (JIRA)" <ji...@apache.org> on 2011/05/03 12:11:03 UTC, 0 replies.
- [jira] [Updated] (NUTCH-993) NullPointerException at FetcherOutputFormat.checkOutputSpecs - posted by "Christian Guegi (JIRA)" <ji...@apache.org> on 2011/05/03 12:13:03 UTC, 1 replies.
- Build failed in Jenkins: Nutch-trunk #1476 - posted by Apache Jenkins Server <hu...@hudson.apache.org> on 2011/05/04 06:02:47 UTC, 0 replies.
- [jira] [Updated] (NUTCH-888) Remove parse-rss - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/05/04 17:00:04 UTC, 0 replies.
- [jira] [Commented] (NUTCH-888) Remove parse-rss - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/05/04 17:22:03 UTC, 0 replies.
- Re: svn commit: r1099483 - in /nutch/branches/branch-1.3: ./ conf/ src/plugin/ src/plugin/parse-rss/ src/plugin/parse-tika/ src/plugin/parse-tika/sample/ src/plugin/parse-tika/src/test/org/apache/nutch/tika/ - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2011/05/04 17:26:59 UTC, 1 replies.
- [jira] [Resolved] (NUTCH-888) Remove parse-rss - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/05/04 22:18:03 UTC, 0 replies.
- [jira] [Closed] (NUTCH-888) Remove parse-rss - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/05/04 22:20:03 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1477 - posted by Apache Jenkins Server <hu...@hudson.apache.org> on 2011/05/05 06:03:10 UTC, 0 replies.
- Re: Update schema to get solrdedup working again - posted by Julien Nioche <li...@gmail.com> on 2011/05/05 15:34:56 UTC, 2 replies.
- [jira] [Resolved] (NUTCH-989) index-basic plugin doesn't use Solr date fieldType - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/05/05 15:51:03 UTC, 0 replies.
- [jira] [Closed] (NUTCH-989) index-basic plugin doesn't use Solr date fieldType - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/05/05 15:53:03 UTC, 0 replies.
- [jira] [Created] (NUTCH-994) Fine tune Solr schema - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/05/05 15:55:03 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-983) Upgrade SolrJ - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/05/05 19:58:03 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1478 - posted by Apache Jenkins Server <hu...@hudson.apache.org> on 2011/05/06 06:04:39 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1479 - posted by Apache Jenkins Server <hu...@hudson.apache.org> on 2011/05/07 06:03:13 UTC, 0 replies.
- [jira] [Created] (NUTCH-995) Generate POM file using the Ivy makepom task - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/05/07 07:59:03 UTC, 0 replies.
- [jira] [Updated] (NUTCH-995) Generate POM file using the Ivy makepom task - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/05/07 08:57:03 UTC, 1 replies.
- [jira] [Assigned] (NUTCH-995) Generate POM file using the Ivy makepom task - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/05/07 09:05:04 UTC, 0 replies.
- [jira] [Created] (NUTCH-996) Indexer adds solr.commit.size+1 docs - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/05/08 03:44:03 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1480 - posted by Apache Jenkins Server <hu...@hudson.apache.org> on 2011/05/08 06:03:09 UTC, 0 replies.
- Usefulness of site and host fields? - posted by Markus Jelsma <ma...@openindex.io> on 2011/05/08 12:20:27 UTC, 1 replies.
- Usefulness of cache field - posted by Markus Jelsma <ma...@openindex.io> on 2011/05/08 12:26:01 UTC, 3 replies.
- Return value of jobs - posted by Markus Jelsma <ma...@openindex.io> on 2011/05/09 00:18:01 UTC, 1 replies.
- [jira] [Commented] (NUTCH-887) Delegate parsing of feeds to Tika - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/05/09 00:30:03 UTC, 1 replies.
- [jira] [Closed] (NUTCH-912) MoreIndexingFilter does not parse docx and xlsx date formats - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/05/09 00:36:03 UTC, 0 replies.
- [jira] [Closed] (NUTCH-963) Add support for deleting Solr documents with STATUS_DB_GONE in CrawlDB (404 urls) - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/05/09 00:36:03 UTC, 0 replies.
- [jira] [Closed] (NUTCH-980) Fix IllegalAccessError with slf4j used in Solrj. - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/05/09 00:36:03 UTC, 0 replies.
- [jira] [Closed] (NUTCH-991) SolrDedup must issue a commit - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/05/09 00:36:03 UTC, 0 replies.
- [jira] [Closed] (NUTCH-977) SolrMappingReader uses hardcoded configuration parameter name for mapping file - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/05/09 00:36:03 UTC, 0 replies.
- [jira] [Closed] (NUTCH-976) Rename properties solrindex.* to solr.* - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/05/09 00:36:04 UTC, 0 replies.
- [jira] [Closed] (NUTCH-986) Dedup fails due to date format (long) - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/05/09 00:36:04 UTC, 0 replies.
- [jira] [Closed] (NUTCH-935) remove unnecessary /./ in basic urlnormalizer - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/05/09 00:36:04 UTC, 0 replies.
- [jira] [Closed] (NUTCH-964) ERROR conf.Configuration - Failed to set setXIncludeAware(true) - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/05/09 00:36:04 UTC, 0 replies.
- [jira] [Closed] (NUTCH-897) Subcollection requires blacklist element - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/05/09 00:36:04 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1481 - posted by Apache Jenkins Server <hu...@hudson.apache.org> on 2011/05/09 06:03:19 UTC, 0 replies.
- found a nutch bug - posted by ldk_5370 <ld...@163.com> on 2011/05/09 12:08:01 UTC, 1 replies.
- [jira] [Commented] (NUTCH-937) When nutch is run on hadoop > 0.20.2 (or cdh) it will not find plugins because MapReduce will not unpack plugin/ directory from the job's pack (due to MAPREDUCE-967) - posted by "Viksit Gaur (JIRA)" <ji...@apache.org> on 2011/05/10 01:14:03 UTC, 3 replies.
- [jira] [Assigned] (NUTCH-994) Fine tune Solr schema - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/05/10 01:43:03 UTC, 0 replies.
- [jira] [Updated] (NUTCH-994) Fine tune Solr schema - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/05/10 02:34:03 UTC, 1 replies.
- [jira] [Issue Comment Edited] (NUTCH-994) Fine tune Solr schema - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/05/10 02:36:03 UTC, 1 replies.
- [jira] [Resolved] (NUTCH-996) Indexer adds solr.commit.size+1 docs - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/05/10 02:48:03 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1482 - posted by Apache Jenkins Server <hu...@hudson.apache.org> on 2011/05/10 06:04:24 UTC, 0 replies.
- Re: 1.3 RC2? - posted by Markus Jelsma <ma...@openindex.io> on 2011/05/10 22:55:52 UTC, 5 replies.
- Build failed in Jenkins: Nutch-trunk #1483 - posted by Apache Jenkins Server <hu...@hudson.apache.org> on 2011/05/11 06:02:02 UTC, 0 replies.
- [jira] [Commented] (NUTCH-985) MoreIndexingFilter doesn't use properly formatted date fields for Solr - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/05/11 15:15:47 UTC, 3 replies.
- [jira] [Updated] (NUTCH-961) Expose Tika's boilerpipe support - posted by "Gabriele Kahlout (JIRA)" <ji...@apache.org> on 2011/05/12 03:25:47 UTC, 1 replies.
- Build failed in Jenkins: Nutch-trunk #1484 - posted by Apache Jenkins Server <hu...@hudson.apache.org> on 2011/05/12 06:12:10 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1485 - posted by Apache Jenkins Server <hu...@hudson.apache.org> on 2011/05/13 06:02:20 UTC, 0 replies.
- [jira] [Commented] (NUTCH-993) NullPointerException at FetcherOutputFormat.checkOutputSpecs - posted by "Viksit Gaur (JIRA)" <ji...@apache.org> on 2011/05/13 20:43:47 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1486 - posted by Apache Jenkins Server <hu...@hudson.apache.org> on 2011/05/14 06:03:18 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1487 - posted by Apache Jenkins Server <hu...@hudson.apache.org> on 2011/05/15 09:03:09 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1488 - posted by Apache Jenkins Server <hu...@hudson.apache.org> on 2011/05/16 06:03:06 UTC, 0 replies.
- How can I show all the hit lists? - posted by seso <su...@gmail.com> on 2011/05/16 14:38:59 UTC, 2 replies.
- Collecting Nutch use cases for talk @BerlinBuzzwords - posted by Julien Nioche <li...@gmail.com> on 2011/05/16 17:53:56 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1489 - posted by Apache Jenkins Server <hu...@hudson.apache.org> on 2011/05/17 06:04:09 UTC, 0 replies.
- [jira] [Issue Comment Edited] (NUTCH-995) Generate POM file using the Ivy makepom task - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/05/17 09:58:47 UTC, 1 replies.
- [jira] [Commented] (NUTCH-995) Generate POM file using the Ivy makepom task - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/05/17 09:58:47 UTC, 9 replies.
- [jira] [Created] (NUTCH-997) IndexingFitlers to store Date objects instead of Strings - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/05/17 15:40:47 UTC, 0 replies.
- [jira] [Updated] (NUTCH-997) IndexingFitlers to store Date objects instead of Strings - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/05/17 18:25:47 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1490 - posted by Apache Jenkins Server <hu...@hudson.apache.org> on 2011/05/18 06:11:21 UTC, 0 replies.
- [jira] [Commented] (NUTCH-997) IndexingFitlers to store Date objects instead of Strings - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/05/18 13:27:48 UTC, 2 replies.
- [jira] [Created] (NUTCH-998) index-basic should use filename if title is empty - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/05/18 13:31:47 UTC, 0 replies.
- [jira] [Commented] (NUTCH-998) index-basic should use filename if title is empty - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/05/18 13:41:47 UTC, 1 replies.
- [jira] [Resolved] (NUTCH-997) IndexingFitlers to store Date objects instead of Strings - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/05/18 13:57:47 UTC, 0 replies.
- [jira] [Created] (NUTCH-999) IndexingFitlers to store Date objects instead of Strings - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/05/18 14:01:51 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-985) MoreIndexingFilter doesn't use properly formatted date fields for Solr - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/05/18 14:05:47 UTC, 0 replies.
- [jira] [Updated] (NUTCH-999) Normalise String representation for Dates in IndexingFilters - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/05/18 14:15:47 UTC, 1 replies.
- [jira] [Commented] (NUTCH-994) Fine tune Solr schema - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/05/18 14:27:47 UTC, 4 replies.
- Build failed in Jenkins: Nutch-trunk #1491 - posted by Apache Jenkins Server <hu...@hudson.apache.org> on 2011/05/19 06:11:35 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1492 - posted by Apache Jenkins Server <hu...@hudson.apache.org> on 2011/05/20 06:18:48 UTC, 0 replies.
- [jira] [Created] (NUTCH-1000) Add option not to commit to Solr - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/05/20 13:42:47 UTC, 1 replies.
- Build failed in Jenkins: Nutch-trunk #1493 - posted by Apache Jenkins Server <hu...@hudson.apache.org> on 2011/05/21 06:02:09 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1494 - posted by Apache Jenkins Server <hu...@hudson.apache.org> on 2011/05/22 06:01:11 UTC, 0 replies.
- [jira] [Commented] (NUTCH-892) nutch maven build support - posted by "Gabriele Kahlout (JIRA)" <ji...@apache.org> on 2011/05/22 10:45:47 UTC, 1 replies.
- Build failed in Jenkins: Nutch-trunk #1495 - posted by Apache Jenkins Server <hu...@hudson.apache.org> on 2011/05/23 06:02:54 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-999) Normalise String representation for Dates in IndexingFilters - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/05/23 12:34:47 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-994) Fine tune Solr schema - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/05/23 12:50:47 UTC, 0 replies.
- [jira] [Closed] (NUTCH-988) index-feed plugin also doesn't use proper date fields - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/05/23 13:22:47 UTC, 0 replies.
- [jira] [Closed] (NUTCH-990) protocol-httpclient fails with short pages - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/05/23 18:50:47 UTC, 0 replies.
- [jira] [Closed] (NUTCH-996) Indexer adds solr.commit.size+1 docs - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/05/23 18:50:47 UTC, 0 replies.
- [jira] [Closed] (NUTCH-985) MoreIndexingFilter doesn't use properly formatted date fields for Solr - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/05/23 18:50:47 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1496 - posted by Apache Jenkins Server <hu...@hudson.apache.org> on 2011/05/24 06:12:23 UTC, 0 replies.
- Nutch crawler problem - posted by ian paulo ilagan <ip...@yahoo.com> on 2011/05/24 14:18:03 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1497 - posted by Apache Jenkins Server <hu...@hudson.apache.org> on 2011/05/25 06:00:51 UTC, 0 replies.
- Nutch bug - assumption of HDFS in CrawlDb.java even if using other file systems like S3 - posted by Viksit Gaur <vi...@gmail.com> on 2011/05/25 20:02:21 UTC, 2 replies.
- Build failed in Jenkins: Nutch-trunk #1498 - posted by Apache Jenkins Server <hu...@hudson.apache.org> on 2011/05/26 06:12:13 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1499 - posted by Apache Jenkins Server <hu...@hudson.apache.org> on 2011/05/27 06:03:06 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1500 - posted by Apache Jenkins Server <hu...@hudson.apache.org> on 2011/05/28 06:13:22 UTC, 0 replies.
- [jira] [Commented] (NUTCH-828) Fetch Filter - posted by "Zain Us Sami Ahmed (JIRA)" <ji...@apache.org> on 2011/05/28 10:18:47 UTC, 0 replies.
- Filtering Based on Content - posted by zainussami <za...@gmail.com> on 2011/05/28 10:23:38 UTC, 0 replies.
- Re: Skipping certain URLs - posted by mmartinek <mi...@gmail.com> on 2011/05/29 02:14:23 UTC, 0 replies.
- RegEx Domain/URL matching - posted by mmartinek <mi...@gmail.com> on 2011/05/29 02:23:37 UTC, 0 replies.
- Re: Nutch: Connection Exception - posted by mmartinek <mi...@gmail.com> on 2011/05/29 02:34:27 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1501 - posted by Apache Jenkins Server <hu...@hudson.apache.org> on 2011/05/29 06:06:42 UTC, 0 replies.
- Keysigning @ Berlin Buzzwords - posted by Thomas Koch <th...@koch.ro> on 2011/05/29 13:46:43 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1502 - posted by Apache Jenkins Server <hu...@hudson.apache.org> on 2011/05/30 06:13:24 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1503 - posted by Apache Jenkins Server <hu...@hudson.apache.org> on 2011/05/31 06:03:03 UTC, 0 replies.
- Nutch Fetch failure on Elastic Mapreduce - posted by Viksit Gaur <vi...@gmail.com> on 2011/05/31 08:56:09 UTC, 0 replies.
- Questions about Jena in Nutch - posted by lfs <fa...@hotmail.com> on 2011/05/31 16:51:52 UTC, 1 replies.