You are viewing a plain text version of this content. The canonical link for it is here.
- [jira] [Updated] (NUTCH-1294) IndexClean job with solr implementation. - posted by "Claudiu Chis (JIRA)" <ji...@apache.org> on 2013/08/01 02:31:48 UTC, 0 replies.
- [jira] [Issue Comment Deleted] (NUTCH-1406) index-metadata plugin: conversion to Solr date format - posted by "Antoinette (JIRA)" <ji...@apache.org> on 2013/08/01 21:01:50 UTC, 0 replies.
- [jira] [Created] (NUTCH-1619) Writes Dmoz Description and Title information to db with snippet argument - posted by "Yasin Kılınç (JIRA)" <ji...@apache.org> on 2013/08/02 14:51:49 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1619) Writes Dmoz Description and Title information to db with snippet argument - posted by "Yasin Kılınç (JIRA)" <ji...@apache.org> on 2013/08/02 14:53:48 UTC, 1 replies.
- [jira] [Updated] (NUTCH-1486) Upgrade to Solr 4.3.0 - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/08/06 04:06:50 UTC, 0 replies.
- [jira] [Created] (NUTCH-1620) log how many URLs are generated and contained within a particular batchId - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/08/07 00:25:49 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1377) Add option to index via CloudSolrServer instead - posted by "Otis Gospodnetic (JIRA)" <ji...@apache.org> on 2013/08/07 02:28:52 UTC, 1 replies.
- [jira] [Commented] (NUTCH-945) Indexing to multiple SOLR Servers - posted by "Otis Gospodnetic (JIRA)" <ji...@apache.org> on 2013/08/07 02:30:48 UTC, 0 replies.
- [jira] [Comment Edited] (NUTCH-945) Indexing to multiple SOLR Servers - posted by "Otis Gospodnetic (JIRA)" <ji...@apache.org> on 2013/08/07 02:30:49 UTC, 0 replies.
- [jira] [Closed] (NUTCH-945) Indexing to multiple SOLR Servers - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2013/08/07 09:35:56 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1480) SolrIndexer to write to multiple servers. - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2013/08/07 09:37:49 UTC, 0 replies.
- subscribe - posted by Richard Bergmann <RB...@colsa.com> on 2013/08/07 18:44:49 UTC, 0 replies.
- Feed Plugin Crawl Links - posted by Richard Bergmann <RB...@colsa.com> on 2013/08/07 18:55:22 UTC, 4 replies.
- [jira] [Updated] (NUTCH-1483) Can't crawl filesystem with protocol-file plugin - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/08/07 19:26:49 UTC, 0 replies.
- [jira] [Commented] (NUTCH-911) recrawls file protocol causes Errors/Exceptions when actually not modified or gone - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/08/07 22:39:47 UTC, 2 replies.
- [jira] [Updated] (NUTCH-911) recrawls file protocol causes Errors/Exceptions when actually not modified or gone - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/08/07 22:55:50 UTC, 1 replies.
- [jira] [Resolved] (NUTCH-911) recrawls file protocol causes Errors/Exceptions when actually not modified or gone - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/08/07 23:11:47 UTC, 0 replies.
- [jira] [Created] (NUTCH-1621) Deprecated class o.a.n.crawl.Crawler is still in code base - posted by "Rui Gao (JIRA)" <ji...@apache.org> on 2013/08/08 15:24:48 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1621) Deprecated class o.a.n.crawl.Crawler is still in code base - posted by "Rui Gao (JIRA)" <ji...@apache.org> on 2013/08/08 15:54:48 UTC, 1 replies.
- [jira] [Created] (NUTCH-1622) Create Outlinks with metadata - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2013/08/08 17:04:47 UTC, 1 replies.
- [jira] [Updated] (NUTCH-1622) Create Outlinks with metadata - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2013/08/08 17:07:00 UTC, 5 replies.
- [jira] [Commented] (NUTCH-1294) IndexClean job with solr implementation. - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/08/11 03:18:47 UTC, 7 replies.
- [jira] [Commented] (NUTCH-1621) Deprecated class o.a.n.crawl.Crawler is still in code base - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/08/11 03:22:47 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1325) HostDB for Nutch - posted by "Tejas Patil (JIRA)" <ji...@apache.org> on 2013/08/12 01:04:47 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #2316 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/08/12 06:04:09 UTC, 0 replies.
- Converting HTML text in org.apache.nutch.protocol.Content to String - posted by byte array <by...@gmail.com> on 2013/08/12 18:50:47 UTC, 2 replies.
- [jira] [Created] (NUTCH-1623) Implement file.content.ignored function - posted by "Osy (JIRA)" <ji...@apache.org> on 2013/08/13 00:23:47 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #2317 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/08/13 06:10:45 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1294) IndexClean job with solr implementation. - posted by "lufeng (JIRA)" <ji...@apache.org> on 2013/08/13 17:29:50 UTC, 0 replies.
- [jira] [Created] (NUTCH-1624) Type in WebTableReader line 486 - posted by "kaveh minooie (JIRA)" <ji...@apache.org> on 2013/08/14 00:40:48 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1624) Typo in WebTableReader line 486 - posted by "kaveh minooie (JIRA)" <ji...@apache.org> on 2013/08/14 00:42:47 UTC, 1 replies.
- Jenkins build is back to normal : Nutch-trunk #2318 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/08/14 06:10:39 UTC, 0 replies.
- Reading additional metadata field: mtdt:_hr_ - posted by Ahmet Emre Aladağ <em...@agmlab.com> on 2013/08/14 22:23:47 UTC, 1 replies.
- minor typo in "What is Apache Nutch?" section - posted by Andrew Pennebaker <ap...@42six.com> on 2013/08/15 17:13:24 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1598) ElasticSearchIndexer to read ImmutableSettings from config - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/08/15 18:34:56 UTC, 0 replies.
- crawl.gen.delay - posted by kaveh minooie <ka...@plutoz.com> on 2013/08/15 21:17:29 UTC, 1 replies.
- [jira] [Updated] (NUTCH-1321) IDNNormalizer - posted by "İlhami KALKAN (JIRA)" <ji...@apache.org> on 2013/08/16 11:05:47 UTC, 2 replies.
- [jira] [Commented] (NUTCH-1321) IDNNormalizer - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2013/08/16 13:41:48 UTC, 0 replies.
- [jira] [Created] (NUTCH-1625) IndexerMapReduce skips FETCH_NOTMODIFIED - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2013/08/16 16:00:48 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1625) IndexerMapReduce skips FETCH_NOTMODIFIED - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2013/08/16 16:00:48 UTC, 0 replies.
- problem with running 2.x in eclipse - posted by kaveh minooie <ka...@plutoz.com> on 2013/08/17 01:11:02 UTC, 1 replies.
- Jenkins build is back to normal : Nutch-trunk #2321 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/08/17 06:09:13 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #722 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/08/18 06:03:21 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1413) Fetcher to record response time - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/08/19 00:43:47 UTC, 1 replies.
- [jira] [Commented] (NUTCH-1413) Fetcher to record response time - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/08/19 00:43:48 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1619) Writes Dmoz Description and Title information to db with snippet argument - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/08/19 00:45:48 UTC, 9 replies.
- [jira] [Resolved] (NUTCH-1624) Typo in WebTableReader line 486 - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/08/19 01:04:47 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1623) Implement file.content.ignored function - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/08/19 01:06:48 UTC, 1 replies.
- Jenkins build is back to normal : Nutch-nutchgora #723 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/08/19 01:42:11 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1624) Typo in WebTableReader line 486 - posted by "Hudson (JIRA)" <ji...@apache.org> on 2013/08/19 01:42:47 UTC, 0 replies.
- nofollow behaviour [#NUTCH-693] - posted by "Santiago M. Mola" <co...@gmail.com> on 2013/08/19 18:58:56 UTC, 0 replies.
- [jira] [Created] (NUTCH-1626) Homebrew formula for installing Nutch in Mac OS X - posted by "Andrew Pennebaker (JIRA)" <ji...@apache.org> on 2013/08/19 22:21:53 UTC, 0 replies.
- [jira] [Created] (NUTCH-1627) Debian package for installing nutch - posted by "Andrew Pennebaker (JIRA)" <ji...@apache.org> on 2013/08/19 22:25:47 UTC, 0 replies.
- [jira] [Created] (NUTCH-1628) Chocolatey package for Windows users - posted by "Andrew Pennebaker (JIRA)" <ji...@apache.org> on 2013/08/19 22:25:53 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1477) NPE when injecting with DataFileAvroStore - posted by "Alex McLintock (JIRA)" <ji...@apache.org> on 2013/08/20 19:50:52 UTC, 4 replies.
- [jira] [Created] (NUTCH-1629) there is no need to fail on empty lines in seed file when injecting. - posted by "kaveh minooie (JIRA)" <ji...@apache.org> on 2013/08/21 01:13:51 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1629) there is no need to fail on empty lines in seed file when injecting. - posted by "kaveh minooie (JIRA)" <ji...@apache.org> on 2013/08/21 01:13:53 UTC, 6 replies.
- Nutch2.2 FeedParser and FeedIndexingFilter can not find ParseResult - posted by "Jonathan.Wei" <25...@qq.com> on 2013/08/21 05:10:57 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1629) there is no need to fail on empty lines in seed file when injecting. - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2013/08/21 10:28:52 UTC, 7 replies.
- [jira] [Created] (NUTCH-1630) How to achieve finishing fetch approximately at the same time for each queue (a.k.a adaptive queue size) - posted by "Talat UYARER (JIRA)" <ji...@apache.org> on 2013/08/21 12:45:21 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1630) How to achieve finishing fetch approximately at the same time for each queue (a.k.a adaptive queue size) - posted by "Talat UYARER (JIRA)" <ji...@apache.org> on 2013/08/21 12:51:01 UTC, 0 replies.
- [jira] [Created] (NUTCH-1631) Display Document Count Added To Solr Server - posted by "Furkan KAMACI (JIRA)" <ji...@apache.org> on 2013/08/21 18:52:52 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1631) Display Document Count Added To Solr Server - posted by "Furkan KAMACI (JIRA)" <ji...@apache.org> on 2013/08/21 19:02:51 UTC, 1 replies.
- Re: Wiki User - posted by Lewis John Mcgibbney <le...@gmail.com> on 2013/08/21 21:44:45 UTC, 1 replies.
- [jira] [Commented] (NUTCH-1517) CloudSearch indexer - posted by "Daniel Ciborowski (JIRA)" <ji...@apache.org> on 2013/08/22 23:13:51 UTC, 0 replies.
- [jira] [Issue Comment Deleted] (NUTCH-1517) CloudSearch indexer - posted by "Daniel Ciborowski (JIRA)" <ji...@apache.org> on 2013/08/22 23:21:52 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1629) there is no need to fail on empty lines in seed file when injecting. - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2013/08/23 10:53:52 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1631) Display Document Count Added To Solr Server - posted by "lufeng (JIRA)" <ji...@apache.org> on 2013/08/23 16:50:51 UTC, 2 replies.
- [jira] [Resolved] (NUTCH-1631) Display Document Count Added To Solr Server - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/08/23 21:49:53 UTC, 0 replies.
- [jira] [Commented] (NUTCH-693) Add configurable option for treating nofollow behaviour. - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/08/23 22:21:52 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1619) Writes Dmoz Description and Title information to db with snippet argument - posted by "lufeng (JIRA)" <ji...@apache.org> on 2013/08/24 17:23:53 UTC, 1 replies.
- Build failed in Jenkins: Nutch-nutchgora #732 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/08/24 17:39:42 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #733 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/08/24 18:44:12 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #2330 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/08/25 06:01:46 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #734 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/08/25 06:01:55 UTC, 0 replies.
- [jira] [Reopened] (NUTCH-1619) Writes Dmoz Description and Title information to db with snippet argument - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2013/08/25 10:53:53 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1375) extract main content of a html file - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2013/08/25 17:52:51 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #2331 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/08/26 06:01:40 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-nutchgora #735 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/08/26 06:04:05 UTC, 0 replies.
- http.content.limit - posted by cihat güzel <c....@gmail.com> on 2013/08/26 10:00:43 UTC, 1 replies.
- [jira] [Updated] (NUTCH-1317) Max content length by MIME-type - posted by "cihad güzel (JIRA)" <ji...@apache.org> on 2013/08/26 12:59:51 UTC, 2 replies.
- [jira] [Comment Edited] (NUTCH-1317) Max content length by MIME-type - posted by "cihad güzel (JIRA)" <ji...@apache.org> on 2013/08/26 13:01:52 UTC, 0 replies.
- NUTCH-1317 patch - posted by cihat güzel <c....@gmail.com> on 2013/08/26 15:04:16 UTC, 1 replies.
- [jira] [Created] (NUTCH-1632) add batchId argument for DbUpdaterJob - posted by "lufeng (JIRA)" <ji...@apache.org> on 2013/08/26 16:59:52 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1632) add batchId argument for DbUpdaterJob - posted by "lufeng (JIRA)" <ji...@apache.org> on 2013/08/26 17:03:53 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1632) add batchId argument for DbUpdaterJob - posted by "kaveh minooie (JIRA)" <ji...@apache.org> on 2013/08/26 19:21:53 UTC, 1 replies.
- [jira] [Created] (NUTCH-1633) slf4j is provided by hadoop and should not be included in the job file. - posted by "kaveh minooie (JIRA)" <ji...@apache.org> on 2013/08/26 20:43:51 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1633) slf4j is provided by hadoop and should not be included in the job file. - posted by "kaveh minooie (JIRA)" <ji...@apache.org> on 2013/08/26 20:46:07 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1556) enabling updatedb to accept batchId - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/08/26 20:49:51 UTC, 2 replies.
- [jira] [Closed] (NUTCH-1632) add batchId argument for DbUpdaterJob - posted by "lufeng (JIRA)" <ji...@apache.org> on 2013/08/27 02:23:53 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-trunk #2332 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/08/27 06:10:56 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1556) enabling updatedb to accept batchId - posted by "lufeng (JIRA)" <ji...@apache.org> on 2013/08/27 15:22:51 UTC, 1 replies.
- Build failed in Jenkins: Nutch-nutchgora #737 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/08/28 06:04:57 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1622) Create Outlinks with metadata - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2013/08/28 17:16:52 UTC, 3 replies.
- [jira] [Updated] (NUTCH-1562) Order of execution for scoring filters - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2013/08/28 17:49:03 UTC, 0 replies.
- [jira] [Created] (NUTCH-1634) readdb -stats show the result twice - posted by "kaveh minooie (JIRA)" <ji...@apache.org> on 2013/08/28 20:40:54 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1634) readdb -stats show the result twice - posted by "kaveh minooie (JIRA)" <ji...@apache.org> on 2013/08/28 20:48:53 UTC, 1 replies.
- Jenkins build is back to normal : Nutch-nutchgora #738 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/08/29 06:05:59 UTC, 0 replies.
- Wiki entry: Tutorials on Nutch - posted by Carmen Klaussner <di...@gmail.com> on 2013/08/29 11:15:59 UTC, 1 replies.
- [Nutch Wiki] Update of "ContributorsGroup" by SebastianNagel - posted by Apache Wiki <wi...@apache.org> on 2013/08/29 22:25:19 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #739 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/08/30 06:04:42 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #2336 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/08/30 06:10:06 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1610) Can't run individual unit tests for plugins in nutch 2.x - posted by "Brian (JIRA)" <ji...@apache.org> on 2013/08/30 21:19:53 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-nutchgora #740 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/08/31 06:06:54 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #2337 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/08/31 06:09:23 UTC, 0 replies.