You are viewing a plain text version of this content. The canonical link for it is here.
- Build failed in Jenkins: Nutch-nutchgora #1023 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2014/06/01 06:02:16 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #1024 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2014/06/02 06:09:02 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #1025 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2014/06/03 06:54:13 UTC, 0 replies.
- [jira] [Created] (NUTCH-1791) Null pointer exceptions with gora-cassandra-0.4 - posted by "Koen Smets (JIRA)" <ji...@apache.org> on 2014/06/03 12:22:01 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1791) Null pointer exceptions with gora-cassandra-0.4 - posted by "Koen Smets (JIRA)" <ji...@apache.org> on 2014/06/03 22:54:01 UTC, 3 replies.
- [jira] [Commented] (NUTCH-1622) Create Outlinks with metadata - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2014/06/04 01:26:01 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1789) Migrate Nutch site to Apache CMS - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2014/06/04 03:14:02 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #1026 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2014/06/04 06:07:17 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1714) Nutch 2.x upgrade to Gora 0.4 - posted by "Amitabh Ranjan (JIRA)" <ji...@apache.org> on 2014/06/04 12:31:03 UTC, 1 replies.
- [jira] [Issue Comment Deleted] (NUTCH-1714) Nutch 2.x upgrade to Gora 0.4 - posted by "Amitabh Ranjan (JIRA)" <ji...@apache.org> on 2014/06/04 13:20:04 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1785) Ability to index raw content - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2014/06/04 15:01:04 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1781) Update gora-*-mapping.xml and gora.proeprties to reflect Gora 0.4 - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2014/06/05 02:20:01 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1781) Update gora-*-mapping.xml and gora.proeprties to reflect Gora 0.4 - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2014/06/05 02:20:01 UTC, 1 replies.
- [jira] [Work started] (NUTCH-1781) Update gora-*-mapping.xml and gora.proeprties to reflect Gora 0.4 - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2014/06/05 02:20:01 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1781) Update gora-*-mapping.xml and gora.proeprties to reflect Gora 0.4 - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2014/06/05 02:20:01 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #1027 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2014/06/05 03:22:09 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1781) Update gora-*-mapping.xml and gora.proeprties to reflect Gora 0.4 - posted by "Hudson (JIRA)" <ji...@apache.org> on 2014/06/05 03:23:02 UTC, 2 replies.
- Build failed in Jenkins: Nutch-nutchgora #1028 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2014/06/05 04:33:35 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-1788) Tika may return multiple values for Title on PDF's - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2014/06/05 05:01:06 UTC, 0 replies.
- [jira] [Work started] (NUTCH-1788) Tika may return multiple values for Title on PDF's - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2014/06/05 05:01:06 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1788) Tika may return multiple values for Title on PDF's - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2014/06/05 05:01:06 UTC, 1 replies.
- [jira] [Resolved] (NUTCH-1788) Tika may return multiple values for Title on PDF's - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2014/06/05 05:01:07 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1788) Tika may return multiple values for Title on PDF's - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2014/06/05 05:01:07 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #1029 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2014/06/05 05:42:32 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1788) Tika may return multiple values for Title on PDF's - posted by "Hudson (JIRA)" <ji...@apache.org> on 2014/06/05 05:43:01 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #1030 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2014/06/05 06:07:39 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1782) NodeWalker to return current node - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2014/06/05 10:36:02 UTC, 1 replies.
- Build failed in Jenkins: Nutch-nutchgora #1031 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2014/06/05 18:05:51 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1782) NodeWalker to return current node - posted by "Hudson (JIRA)" <ji...@apache.org> on 2014/06/05 18:20:01 UTC, 2 replies.
- [jira] [Reopened] (NUTCH-1782) NodeWalker to return current node - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2014/06/05 23:06:02 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1782) NodeWalker to return current node - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2014/06/05 23:06:02 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #1032 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2014/06/06 06:07:25 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1684) ParseMeta to be added before fetch schedulers are run - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2014/06/06 12:07:01 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #1033 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2014/06/06 12:44:56 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #1034 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2014/06/07 06:05:28 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #1035 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2014/06/08 06:05:25 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #2648 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2014/06/08 06:05:25 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1708) use same id when indexing and deleting redirects - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2014/06/08 22:19:01 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #1036 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2014/06/09 06:07:34 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #2649 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2014/06/09 06:13:40 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1769) API refactoring - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2014/06/09 17:24:03 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1769) API refactoring - posted by "Fjodor Vershinin (JIRA)" <ji...@apache.org> on 2014/06/09 21:03:05 UTC, 1 replies.
- Build failed in Jenkins: Nutch-nutchgora #1037 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2014/06/10 06:05:54 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-trunk #2650 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2014/06/10 06:12:13 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1745) Upgrade to ElasticSearch 1.1.0 - posted by "Dafydd James (JIRA)" <ji...@apache.org> on 2014/06/10 10:12:01 UTC, 1 replies.
- Sending parse data from one generate-fetch-update cycle to another one - posted by Ali Nazemian <al...@gmail.com> on 2014/06/10 12:55:42 UTC, 0 replies.
- ApacheCon CFP closes June 25 - posted by Julien Nioche <li...@gmail.com> on 2014/06/10 23:22:35 UTC, 0 replies.
- [jira] [Work started] (NUTCH-1789) Migrate Nutch site to Apache CMS - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2014/06/11 06:09:01 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1789) Migrate Nutch site to Apache CMS - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2014/06/11 06:09:02 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1789) Migrate Nutch site to Apache CMS - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2014/06/11 06:11:01 UTC, 0 replies.
- New Apache Nutch Site - posted by Lewis John Mcgibbney <le...@gmail.com> on 2014/06/11 06:13:37 UTC, 2 replies.
- Build failed in Jenkins: Nutch-nutchgora #1038 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2014/06/11 06:23:52 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1084) ReadDB url throws exception - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/11 17:24:01 UTC, 2 replies.
- [jira] [Resolved] (NUTCH-1458) Support for raw HTML field added to Solr - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/11 17:24:02 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1647) protocol-http throws unzipBestEffort returned null for some pages - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/11 17:42:03 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1270) some of Deflate encoded pages not fetched - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/11 17:44:02 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1736) Can't fetch page if http response header contains Transfer-Encoding:chunked - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/11 17:52:02 UTC, 1 replies.
- [jira] [Commented] (NUTCH-1736) Can't fetch page if http response header contains Transfer-Encoding:chunked - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/11 17:52:04 UTC, 3 replies.
- [jira] [Resolved] (NUTCH-1736) Can't fetch page if http response header contains Transfer-Encoding:chunked - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/11 17:58:03 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #1039 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2014/06/11 18:43:21 UTC, 0 replies.
- Travel assistance for ApacheCon EU, Budapest November 17-21 2014 - posted by Julien Nioche <li...@gmail.com> on 2014/06/11 21:23:39 UTC, 0 replies.
- Fwd: svn commit: r1602053 - /infrastructure/site-tools/trunk/projects/files.xml - posted by sebb AT ASF <se...@apache.org> on 2014/06/12 02:35:43 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #1040 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2014/06/12 06:20:57 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1614) Plugin to exclude URLs matching regex list from indexing - to enable crawl but do not index - posted by "Riyaz Shaik (JIRA)" <ji...@apache.org> on 2014/06/12 09:58:01 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1614) Plugin to exclude URLs matching regex list from indexing - to enable crawl but do not index - posted by "Riyaz Shaik (JIRA)" <ji...@apache.org> on 2014/06/12 11:08:03 UTC, 0 replies.
- [jira] [Comment Edited] (NUTCH-1614) Plugin to exclude URLs matching regex list from indexing - to enable crawl but do not index - posted by "Riyaz Shaik (JIRA)" <ji...@apache.org> on 2014/06/12 11:10:01 UTC, 2 replies.
- [jira] [Comment Edited] (NUTCH-1791) Null pointer exceptions with gora-cassandra-0.4 - posted by "Koen Smets (JIRA)" <ji...@apache.org> on 2014/06/12 13:17:02 UTC, 1 replies.
- [jira] [Created] (NUTCH-1792) Refactor resource loading in plugin tests - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2014/06/12 16:01:26 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1792) Refactor resource loading in plugin tests - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2014/06/12 16:07:02 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #1041 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2014/06/13 06:33:58 UTC, 0 replies.
- [jira] [Reopened] (NUTCH-1647) protocol-http throws unzipBestEffort returned null for some pages - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2014/06/13 10:00:16 UTC, 0 replies.
- [jira] [Comment Edited] (NUTCH-1647) protocol-http throws unzipBestEffort returned null for some pages - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2014/06/13 10:00:20 UTC, 2 replies.
- [jira] [Commented] (NUTCH-1647) protocol-http throws unzipBestEffort returned null for some pages - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/13 11:37:02 UTC, 1 replies.
- [jira] [Updated] (NUTCH-1647) protocol-http throws unzipBestEffort returned null for some pages - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/13 13:01:03 UTC, 2 replies.
- [jira] [Updated] (NUTCH-1647) protocol-http throws 'unzipBestEffort returned null' for redirected pages - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/13 13:17:04 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1647) protocol-http throws 'unzipBestEffort returned null' for redirected pages - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/13 13:19:02 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1647) protocol-http throws 'unzipBestEffort returned null' for redirected pages - posted by "Hudson (JIRA)" <ji...@apache.org> on 2014/06/13 13:55:02 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #1042 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2014/06/14 06:05:40 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #1043 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2014/06/15 06:07:13 UTC, 0 replies.
- nutch elpais.com - posted by Yann Levreau <ya...@gmail.com> on 2014/06/15 11:20:31 UTC, 3 replies.
- Build failed in Jenkins: Nutch-nutchgora #1044 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2014/06/16 06:07:22 UTC, 0 replies.
- [jira] [Created] (NUTCH-1793) HttpRobotRulesParser not configured properly => "http.robots.403.allow" property is not read - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/16 17:30:03 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #1045 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2014/06/17 06:07:04 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1793) HttpRobotRulesParser not configured properly => "http.robots.403.allow" property is not read - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/17 10:00:20 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1793) HttpRobotRulesParser not configured properly => "http.robots.403.allow" property is not read - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2014/06/17 10:23:03 UTC, 1 replies.
- [jira] [Resolved] (NUTCH-1793) HttpRobotRulesParser not configured properly => "http.robots.403.allow" property is not read - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/17 10:43:01 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-1269) Generate main problems - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/17 10:53:02 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1269) Improve distribution of URLS with multi-segment generation - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/17 10:55:01 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1492) Support gora-dynamodb in Nutch 2.x - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/17 13:12:04 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1633) slf4j is provided by hadoop and should not be included in the job file. - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/17 13:16:03 UTC, 2 replies.
- [jira] [Updated] (NUTCH-1220) Upgrade Solr deps - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/17 13:18:03 UTC, 1 replies.
- [jira] [Resolved] (NUTCH-1285) Debian Packaging for Nutch - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/17 13:22:01 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1590) [SECURITY] Frame injection vulnerability in published Javadoc - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/17 13:26:01 UTC, 4 replies.
- [jira] [Created] (NUTCH-1794) IndexingFilterChecker to optionally dumpText - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2014/06/17 14:07:01 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1794) IndexingFilterChecker to optionally dumpText - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2014/06/17 14:11:02 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1794) IndexingFilterChecker to optionally dumpText - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2014/06/17 14:13:02 UTC, 2 replies.
- [jira] [Updated] (NUTCH-1590) [SECURITY] Frame injection vulnerability in published Javadoc - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/17 15:41:04 UTC, 1 replies.
- [jira] [Resolved] (NUTCH-1590) [SECURITY] Frame injection vulnerability in published Javadoc - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/17 16:18:02 UTC, 0 replies.
- Version of Java in Jenkins - posted by Julien Nioche <li...@gmail.com> on 2014/06/17 16:20:38 UTC, 1 replies.
- [jira] [Resolved] (NUTCH-1794) IndexingFilterChecker to optionally dumpText - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2014/06/17 16:26:01 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1422) reset signature for redirects - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2014/06/17 16:30:03 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-1776) Log incorrect plugin.folder file path - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2014/06/17 16:36:02 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #1046 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2014/06/17 16:49:04 UTC, 0 replies.
- Nutch Extension for realtime processing - posted by Jake Dodd <ja...@ontopic.io> on 2014/06/17 19:30:07 UTC, 7 replies.
- Build failed in Jenkins: Nutch-nutchgora #1047 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2014/06/18 06:05:25 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1692) SegmentReader broken in distributed mode - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/18 12:51:02 UTC, 0 replies.
- [Nutch Wiki] Update of "bin/nutch inject" by JulienNioche - posted by Apache Wiki <wi...@apache.org> on 2014/06/18 13:15:27 UTC, 0 replies.
- #nutch on IRC - posted by Lewis John Mcgibbney <le...@gmail.com> on 2014/06/18 16:34:13 UTC, 2 replies.
- [jira] [Comment Edited] (NUTCH-1084) ReadDB url throws exception - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/18 17:20:28 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1084) ReadDB url throws exception - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/18 17:28:24 UTC, 0 replies.
- Fixing Nutch 2.x Build on Jenkins - posted by Lewis John Mcgibbney <le...@gmail.com> on 2014/06/18 18:07:49 UTC, 4 replies.
- [jira] [Closed] (NUTCH-1590) [SECURITY] Frame injection vulnerability in published Javadoc - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2014/06/19 02:09:25 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #1048 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2014/06/19 06:13:45 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1698) crawl script should not specify solrUrl to accommodate pluggable indexing architecture - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/19 17:16:24 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1698) crawl script should not specify solrUrl to accommodate pluggable indexing architecture - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/19 17:16:25 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1269) Improve distribution of URLS with multi-segment generation - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/19 17:28:24 UTC, 0 replies.
- [jira] [Created] (NUTCH-1795) Please create a DOAP file for your TLP - posted by "Sebb (JIRA)" <ji...@apache.org> on 2014/06/19 19:44:25 UTC, 0 replies.
- Re: dev Digest 19 Jun 2014 17:46:49 -0000 Issue 1829 - posted by Lewis John Mcgibbney <le...@gmail.com> on 2014/06/19 20:17:55 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-1795) Please create a DOAP file for your TLP - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2014/06/19 20:20:24 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1795) Please create a DOAP file for your TLP - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2014/06/19 20:24:25 UTC, 1 replies.
- [jira] [Created] (NUTCH-1796) Ensure Gora object builders are used as oppose to empty constructors. - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2014/06/20 04:05:24 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1796) Ensure Gora object builders are used as oppose to empty constructors. - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2014/06/20 04:07:24 UTC, 1 replies.
- [jira] [Resolved] (NUTCH-1796) Ensure Gora object builders are used as oppose to empty constructors. - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2014/06/20 04:14:24 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1796) Ensure Gora object builders are used as oppose to empty constructors. - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2014/06/20 04:14:24 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "FrontPage" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2014/06/20 04:44:22 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "GoogleSummerOfCode" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2014/06/20 04:59:23 UTC, 4 replies.
- [Nutch Wiki] New attachment added to page GoogleSummerOfCode - posted by Apache Wiki <wi...@apache.org> on 2014/06/20 05:00:56 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-nutchgora #1049 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2014/06/20 05:12:13 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-841) Create a Wicket-based Web Application for Nutch - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2014/06/20 05:12:25 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1796) Ensure Gora object builders are used as oppose to empty constructors. - posted by "Hudson (JIRA)" <ji...@apache.org> on 2014/06/20 05:12:25 UTC, 1 replies.
- [jira] [Commented] (NUTCH-841) Create a Wicket-based Web Application for Nutch - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2014/06/20 05:14:24 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "FirstReport" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2014/06/20 05:22:56 UTC, 3 replies.
- [jira] [Updated] (NUTCH-1718) redefine http.robots.agent as "additional agent names" - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2014/06/21 00:18:24 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1718) redefine http.robots.agent as "additional agent names" - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2014/06/21 00:18:25 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1718) redefine http.robots.agent as "additional agent names" - posted by "Hudson (JIRA)" <ji...@apache.org> on 2014/06/21 00:48:25 UTC, 1 replies.
- [jira] [Resolved] (NUTCH-1767) remove special treatment of "params" in relative links - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2014/06/21 00:58:27 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1767) remove special treatment of "params" in relative links - posted by "Hudson (JIRA)" <ji...@apache.org> on 2014/06/21 01:42:27 UTC, 1 replies.
- [Nutch Wiki] Update of "FirstReport" by FjodorVershinin - posted by Apache Wiki <wi...@apache.org> on 2014/06/22 23:40:03 UTC, 0 replies.
- [jira] [Updated] (NUTCH-717) Make Nutch Solr integration easier - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/23 11:54:24 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1633) slf4j is provided by hadoop and should not be included in the job file. - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/23 11:58:25 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-1633) slf4j is provided by hadoop and should not be included in the job file. - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/23 11:58:25 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1220) Upgrade Solr deps - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/23 11:58:26 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1785) Ability to index raw content - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/23 12:02:24 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1787) update and complete API doc overview page - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/23 12:05:25 UTC, 6 replies.
- [jira] [Assigned] (NUTCH-1746) OutOfMemoryError in Mappers - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/23 12:07:24 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1765) SolrClean to remove redirected URLs from Solr - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/23 12:07:25 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1561) improve usability of parse-metatags and index-metadata - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/23 12:09:24 UTC, 1 replies.
- [jira] [Commented] (NUTCH-1561) improve usability of parse-metatags and index-metadata - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/23 12:13:24 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1648) Sentence Detection plugin - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/23 12:17:24 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-1687) Pick queue in Round Robin - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/23 12:17:24 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-1561) improve usability of parse-metatags and index-metadata - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2014/06/23 22:06:25 UTC, 0 replies.
- [jira] [Created] (NUTCH-1797) remove unused package o.a.n.html - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2014/06/23 22:10:25 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "NutchRESTAPI" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2014/06/23 22:50:39 UTC, 3 replies.
- [Nutch Wiki] New attachment added to page NutchRESTAPI - posted by Apache Wiki <wi...@apache.org> on 2014/06/23 22:56:10 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1787) update and complete API doc overview page - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2014/06/23 23:16:27 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1797) remove unused package o.a.n.html - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2014/06/24 12:07:24 UTC, 0 replies.
- [jira] [Created] (NUTCH-1798) Unable to get any documents to index in elastic search - posted by "Aaron Bedward (JIRA)" <ji...@apache.org> on 2014/06/24 12:49:24 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1798) Unable to get any documents to index in elastic search - posted by "Fjodor Vershinin (JIRA)" <ji...@apache.org> on 2014/06/24 14:18:25 UTC, 11 replies.
- [jira] [Comment Edited] (NUTCH-1798) Unable to get any documents to index in elastic search - posted by "Fjodor Vershinin (JIRA)" <ji...@apache.org> on 2014/06/24 14:18:25 UTC, 4 replies.
- [jira] [Resolved] (NUTCH-1787) update and complete API doc overview page - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2014/06/24 23:44:29 UTC, 0 replies.
- [Nutch Wiki] Update of "CMS_Website_Update_HOWTO" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2014/06/25 07:10:04 UTC, 0 replies.
- [Nutch Wiki] Update of "FrontPage" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2014/06/25 07:15:21 UTC, 0 replies.
- Documentation on Updating new Nutch Site - posted by Lewis John Mcgibbney <le...@gmail.com> on 2014/06/25 07:28:00 UTC, 1 replies.
- [jira] [Updated] (NUTCH-1798) Unable to get any documents to index in elastic search - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/25 11:56:26 UTC, 4 replies.
- [jira] [Reopened] (NUTCH-1220) Upgrade Solr deps - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/25 12:36:24 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1633) slf4j is provided by hadoop and should not be included in the job file. - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/25 13:07:24 UTC, 0 replies.
- [jira] [Created] (NUTCH-1799) ANT Eclipse task discovers all plugin jars automatically - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/25 13:52:24 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1799) ANT Eclipse task discovers all plugin jars automatically - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/25 13:54:24 UTC, 1 replies.
- Build failed in Jenkins: Nutch-trunk #2673 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2014/06/25 13:57:52 UTC, 0 replies.
- [Nutch Wiki] Update of "NutchRESTAPI" by FjodorVershinin - posted by Apache Wiki <wi...@apache.org> on 2014/06/25 18:57:59 UTC, 13 replies.
- Feedback on Nutch clip in Solr training - posted by Xavier Morera <xa...@familiamorera.com> on 2014/06/25 22:02:06 UTC, 0 replies.
- GSoC Nutch REST API Documentation - posted by Lewis John Mcgibbney <le...@gmail.com> on 2014/06/25 22:19:42 UTC, 0 replies.
- [Nutch Wiki] Update of "bin/nutch generate" by SebastianNagel - posted by Apache Wiki <wi...@apache.org> on 2014/06/25 23:32:54 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1769) REST API refactoring - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2014/06/26 00:38:24 UTC, 1 replies.
- [jira] [Created] (NUTCH-1800) Substantiate Javadoc for Nutch 2.X REST API - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2014/06/26 02:34:24 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1769) REST API refactoring - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2014/06/26 02:47:24 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1769) REST API refactoring - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2014/06/26 02:47:26 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1800) Substantiate Javadoc for Nutch 2.X REST API - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2014/06/26 02:51:25 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1679) UpdateDb using batchId, link may override crawled page. - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2014/06/26 03:03:36 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1769) REST API refactoring - posted by "Hudson (JIRA)" <ji...@apache.org> on 2014/06/26 03:55:24 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-trunk #2674 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2014/06/26 07:17:43 UTC, 0 replies.
- [jira] [Created] (NUTCH-1801) Fix chain of dependencies between ANT tasks - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/26 10:27:26 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1801) Fix chain of dependencies between ANT tasks - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/26 10:33:24 UTC, 0 replies.
- [jira] [Created] (NUTCH-1802) Move TestbedProxy to test environment - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/26 11:33:25 UTC, 0 replies.
- [jira] [Created] (NUTCH-1803) Put test dependencies in a separate lib dir - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/26 11:39:24 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1803) Put test dependencies in a separate lib dir - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/26 11:45:25 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1801) Improve handling of test dependencies in ANT+Ivy - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/26 11:49:24 UTC, 1 replies.
- [jira] [Created] (NUTCH-1804) Move JUnit dependency to test scope - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/26 11:49:24 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1802) Move TestbedProxy to test environment - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/26 12:19:24 UTC, 2 replies.
- [jira] [Created] (NUTCH-1805) Remove unnecessary transitive dependencies from Hadoop core - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/26 12:34:24 UTC, 0 replies.
- [jira] [Created] (NUTCH-1806) Delegate processing of URL domains to crawler commons - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/26 14:45:26 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-385) Server delay feature conflicts with maxThreadsPerHost - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/26 16:28:25 UTC, 0 replies.
- [jira] [Updated] (NUTCH-385) Server delay feature conflicts with maxThreadsPerHost - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/26 16:44:25 UTC, 1 replies.
- [jira] [Commented] (NUTCH-385) Server delay feature conflicts with maxThreadsPerHost - posted by "Chris Schneider (JIRA)" <ji...@apache.org> on 2014/06/26 17:17:24 UTC, 1 replies.
- [jira] [Updated] (NUTCH-385) Improve description of thread related configuration for Fetcher - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/26 17:37:25 UTC, 1 replies.
- [jira] [Commented] (NUTCH-385) Improve description of thread related configuration for Fetcher - posted by "lufeng (JIRA)" <ji...@apache.org> on 2014/06/27 07:06:24 UTC, 2 replies.
- [jira] [Updated] (NUTCH-1798) Crawl script not calling index command correctly - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/27 09:29:24 UTC, 1 replies.
- [jira] [Resolved] (NUTCH-1798) Crawl script not calling index command correctly - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/27 09:33:24 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1798) Crawl script not calling index command correctly - posted by "Hudson (JIRA)" <ji...@apache.org> on 2014/06/27 09:43:25 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-385) Improve description of thread related configuration for Fetcher - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/27 09:49:25 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #2676 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2014/06/27 09:52:46 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-817) parse-(html)does follow links of full html page, parse-(tika) does follow any links and stops at level 1 - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/27 11:39:25 UTC, 0 replies.
- [jira] [Commented] (NUTCH-578) URL fetched with 403 is generated over and over again - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/27 11:55:25 UTC, 1 replies.
- [Nutch Wiki] Trivial Update of "CMS_Website_Update_HOWTO" by MarkusJelsma - posted by Apache Wiki <wi...@apache.org> on 2014/06/27 12:08:55 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-trunk #2677 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2014/06/28 06:20:43 UTC, 0 replies.
- Fwd: Google Summer of Code 2014 - Midterm Evaluation results processed for Apache Nutch web GUI - posted by Lewis John Mcgibbney <le...@gmail.com> on 2014/06/28 10:17:06 UTC, 1 replies.
- [jira] [Updated] (NUTCH-1285) Debian Packaging for Nutch - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/29 11:03:24 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1285) Debian Packaging for Nutch - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2014/06/29 11:11:24 UTC, 0 replies.
- [jira] [Updated] (NUTCH-578) URL fetched with 403 is generated over and over again - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/29 11:20:25 UTC, 0 replies.
- Nearing a 1.9 release? - posted by Julien Nioche <li...@gmail.com> on 2014/06/29 11:20:32 UTC, 1 replies.
- [jira] [Created] (NUTCH-1807) avoid methods relying on system-specific default locale / charset - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2014/06/29 22:28:24 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1693) TextMD5Signatue compute on textual content - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2014/06/29 22:55:24 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1801) Improve handling of test dependencies in ANT+Ivy - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2014/06/29 23:03:24 UTC, 0 replies.
- site missing titles - posted by Markus Jelsma <ma...@openindex.io> on 2014/06/30 14:07:07 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1776) Log incorrect plugin.folder file path - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/30 14:15:25 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1803) Put test dependencies in a separate lib dir - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/30 14:40:25 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #2680 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2014/06/30 14:45:31 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1802) Move TestbedProxy to test environment - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/30 14:50:24 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-1802) Move TestbedProxy to test environment - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/30 14:50:24 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1799) ANT Eclipse task discovers all plugin jars automatically - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/30 14:50:25 UTC, 0 replies.
- [jira] [Issue Comment Deleted] (NUTCH-1802) Move TestbedProxy to test environment - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/30 14:50:25 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1802) Move TestbedProxy to test environment - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/30 15:41:24 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #2681 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2014/06/30 15:45:49 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1804) Move JUnit dependency to test scope - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/06/30 16:51:24 UTC, 0 replies.
- [FEEDBACK] Improving Content on the Nutch WebSite - posted by Lewis John Mcgibbney <le...@gmail.com> on 2014/06/30 21:07:50 UTC, 1 replies.
- [jira] [Created] (NUTCH-1808) JMX/JMS artifacts cannot be resolved from fresh repository - posted by "Valerio Schiavoni (JIRA)" <ji...@apache.org> on 2014/06/30 23:17:24 UTC, 0 replies.
- [jira] [Created] (NUTCH-1809) Duplicate jdom dependency - posted by "Valerio Schiavoni (JIRA)" <ji...@apache.org> on 2014/06/30 23:19:28 UTC, 0 replies.
- [jira] [Created] (NUTCH-1810) Duplicate jdom dependency - posted by "Valerio Schiavoni (JIRA)" <ji...@apache.org> on 2014/06/30 23:21:24 UTC, 0 replies.