You are viewing a plain text version of this content. The canonical link for it is here.
- Re: [VOTE] Release Apache Nutch 2.3.1 - posted by Lewis John Mcgibbney <le...@gmail.com> on 2015/10/01 02:35:09 UTC, 4 replies.
- [jira] [Created] (NUTCH-2129) Track Protocol Status in Crawl Datum - posted by "Michael Joyce (JIRA)" <ji...@apache.org> on 2015/10/01 02:49:04 UTC, 0 replies.
- [GitHub] nutch pull request: NUTCH-2129 - Add protocol status tracking to c... - posted by MJJoyce <gi...@git.apache.org> on 2015/10/01 02:56:08 UTC, 1 replies.
- [jira] [Commented] (NUTCH-2129) Track Protocol Status in Crawl Datum - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2015/10/01 02:57:04 UTC, 8 replies.
- [jira] [Created] (NUTCH-2130) copyField rawcontent creates error within schema.xml - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/10/01 05:08:04 UTC, 0 replies.
- [GitHub] nutch pull request: Fix for NUTCH-2086 Contributed by Sujen Shah - posted by asfgit <gi...@git.apache.org> on 2015/10/01 09:01:43 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2086) Nutch 1.X Webui - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2015/10/01 09:02:04 UTC, 2 replies.
- [jira] [Created] (NUTCH-2131) Problem running nutch(crawl) with selenium - posted by "Ashwini (JIRA)" <ji...@apache.org> on 2015/10/01 12:32:26 UTC, 0 replies.
- Re: [MASSMAIL]Re: Fetch failed : java.lang.NullPointerException - posted by Roannel Fern�ndez Hern�ndez <ro...@uci.cu> on 2015/10/01 15:16:25 UTC, 1 replies.
- Atomic update and optimistic concurrency in Solr - posted by Roannel Fernández Hernández <ro...@uci.cu> on 2015/10/01 15:52:51 UTC, 0 replies.
- Re: Team 18: Selenium handler question - posted by "Joyce, Michael J (398M)" <Mi...@jpl.nasa.gov> on 2015/10/01 17:26:46 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2108) Add a function to the selenium interactive plugin interface to do multiple manipulation of driver and then return the data - posted by "Asitang Mishra (JIRA)" <ji...@apache.org> on 2015/10/01 18:15:26 UTC, 4 replies.
- [jira] [Assigned] (NUTCH-2128) Refactor configuration end point - posted by "Sujen Shah (JIRA)" <ji...@apache.org> on 2015/10/01 18:52:26 UTC, 0 replies.
- [GitHub] nutch pull request: fix for NUTCH-2128 Refactor config endpoint by... - posted by sujen1412 <gi...@git.apache.org> on 2015/10/01 18:55:26 UTC, 1 replies.
- [jira] [Commented] (NUTCH-2128) Refactor configuration end point - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2015/10/01 18:55:27 UTC, 1 replies.
- [jira] [Updated] (NUTCH-2123) Seed List REST API returns Text but headers indicate/require JSON - posted by "Sujen Shah (JIRA)" <ji...@apache.org> on 2015/10/01 19:47:26 UTC, 0 replies.
- [Nutch Wiki] Update of "Nutch_1.X_RESTAPI" by SujenShah - posted by Apache Wiki <wi...@apache.org> on 2015/10/01 20:05:13 UTC, 2 replies.
- Re: Request for inclusion in the Nutch email list - posted by Sujen Shah <su...@gmail.com> on 2015/10/02 08:35:15 UTC, 0 replies.
- [jira] [Created] (NUTCH-2132) Publisher/Subscriber model for Nutch to emit events - posted by "Sujen Shah (JIRA)" <ji...@apache.org> on 2015/10/02 10:53:26 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2132) Publisher/Subscriber model for Nutch to emit events - posted by "Sujen Shah (JIRA)" <ji...@apache.org> on 2015/10/02 10:56:26 UTC, 4 replies.
- [jira] [Commented] (NUTCH-2011) Endpoint to support realtime JSON output from the fetcher - posted by "Sujen Shah (JIRA)" <ji...@apache.org> on 2015/10/02 10:58:27 UTC, 2 replies.
- [jira] [Commented] (NUTCH-2132) Publisher/Subscriber model for Nutch to emit events - posted by "Roannel Fernández Hernández (JIRA)" <ji...@apache.org> on 2015/10/02 15:16:27 UTC, 21 replies.
- [Nutch Wiki] Update of "ContributorsGroup" by ChrisMattmann - posted by Apache Wiki <wi...@apache.org> on 2015/10/02 18:46:58 UTC, 0 replies.
- How to make sure rotate agent works - posted by Huachao Zhang <ch...@gmail.com> on 2015/10/02 22:34:25 UTC, 1 replies.
- Redundant requests when interactive selenium is enabled - posted by ThammeGowda N <tg...@gmail.com> on 2015/10/03 00:13:15 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "NutchFileFormats" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2015/10/03 02:41:02 UTC, 6 replies.
- Trying to work Rotating agent id plugin - posted by Manali Shah <ma...@usc.edu> on 2015/10/03 04:50:55 UTC, 0 replies.
- Integrating Selenium with Nutch - posted by Taichi Ho <he...@gmail.com> on 2015/10/03 05:53:41 UTC, 1 replies.
- Redirection in nutch - posted by Taichi Ho <he...@gmail.com> on 2015/10/03 09:22:29 UTC, 3 replies.
- Nutch Interactive selenium crawling query - posted by Girish Rao <gr...@usc.edu> on 2015/10/03 11:39:15 UTC, 0 replies.
- Tika parsing - posted by Taichi Ho <he...@gmail.com> on 2015/10/03 19:47:33 UTC, 1 replies.
- Unable to fetch content after integrating selenium - posted by Charan Shampur <ch...@gmail.com> on 2015/10/03 20:25:04 UTC, 2 replies.
- Nutch not recognizing html pages/images retrieved via php - posted by Girish Rao <gr...@usc.edu> on 2015/10/04 04:01:07 UTC, 1 replies.
- Fwd: WELCOME to dev@nutch.apache.org - posted by Pavan Lingambudhi Seshadri Vasan <li...@usc.edu> on 2015/10/04 04:06:02 UTC, 1 replies.
- selenium - posted by Pavan Lingambudhi Seshadri Vasan <li...@usc.edu> on 2015/10/04 04:09:27 UTC, 0 replies.
- Can't retrieve Tika parser for mime-type text/aspdotnet - posted by Manali Shah <ma...@usc.edu> on 2015/10/05 06:43:09 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2124) redirect following same link again and again , max redirect exceed and went db_gone - posted by "Yogendra Kumar Soni (JIRA)" <ji...@apache.org> on 2015/10/05 15:55:27 UTC, 3 replies.
- [jira] [Comment Edited] (NUTCH-2132) Publisher/Subscriber model for Nutch to emit events - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/05 18:42:27 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-2123) Seed List REST API returns Text but headers indicate/require JSON - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/06 01:52:27 UTC, 0 replies.
- [jira] [Work started] (NUTCH-2123) Seed List REST API returns Text but headers indicate/require JSON - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/06 01:52:27 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2123) Seed List REST API returns Text but headers indicate/require JSON - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/06 01:54:27 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2123) Seed List REST API returns Text but headers indicate/require JSON - posted by "Hudson (JIRA)" <ji...@apache.org> on 2015/10/06 03:00:31 UTC, 0 replies.
- [jira] [Comment Edited] (NUTCH-2124) redirect following same link again and again , max redirect exceed and went db_gone - posted by "Yogendra Kumar Soni (JIRA)" <ji...@apache.org> on 2015/10/06 12:53:26 UTC, 0 replies.
- [jira] [Created] (NUTCH-2133) Transfer Selenium Documentation to WIki - posted by "Michael Joyce (JIRA)" <ji...@apache.org> on 2015/10/06 17:02:26 UTC, 0 replies.
- Set up protocol-selenium - posted by Huachao Zhang <ch...@gmail.com> on 2015/10/07 09:10:21 UTC, 0 replies.
- Re: Team 18 : Similarity scoring: goldstandard.txt, stopwords.txt contents - posted by Christian Alan Mattmann <ma...@usc.edu> on 2015/10/07 15:52:21 UTC, 1 replies.
- [jira] [Resolved] (NUTCH-2124) redirect following same link again and again , max redirect exceed and went db_gone - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2015/10/07 21:04:26 UTC, 0 replies.
- Re: dbunfetched URLs - team #32 - posted by Michael Joyce <jo...@apache.org> on 2015/10/07 23:23:52 UTC, 0 replies.
- [jira] [Created] (NUTCH-2134) Redirection and cookie handling using protocol plugins - posted by "Yogendra Kumar Soni (JIRA)" <ji...@apache.org> on 2015/10/08 09:00:37 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2134) Redirection and cookie handling using protocol plugins - posted by "Yogendra Kumar Soni (JIRA)" <ji...@apache.org> on 2015/10/08 09:02:27 UTC, 1 replies.
- Form authentication issue - posted by Huachao Zhang <ch...@gmail.com> on 2015/10/08 10:22:58 UTC, 0 replies.
- [GitHub] nutch pull request: NUTCH-2108 - posted by asfgit <gi...@git.apache.org> on 2015/10/08 20:55:41 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2108) Add a function to the selenium interactive plugin interface to do multiple manipulation of driver and then return the data - posted by "Asitang Mishra (JIRA)" <ji...@apache.org> on 2015/10/08 20:58:27 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2109) Create a brute force click-all-ajax-links utility fucntion for selenium interactive plugin - posted by "Asitang Mishra (JIRA)" <ji...@apache.org> on 2015/10/08 20:59:26 UTC, 0 replies.
- Interactive selenium plugin issue - posted by Junpeng Luo <ju...@usc.edu> on 2015/10/08 22:54:59 UTC, 4 replies.
- Normalize before inject - posted by Roannel Fernández Hernández <ro...@uci.cu> on 2015/10/09 16:38:28 UTC, 1 replies.
- [jira] [Commented] (NUTCH-2110) Create the capability to provide seeds in the form of "url+xpath(including option to enter seach terms).selenium" - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/10 01:27:05 UTC, 3 replies.
- [jira] [Comment Edited] (NUTCH-2110) Create the capability to provide seeds in the form of "url+xpath(including option to enter seach terms).selenium" - posted by "Asitang Mishra (JIRA)" <ji...@apache.org> on 2015/10/10 01:48:05 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2110) Create the capability to provide seeds in the form of "url+xpath(including option to enter seach terms).selenium" - posted by "Asitang Mishra (JIRA)" <ji...@apache.org> on 2015/10/10 01:51:06 UTC, 0 replies.
- [jira] [Created] (NUTCH-2135) Ant Eclipse build does not include protocol-interactiveselenium - posted by "Sujen Shah (JIRA)" <ji...@apache.org> on 2015/10/10 04:05:07 UTC, 0 replies.
- [GitHub] nutch pull request: Fix for NUTCH-2135 by Sujen Shah - posted by sujen1412 <gi...@git.apache.org> on 2015/10/10 04:06:16 UTC, 1 replies.
- [jira] [Commented] (NUTCH-2135) Ant Eclipse build does not include protocol-interactiveselenium - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2015/10/10 04:07:05 UTC, 1 replies.
- Issue with selenium - posted by Charan Shampur <ch...@gmail.com> on 2015/10/11 08:33:22 UTC, 0 replies.
- gora upgrade - posted by Cihad Guzel <cg...@gmail.com> on 2015/10/11 17:20:09 UTC, 0 replies.
- [Nutch Wiki] Update of "SimilarityScoringFilter" by SujenShah - posted by Apache Wiki <wi...@apache.org> on 2015/10/12 03:19:45 UTC, 0 replies.
- [jira] [Created] (NUTCH-2136) Implement a different version of Naive Bayes Parse Filter - posted by "Asitang Mishra (JIRA)" <ji...@apache.org> on 2015/10/12 05:25:05 UTC, 0 replies.
- [GitHub] nutch pull request: NUTCH-2136 - posted by asitang <gi...@git.apache.org> on 2015/10/12 11:34:44 UTC, 1 replies.
- [jira] [Commented] (NUTCH-2136) Implement a different version of Naive Bayes Parse Filter - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2015/10/12 11:35:05 UTC, 4 replies.
- [GitHub] nutch pull request: Branch 2.3.1 - posted by dyzsasd <gi...@git.apache.org> on 2015/10/12 17:53:25 UTC, 1 replies.
- [jira] [Resolved] (NUTCH-2136) Implement a different version of Naive Bayes Parse Filter - posted by "Asitang Mishra (JIRA)" <ji...@apache.org> on 2015/10/12 18:30:06 UTC, 1 replies.
- [jira] [Reopened] (NUTCH-2136) Implement a different version of Naive Bayes Parse Filter - posted by "Asitang Mishra (JIRA)" <ji...@apache.org> on 2015/10/12 19:29:06 UTC, 0 replies.
- [GitHub] nutch pull request: Made changes to changes.txt and added AVL2 hea... - posted by asitang <gi...@git.apache.org> on 2015/10/12 19:46:25 UTC, 1 replies.
- [jira] [Updated] (NUTCH-2137) add changes.txt and ALV2 headers to the Naive Bayes Parse Filter - posted by "Asitang Mishra (JIRA)" <ji...@apache.org> on 2015/10/12 20:06:05 UTC, 1 replies.
- [jira] [Created] (NUTCH-2137) add changes.txt and ALV2 headers to the Naive Bayes Parse Filter - posted by "Asitang Mishra (JIRA)" <ji...@apache.org> on 2015/10/12 20:06:05 UTC, 0 replies.
- [GitHub] nutch pull request: NUTCH 2137 - posted by asitang <gi...@git.apache.org> on 2015/10/12 20:15:12 UTC, 1 replies.
- [jira] [Resolved] (NUTCH-2137) add changes.txt and ALV2 headers to the Naive Bayes Parse Filter - posted by "Asitang Mishra (JIRA)" <ji...@apache.org> on 2015/10/12 20:18:05 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2137) add changes.txt and ALV2 headers to the Naive Bayes Parse Filter - posted by "Hudson (JIRA)" <ji...@apache.org> on 2015/10/12 20:54:05 UTC, 0 replies.
- [GitHub] nutch pull request: Trunk - posted by roberttjahjadi <gi...@git.apache.org> on 2015/10/13 09:47:24 UTC, 1 replies.
- [jira] [Created] (NUTCH-2138) Tika cannot OCR embedded images from PDF - posted by "jean blue (JIRA)" <ji...@apache.org> on 2015/10/13 16:25:06 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2138) Tika cannot OCR embedded images from PDF - posted by "jean blue (JIRA)" <ji...@apache.org> on 2015/10/13 16:55:05 UTC, 1 replies.
- [jira] [Created] (NUTCH-2139) Basic plugin to index inlinks and outlinks - posted by "Jorge Luis Betancourt Gonzalez (JIRA)" <ji...@apache.org> on 2015/10/14 01:11:06 UTC, 0 replies.
- [GitHub] nutch pull request: Fixed FileNotFoundException (Invalid Argument)... - posted by karanjeets <gi...@git.apache.org> on 2015/10/14 01:42:48 UTC, 1 replies.
- [jira] [Created] (NUTCH-2140) Atomic update and optimistic concurrency update using Solr - posted by "Roannel Fernández Hernández (JIRA)" <ji...@apache.org> on 2015/10/14 15:07:05 UTC, 0 replies.
- [jira] [Created] (NUTCH-2141) Change the InteractiveSelenium plugin handler Interface to return page content - posted by "Balaji Gurumurthy (JIRA)" <ji...@apache.org> on 2015/10/15 02:56:05 UTC, 0 replies.
- [GitHub] nutch pull request: fix for NUTCH-2141 contributed by Balaji Gurum... - posted by balajig17 <gi...@git.apache.org> on 2015/10/15 05:12:15 UTC, 1 replies.
- [jira] [Commented] (NUTCH-2141) Change the InteractiveSelenium plugin handler Interface to return page content - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2015/10/15 05:13:05 UTC, 6 replies.
- [Nutch Wiki] Update of "Presentations" by ChrisMattmann - posted by Apache Wiki <wi...@apache.org> on 2015/10/15 06:36:41 UTC, 0 replies.
- [GitHub] nutch pull request: Fix for NUTCH-2139 contributed by jorgelbg - posted by jorgelbg <gi...@git.apache.org> on 2015/10/15 18:37:50 UTC, 10 replies.
- [jira] [Commented] (NUTCH-2139) Basic plugin to index inlinks and outlinks - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2015/10/15 18:38:05 UTC, 11 replies.
- [jira] [Updated] (NUTCH-2139) Basic plugin to index inlinks and outlinks - posted by "Jorge Luis Betancourt Gonzalez (JIRA)" <ji...@apache.org> on 2015/10/15 19:12:05 UTC, 1 replies.
- [jira] [Created] (NUTCH-2142) Nutch File Dump - FileNotFoundException (Invalid Argument) Error - posted by "Karanjeet Singh (JIRA)" <ji...@apache.org> on 2015/10/15 23:47:06 UTC, 0 replies.
- [jira] [Created] (NUTCH-2143) GeneratorJob ignores batch id passed as argument - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2015/10/16 00:07:05 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2143) GeneratorJob ignores batch id passed as argument - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2015/10/16 00:08:05 UTC, 0 replies.
- nutch-python - posted by "Mattmann, Chris A (3980)" <ch...@jpl.nasa.gov> on 2015/10/16 18:59:48 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2142) Nutch File Dump - FileNotFoundException (Invalid Argument) Error - posted by "Karanjeet Singh (JIRA)" <ji...@apache.org> on 2015/10/18 13:11:05 UTC, 1 replies.
- [jira] [Assigned] (NUTCH-2142) Nutch File Dump - FileNotFoundException (Invalid Argument) Error - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:28:05 UTC, 0 replies.
- [jira] [Work started] (NUTCH-2142) Nutch File Dump - FileNotFoundException (Invalid Argument) Error - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:28:05 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-2129) Track Protocol Status in Crawl Datum - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:31:05 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2142) Nutch File Dump - FileNotFoundException (Invalid Argument) Error - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:31:05 UTC, 0 replies.
- [jira] [Work started] (NUTCH-2129) Track Protocol Status in Crawl Datum - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:31:05 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-2141) Change the InteractiveSelenium plugin handler Interface to return page content - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:35:05 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2129) Track Protocol Status in Crawl Datum - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:35:05 UTC, 0 replies.
- [jira] [Work started] (NUTCH-2141) Change the InteractiveSelenium plugin handler Interface to return page content - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:36:05 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2133) Transfer Selenium Documentation to WIki - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:41:05 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1943) Form authentication should not be global and ignore - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:41:05 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2064) URLNormalizer basic to properly encode non-ASCII characters - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:41:05 UTC, 1 replies.
- [jira] [Updated] (NUTCH-2030) ParseZip plugin is not able to extract language from zip document,this could solve that problem. - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:41:05 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2140) Atomic update and optimistic concurrency update using Solr - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:41:05 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2135) Ant Eclipse build does not include protocol-interactiveselenium - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:41:05 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2141) Change the InteractiveSelenium plugin handler Interface to return page content - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:41:05 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2086) Nutch 1.X Webui - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:41:05 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2122) Implement Javadoc package.html for service packages - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:41:05 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2128) Refactor configuration end point - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:41:05 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2120) Remove MapWritable from trunk codebase - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:41:05 UTC, 0 replies.
- [DISCUSS] Release 1.11 RC #1 (70 issues fixed) - posted by "Mattmann, Chris A (3980)" <ch...@jpl.nasa.gov> on 2015/10/18 21:42:08 UTC, 4 replies.
- [jira] [Created] (NUTCH-2144) Plugin to override db.ignore.external to exempt interesting external domain URLs - posted by "Thamme Gowda N (JIRA)" <ji...@apache.org> on 2015/10/19 08:54:05 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2144) Plugin to override db.ignore.external to exempt interesting external domain URLs - posted by "Thamme Gowda N (JIRA)" <ji...@apache.org> on 2015/10/19 08:57:05 UTC, 3 replies.
- [jira] [Commented] (NUTCH-1932) Automatically remove orphaned pages - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2015/10/19 11:01:05 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1932) Automatically remove orphaned pages - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2015/10/19 11:44:05 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2144) Plugin to override db.ignore.external to exempt interesting external domain URLs - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2015/10/19 11:51:05 UTC, 2 replies.
- [jira] [Created] (NUTCH-2145) parse/index checker fail to fetch valid percent-encoded URLs - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2015/10/19 13:36:05 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2145) parse/index checker fail to fetch valid percent-encoded URLs - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2015/10/19 13:58:05 UTC, 1 replies.
- [jira] [Updated] (NUTCH-2145) parse/index checker fail to fetch valid percent-encoded URLs - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2015/10/19 13:58:05 UTC, 0 replies.
- [jira] [Comment Edited] (NUTCH-2144) Plugin to override db.ignore.external to exempt interesting external domain URLs - posted by "Thamme Gowda N (JIRA)" <ji...@apache.org> on 2015/10/19 16:55:05 UTC, 1 replies.
- [jira] [Updated] (NUTCH-2064) URLNormalizer basic to encode reserved chars and decode non-reserved chars - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2015/10/20 00:07:27 UTC, 0 replies.
- [jira] [Created] (NUTCH-2146) hashCode on the Outlink class - posted by "Jorge Luis Betancourt Gonzalez (JIRA)" <ji...@apache.org> on 2015/10/21 00:59:27 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2146) hashCode on the Outlink class - posted by "Jorge Luis Betancourt Gonzalez (JIRA)" <ji...@apache.org> on 2015/10/21 01:03:27 UTC, 3 replies.
- [jira] [Created] (NUTCH-2147) LanguagePreferenceScoringFilter for Nutch - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/10/21 01:26:27 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-2147) LanguagePreferenceScoringFilter for Nutch - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/10/21 01:26:27 UTC, 0 replies.
- [jira] [Created] (NUTCH-2148) Review and update mapred --> mapreduce config params in crawl script - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/10/21 03:04:27 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2148) Review and update mapred --> mapreduce config params in crawl script - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/10/21 05:36:27 UTC, 2 replies.
- [jira] [Commented] (NUTCH-2148) Review and update mapred --> mapreduce config params in crawl script - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/10/21 05:38:27 UTC, 2 replies.
- [jira] [Resolved] (NUTCH-2148) Review and update mapred --> mapreduce config params in crawl script - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/10/22 05:48:27 UTC, 0 replies.
- [jira] [Closed] (NUTCH-2148) Review and update mapred --> mapreduce config params in crawl script - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/10/22 05:48:27 UTC, 0 replies.
- [GitHub] nutch pull request: Adding a hashCode method to the Outlink class ... - posted by jorgelbg <gi...@git.apache.org> on 2015/10/22 18:21:55 UTC, 0 replies.
- [jira] [Created] (NUTCH-2149) REST endpoint to read Nutch sequence files - posted by "Sujen Shah (JIRA)" <ji...@apache.org> on 2015/10/23 09:04:27 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2149) REST endpoint to read Nutch sequence files - posted by "Sujen Shah (JIRA)" <ji...@apache.org> on 2015/10/23 09:25:27 UTC, 0 replies.
- [GitHub] nutch pull request: NUTCH-2149 REST endpoint to read Nutch sequenc... - posted by sujen1412 <gi...@git.apache.org> on 2015/10/23 09:27:41 UTC, 1 replies.
- [jira] [Commented] (NUTCH-2149) REST endpoint to read Nutch sequence files - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2015/10/23 09:28:27 UTC, 6 replies.
- [GitHub] nutch pull request: NUTCH 2128 - Refactor config endpoint - posted by sujen1412 <gi...@git.apache.org> on 2015/10/24 02:58:55 UTC, 1 replies.
- [jira] [Assigned] (NUTCH-2149) REST endpoint to read Nutch sequence files - posted by "Sujen Shah (JIRA)" <ji...@apache.org> on 2015/10/25 19:21:27 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2149) REST endpoint to read Nutch sequence files - posted by "Sujen Shah (JIRA)" <ji...@apache.org> on 2015/10/25 19:22:27 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2147) LanguagePreferenceScoringFilter for Nutch - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/26 05:48:27 UTC, 0 replies.
- [VOTE] Apache Nutch 1.11 Release Candidate #1 - posted by "Mattmann, Chris A (3980)" <ch...@jpl.nasa.gov> on 2015/10/26 06:53:11 UTC, 3 replies.
- [Nutch Wiki] Update of "Release_HOWTO" by ChrisMattmann - posted by Apache Wiki <wi...@apache.org> on 2015/10/26 07:01:12 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2147) LanguagePreferenceScoringFilter for Nutch - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2015/10/26 10:34:27 UTC, 1 replies.
- [jira] [Updated] (NUTCH-2147) MetadataScoringFilter for Nutch - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/10/26 18:13:27 UTC, 1 replies.
- [jira] [Commented] (NUTCH-2147) MetadataScoringFilter for Nutch - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2015/10/26 19:00:30 UTC, 0 replies.
- [jira] [Comment Edited] (NUTCH-2147) MetadataScoringFilter for Nutch - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2015/10/26 19:00:30 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2131) Problem running nutch(crawl) with selenium - posted by "Kim Whitehall (JIRA)" <ji...@apache.org> on 2015/10/26 22:01:27 UTC, 1 replies.
- [jira] [Created] (NUTCH-2150) Add ProtocolStatus Utility - posted by "Michael Joyce (JIRA)" <ji...@apache.org> on 2015/10/27 20:36:27 UTC, 0 replies.
- [GitHub] nutch pull request: NUTCH-2150 - Add protocolstats utility - posted by MJJoyce <gi...@git.apache.org> on 2015/10/27 20:47:34 UTC, 3 replies.
- [jira] [Commented] (NUTCH-2150) Add ProtocolStatus Utility - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2015/10/27 20:48:27 UTC, 5 replies.
- [jira] [Resolved] (NUTCH-2128) Refactor configuration end point - posted by "Sujen Shah (JIRA)" <ji...@apache.org> on 2015/10/27 21:39:27 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-2070) Parameterize Fetch REST Endpoint - posted by "Sujen Shah (JIRA)" <ji...@apache.org> on 2015/10/27 21:40:27 UTC, 0 replies.
- [jira] [Closed] (NUTCH-2070) Parameterize Fetch REST Endpoint - posted by "Sujen Shah (JIRA)" <ji...@apache.org> on 2015/10/27 21:42:27 UTC, 0 replies.
- [jira] [Created] (NUTCH-2151) Service endpoint for REST API - posted by "Sujen Shah (JIRA)" <ji...@apache.org> on 2015/10/27 21:45:27 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2151) Service endpoint for REST API - posted by "Sujen Shah (JIRA)" <ji...@apache.org> on 2015/10/27 21:49:27 UTC, 0 replies.
- [jira] [Created] (NUTCH-2152) CommonCrawl dump via Service endpoint - posted by "Sujen Shah (JIRA)" <ji...@apache.org> on 2015/10/27 21:50:28 UTC, 0 replies.
- [jira] [Work started] (NUTCH-2152) CommonCrawl dump via Service endpoint - posted by "Sujen Shah (JIRA)" <ji...@apache.org> on 2015/10/27 21:52:27 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-2152) CommonCrawl dump via Service endpoint - posted by "Sujen Shah (JIRA)" <ji...@apache.org> on 2015/10/27 21:52:27 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2152) CommonCrawl dump via Service endpoint - posted by "Sujen Shah (JIRA)" <ji...@apache.org> on 2015/10/28 04:30:27 UTC, 0 replies.
- [jira] [Created] (NUTCH-2153) Nutch REST API (DB) uses POST instead of GET to request - posted by "Aron Ahmadia (JIRA)" <ji...@apache.org> on 2015/10/28 17:55:27 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2153) Nutch REST API (DB) uses POST instead of GET to request - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/28 17:58:27 UTC, 5 replies.
- [jira] [Created] (NUTCH-2154) Nutch REST API (DB) suffering NullPointerException - posted by "Aron Ahmadia (JIRA)" <ji...@apache.org> on 2015/10/28 18:21:28 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2153) Nutch REST API (DB) uses POST instead of GET to request - posted by "Aron Ahmadia (JIRA)" <ji...@apache.org> on 2015/10/28 18:22:27 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2154) Nutch REST API (DB) suffering NullPointerException - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/28 18:24:27 UTC, 1 replies.
- [jira] [Assigned] (NUTCH-2154) Nutch REST API (DB) suffering NullPointerException - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/28 18:24:27 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2154) Nutch REST API (DB) suffering NullPointerException - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/28 18:25:27 UTC, 3 replies.
- [Nutch Wiki] Trivial Update of "NewScoringIndexingExample" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2015/10/28 20:57:08 UTC, 2 replies.
- [Nutch Wiki] New attachment added to page NewScoringIndexingExample - posted by Apache Wiki <wi...@apache.org> on 2015/10/28 21:18:12 UTC, 0 replies.
- [jira] [Created] (NUTCH-2155) Create a "crawl completeness" utility - posted by "Michael Joyce (JIRA)" <ji...@apache.org> on 2015/10/28 21:44:27 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2155) Create a "crawl completeness" utility - posted by "Michael Joyce (JIRA)" <ji...@apache.org> on 2015/10/28 21:44:27 UTC, 7 replies.
- [GitHub] nutch pull request: NUTCH-2155 - Add crawl completion utility - posted by MJJoyce <gi...@git.apache.org> on 2015/10/28 22:21:24 UTC, 5 replies.
- [jira] [Created] (NUTCH-2156) Dump via Services end point - posted by "Sujen Shah (JIRA)" <ji...@apache.org> on 2015/10/29 01:09:27 UTC, 0 replies.
- Re: MireDot user activation - posted by lewis john mcgibbney <le...@apache.org> on 2015/10/29 06:03:49 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-1800) Documentation for Nutch 1.X and 2.X REST APIs - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/10/29 06:06:27 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1800) Documentation for Nutch 1.X and 2.X REST APIs - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/10/29 06:06:27 UTC, 3 replies.
- [jira] [Commented] (NUTCH-1800) Documentation for Nutch 1.X and 2.X REST APIs - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/10/29 06:11:27 UTC, 8 replies.
- [jira] [Commented] (NUTCH-2152) CommonCrawl dump via Service endpoint - posted by "Aron Ahmadia (JIRA)" <ji...@apache.org> on 2015/10/29 20:31:27 UTC, 6 replies.
- [jira] [Created] (NUTCH-2157) Parent Issue for Addressing Miredot REST API Warnings - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/10/29 21:56:27 UTC, 0 replies.
- [jira] [Reopened] (NUTCH-1988) Make nested output directory dump optional - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/10/29 22:21:27 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1988) Make nested output directory dump optional - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/10/29 22:29:27 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1988) Make nested output directory dump optional - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/10/29 22:31:27 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1988) Make nested output directory dump optional - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/10/29 22:31:27 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1988) Make nested output directory dump optional - posted by "Hudson (JIRA)" <ji...@apache.org> on 2015/10/29 23:37:27 UTC, 0 replies.
- [RESULT] WAS Re: [VOTE] Release Apache Nutch 2.3.1 - posted by Lewis John Mcgibbney <le...@gmail.com> on 2015/10/30 06:14:05 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-2143) GeneratorJob ignores batch id passed as argument - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/10/30 06:15:27 UTC, 0 replies.
- [jira] [Work started] (NUTCH-2155) Create a "crawl completeness" utility - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/30 22:46:27 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-2155) Create a "crawl completeness" utility - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/30 22:46:27 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2155) Create a "crawl completeness" utility - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/30 22:46:27 UTC, 1 replies.
- [jira] [Updated] (NUTCH-2146) hashCode on the Outlink class - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/30 22:52:27 UTC, 0 replies.
- [jira] [Work started] (NUTCH-2146) hashCode on the Outlink class - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/30 22:52:27 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2155) Create a "crawl completeness" utility - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/30 22:52:27 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-2146) hashCode on the Outlink class - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/30 22:52:27 UTC, 0 replies.
- [GitHub] nutch pull request: Fix for NUTCH-2146 contributed jorgelbg - posted by asfgit <gi...@git.apache.org> on 2015/10/30 22:55:57 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2146) hashCode on the Outlink class - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/30 22:56:27 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2150) Add ProtocolStatus Utility - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/30 23:03:27 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-2150) Add ProtocolStatus Utility - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/30 23:03:27 UTC, 0 replies.
- [jira] [Work started] (NUTCH-2150) Add ProtocolStatus Utility - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/30 23:03:27 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1800) Documentation for Nutch 1.X and 2.X REST APIs - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/30 23:04:28 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2150) Add ProtocolStatus Utility - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/30 23:04:28 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2154) Nutch REST API (DB) suffering NullPointerException - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/31 00:56:27 UTC, 0 replies.