You are viewing a plain text version of this content. The canonical link for it is here.
- [Nutch Wiki] Trivial Update of "HaismyflH" by HaismyflH - posted by Apache Wiki <wi...@apache.org> on 2013/04/01 00:53:29 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "TheodoreP" by TheodoreP - posted by Apache Wiki <wi...@apache.org> on 2013/04/01 00:55:08 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "MilagrosN" by MilagrosN - posted by Apache Wiki <wi...@apache.org> on 2013/04/01 02:10:46 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "pylaqigrumo502" by pylaqigrumo502 - posted by Apache Wiki <wi...@apache.org> on 2013/04/01 02:32:48 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1538) tuning of loaded fields during fetcherJob start-up - posted by "lufeng (JIRA)" <ji...@apache.org> on 2013/04/01 03:06:14 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "CharissaB" by CharissaB - posted by Apache Wiki <wi...@apache.org> on 2013/04/01 03:19:46 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "StellaHTR" by StellaHTR - posted by Apache Wiki <wi...@apache.org> on 2013/04/01 03:37:03 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "KathieRec" by KathieRec - posted by Apache Wiki <wi...@apache.org> on 2013/04/01 04:29:54 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "VerlaVigi" by VerlaVigi - posted by Apache Wiki <wi...@apache.org> on 2013/04/01 04:43:23 UTC, 0 replies.
- Nutch2.x Null Pointer Exception in IndexerJob.Java for a fresh crawl with One Seed. - posted by Binoy d <bi...@gmail.com> on 2013/04/01 05:25:33 UTC, 6 replies.
- [Nutch Wiki] Trivial Update of "AntwanDob" by AntwanDob - posted by Apache Wiki <wi...@apache.org> on 2013/04/01 06:08:13 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "JeanetteG" by JeanetteG - posted by Apache Wiki <wi...@apache.org> on 2013/04/01 07:04:20 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "ipadkoyfaa" by ipadkoyfaa - posted by Apache Wiki <wi...@apache.org> on 2013/04/01 07:38:26 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "Guadalupe" by Guadalupe - posted by Apache Wiki <wi...@apache.org> on 2013/04/01 07:38:52 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "AugustGla" by AugustGla - posted by Apache Wiki <wi...@apache.org> on 2013/04/01 08:09:33 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "ColinNeff" by ColinNeff - posted by Apache Wiki <wi...@apache.org> on 2013/04/01 09:11:17 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "MarionF54" by MarionF54 - posted by Apache Wiki <wi...@apache.org> on 2013/04/01 10:11:08 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "ERZAngelo" by ERZAngelo - posted by Apache Wiki <wi...@apache.org> on 2013/04/01 10:22:03 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "BlancheLi" by BlancheLi - posted by Apache Wiki <wi...@apache.org> on 2013/04/01 10:27:52 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "IdaxvswfT" by IdaxvswfT - posted by Apache Wiki <wi...@apache.org> on 2013/04/01 10:55:46 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "themarketingbusinessrandom-10-100" by themarketingbusinessrandom-10-100 - posted by Apache Wiki <wi...@apache.org> on 2013/04/01 11:05:43 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "AmieLinco" by AmieLinco - posted by Apache Wiki <wi...@apache.org> on 2013/04/01 11:16:07 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "ElmaPeter" by ElmaPeter - posted by Apache Wiki <wi...@apache.org> on 2013/04/01 11:46:43 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "RachelPel" by RachelPel - posted by Apache Wiki <wi...@apache.org> on 2013/04/01 12:35:51 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "Nola2902" by Nola2902 - posted by Apache Wiki <wi...@apache.org> on 2013/04/01 13:30:08 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "IvoryCart" by IvoryCart - posted by Apache Wiki <wi...@apache.org> on 2013/04/01 14:37:12 UTC, 0 replies.
- Re: Important : Bunch of Spam Created under Nutch Wiki!! - posted by kiran chitturi <ch...@gmail.com> on 2013/04/01 15:52:57 UTC, 7 replies.
- [Nutch Wiki] Trivial Update of "ElizaMcca" by ElizaMcca - posted by Apache Wiki <wi...@apache.org> on 2013/04/01 17:23:49 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "MelodeeBr" by MelodeeBr - posted by Apache Wiki <wi...@apache.org> on 2013/04/01 18:53:43 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "SophieArt" by SophieArt - posted by Apache Wiki <wi...@apache.org> on 2013/04/01 19:13:56 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "DannyIZW" by DannyIZW - posted by Apache Wiki <wi...@apache.org> on 2013/04/01 19:15:26 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "LisaYance" by LisaYance - posted by Apache Wiki <wi...@apache.org> on 2013/04/01 21:42:43 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "JeffreyDW" by JeffreyDW - posted by Apache Wiki <wi...@apache.org> on 2013/04/01 22:26:24 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "BurtonCav" by BurtonCav - posted by Apache Wiki <wi...@apache.org> on 2013/04/01 23:34:05 UTC, 0 replies.
- Re: error using generate in 2.x - posted by kaveh minooie <ka...@plutoz.com> on 2013/04/01 23:45:22 UTC, 2 replies.
- [Nutch Wiki] Trivial Update of "FranklynE" by FranklynE - posted by Apache Wiki <wi...@apache.org> on 2013/04/01 23:52:39 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "HelenNull" by HelenNull - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 01:09:07 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "AngieCowe" by AngieCowe - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 02:35:05 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "LeilaGayl" by LeilaGayl - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 02:47:08 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "KarolPear" by KarolPear - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 04:19:57 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "Jason_Morgan_TV_New_Production_Show" by KarolPear - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 04:28:57 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "JessieCut" by JessieCut - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 04:29:36 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "Mohammad7" by Mohammad7 - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 05:34:14 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "AlilgxcaC" by AlilgxcaC - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 06:24:33 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "TerrenceM" by TerrenceM - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 06:25:38 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "Joann77Y" by Joann77Y - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 06:46:30 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "CaitlinFa" by CaitlinFa - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 08:26:41 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "LucyPruet" by LucyPruet - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 08:30:00 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "MargieCru" by MargieCru - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 08:39:21 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "FVGDaisy" by FVGDaisy - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 09:20:53 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "Celia04X" by Celia04X - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 10:53:41 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "Virgil717" by Virgil717 - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 10:59:05 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "RamonHaCa" by RamonHaCa - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 11:57:06 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "CNYFoster" by CNYFoster - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 12:04:33 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "SherrieEC" by SherrieEC - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 12:07:01 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "A_Background_In_Fast_Advice_For_Electronic_Cigarette" by SherrieEC - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 12:07:14 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "HarryTalb" by HarryTalb - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 12:19:48 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "Roma61I" by Roma61I - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 12:47:28 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "TeraWhitl" by TeraWhitl - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 13:23:22 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "Louisa87A" by Louisa87A - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 14:44:22 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "KamiCarre" by KamiCarre - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 14:47:25 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "DewayneMo" by DewayneMo - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 15:15:04 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "ScottyT91" by ScottyT91 - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 15:17:04 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "JolieChil" by JolieChil - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 16:18:29 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "MellissaF" by MellissaF - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 16:27:39 UTC, 0 replies.
- Re: IOException during #Crawl.run -> #JobClient.runJob() - posted by feng lu <am...@gmail.com> on 2013/04/02 16:40:12 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "EvieSherr" by EvieSherr - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 16:48:27 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "DianaSeel" by DianaSeel - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 17:49:20 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "DanaRyk" by DanaRyk - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 17:50:43 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "BillEPZ" by BillEPZ - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 18:40:08 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "An_Analysis_Of_Realistic_Systems_Of_flooring_toronto" by BillEPZ - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 18:40:23 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "TaraNatha" by TaraNatha - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 18:59:04 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "DorcasUII" by DorcasUII - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 19:04:07 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "MuhammadS" by MuhammadS - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 19:32:09 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "ConnorRei" by ConnorRei - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 19:44:24 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "Margherit" by Margherit - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 19:52:33 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "RandolphG" by RandolphG - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 20:41:53 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "A_Spotlight_On_Clear-Cut_Plans_In_web_design_toronto" by RandolphG - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 20:42:11 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "CeciliaDo" by CeciliaDo - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 21:20:27 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "TomokoSay" by TomokoSay - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 21:34:17 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "ElishaPSC" by ElishaPSC - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 22:52:31 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "JaneenKel" by JaneenKel - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 22:53:53 UTC, 0 replies.
- Re: dev Digest 2 Apr 2013 18:42:33 -0000 Issue 1587 - posted by Lewis John Mcgibbney <le...@gmail.com> on 2013/04/02 22:55:08 UTC, 1 replies.
- [Nutch Wiki] Trivial Update of "BetseyCar" by BetseyCar - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 23:09:37 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "MatildaSZ" by MatildaSZ - posted by Apache Wiki <wi...@apache.org> on 2013/04/02 23:09:52 UTC, 0 replies.
- [jira] [Created] (NUTCH-1552) possibility of a NPE in index-more plugin - posted by "kaveh minooie (JIRA)" <ji...@apache.org> on 2013/04/03 00:01:15 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1552) possibility of a NPE in index-more plugin - posted by "kaveh minooie (JIRA)" <ji...@apache.org> on 2013/04/03 00:05:16 UTC, 1 replies.
- [Nutch Wiki] Trivial Update of "Marcia806" by Marcia806 - posted by Apache Wiki <wi...@apache.org> on 2013/04/03 00:39:44 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "ReggieGus" by ReggieGus - posted by Apache Wiki <wi...@apache.org> on 2013/04/03 01:19:10 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "TRSKasha" by TRSKasha - posted by Apache Wiki <wi...@apache.org> on 2013/04/03 02:40:35 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "GreggLemo" by GreggLemo - posted by Apache Wiki <wi...@apache.org> on 2013/04/03 03:18:27 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "TreyMacon" by TreyMacon - posted by Apache Wiki <wi...@apache.org> on 2013/04/03 06:26:06 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "JonoZ97" by JonoZ97 - posted by Apache Wiki <wi...@apache.org> on 2013/04/03 06:40:08 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "Straightforward_Programs_In_Aura_83_-_The_Options" by JonoZ97 - posted by Apache Wiki <wi...@apache.org> on 2013/04/03 06:40:17 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "ArturoDee" by ArturoDee - posted by Apache Wiki <wi...@apache.org> on 2013/04/03 09:42:03 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "LFQLinnie" by LFQLinnie - posted by Apache Wiki <wi...@apache.org> on 2013/04/03 11:22:28 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "ErikYHNW" by ErikYHNW - posted by Apache Wiki <wi...@apache.org> on 2013/04/03 15:20:57 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "RickieBrr" by RickieBrr - posted by Apache Wiki <wi...@apache.org> on 2013/04/03 16:28:13 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "Sasha4767" by Sasha4767 - posted by Apache Wiki <wi...@apache.org> on 2013/04/03 17:00:43 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "AlvinButt" by AlvinButt - posted by Apache Wiki <wi...@apache.org> on 2013/04/03 19:30:37 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "BrandyHin" by BrandyHin - posted by Apache Wiki <wi...@apache.org> on 2013/04/03 19:40:16 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "LashondaE" by LashondaE - posted by Apache Wiki <wi...@apache.org> on 2013/04/03 20:04:43 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "EdwinBurk" by EdwinBurk - posted by Apache Wiki <wi...@apache.org> on 2013/04/03 20:04:46 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "ErickLind" by ErickLind - posted by Apache Wiki <wi...@apache.org> on 2013/04/03 20:29:59 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "ValenciaD" by ValenciaD - posted by Apache Wiki <wi...@apache.org> on 2013/04/03 21:28:23 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "MercedesP" by MercedesP - posted by Apache Wiki <wi...@apache.org> on 2013/04/03 21:51:28 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "SalvadorC" by SalvadorC - posted by Apache Wiki <wi...@apache.org> on 2013/04/03 22:29:49 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "LeoraHerr" by LeoraHerr - posted by Apache Wiki <wi...@apache.org> on 2013/04/03 22:49:02 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1552) possibility of a NPE in index-more plugin - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/04/03 23:17:15 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1552) possibility of a NPE in index-more plugin - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/04/03 23:17:15 UTC, 4 replies.
- [jira] [Reopened] (NUTCH-1532) Replace 'segment' mapping field with batchId - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/04/03 23:27:16 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1532) Replace 'segment' mapping field with batchId - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/04/03 23:29:16 UTC, 0 replies.
- Re: Nutch2.x Null Pointer Exception in IndexerJob.Java for a fresh crawl with One Seed - posted by Lewis John Mcgibbney <le...@gmail.com> on 2013/04/03 23:32:26 UTC, 1 replies.
- [Nutch Wiki] Trivial Update of "EulaFQY" by EulaFQY - posted by Apache Wiki <wi...@apache.org> on 2013/04/04 00:06:50 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "AileenQWW" by AileenQWW - posted by Apache Wiki <wi...@apache.org> on 2013/04/04 00:27:04 UTC, 0 replies.
- Wiki locked down, spam pages deleted - posted by kiran chitturi <ch...@gmail.com> on 2013/04/04 02:16:02 UTC, 2 replies.
- [jira] [Created] (NUTCH-1553) Property 'indexer.delete.robots.noindex' not working if using parser-html. - posted by "Alfonso Presa (JIRA)" <ji...@apache.org> on 2013/04/04 16:11:15 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1434) Indexer to delete robots noIndex - posted by "Alfonso Presa (JIRA)" <ji...@apache.org> on 2013/04/04 16:11:16 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1553) Property 'indexer.delete.robots.noindex' not working if using parser-html. - posted by "Alfonso Presa (JIRA)" <ji...@apache.org> on 2013/04/04 16:15:18 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1553) Property 'indexer.delete.robots.noindex' not working when using parser-html. - posted by "Alfonso Presa (JIRA)" <ji...@apache.org> on 2013/04/04 16:29:15 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1553) Property 'indexer.delete.robots.noindex' not working when using parser-html. - posted by "Alfonso Presa (JIRA)" <ji...@apache.org> on 2013/04/05 09:22:15 UTC, 0 replies.
- Re: FetchSchedule and Metadata - posted by Canan GİRGİN <ca...@gmail.com> on 2013/04/05 12:24:50 UTC, 9 replies.
- Re: dev Digest 5 Apr 2013 07:22:18 -0000 Issue 1589 - posted by lewis john mcgibbney <le...@apache.org> on 2013/04/05 22:19:04 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons - posted by "Tejas Patil (JIRA)" <ji...@apache.org> on 2013/04/06 01:51:16 UTC, 4 replies.
- [jira] [Updated] (NUTCH-1545) capture batchId and remove references to segments in 2.x crawl script. - posted by "lufeng (JIRA)" <ji...@apache.org> on 2013/04/06 18:05:15 UTC, 2 replies.
- Nutch - posted by Parin Jogani <pp...@usc.edu> on 2013/04/06 18:58:12 UTC, 1 replies.
- Build failed in Jenkins: Nutch-trunk #2158 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/07 06:09:45 UTC, 0 replies.
- [jira] [Created] (NUTCH-1554) org.apache.nutch.net.protocols.HttpDateFormat should NOT be Locale.US aware - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/04/07 23:27:15 UTC, 0 replies.
- [jira] [Created] (NUTCH-1555) bug in 2.x ParserJob command line parsing - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/04/08 01:39:15 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1486) Upgrade to Solr 4.2.1 - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/04/08 01:45:16 UTC, 2 replies.
- [jira] [Comment Edited] (NUTCH-1486) Upgrade to Solr 4.2.1 - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/04/08 02:07:16 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1551) Improve WebTableReader field order and display batchId - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/04/08 02:35:16 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1532) Replace 'segment' mapping field with batchId - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/04/08 02:39:16 UTC, 0 replies.
- Next Release Cycle - posted by Lewis John Mcgibbney <le...@gmail.com> on 2013/04/08 02:42:19 UTC, 3 replies.
- [jira] [Commented] (NUTCH-1486) Upgrade to Solr 4.2.1 - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/04/08 02:51:16 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1551) Improve WebTableReader field order and display batchId - posted by "Hudson (JIRA)" <ji...@apache.org> on 2013/04/08 03:33:16 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1532) Replace 'segment' mapping field with batchId - posted by "Hudson (JIRA)" <ji...@apache.org> on 2013/04/08 03:33:16 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-trunk #2159 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/08 06:20:48 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1555) bug in 2.x ParserJob command line parsing - posted by "lufeng (JIRA)" <ji...@apache.org> on 2013/04/08 16:55:16 UTC, 4 replies.
- [jira] [Updated] (NUTCH-1554) org.apache.nutch.net.protocols.HttpDateFormat should NOT be Locale.US aware - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/04/08 21:41:16 UTC, 1 replies.
- [jira] [Assigned] (NUTCH-1554) org.apache.nutch.net.protocols.HttpDateFormat should NOT be Locale.US aware - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/04/08 21:43:16 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1554) org.apache.nutch.net.protocols.HttpDateFormat should NOT be Locale.US aware - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/04/08 21:47:15 UTC, 1 replies.
- [jira] [Commented] (NUTCH-1554) org.apache.nutch.net.protocols.HttpDateFormat should NOT be Locale.US aware - posted by "Hudson (JIRA)" <ji...@apache.org> on 2013/04/08 22:21:17 UTC, 5 replies.
- [jira] [Reopened] (NUTCH-1554) org.apache.nutch.net.protocols.HttpDateFormat should NOT be Locale.US aware - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/04/08 23:48:16 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #2162 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/09 06:33:38 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-trunk #2163 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/10 06:18:38 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1556) enabling updatedb to accept batchId - posted by "kaveh minooie (JIRA)" <ji...@apache.org> on 2013/04/10 10:14:15 UTC, 2 replies.
- [jira] [Created] (NUTCH-1556) enabling updatedb to accept batchId - posted by "kaveh minooie (JIRA)" <ji...@apache.org> on 2013/04/10 10:14:15 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1555) Move to commons-cli for command line parsing - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/04/10 20:30:19 UTC, 3 replies.
- so why does solrindex-mapping.xml get ignored? - posted by kaveh minooie <ka...@plutoz.com> on 2013/04/11 19:43:37 UTC, 2 replies.
- [jira] [Created] (NUTCH-1557) File extraction and classification for any MIME types from segments - posted by "Chao Yan (JIRA)" <ji...@apache.org> on 2013/04/12 08:53:16 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1556) enabling updatedb to accept batchId - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/04/12 23:36:17 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1557) File extraction and classification for any MIME types from segments - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/04/12 23:58:16 UTC, 1 replies.
- [jira] [Comment Edited] (NUTCH-1557) File extraction and classification for any MIME types from segments - posted by "Chao Yan (JIRA)" <ji...@apache.org> on 2013/04/13 03:23:13 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #567 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/13 06:13:03 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #2166 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/13 06:13:03 UTC, 0 replies.
- Nutch/Solr expert - posted by Andrea Lanzoni <an...@tin.it> on 2013/04/13 11:03:28 UTC, 1 replies.
- Jenkins build is back to normal : Nutch-nutchgora #568 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/14 07:07:31 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-trunk #2167 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/14 07:16:38 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #2168 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/15 06:26:02 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #2169 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/16 06:20:29 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #570 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/16 06:20:29 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-1555) Move to commons-cli for command line parsing - posted by "lufeng (JIRA)" <ji...@apache.org> on 2013/04/16 16:29:16 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-nutchgora #571 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/17 06:46:26 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-trunk #2170 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/17 06:55:53 UTC, 0 replies.
- nutch pull request: Patch for fixing coding bug - posted by ysc <gi...@git.apache.org> on 2013/04/17 16:03:09 UTC, 0 replies.
- [jira] [Created] (NUTCH-1558) CharEncodingForConversion in ParseData's ParseMeta, not in ParseData's ContentMeta - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2013/04/17 16:17:15 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1558) CharEncodingForConversion in ParseData's ParseMeta, not in ParseData's ContentMeta - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2013/04/17 16:17:15 UTC, 3 replies.
- [jira] [Work started] (NUTCH-1558) CharEncodingForConversion in ParseData's ParseMeta, not in ParseData's ContentMeta - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2013/04/17 16:17:15 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1467) nutch 1.5.1 not able to parse mutliValued metatags - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/04/17 23:01:16 UTC, 0 replies.
- [jira] [Created] (NUTCH-1559) parse-metatags duplicates extracted metatags in combination with parse-tika - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/04/17 23:13:15 UTC, 0 replies.
- [jira] [Created] (NUTCH-1560) index-metadata to add all values of multivalued metadata - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/04/17 23:39:15 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1560) index-metadata to add all values of multivalued metadata - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/04/17 23:43:15 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1559) parse-metatags duplicates extracted metatags in combination with parse-tika - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/04/17 23:45:16 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1560) index-metadata to add all values of multivalued metadata - posted by "kiran (JIRA)" <ji...@apache.org> on 2013/04/17 23:53:16 UTC, 1 replies.
- [jira] [Created] (NUTCH-1561) improve usability of parse-metatags and index-metadata - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/04/17 23:55:16 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1501) Harmonize behavior of parsechecker and indexchecker - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/04/17 23:57:16 UTC, 2 replies.
- [jira] [Resolved] (NUTCH-1501) Harmonize behavior of parsechecker and indexchecker - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/04/18 01:23:17 UTC, 0 replies.
- [jira] [Comment Edited] (NUTCH-1501) Harmonize behavior of parsechecker and indexchecker - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/04/18 01:25:17 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #2171 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/18 02:36:45 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #572 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/18 02:37:42 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #2172 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/18 06:18:25 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-nutchgora #573 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/18 06:23:41 UTC, 0 replies.
- [jira] [Created] (NUTCH-1562) Order of execution for scoring filters - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2013/04/18 12:01:18 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1562) Order of execution for scoring filters - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2013/04/18 12:03:16 UTC, 1 replies.
- Build failed in Jenkins: Nutch-trunk #2173 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/18 12:37:41 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #574 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/18 12:38:46 UTC, 0 replies.
- [jira] [Created] (NUTCH-1563) FetchSchedule#getFields is never used by GeneraterJob - posted by "lufeng (JIRA)" <ji...@apache.org> on 2013/04/18 17:25:15 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1563) FetchSchedule#getFields is never used by GeneraterJob - posted by "lufeng (JIRA)" <ji...@apache.org> on 2013/04/18 17:39:16 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #2174 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/19 06:11:15 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-nutchgora #575 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/19 06:16:11 UTC, 0 replies.
- Re: [DISCUSS] Google Summer of Code - posted by nisrina <ni...@gmail.com> on 2013/04/19 13:06:57 UTC, 2 replies.
- [jira] [Created] (NUTCH-1564) AdaptiveFetchSchedule: sync_delta forces immediat refetch for documents not modified - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/04/19 14:11:16 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1564) AdaptiveFetchSchedule: sync_delta forces immediate refetch for documents not modified - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/04/19 14:13:16 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1562) Order of execution for scoring filters - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/04/20 02:53:15 UTC, 2 replies.
- Build failed in Jenkins: Nutch-nutchgora #576 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/20 06:33:35 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #2175 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/20 06:34:48 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-1563) FetchSchedule#getFields is never used by GeneraterJob - posted by "lufeng (JIRA)" <ji...@apache.org> on 2013/04/20 18:09:15 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #577 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/21 06:11:43 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-trunk #2176 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/21 06:19:13 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #578 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/22 06:08:30 UTC, 0 replies.
- Regarding URL Filtering and Politeness - posted by naveen shukla <na...@gmail.com> on 2013/04/22 14:06:28 UTC, 1 replies.
- [jira] [Reopened] (NUTCH-1552) possibility of a NPE in index-more plugin - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/04/22 18:25:16 UTC, 0 replies.
- java.lang.RuntimeException: Filter org.apache.nutch.urlfilter.prefix.PrefixURLFilter not found. - posted by naveen shukla <na...@gmail.com> on 2013/04/22 18:28:26 UTC, 1 replies.
- Build failed in Jenkins: Nutch-nutchgora #579 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/23 06:03:02 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #2178 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/23 06:04:16 UTC, 0 replies.
- Error when running Nutch, please help - posted by Maohua Liu <ca...@gmail.com> on 2013/04/23 16:17:59 UTC, 2 replies.
- [jira] [Comment Edited] (NUTCH-1555) Move to commons-cli for command line parsing - posted by "lufeng (JIRA)" <ji...@apache.org> on 2013/04/23 16:59:16 UTC, 1 replies.
- Build failed in Jenkins: Nutch-nutchgora #580 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/24 06:38:52 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #2179 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/24 06:45:56 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1555) Move to commons-cli for command line parsing - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/04/24 22:37:17 UTC, 2 replies.
- [Nutch Wiki] Update of "Becoming_A_Nutch_Developer" by SebastianNagel - posted by Apache Wiki <wi...@apache.org> on 2013/04/24 23:49:00 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1447) Nutch 2.x with Cloudera CDH 4 get Error: Found interface org.apache.hadoop.mapreduce.TaskAttemptContext, but class was expected - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/04/25 02:35:15 UTC, 1 replies.
- [jira] [Updated] (NUTCH-1250) parse-html does not parse links with empty anchor - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/04/25 02:41:13 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1447) Nutch 2.x with Cloudera CDH 4 get Error: Found interface org.apache.hadoop.mapreduce.TaskAttemptContext, but class was expected - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/04/25 02:47:13 UTC, 0 replies.
- [jira] [Updated] (NUTCH-566) Sun's URL class has bug in creation of relative query URLs - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/04/25 02:49:13 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1543) Display consistent usage of DBUpdaterJob with 1.X - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/04/25 02:49:14 UTC, 1 replies.
- [jira] [Resolved] (NUTCH-1369) Improve ParserChecker in Nutchgora - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/04/25 02:51:13 UTC, 0 replies.
- [jira] [Updated] (NUTCH-829) duplicate hadoop temp files - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/04/25 03:02:17 UTC, 1 replies.
- [jira] [Resolved] (NUTCH-1523) Upgrade solr-solr4j dependency to 4.1.0 - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/04/25 03:06:16 UTC, 0 replies.
- Issues in 2.x with Patches for review - posted by Lewis John Mcgibbney <le...@gmail.com> on 2013/04/25 03:13:42 UTC, 0 replies.
- [jira] [Created] (NUTCH-1565) Proper downloads page for Nutch - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/04/25 04:30:15 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1565) Proper downloads page for Nutch - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/04/25 04:30:16 UTC, 1 replies.
- [jira] [Commented] (NUTCH-1565) Proper downloads page for Nutch - posted by "Tejas Patil (JIRA)" <ji...@apache.org> on 2013/04/25 05:26:16 UTC, 3 replies.
- Build failed in Jenkins: Nutch-nutchgora #581 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/25 06:11:21 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #2180 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/25 06:24:51 UTC, 0 replies.
- GSoC Student Application Window Now Open - posted by Lewis John Mcgibbney <le...@gmail.com> on 2013/04/25 06:46:48 UTC, 1 replies.
- [jira] [Resolved] (NUTCH-1565) Proper downloads page for Nutch - posted by "Tejas Patil (JIRA)" <ji...@apache.org> on 2013/04/25 07:10:16 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1371) Replace Ivy with Maven Ant tasks - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/04/26 02:58:16 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1314) Impose a limit on the length of outlink target urls - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/04/26 03:21:13 UTC, 0 replies.
- Partial Updates in Solr 4.1 - posted by Jay Springbernate <js...@gmail.com> on 2013/04/26 03:56:49 UTC, 1 replies.
- Build failed in Jenkins: Nutch-trunk #2181 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/26 06:14:42 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #582 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/26 06:15:04 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1015) MoreIndexingFilter: can't parse erroneous date: 2006-05-24T20:03:42 - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/04/26 21:06:16 UTC, 0 replies.
- [jira] [Updated] (NUTCH-670) feed plugin does not parse RSS2 enclosures - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/04/26 21:08:16 UTC, 0 replies.
- [jira] [Updated] (NUTCH-841) Create a Wicket-based Web Application for Nutch - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/04/26 21:08:16 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1546) Parsechecker and redirection - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/04/26 21:08:20 UTC, 0 replies.
- [jira] [Updated] (NUTCH-966) Behavior of NOINDEX,FOLLOW is not intuitive - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/04/26 21:16:16 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1308) Unnecessary truncate content configuration, and logging in parse-zip/ZipParser - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/04/26 21:18:16 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1513) Support Robots.txt for Ftp urls - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/04/26 21:20:16 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1537) Legacy metadata package needs to take advantage of Apache Tika metadata package more. - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/04/26 21:20:18 UTC, 0 replies.
- Re: dev Digest 26 Apr 2013 19:08:20 -0000 Issue 1603 - posted by Lewis John Mcgibbney <le...@gmail.com> on 2013/04/26 21:26:12 UTC, 1 replies.
- [jira] [Updated] (NUTCH-685) Content-level redirect status lost in ParseSegment - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/04/26 21:28:16 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1548) Move all Utils classes into Utils packages & dedup Utils generally - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/04/26 21:32:15 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1525) Generator to record external links even when db.ignore.external.links set to true - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/04/26 21:32:16 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1549) Fix deprecated use of Tika MimeType API in o.a.n.util.MimeUtil - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/04/26 21:32:16 UTC, 0 replies.
- [jira] [Updated] (NUTCH-366) Merge URLFilters and URLNormalizers - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/04/26 21:34:16 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1281) tika parser not work properly with unwanted file types that passed from filters in nutch - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/04/26 21:34:16 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1257) Support for the x-robots-tag HTTP Header - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/04/26 21:34:16 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1502) Test for CrawlDatum state transitions - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/04/26 21:34:16 UTC, 0 replies.
- [jira] [Updated] (NUTCH-583) FeedParser empty links for items - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/04/26 21:34:17 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1329) parser not extract outlinks to external web sites - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/04/26 21:34:17 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1522) Upgrade to Tika 1.3 - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/04/26 21:34:17 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1527) Port nutch-elasticsearch-indexer to Nutch - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/04/26 21:34:18 UTC, 0 replies.
- [jira] [Updated] (NUTCH-992) SolrDedup is broken in 2.x - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/04/27 01:46:15 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "ErrorMessagesInNutch2" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2013/04/27 02:09:22 UTC, 5 replies.
- Build failed in Jenkins: Nutch-trunk #2182 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/27 06:03:45 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #583 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/27 06:07:32 UTC, 0 replies.
- tickets for nutch beginners - posted by Michael Aro <m....@gmail.com> on 2013/04/27 16:27:09 UTC, 1 replies.
- version for apache nutch giraph integration and irc - posted by Michael Aro <m....@gmail.com> on 2013/04/27 17:01:42 UTC, 1 replies.
- [Nutch Wiki] Update of "AdminGroup" by kiranchitturi - posted by Apache Wiki <wi...@apache.org> on 2013/04/27 22:52:08 UTC, 0 replies.
- [Nutch Wiki] Update of "ContributorsGroup" by kiranchitturi - posted by Apache Wiki <wi...@apache.org> on 2013/04/27 22:53:03 UTC, 0 replies.
- [jira] [Updated] (NUTCH-969) FTP erro encoding - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/04/27 23:08:15 UTC, 1 replies.
- [jira] [Commented] (NUTCH-969) FTP erro encoding - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/04/27 23:08:15 UTC, 0 replies.
- [jira] [Updated] (NUTCH-969) protocol-ftp with configurable encoding - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/04/27 23:08:16 UTC, 0 replies.
- [Nutch Wiki] Update of "bin/nutch generate" by TejasPatil - posted by Apache Wiki <wi...@apache.org> on 2013/04/27 23:19:25 UTC, 0 replies.
- [jira] [Commented] (NUTCH-829) duplicate hadoop temp files - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/04/28 02:38:15 UTC, 1 replies.
- [jira] [Resolved] (NUTCH-829) duplicate hadoop temp files - posted by "Tejas Patil (JIRA)" <ji...@apache.org> on 2013/04/28 03:15:14 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1528) Port nutch-mongodb-indexer to Nutch - posted by "Tejas Patil (JIRA)" <ji...@apache.org> on 2013/04/28 03:29:13 UTC, 5 replies.
- [jira] [Commented] (NUTCH-346) Improve readability of logs/hadoop.log - posted by "Tejas Patil (JIRA)" <ji...@apache.org> on 2013/04/28 03:38:16 UTC, 3 replies.
- Jenkins build is back to normal : Nutch-trunk #2183 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/28 06:07:03 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #584 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/28 07:07:56 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #2184 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/28 07:11:05 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1528) Port nutch-mongodb-indexer to Nutch - posted by "Tejas Patil (JIRA)" <ji...@apache.org> on 2013/04/28 20:36:18 UTC, 0 replies.
- [jira] [Created] (NUTCH-1566) bin/nutch to allow whitespace in paths - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/04/28 23:20:16 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1566) bin/nutch to allow whitespace in paths - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/04/28 23:24:15 UTC, 0 replies.
- Re: dev Digest 28 Apr 2013 11:04:22 -0000 Issue 1605 - posted by Lewis John Mcgibbney <le...@gmail.com> on 2013/04/29 01:14:01 UTC, 0 replies.
- [jira] [Closed] (NUTCH-346) Improve readability of logs/hadoop.log - posted by "Tejas Patil (JIRA)" <ji...@apache.org> on 2013/04/29 01:36:16 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #585 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/29 02:17:50 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #2185 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/29 02:26:18 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "CommandLineOptions" by TejasPatil - posted by Apache Wiki <wi...@apache.org> on 2013/04/29 02:49:39 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #586 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/29 06:02:14 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons - posted by "Tejas Patil (JIRA)" <ji...@apache.org> on 2013/04/29 06:04:17 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-trunk #2186 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/29 06:14:46 UTC, 0 replies.
- [jira] [Updated] (NUTCH-649) Log list of files found but not crawled. - posted by "Tejas Patil (JIRA)" <ji...@apache.org> on 2013/04/29 06:18:16 UTC, 1 replies.
- [jira] [Commented] (NUTCH-1314) Impose a limit on the length of outlink target urls - posted by "Tejas Patil (JIRA)" <ji...@apache.org> on 2013/04/29 07:10:16 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1329) parser not extract outlinks to external web sites - posted by "Tejas Patil (JIRA)" <ji...@apache.org> on 2013/04/29 07:12:19 UTC, 1 replies.
- [jira] [Resolved] (NUTCH-891) Nutch build should not depend on unversioned local deps - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/04/29 21:08:16 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons - posted by "Tejas Patil (JIRA)" <ji...@apache.org> on 2013/04/29 22:30:17 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1455) RobotRulesParser to match multi-word user-agent names - posted by "Tejas Patil (JIRA)" <ji...@apache.org> on 2013/04/29 22:32:15 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #587 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/29 23:14:18 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #588 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/30 06:12:50 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #2187 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/30 06:19:48 UTC, 0 replies.
- [jira] [Closed] (NUTCH-342) Nutch commands log to nutch/logs/hadoop.logs by default - posted by "Tejas Patil (JIRA)" <ji...@apache.org> on 2013/04/30 10:54:16 UTC, 0 replies.
- [jira] [Commented] (NUTCH-213) checkstyle - posted by "Tejas Patil (JIRA)" <ji...@apache.org> on 2013/04/30 10:58:16 UTC, 1 replies.
- [jira] [Commented] (NUTCH-427) protocol-smb: plugin protocol implementing the CIFS/SMB protocol. This protocol allows Nutch to crawl Microsoft Windows Shares remotely using the CIFS/SMB protocol implmentation. - posted by "Tejas Patil (JIRA)" <ji...@apache.org> on 2013/04/30 11:04:16 UTC, 0 replies.
- [jira] [Closed] (NUTCH-449) Format of junit output should be configurable - posted by "Tejas Patil (JIRA)" <ji...@apache.org> on 2013/04/30 11:08:16 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1514) Phase out the deprecated configuration properties (if possible) - posted by "Tejas Patil (JIRA)" <ji...@apache.org> on 2013/04/30 11:20:17 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-802) Problems managing outlinks with large url length - posted by "Tejas Patil (JIRA)" <ji...@apache.org> on 2013/04/30 11:28:16 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1273) Fix [deprecation] javac warnings - posted by "Tejas Patil (JIRA)" <ji...@apache.org> on 2013/04/30 12:18:16 UTC, 4 replies.
- [jira] [Comment Edited] (NUTCH-1273) Fix [deprecation] javac warnings - posted by "Tejas Patil (JIRA)" <ji...@apache.org> on 2013/04/30 13:20:17 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1273) Fix [deprecation] javac warnings - posted by "Tejas Patil (JIRA)" <ji...@apache.org> on 2013/04/30 13:20:17 UTC, 1 replies.
- [jira] [Commented] (NUTCH-1053) Parsing of RSS feeds fails - posted by "Tejas Patil (JIRA)" <ji...@apache.org> on 2013/04/30 13:48:16 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1529) Port nutch-mongdb-parser to trunk - posted by "Tejas Patil (JIRA)" <ji...@apache.org> on 2013/04/30 13:56:16 UTC, 0 replies.
- [jira] [Commented] (NUTCH-649) Log list of files found but not crawled. - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/04/30 18:30:16 UTC, 2 replies.
- [jira] [Created] (NUTCH-1567) More useful logging for batch id (null) scenario - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/04/30 18:46:16 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1273) Fix [deprecation] javac warnings - posted by "Tejas Patil (JIRA)" <ji...@apache.org> on 2013/04/30 21:38:15 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1549) Fix deprecated use of Tika MimeType API in o.a.n.util.MimeUtil - posted by "Tejas Patil (JIRA)" <ji...@apache.org> on 2013/04/30 21:42:16 UTC, 2 replies.
- Build failed in Jenkins: Nutch-nutchgora #589 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/30 22:11:50 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-213) checkstyle - posted by "Tejas Patil (JIRA)" <ji...@apache.org> on 2013/04/30 22:48:16 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1549) Fix deprecated use of Tika MimeType API in o.a.n.util.MimeUtil - posted by "Tejas Patil (JIRA)" <ji...@apache.org> on 2013/04/30 22:54:16 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1329) parser not extract outlinks to external web sites - posted by "Tejas Patil (JIRA)" <ji...@apache.org> on 2013/04/30 22:56:16 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #2188 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/04/30 23:09:17 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1566) bin/nutch to allow whitespace in paths - posted by "Tejas Patil (JIRA)" <ji...@apache.org> on 2013/04/30 23:10:16 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1334) NPE in FetcherOutputFormat - posted by "Tejas Patil (JIRA)" <ji...@apache.org> on 2013/04/30 23:48:16 UTC, 0 replies.