You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Savannah Beckett <sa...@yahoo.com> on 2010/08/06 04:41:23 UTC

bug? nutch cannot parse urls in tbody

Can Nutch 1.0 parse urls inside a table?  somehow none of the urls inside 
<tbody> were parsed in this link:

http://seeker.dice.com/jobsearch/servlet/JobSearch?METRO_AREA=33.78715899%2C-84.39164034&EXTRA_STUFF=0&op=300&Hf=0&SORTSPEC=0&Ntk=JobSearchRanking&DAYSBACK=30&LOCATION_OPTION=4&No=0&Ntx=mode+matchall&NUM_PER_PAGE=30&N=301826+3122&FRMT=0&caller=3


Is it a bug?