You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by Apache Wiki <wi...@apache.org> on 2015/02/05 00:19:26 UTC

[Nutch Wiki] Trivial Update of "AdvancedAjaxInteraction" by LewisJohnMcgibbney

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.

The "AdvancedAjaxInteraction" page has been changed by LewisJohnMcgibbney:
https://wiki.apache.org/nutch/AdvancedAjaxInteraction

New page:
= AdvancedAjaxInteraction =

This page provides commentary and thoughts on adapting Nutch not only to fetch AJAX/JavaScript driven dynamic HTML content, but also for interacting with that content (potentially a number of times) within a fetching scenario.

<<TableOfContents(4)>>

== Lets Begin with a Scenario ==

xyz

== Related Development Issues ==

 * [[https://issues.apache.org/jira/browse/NUTCH-1933|NUTCH-1933]]
 * [[https://github.com/momer/nutch-selenium|momer/nutch-selenium]] - This plugin allows you to fetch javascript pages using Selenium, while relying on the rest of the awesome Nutch stack! (ported to issue NUTCH-1933)
 * [[https://github.com/momer/nutch-selenium-grid-plugin|momer/nutch-selenium-grid-plugin]] - This plugin allows you to fetch javascript pages using an existing Selenium Hub/Node set-up, while relying on the rest of the awesome Nutch stack! 


== Related Articles ==
 * [[http://soryy.com/blog/2014/ajax-javascript-enabled-parsing-apache-nutch-selenium/|AJAX/JavaScript Enabled Parsing with Apache Nutch and Selenium]]