You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by Apache Wiki <wi...@apache.org> on 2015/05/20 10:42:23 UTC
[Nutch Wiki] Update of "GoogleSummerOfCode/Giving HTML5 support for Apache Nutch 2.x" by HalilSimsek
Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.
The "GoogleSummerOfCode/Giving HTML5 support for Apache Nutch 2.x" page has been changed by HalilSimsek:
https://wiki.apache.org/nutch/GoogleSummerOfCode/Giving%20HTML5%20support%20for%20Apache%20Nutch%202.x
Comment:
Detail Page for Halil Ibrahim Simsek's GSoC 2015 project created.
New page:
==== Giving HTML5 support for Apache Nutch 2.x ====
===== Description =====
The project is aimed at giving Html5 support to Apache Nutch 2.x with using a java library. With this project two goals is aimed. First one is implementation of a new parser which has to follow WHATWG HTML5 specification. Second one is implementation of a new plugin which uses newly implemented parser and extracts new elements of HTML5.
===== Reports =====
Reports will be added here.
===== Documentation =====
Documents will be added here.
===== Jira Issues =====
Issues will be added here.