You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by Apache Wiki <wi...@apache.org> on 2015/05/20 10:42:23 UTC

[Nutch Wiki] Update of "GoogleSummerOfCode/Giving HTML5 support for Apache Nutch 2.x" by HalilSimsek

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.

The "GoogleSummerOfCode/Giving HTML5 support for Apache Nutch 2.x" page has been changed by HalilSimsek:
https://wiki.apache.org/nutch/GoogleSummerOfCode/Giving%20HTML5%20support%20for%20Apache%20Nutch%202.x

Comment:
Detail Page for Halil Ibrahim Simsek's GSoC 2015 project created.

New page:
==== Giving HTML5 support for Apache Nutch 2.x ====
===== Description =====
The project is aimed at giving Html5 support to Apache Nutch 2.x with using a java library. With this project two goals is aimed. First one is implementation of a new parser which has to follow WHATWG HTML5 specification. Second one is implementation of a new plugin which uses newly implemented parser and extracts new elements of HTML5.

===== Reports =====
Reports will be added here.

===== Documentation =====
Documents will be added here.

===== Jira Issues =====

Issues will be added here.