You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by tahere ganjiyar <ta...@gmail.com> on 2012/01/08 18:20:41 UTC

crawl-javascript

i use nutch 1.4, i want crawl sites with their javascript files, how
should i config nutch?

Re: crawl-javascript

Posted by mina <ta...@gmail.com>.
i want crawl .js files beacuse in .js files i add some links to a sites. how
i can config nutch to ceawl .js files?


--
View this message in context: http://lucene.472066.n3.nabble.com/crawl-javascript-tp3642437p3642593.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Re: crawl-javascript

Posted by Markus Jelsma <ma...@openindex.io>.
Check regex URL filters for filtering of .js files which is default if im not 
mistaken.

> i use nutch 1.4, i want crawl sites with their javascript files, how
> should i config nutch?