You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by Jack Tang <hi...@gmail.com> on 2005/06/06 03:41:36 UTC
Index more...
Hi Guys
I wanna more fields in html header to be indexed or stored.
Take below as example, 'breadcrumb' should be stored without indexing
while 'keywords' should indexed.
<html>
<head>
.....
<meta name="breadcrumb" content="home >> introduction">
<meta name="keywords" content="">
</head>
......
</html>
And I read the index-more plugin, it seems I cannot not get <meta ...>
at all. Those info. is stored in "content". Are there something
missing in the 'metaData' in this plugin?
/Jack