You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by Jack Tang <hi...@gmail.com> on 2005/06/06 03:41:36 UTC

Index more...

Hi Guys

I wanna more fields in html header to be indexed or stored. 
Take below as example, 'breadcrumb' should be stored without indexing
while 'keywords' should indexed.
<html>
<head>
.....
<meta name="breadcrumb" content="home >> introduction">
<meta name="keywords"  content="">
</head>
......

</html>

And I read the index-more plugin, it seems I cannot not get <meta ...>
at all. Those info. is stored in "content". Are there something
missing in the 'metaData' in this plugin?

/Jack