You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by Apache Wiki <wi...@apache.org> on 2010/08/11 17:19:33 UTC

[Nutch Wiki] Trivial Update of "TikaPlugin" by AndreRicardo

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.

The "TikaPlugin" page has been changed by AndreRicardo.
http://wiki.apache.org/nutch/TikaPlugin?action=diff&rev1=7&rev2=8

--------------------------------------------------

  
  '''mp3''': Nutch identifies several fields (Title, Album, Artist) whereas Tika knows only about Titles, the rest is stored as paragraphs.
  
- Tika-app can also identify in an mp3 id3v1 and id3v2 tags like: album, artist, audioSampleRate, composer, genre, logcomment, releaseDate, trackNumber.
+ Tika-app can also identify in an mp3 id3v1 and id3v2 tags like: album, artist, audioSampleRate, composer, genre, logcomment, releaseDate, trackNumber using the [[http://tika.apache.org/0.7/api/org/apache/tika/metadata/XMPDM.html|XMPDM interface]]
  
  '''msexcel''': comparable (+ Tika able to represent content in structured way as XHTML tables which can be useful for HTML parser plugins)