You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@nutch.apache.org by Apache Wiki <wi...@apache.org> on 2006/03/06 11:05:48 UTC

[Nutch Wiki] Trivial Update of "nutch-0.8-dev/bin/nutch segread" by AndrzejBialecki

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.

The following page has been changed by AndrzejBialecki:
http://wiki.apache.org/nutch/nutch-0%2e8-dev/bin/nutch_segread

------------------------------------------------------------------------------
   None.
  
  === Caveats and Notes ===
-  Creates a directory in <segment> called segdump.  Within that directory a number of files are created.  A dump file called ''dump'' and several other files prefixed ''part-''.  The dump file contains some readable information about the pages fetched and their parsed information.  The part files are consolidated together to form the dump file and can be deleted.  Do not 'cat' these files if in a term as it does contain some binary data that will corrupt your terminal.
+  Creates a directory in <segment> called segdump.  Within that directory a number of files are created.  A dump file called ''dump'' and several other files prefixed ''part-''.  The dump file contains some readable information about the pages fetched and their parsed information.  The part files are consolidated together to form the dump file and can be deleted.  Do not 'cat' these files if in a term as it does contain some binary data that will corrupt your terminal (however, if you end up in such state, you can reset your terminal with 'stty sane' or if this fails with 'reset').
  
  DevelopmentCommandLineOptions