You are viewing a plain text version of this content. The canonical link for it is here.

Posted to commits@pig.apache.org by Apache Wiki <wi...@apache.org> on 2007/12/11 16:32:10 UTC

[Pig Wiki] Trivial Update of "PigOverview" by ChrisOlston

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Pig Wiki" for change notification.

The following page has been changed by ChrisOlston:
http://wiki.apache.org/pig/PigOverview

------------------------------------------------------------------------------
  
  Pig has two parts:
   * A language for processing data, called ''Pig Latin''.
-  * A set of ''evaluation mechanisms'' for evaluating a Pig Latin program. Current evaluation mechanisms include (a) local evaluation in a single JVM, (b) evaluation by translation into one or more Map-Reduce jobs, executed using Hadoop.
+  * A set of ''evaluation mechanisms'' for evaluating a Pig Latin program. Current evaluation mechanisms include (a) local evaluation in a single JVM, (b) evaluation by translation into one or more Map-Reduce jobs, executed using [lucene.apache.org/hadoop Hadoop].
  
  == Pig Latin programs: ==
  
@@ -26, +26 @@

  == Data formats: ==
  
   * Pig can process data of any format. (Pigs eat anything! .. or is that goats?) A few common formats such as tab delimited text files, are supported via built-in capabilities. A user can add support for a file format by writing a function that parses the bytes of a file into objects in Pig's data model, and vice versa.
-  * Pig's data model is similar to the relational data model, except that tuples can be nested. For example, you can have a table of tuples, where the third field of each tuple contains a table. In Pig, tables are called bags.
+  * Pig's data model is similar to the relational data model, except that tuples (a.k.a. records or rows) can be nested. For example, you can have a table of tuples, where the third field of each tuple contains a table. In Pig, tables are called bags.
  
  == Other capabilities: ==