You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-user@hadoop.apache.org by Rob Stewart <ro...@googlemail.com> on 2010/01/13 16:30:27 UTC
Java programming Conditional select
Hi folks,
I am trying to provide a comparative study on the various Hadoop languages.
I have all but completed the implementation in JAQL, Pig and Hive for the
tasks, leaving Java MR to last.
The applications are simple, and they include:
1. Conditional Select (A and B would be read from files A.dat and B.dat)
--------------
select A.filename from A, B where A.filename = B.filename
--------------
2. Aggregate select (would be read from a delimited file webLog.dat)
--------------
select userID, AVG(timeOnSite) as averages, COUNT(pageID)
group by userID
order by averages DESC;
--------------
My question is... Can these
A) Be found in the packaged hadoop-examples.jar (I know wordcount is).
B) Be implemented in a straight forward way in Java MapReduce
C) Be implemented with some obvious API method calls within Hadoop that I
have not yet found?