You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-user@hadoop.apache.org by Jimmy Lin <ji...@umd.edu> on 2009/09/29 04:29:51 UTC

Final Reminder: NSF, Google, IBM CLuE PI Meeting: October 5, 2009

Hi everyone,

Just a final reminder for this NSF/Google/IBM event next Monday (10/5). 
  We've put together an exciting program with talks by Luiz André 
Barroso (Google), Hamid Pirahesh (IBM), and many academics describing 
their latest research with Hadoop.  Hope you can join us next week!

Best,
Jimmy


==CLuE PI Meeting 2009==

Monday, October 5, 2009
Computer History Museum
Mountain View, California

Sponsored by the National Science Foundation, Google, IBM
Organized by the University of Maryland Cloud Computing Center

Website: https://wiki.umiacs.umd.edu/ccc/index.php/CLuE_PI_Meeting_2009
Registration: http://clue2009.eventbrite.com/

= What's this event about?

In October 2007, Google and IBM announced the first pilot phase of the 
Academic Cloud Computing Initiative (ACCI), which granted several 
prominent U.S. universities access to a large computer cluster running 
Hadoop, an open source distributed computing platform inspired by 
Google’s file system and MapReduce programming model. In February 2008, 
the ACCI partnered with the National Science Foundation to provide grant 
funding to academic researchers interested in exploring large-data 
applications that could take advantage of this infrastructure. This 
resulted in the creation of the Cluster Exploratory (CLuE) program led 
by Dr. Jim French, which currently funds 14 projects. See this NSF Press 
Release for a short description of all the projects funded under the 
CLuE program.

Nearing the two year anniversary of this collaboration, the National 
Science Foundation, Google, and IBM will be jointly sponsoring a meeting 
for the CLuE project principal investigators (PIs). This will event will 
be open to the public—in fact, the explicit goal of this event is to 
showcase the exciting research currently underway in academia and 
promote closer ties with the broader "cloud computing" community in the 
bay area.

= Schedule at a Glance

See below for an overview of talks scheduled for the day. We are pleased 
to welcome two keynotes, by Hamid Pirahesh from IBM (in the morning) and 
Luiz Barroso from Google (in the afternoon). The meeting will be capped 
off with a poster reception in the early evening, where representatives 
of all CLuE projects will present their work in a more informal setting.

Morning Session

(07:30 - 08:00) Registration and breakfast

(08:00 - 08:30) Introductions

(08:30 - 09:15) IBM keynote: Impact of Cloud Computing on Research in 
Extreme Scale Analytics. Hamid Pirahesh

(09:15 - 09:40) Topic-Partitioned Search Engine Indexes. Jamie Callan, 
Jaime Arguello, Anagha Kulkarni (CMU)

(09:40 - 10:05) Indexing Geospatial Data with MapReduce. Naphtali Rishe, 
Vagelis Hristidis, Raju Rangaswami, Ouri Wolfson, Howard Ho, Ariel Cary, 
Zhengguo Sun, Lester Melendes (Florida International University)

(10:05 - 10:30) Morning coffee break

(10:30 - 10:55) Scalable Graph Processing in Data Center Environments. 
Ben Zhao, Xifeng Yan, Divyakant Agrawal, Amr El Abbadi (University of 
California, Santa Barbara)

(10:55 - 11:20) Large-Scale Data Cleaning Using Hadoop. Chen Li, Michael 
Carey, Alexander Behm, Shengyue Ji, Rares Vernica (University of 
California, Irvine)

(11:20 - 11:45) Cluster Computing for Statistical Machine Translation. 
Stephan Vogel, Qin Gao, Noah Smith, Kevin Gimpel, Alok Parlikar, Andreas 
Zollmann (CMU)

(11:45 - 12:10) Research and Education with MapReduce/Hadoop: 
Data-Intensive Text Processing and Beyond. Jimmy Lin, Tamer Elsayed, 
Chris Dyer, Philip Resnik, Doug Oard (University of Maryland)

Afternoon Session

(1:00 - 1:45) Google keynote: Datacenter-Scale Computing. Luiz André Barroso

(1:45 - 2:10) A Performance and Usability Comparison of Hadoop and 
Relational Database Systems. Sam Madden, Andrew Pavlo, Erik Paulson, 
Alexander Rasin, Daniel Abadi, David DeWitt, Michael Stonebraker (MIT, 
Brown, University of Wisconsin, Microsoft, Yale)

(2:10 - 2:35) HadoopDB An Architectural Hybrid of MapReduce and DBMS 
Technologies for Analytical Workloads. Daniel Abadi, Azza Abouzeid, 
Kamil Bajda-Pawlikowski (Yale University)

(2:35 - 3:00) Towards Interactive Visualization in the Cloud. Bill Howe, 
Huy Vo, Claudio Silva, Juliana Friere, YingYi Bu (University of 
Washington, University of Utah)

(3:00 - 3:30) Afternoon coffee break

(3:30 - 3:55) Scaling the Sky with MapReduce/Hadoop. Andrew Connolly, 
Jeff Gardner, Simon Krughoff (University of Washington)

(3:55 - 4:20) Commodity Computing in Genomics Research. Mihai Pop, Mike 
Schatz (University of Maryland)

(4:20 - 4:45) Relaxed Synchronization and Eager Scheduling in MapReduce. 
Ananth Grama, Suresh Jagannathan (Purdue University)

(4:45 - 5:10) Dynamic Provisioning of Data Intensive Applications. 
Chaitanya Baru, Sriram Krishnan (San Diego Supercomputer 
Center/University of California, San Diego

Re: Final Reminder: NSF, Google, IBM CLuE PI Meeting: October 5, 2009

Posted by Jason Venner <ja...@gmail.com>.
You can also publish them on www.prohadoop.com, as well as announce your
events ;)

On Tue, Sep 29, 2009 at 7:58 AM, Oliver Senn <se...@student.ethz.ch> wrote:

> +1
>
>
> Steve Lihn wrote:
>
>> Can the group make these speeches available online (such as youtube)
>> for the global community?
>>
>> Thx, steve
>>
>> On 9/28/09, Jimmy Lin <ji...@umd.edu> wrote:
>>
>>> Hi everyone,
>>>
>>> Just a final reminder for this NSF/Google/IBM event next Monday (10/5).
>>>  We've put together an exciting program with talks by Luiz André
>>> Barroso (Google), Hamid Pirahesh (IBM), and many academics describing
>>> their latest research with Hadoop.  Hope you can join us next week!
>>>
>>> Best,
>>> Jimmy
>>>
>>


-- 
Pro Hadoop, a book to guide you from beginner to hadoop mastery,
http://www.amazon.com/dp/1430219424?tag=jewlerymall
www.prohadoopbook.com a community for Hadoop Professionals

Re: Final Reminder: NSF, Google, IBM CLuE PI Meeting: October 5, 2009

Posted by Jimmy Lin <ji...@umd.edu>.
Thanks for the feedback.  I'll look into it... but at the very least 
slides will be posted online.

-Jimmy

Oliver Senn wrote:
> +1
> 
> Steve Lihn wrote:
>> Can the group make these speeches available online (such as youtube)
>> for the global community?
>>
>> Thx, steve
>>
>> On 9/28/09, Jimmy Lin <ji...@umd.edu> wrote:
>>> Hi everyone,
>>>
>>> Just a final reminder for this NSF/Google/IBM event next Monday (10/5).
>>>   We've put together an exciting program with talks by Luiz André
>>> Barroso (Google), Hamid Pirahesh (IBM), and many academics describing
>>> their latest research with Hadoop.  Hope you can join us next week!
>>>
>>> Best,
>>> Jimmy
> 

Re: Final Reminder: NSF, Google, IBM CLuE PI Meeting: October 5, 2009

Posted by Oliver Senn <se...@student.ethz.ch>.
+1

Steve Lihn wrote:
> Can the group make these speeches available online (such as youtube)
> for the global community?
> 
> Thx, steve
> 
> On 9/28/09, Jimmy Lin <ji...@umd.edu> wrote:
>> Hi everyone,
>>
>> Just a final reminder for this NSF/Google/IBM event next Monday (10/5).
>>   We've put together an exciting program with talks by Luiz André
>> Barroso (Google), Hamid Pirahesh (IBM), and many academics describing
>> their latest research with Hadoop.  Hope you can join us next week!
>>
>> Best,
>> Jimmy

Re: Final Reminder: NSF, Google, IBM CLuE PI Meeting: October 5, 2009

Posted by Steve Lihn <st...@gmail.com>.
Can the group make these speeches available online (such as youtube)
for the global community?

Thx, steve

On 9/28/09, Jimmy Lin <ji...@umd.edu> wrote:
> Hi everyone,
>
> Just a final reminder for this NSF/Google/IBM event next Monday (10/5).
>   We've put together an exciting program with talks by Luiz André
> Barroso (Google), Hamid Pirahesh (IBM), and many academics describing
> their latest research with Hadoop.  Hope you can join us next week!
>
> Best,
> Jimmy
>
>
> ==CLuE PI Meeting 2009==
>
> Monday, October 5, 2009
> Computer History Museum
> Mountain View, California
>
> Sponsored by the National Science Foundation, Google, IBM
> Organized by the University of Maryland Cloud Computing Center
>
> Website: https://wiki.umiacs.umd.edu/ccc/index.php/CLuE_PI_Meeting_2009
> Registration: http://clue2009.eventbrite.com/
>
> = What's this event about?
>
> In October 2007, Google and IBM announced the first pilot phase of the
> Academic Cloud Computing Initiative (ACCI), which granted several
> prominent U.S. universities access to a large computer cluster running
> Hadoop, an open source distributed computing platform inspired by
> Google’s file system and MapReduce programming model. In February 2008,
> the ACCI partnered with the National Science Foundation to provide grant
> funding to academic researchers interested in exploring large-data
> applications that could take advantage of this infrastructure. This
> resulted in the creation of the Cluster Exploratory (CLuE) program led
> by Dr. Jim French, which currently funds 14 projects. See this NSF Press
> Release for a short description of all the projects funded under the
> CLuE program.
>
> Nearing the two year anniversary of this collaboration, the National
> Science Foundation, Google, and IBM will be jointly sponsoring a meeting
> for the CLuE project principal investigators (PIs). This will event will
> be open to the public—in fact, the explicit goal of this event is to
> showcase the exciting research currently underway in academia and
> promote closer ties with the broader "cloud computing" community in the
> bay area.
>
> = Schedule at a Glance
>
> See below for an overview of talks scheduled for the day. We are pleased
> to welcome two keynotes, by Hamid Pirahesh from IBM (in the morning) and
> Luiz Barroso from Google (in the afternoon). The meeting will be capped
> off with a poster reception in the early evening, where representatives
> of all CLuE projects will present their work in a more informal setting.
>
> Morning Session
>
> (07:30 - 08:00) Registration and breakfast
>
> (08:00 - 08:30) Introductions
>
> (08:30 - 09:15) IBM keynote: Impact of Cloud Computing on Research in
> Extreme Scale Analytics. Hamid Pirahesh
>
> (09:15 - 09:40) Topic-Partitioned Search Engine Indexes. Jamie Callan,
> Jaime Arguello, Anagha Kulkarni (CMU)
>
> (09:40 - 10:05) Indexing Geospatial Data with MapReduce. Naphtali Rishe,
> Vagelis Hristidis, Raju Rangaswami, Ouri Wolfson, Howard Ho, Ariel Cary,
> Zhengguo Sun, Lester Melendes (Florida International University)
>
> (10:05 - 10:30) Morning coffee break
>
> (10:30 - 10:55) Scalable Graph Processing in Data Center Environments.
> Ben Zhao, Xifeng Yan, Divyakant Agrawal, Amr El Abbadi (University of
> California, Santa Barbara)
>
> (10:55 - 11:20) Large-Scale Data Cleaning Using Hadoop. Chen Li, Michael
> Carey, Alexander Behm, Shengyue Ji, Rares Vernica (University of
> California, Irvine)
>
> (11:20 - 11:45) Cluster Computing for Statistical Machine Translation.
> Stephan Vogel, Qin Gao, Noah Smith, Kevin Gimpel, Alok Parlikar, Andreas
> Zollmann (CMU)
>
> (11:45 - 12:10) Research and Education with MapReduce/Hadoop:
> Data-Intensive Text Processing and Beyond. Jimmy Lin, Tamer Elsayed,
> Chris Dyer, Philip Resnik, Doug Oard (University of Maryland)
>
> Afternoon Session
>
> (1:00 - 1:45) Google keynote: Datacenter-Scale Computing. Luiz André Barroso
>
> (1:45 - 2:10) A Performance and Usability Comparison of Hadoop and
> Relational Database Systems. Sam Madden, Andrew Pavlo, Erik Paulson,
> Alexander Rasin, Daniel Abadi, David DeWitt, Michael Stonebraker (MIT,
> Brown, University of Wisconsin, Microsoft, Yale)
>
> (2:10 - 2:35) HadoopDB An Architectural Hybrid of MapReduce and DBMS
> Technologies for Analytical Workloads. Daniel Abadi, Azza Abouzeid,
> Kamil Bajda-Pawlikowski (Yale University)
>
> (2:35 - 3:00) Towards Interactive Visualization in the Cloud. Bill Howe,
> Huy Vo, Claudio Silva, Juliana Friere, YingYi Bu (University of
> Washington, University of Utah)
>
> (3:00 - 3:30) Afternoon coffee break
>
> (3:30 - 3:55) Scaling the Sky with MapReduce/Hadoop. Andrew Connolly,
> Jeff Gardner, Simon Krughoff (University of Washington)
>
> (3:55 - 4:20) Commodity Computing in Genomics Research. Mihai Pop, Mike
> Schatz (University of Maryland)
>
> (4:20 - 4:45) Relaxed Synchronization and Eager Scheduling in MapReduce.
> Ananth Grama, Suresh Jagannathan (Purdue University)
>
> (4:45 - 5:10) Dynamic Provisioning of Data Intensive Applications.
> Chaitanya Baru, Sriram Krishnan (San Diego Supercomputer
> Center/University of California, San Diego
>