CLuE PI Meeting 2009

From University of Maryland Cloud Computing Center
Jump to: navigation, search

Monday, October 5, 2009
Computer History Museum
Mountain View, California (directions)

Register Here!


In October 2007, Google and IBM announced the first pilot phase of the Academic Cloud Computing Initiative (ACCI), which granted several prominent U.S. universities access to a large computer cluster running Hadoop, an open source distributed computing platform inspired by Google’s file system and MapReduce programming model. In February 2008, the ACCI partnered with the National Science Foundation to provide grant funding to academic researchers interested in exploring large-data applications that could take advantage of this infrastructure. This resulted in the creation of the Cluster Exploratory (CLuE) program led by Dr. Jim French, which currently funds 14 projects. See this NSF Press Release for a short description of all the projects funded under the CLuE program.

Nearing the two year anniversary of this collaboration, the National Science Foundation, Google, and IBM will be jointly sponsoring a meeting for the CLuE project principal investigators (PIs). This will event will be open to the public—in fact, the explicit goal of this event is to showcase the exciting research currently underway in academia and promote closer ties with the broader "cloud computing" community in the bay area.

Schedule at a Glance

See below for an overview of talks scheduled for the day. We are pleased to welcome two keynotes, by Hamid Pirahesh from IBM (in the morning) and Luiz Barroso from Google (in the afternoon). The meeting will be capped off with a poster reception in the early evening, where representatives of all CLuE projects will present their work in a more informal setting. See CLuE PI Meeting 2009: Program for abstracts and additional details.

Morning Session

(07:30 - 08:00) Registration and breakfast

(08:00 - 08:30) Introductions

(08:30 - 09:15) IBM keynote: Impact of Cloud Computing on Research in Extreme Scale Analytics

Hamid Pirahesh

(09:15 - 09:40) Topic-Partitioned Search Engine Indexes

Jamie Callan, Jaime Arguello, Anagha Kulkarni (CMU)

(09:40 - 10:05) Indexing Geospatial Data with MapReduce

Naphtali Rishe, Vagelis Hristidis, Raju Rangaswami, Ouri Wolfson, Howard Ho, Ariel Cary, Zhengguo Sun, Lester Melendes (Florida International University)

(10:05 - 10:30) Morning coffee break

(10:30 - 10:55) Scalable Graph Processing in Data Center Environments

Ben Zhao, Xifeng Yan, Divyakant Agrawal, Amr El Abbadi (University of California, Santa Barbara)

(10:55 - 11:20) Large-Scale Data Cleaning Using Hadoop

Chen Li, Michael Carey, Alexander Behm, Shengyue Ji, Rares Vernica (University of California, Irvine)

(11:20 - 11:45) Cluster Computing for Statistical Machine Translation

Stephan Vogel, Qin Gao, Noah Smith, Kevin Gimpel, Alok Parlikar, Andreas Zollmann (CMU)

(11:45 - 12:10) Research and Education with MapReduce/Hadoop: Data-Intensive Text Processing and Beyond

Jimmy Lin, Tamer Elsayed, Chris Dyer, Philip Resnik, Doug Oard (University of Maryland)

Afternoon Session

(1:00 - 1:45) Google keynote: Datacenter-Scale Computing

Luiz André Barroso

(1:45 - 2:10) A Performance and Usability Comparison of Hadoop and Relational Database Systems

Sam Madden, Andrew Pavlo, Erik Paulson, Alexander Rasin, Daniel Abadi, David DeWitt, Michael Stonebraker (MIT, Brown, University of Wisconsin, Microsoft, Yale)

(2:10 - 2:35) HadoopDB An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads

Daniel Abadi, Azza Abouzeid, Kamil Bajda-Pawlikowski (Yale University)

(2:35 - 3:00) Towards Interactive Visualization in the Cloud

Bill Howe, Huy Vo, Claudio Silva, Juliana Friere, YingYi Bu (University of Washington, University of Utah)

(3:00 - 3:30) Afternoon coffee break

(3:30 - 3:55) Scaling the Sky with MapReduce/Hadoop

Andrew Connolly, Jeff Gardner, Simon Krughoff (University of Washington)

(3:55 - 4:20) Commodity Computing in Genomics Research

Mihai Pop, Mike Schatz (University of Maryland)

(4:20 - 4:45) Relaxed Synchronization and Eager Scheduling in MapReduce

Ananth Grama, Suresh Jagannathan (Purdue University)

(4:45 - 5:10) Dynamic Provisioning of Data Intensive Applications

Chaitanya Baru, Sriram Krishnan (San Diego Supercomputer Center/University of California, San Diego)

Poster Reception

(5:30 - 7:30) Hors d'oeuvre and drinks hosted by Google and IBM. Come mingle with researchers and attendees informally!