- Submission Deadline:
June 10, 2013
May 31, 2013
- Notification of Acceptance:
July 10, 2013
July 8, 2013
- Camera Ready Manuscript:
October 3, 2013
- Workshop Date:
August 26, 2013
- 24/7/2013: Workshop program online
- 8/7/2013: Author notification extended to July 10!
- 23/5/2013: Paper submission deadline extended to June 10!
- 15/4/2013: EasyChair submission page online
2nd Workshop on Big Data Management in Clouds
The second edition of the Workshop on Big Data Management in Clouds will be held in Aachen, Germany. BigDataCloud 2013 follows the joint BDMC / CoreGRID-ERCIM Workshop on Grids, Clouds and P2P Computing held in conjunction with EuroPar 2012. Its goal is to aggregate the data management and Clouds / Grids / P2P communities built around the previous editions of these workshops in order to complement the data handling issues with a comprehensive system / infrastructure perspective.
Workshop Program [top]
Session WC4, Room: Main Building - Lecture Hall IV, Chair: Frédéric Desprez (INRIA / LIP ENS Lyon)
- 14:30 Bogdan Nicolae. Understanding Vertical Scalability of I/O Virtualization for MapReduce Workloads: Challenges and Opportunities
- 15:00 Spiros Koulouzis, Dmitry Vasyunin, Reginald Cushing, Marian Bubak. Cloud Data Storage Federation for Scientific Applications
- 15:30 Noah Watkins, Carlos Maltzahn, Scott Brandt, Ian Pye, Adam Manzanares. In-Vivo Storage System Development
Session WD4, Room: Main Building - Lecture Hall IVChair: Alexandru Costan (INRIA / INSA Rennes)
- 16:30 Dr. Götz Brasche, Microsoft Research, Advanced Technology Labs Europe. Invited Talk: Advancing Science with Big Data in the Cloud
- 17:10 Nikolaus Jeremic, Helge Parzyjegla, Gero Mühl, Jan Richling. Towards Workload-Driven Automated Adaptation of Data Organization in Heterogeneous Storage Systems
- 17:40 Tiberiu Rotaru, Mirko Rahn. MapReduce in GPI-Space
Invited Talk [top]
Dr. Götz Brasche, Microsoft Research, Advanced Technology Labs Europe.
Advancing Science with Big Data in the Cloud
Dr. Götz Philip Brasche, head of Cloud Computing Initiative for Europe, Middle East, and Africa (EMEA) at Microsoft, co-founded the European Microsoft Innovation Center (EMIC) in 2003. In his current role at EMIC, he is responsible for Microsoft’s cloud computing research engagements in EMEA. As a member of the executive committee, technical lead, and Microsoft’s representative in the Project Management Board, he drives Microsoft’s contribution to the FP7 (7th Framework Programme for Research and Technological Development ) project, VENUS-C. He is also member of the Roadmap Editorial Board of the FP7 support action SIENA, the Standards and Interoperability for e-Infrastructure Implementation Initiative, and actively contributes to the future e-infrastructure roadmap. Dr. Brasche holds a master’s degree in Computer Science with a minor in Business Administration and a Ph.D. in Electrical Engineering.
The talk will address the evolution and benefits of big data in the cloud. It will provide examples and illustrates experiences with science in the cloud.
What are the challenges? Where are the opportunities? What is needed to make big data in the cloud valuable for scientists?
Workshop Description [top]
As data volumes increase at exponential speed in more and more application fields of science, the challenges posed by handling Big Data gain an increasing importance. Large scientific experiments, such as climate modeling, genome mapping, and high-energy physics simulations generate data volumes reaching petabytes per year, futher used for real-time or offline processing. Initially designed for powerful and expensive supercomputers, such applications have seen an increasing adoption on clouds, exploiting their elasticity and economical model.
However, running such applications in an efficient fashion on clouds is challenging. One such open challenge is how to handle this “data deluge”. Sharing, disseminating and analyzing large data sets has become a critical issue despite the deployment of petascale computing systems, and optical networking speeds reaching up to 100 Gbps. While Map/Reduce covers a large fraction of the development space, there are still many applications that are better served by other models and systems. In such a context, we need to embrace new programming models, scheduling schemes, hybrid infrastructures and scale out of single datacenters to geographically distributed deployments in order to cope with these new challenges effectively.
The BigDataCloud workshop provides a platform for the dissemination of recent research efforts that explicitly aim at addressing these challenges. It supports the presentation of advanced solutions for the efficient management of Big Data in the context of Cloud computing, new development and deployment efforts in running data-intensive computing workloads. In particular, we are interested in how the use of Cloud-based technologies can meet the data intensive scientific challenges of HPC applications that are not well served by the current supercomputers or grids, and are being ported to Cloud platforms. The goal of the workshop is to support the assessment of the current state, introduce future directions, and present architectures and services for future Clouds supporting data intensive computing.
Call for Papers [top]
Workshop Topics [top]
The BigDataCloud workshop calls for contributions that address fundamental research and system issues in Cloud data management including but not limited to the following:
- Cloud storage architectures for Big Data
- Reliability of data intensive applications and services running on the Cloud
- Query processing and indexing in Cloud computing systems
- Data privacy and security in Clouds
- Data-intensive computing on hybrid infrastructures (Grids/Clouds/P2P)
- Cloud storage resource management
- Data-intensive Cloud-based applications
- Content delivery networks using storage Clouds
- Data intensive scalable computing on Clouds
- Data management within and across multiple geographically distributed data centers
- Data handling in MapReduce based computations
- Data management in HPC Clouds
- Programming models for data-intensive Cloud computing
- Elasticity for Cloud data management systems
- Self-* and adaptive mechanisms.
- Many-Task Computing in the Cloud
- Performance evaluation of Cloud environments and technologies
- Data streaming and dynamic applications on Clouds
Organizing Comitee [top]
Alexandru Costan, Inria Rennes - Bretagne Atlantique, France
Frédéric Desprez, Inria / ENS Lyon, France
Gabriel Antoniu, Inria, France
Luc Bougé, ENS, France
Toni Cortes, Universitat Politècnica de Catalunya, Spain
Kate Keahey, University of Chicago / Argonne National Laboratory, USA
Bogdan Nicolae, IBM Research, Ireland
Maria S. Pérez, Universidad Politecnica De Madrid, Spain
Guillaume Pierre, IRISA / Université de Rennes 1, France
Robert Ross, Argonne National Laboratory, USA
Osamu Tatebe, University of Tsukuba, Japan
Cristian Zamfir, EPFL, Switzerland
Submission Guidelines [top]
Authors are invited to submit research and application papers not exceeding 10 pages following the Springer LNCS format. You can download LNCS Latex style here
We solicit the submission of academic workshop papers representing original, previously unpublished work. Submitted papers will be carefully evaluated based on originality, significance, technical soundness and clarity of exposition. Papers should be prepared as the .pdf files and submitted electronically to the BigDataCloud 2013 online submission system. Submission of the paper implies that should the paper be accepted, at least one of the authors must register and present the paper at the workshop.
Accepted papers that are presented at the workshop, will be published in a revised form in a special Euro-Par Workshop Volume in the Lecture Notes in Computer Science (LNCS) series after the Euro-Par conference.