Bof: Infrastructure issues in Big Data Science Save to my Google calendar

03/06/2013 18:00-19:30
C Brussels
Target audience:

* NREN and campus specialists dealing with new technologies and high speed networks
* e-Science community requiring advanced supporting infrastructure for data intensive projects
* TERENA community planning use of emerging Big Data technologies

BoF Description

Big Data technologies are becoming a current focus and a new “buzz-word” both in science and in industry. Big Data are becoming related to almost all aspects of human activity from just recording events to research, design, production and digital services or products delivery, to the final consumer. Current technologies such as Cloud Computing and ubiquitous network connectivity provide a platform for automation of all processes in data collection, storing, processing and visualization. However, Big Data due to their volume, velocity and variety impose significant challenges to current/traditional campus and enterprise infrastructure and network infrastructure.

Modern e-Science typically produces a huge amount of data that need to be supported by a new type of e-Infrastructure capable to store, distribute, process, visualise, preserve and curate these data which can be defined as the Scientific Data e-Infrastructure (SDI). In e-Science, the scientific data are complex multifaceted objects with the complex internal relations, they are becoming an infrastructure of their own and need to be supported by corresponding physical or logical infrastructures to store, access and manage these data.

The emerging SDI should allow different groups of researchers to work on the same data sets, build their own (virtual) research and collaborative environments, safely store intermediate results, and later share the discovered results. New data provenance, security and access control mechanisms and tools should allow researchers to link their scientific results with the initial data (sets) and intermediate data to allow future re-use/re-purpose of data, e.g. with the improved research technique and tools.

This BoF will be primary focused on challenges related to network and services infrastructure at NRENs to support modern and future Scientific Data e-Infrastructure and Big Data infrastructure in general. The BoF will include both overview presentations on the current state of Big Data infrastructure and open discussion between BoF attendees.

Draft Agenda

1. BoF goal (Peter Hinrich, SURFnet)
2. Round of introductions
3. Big Data and Data Intensive Science (Y.Demchenko)
4. Big data facilities at SURFsara (Freek Dijkstra)
5. Open floor discussion and presentations from audience
6. Conclusion




Social Events