Go Back to Financed Projects Page

HORIZON 2020

DATACLOUD

Enabling the big data pipeline lifecycle on the computing continuum.

Project name and Acronym

DATACLOUD - Enabling The Big Data Pipeline Lifecycle on the Computing Continuum

Project Code

101016835

Call

ICT-40-2020-RIA

Financing Program

H2020

Promotor/ Coordinator

Sintef

Consortia Partners

Duration

36 months

Total Eligible Costs (€)

4,999,996.25

EU Grant Amount (€)

4,999,996.25

Main objective

To develop a software toolbox comprising of new languages, methods, infrastructures, and software prototypes for discovering, simulating, deploying, and adapting Big Data pipelines on heterogeneous and untrusted resources in a manner that makes execution of Big Data pipelines traceable, trustable, manageable, analyzable, and optimizable. DataCloud separates the design from the run-time aspects of their deployment, thus empowering domain experts to take an active part in their definition. The aim of the toolbox is to lower the technological entry barriers to the incorporation of Big Data pipelines in organizations' business processes and make them accessible to a wider set of stakeholders (such as start-ups and SMEs) regardless of the hardware infrastructure.

Objectives

Objective 1: Big Data pipelines discovery: To develop techniques for discovering Big Data pipelines from various data sources, featuring the use of AI-based and process mining algorithms using data-driven discovery approaches for learning their structure.
Objective 2: Big Data pipelines definition: To develop a domain-specific language (DSL) for Big Data pipelines featuring an abstraction level suitable for pure data processing, which realizes pipeline specifications using instances of a predefined set of scalable and composable container templates (corresponding to step types in pipelines).
Objective 3: Big Data pipelines simulation: To develop a novel Big Data pipeline simulation framework for determining the “best” deployment scenario by evaluating the performance of individual steps in a sandboxed environment and varying different aspects of input data and step parameters.
Objective 4: Blockchain-based resources provisioning for Big Data pipelines: To develop a blockchain-based resource marketplace for securely provisioning, for any given Big Data pipeline, a set of (trusted and untrusted) resources (Cloud, Edge, Fog), ensuring privacy and security of data and pipelines executions.
Objective 5: Flexible and automated deployment of Big Data pipelines: To develop a deployment framework for data pipelines specifications, featuring secure, adaptable, elastic, scalable, and resilient resource deployment and taking into account Quality of Service (QoS) requirements and Key Performance Indicators (KPIs) for pipelines and resources.
Objective 6: Adaptive, interoperable Fog/Cloud/Edge resource provisioning for execution of Big Data pipelines: To develop algorithms for optimized runtime provisioning of resources made available on the marketplace on the Computing Continuum (Cloud, Edge, Fog), facilitating omnidirectional data drifts among the data pipelines.

Work Packages

The work in DataCloud is organized into eight main work packages:

Work Package 1: Requirements analysis and architecture design.
Work Package 2: Big Data pipeline discovery.
Work Package 3: Big Data pipelines definition and simulation.
Work Package 4: Blockchain-based decentralized resource marketplace.
Work Package 5: Adaptative resource provisioning and orchestration.
Work Package 6: Deployment, testing, integration, and validation.
Work Package 7: Exploitation, dissemination, and communication.
Work Package 8: Project management.

Architecture

Norte 2020, Portugal 2020 e União Europeia

Other Financed Projects

NORTE 2020

MOG CANDY

Deteção automática de segmentos publicitários em conteúdo audiovisual.

HORIZON 2020

CinEd 2.0

CinEd is a European film education programme created in 2015 with a holistic digital and multi-lingual approach that facilitates access to European films and expands cinema education across Europe.

HORIZON 2020

SHAZAAM

SHAZAAM fights pseudoscience by enhancing Digital Media and Information Literacy (DMIL) among Gen Z learners in Europe, promoting critical thinking through an innovative pedagogic project.

EUROSTARS | NORTE 2020

VAPOR360

Desenvolvimento e teste de um sistema Cloud-based para product placement dinâmico em vídeo 360º.

DATACLOUD

DATACLOUD

Project name and Acronym

Project Code

Call

Financing Program

Promotor/ Coordinator

Consortia Partners

Duration

Total Eligible Costs (€)

EU Grant Amount (€)

Main objective

Objectives

Work Packages

Architecture

Other Financed Projects

MOG CANDY

CinEd 2.0

SHAZAAM

VAPOR360

Let’s build something amazing, together.

Products

Company

Communities