Big data chronicles: Talks and community activity

Here is part of my community activity among the ASF and in particular the talks I gave at the ApacheCon conference. They are related to Big Data topics or more generally to Open Source Software:

Articles published on other blogs

Apache Software foundation blog

Here is an article published in the "success at Apache" series of the Apache Software Foundation blog
My experience with the Apache Way: a perfect society?

Apache Flink blog

The article I published here were also published to Flink official blog to target the maximum audience:

ApacheCon 2020

This talk is the continuation of the 2019 talk and gives updates about the new Apache Beam runner based on Spark Structured Streaming framework.

ApacheCon 2019

This talk is about building the new translation layer from Apache Beam to Apache Spark which is called the Spark runner. This new runner leverages the Spark Structured Streaming framework

Apache Software foundation blog aôut 2019

Glad to be in the top 2 Apache contributors ASF blog August 2019. Thanks the ASF !

This talk is about how to extract metrics from a Big Data pipeline with Apache Beam and how these metrics are universal throughout Big Data execution engines.

Unfortunately, there was no video recording of this session, but only an audio recording.

The slides

The audio

ApacheCon 2017

This talk is about benchmarking Big Data engines with Apache Beam and Nexmark.

Unfortunately, there was no video recording of this session, but only an audio recording.

Big data chronicles

Pages

Blog topics

Talks and community activity

Articles published on other blogs

Apache Software foundation blog

Apache Flink blog

ApacheCon 2020

ApacheCon 2019

Apache Software foundation blog aôut 2019

ApacheCon 2018

The slides

The audio

ApacheCon 2017

The slides

The audio