Big data chronicles
Technical blog on big data technologies
Pages
(Move to ...)
Articles
Talks and community activity
▼
Monday, April 3, 2023
Flink: Howto test a batch source with the new Source framework
›
🕥 6 min. Introducion The Flink community has designed a new Source framework based on FLIP-27 lately. This article is the continuatio...
Thursday, March 30, 2023
Flink: Howto create a batch source with the new Source framework
›
🕥 10 min. Introducion The Flink community has designed a new Source framework based on FLIP-27 lately. Some connectors have migrated t...
Tuesday, March 21, 2023
Cassandra: evaluate table size without reading the data
›
🕥 3 min. Introduction While developing the Cassandra source connector for Flink I needed a way to ensure that the data I was reading fit...
Monday, November 7, 2022
Flink: Howto migrate a real-life batch pipeline from the DataSet API to the DataStream API
›
🕥 5 min. Introducion Flink has been deprecating the DataSet API since version 1.12 as part of the work on FLIP-131: Consolidate the user-...
Tuesday, January 5, 2021
Tricky use cases of Apache Beam 3/3: Custom Combine
›
🕥 5 min. Introduction This is the third article of a serie of blog posts about tricky use cases of Apache Beam that enlight some of the...
Wednesday, December 2, 2020
Tricky use cases of Apache Beam 2/3: Custom windows
›
🕥 9 min. Introduction This is the second article of a serie of blog posts about tricky use cases of Apache Beam that enlight some of the...
Tuesday, November 10, 2020
Tricky use cases of Apache Beam 1/3: incremental join
›
🕥 7 min. Introduction This is the first article of a serie of blog posts about tricky use cases of Apache Beam that enlight some of the...
›
Home
View web version