22. Gulaschprogrammiernacht

Big Data Analysis with SQL
02.06, 10:00–11:00 (Europe/Berlin), ZKM Kubus
Sprache: English

This talk explains how you can build your own scalable data processing system with just a few open source tools: DBT, Trino, Iceberg and MinIO. And also why SQL is still the best language for data analysis!


Have you ever used PostgreSQL to store massive amounts of data? Did your queries take minutes or even hours to compute?

The field of data analysis is rather complex and a ton of solutions are available: therefore I will show how to compare systems with each other. You will learn why databases like PostgreSQL or MongoDB are not suited to compute analytics queries on huge amounts of data. Then we will look at data analysis architectures that are capable of scaling to terabytes of data and I will explain why they are better in those particular situations.

At the end of the talk you will know which solution is best suited for your next large-scale data project!


Content Notes

n/a

  • Background: Electrical Engineering
  • I like to play with Microcontrollers, Databases, Networks, and Hypervisors
  • I also like to share my expertise:
  • I co-host a group where we teach kids to program self build robots
  • Open Source Enthusiast