- InsuranceReal-Time Data IntegrationBANKEN & VERSICHERUNGENDezember 2022 - Heute (2 Jahre und 1 Monat)Data from various source systems is ingested from Kafka, then enriched and transformed for the requirements of target systems in Kafka Streams Java microservices. Kafka Connect is used to transfer the data to the target systems, e.g. via JDBC.- Apache Kafka (Kafka Streams, Kafka Connect)- Java, Quarkus- Red Hat OpenShift- Splunk, SignalFX
- Media IndustryData Engineer (Kafka / real-time processing and integration)DIGITALAGENTUREN & IT-CONSULTINGFebruar 2021 - November 2022 (1 Jahr und 10 Monate)Customer data (e.g. sales data, web site click stream) was ingested as a data stream from various source systems and integrated in real-time into a marketing automation platform. User characteristics such as customers content preferences were extracted in real-time, enabling the customer relations team to achieve precise customer segmentation and provide customers with highly personalized communication.- Apache Kafka (Kafka Streams)- Migration of legacy AWS Kinesis data- Apache Spark (DStreams API, Structured Streaming)- Kubernetes, kustomize- CI/CD (Gitlab CI)
- Media IndustryData Engineer (Spark / ETL)DIGITALAGENTUREN & IT-CONSULTINGMai 2020 - Februar 2021 (9 Monate)Customer data from various source systems was integrated with target systems of company-internal stakeholders in the commercial department to enable reporting and analysis.- Created reliable data pipeline applications to ingest the data in a data lake, process and combine the data and load it into database systems- Comprehensive concept for complying with GDPR and company privacy policies for public clouds using encryption- Kubernetes / spark-on-k8s, kustomize- Airflow for workload scheduling- AWS cloud technologies (S3, RDS), terraform- Pyspark- CI/CD (Gitlab CI)