Services

In today’s data-driven world, partnering with the right expert can make all the difference. Here’s why I’m the ideal choice to help you unlock the full potential of your data.

Development of High-Quality Software Solutions for Big Data Applications

I specialize in developing high-performance, scalable software solutions tailored to handle big data applications, ensuring your systems are equipped to manage vast amounts of data efficiently and effectively.

Development of GenAI Applications with RAG

I create advanced Generative AI applications, integrating Retrieval-Augmented Generation (RAG) capabilities to enhance automation, decision-making, and knowledge retrieval, providing your business with cutting-edge tools to stay ahead in a rapidly evolving landscape.

End-to-End Project Execution

From initial requirements specification and technical design to final implementation, I provide comprehensive end-to-end project execution, ensuring every aspect of your project is expertly managed.

Strategic Consulting in Data & Software Architectures

Drawing on my vast cross-industry experience, I offer expert support in technical decisions and consulting on data and software architectures, helping you make informed, strategic choices that align with your business goals and drive success.

Developer Education through Tailored Workshops

I offer customized workshops designed to upskill your development team, ensuring they are equipped with the latest knowledge and tools to maintain and grow your systems effectively.

Comprehensive Quality Assurance

I provide thorough code reviews, actionable feedback, and performance optimizations, alongside detailed analysis and mitigation of scaling or bottleneck issues, ensuring your solutions are robust, scalable, and optimized for long-term success.

Previous Gigs

Don’t just take my word for it—explore my portfolio to see how I’ve successfully delivered for clients of all sizes.

AI Application Engineer
LLM Agent for Semi-Automated Task Execution
PBI Hausverwaltung Jul 2024 - Nov 2024
  • Designed and implemented an AI agent using LLM to semi-automate recurring tasks with integrated tool access
  • Developed and integrated custom tools, including CRM and email access, to enhance automation capabilities
  • Deployed a self-hosted, open-source LLM model on AWS for scalable and secure operation
Python LLM langchain langgraph RAG RDBMS AWS PyTorch Ollama
Graph Data Scientist
Heterogeneous Graph Embeddings for Patient Similarity Classification
Amida Technologies Inc. Mar 2024 - Jul 2024
  • Researched heterogeneous graph embedding algorithms, focusing on their implications and real-world applications
  • Presented and explained graph embedding algorithms to a technical audience, aiding in informed decision-making
  • Designed and implemented a hyper-parameter tuning component using a Neo4j graph to evaluate and compare multiple embedding approaches
Graph Embeddings Scientific Research Neo4j Python Java
AI Application Engineer
Implement LLM-Based Tax-Automation Tool
Self-owned Sep 2023 - Dec 2023
  • Developed a command line-based tool for tax automation and PDF processing
  • Fine-tuned Llama2 LLM using custom-generated training data to enable tailored information retrieval from PDF documents
  • Employed a student-teacher approach, leveraging ChatGPT results, to generate high-quality training data
Python PyTorch Pandas Google Colab Llama ChatGPT langchain
Big Data Application Engineer
Re-Design of a Large-Scale Streaming Application
Deutsche Börse Sep 2022 - Sep 2023
  • Conceptualized functional and non-functional requirements in collaboration with business requirements and technical specification teams; conducted technical reviews of specifications
  • Developed technical design for a real-time streaming application capable of processing multi-billion messages daily
  • Implemented multiple Spark Structured Streaming applications, incorporating custom outer-join operations, HBase access, and complex data structures
  • Designed and implemented data migration routines from traditional data warehouses to event stream-based architectures
Spark (SQL, Structured Streaming) Kafka (Avro) Hbase Oracle DB Informatica HDFS Zeppelin YARN Apache Hive
Graph Data Engineer
Advanced Analytics in a Large-Scale Knowledge Graph
DB Systel Dec 2021 - Jul 2022
  • Assisted client in addressing stability and performance challenges with a large-scale, single-node Neo4j database
  • Implemented a concurrent ETL process to efficiently load a timely-sorted, two-dimensional grid of linked nodes in Neo4j
  • Set up a comprehensive monitoring system for Neo4j to ensure optimal performance and stability
  • Developed an automated nightly backup process to securely archive the Neo4j database
Neo4j Python AWS Grafana Graphite InfluxDB PostgreSQL HDFS Kafka
Big Data Application Engineer
Advanced Graph Analytics on a Multi-Billion Node Knowledge Graph
Ippen Digital GmbH Sep 2019 - Apr 2021
  • Designed, implemented, and operated a large-scale knowledge graph platform with multi-billion nodes, utilizing stream-fed ingestion, multi-stage environments, automatic deployment, and high availability (TigerGraph)
  • Conducted requirement analysis, design, implementation, and operation of a graph-based real-time recommendation engine for news articles
  • Performed A/B testing to evaluate and optimize recommendation results
  • Implemented continuous integration and deployment (CI/CD) workflows for streamlined development and deployment processes
  • Developed low-latency caching and fallback mechanisms to enhance system performance and reliability
  • Modeled and validated schemas for a knowledge graph incorporating multiple data streams
TigerGraph Kafka AWS Docker Kubernetes Java Scala Go Spring Grafana Prometheus Terraform Helm GitHub Actions Jenkins
Data Engineer for Streaming Data
Design & Implementation of Real-Time Analytics Data Platform
Telefonica Germany GmbH Mar 2019 - Jul 2019
  • Specified requirements and designed a real-time data platform sourced from a data warehouse
  • Designed and implemented real-time streaming applications for continuous data ingestion and analysis
  • Developed and deployed continuous integration and deployment (CI/CD) workflows for the data platform
  • Conducted an internal workshop on Apache Kafka to educate and upskill team members
Spark Kafka Kafka Streams Scala Docker Kubernetes AWS Akka Streams
Data Engineer for Data Integration and Streaming
Design & Implementation of an Advanced Analytics Streaming Application
Allianz SE Sep 2018 - Jan 2019
  • Designed and implemented a proof-of-concept for an event-driven data analytics application, integrating warehouse-sourced data for real-time analytics and comprehensive exploration through a front-end interface
  • Provided consulting on data modeling and supported decision-making processes for the project
  • Developed an ingestion pipeline to transform and feed relational data into a graph database
  • Deployed application components on in-house infrastructure
  • Designed and implemented continuous integration and deployment (CI/CD) workflows to streamline development and deployment processes
Spark Kafka Java Scala Docker Neo4j git Jenkins Kubernetes
Spark Performance Engineer (NLP)
Feature Implementation & Performance Optimizations of Document Mining Application
Ayfie GmbH Jan 2018 - Jul 2018
  • Optimized a large-scale Spark-based text analytics pipeline to enhance scalability and performance
  • Implemented knowledge discovery use cases using state-of-the-art NLP and ML techniques
  • Researched and applied efficient algorithms for analyzing large data sets
  • Refined multi-platform CI/CD workflows for improved development and deployment efficiency
  • Implemented performance monitoring for Spark applications to ensure optimal operation
  • Educated in-house development team on big data software development and data mining techniques
Spark Java Scala Docker AWS CI Spring Elasticsearch SQL Sonar git Grafana Graphite
Machine Learning Application Engineer
Machine Learning for Knowledge Discovery in Food Recipes
EatSmarter GmbH Jun 2017 - Sep 2017
  • Provided consultation on project planning and solution approach for the automatic calculation of nutritional values for food recipes
  • Designed a proof-of-concept (POC) for a calculation pipeline leveraging multiple heterogeneous data sources and various machine learning techniques
  • Implemented a data processing pipeline to accurately calculate nutritional values for food recipes
  • Deployed the nutritional value calculation pipeline on client infrastructure using AWS
Spark Scala AWS Docker Python git MySQL CouchDB
Machine Learning Application Engineer
Machine Learning for Automatic Classification of Food Recipes
EatSmarter GmbH Jun 2016 - Sep 2016
  • Consulted on management and design thinking to identify innovative use cases for evaluating the healthiness of food recipes based on expert ratings
  • Supervised the generation of a training dataset and conducted data quality analysis for the proof of concept
  • Implemented a supervised classification system using selected features and various machine learning approaches
  • Developed and validated the final solution to provide a healthiness indicator for food recipes
Spark Scala AWS SQL git Docker Python
Machine Learning Application Engineer
Master's Thesis: Knowledge Discovery in unstructured Data
Technical University Munich (TUM) Nov 2016 - May 2017
  • Title: Knowledge Discovery in Textual Databases for Enhancing the Automatic Calculation of Nutritional Values for Online-Based Food Recipes
  • Utilized various machine learning and NLP techniques to extract information from unstructured text and determine the nutritional content of food recipes
  • Applied theoretical approaches including tokenization, part-of-speech tagging, stemming, neural networks, logistic regression, and word embeddings
Spark Scala Docker Python Stanford-NLP
Embedding Software Team Lead & Engineer
Team Lead of Software Development at TUM Student Group "Roboy"
Technical University Munich (TUM) Nov 2016 - May 2017
  • Collaborated with an interdisciplinary team of TUM students to develop a child-sized humanoid robot, showcasing our work at various events worldwide
  • Led the software development group, overseeing design, implementation, and software engineering processes to ensure successful project execution
C++ ROS (Robot Operating System) CMake Unix
Embedded Software Engineer
Research & Development: Unmanned Aerial Vehicle
Elektroniksystem- und Logistik-GmbH (ESG) Oct 2014 - Mar 2015
  • Bachelor's Thesis: Design and Prototypical Implementation of a Dynamic Mission Planner for Unmanned Aerial Vehicle (UAV) Mission Planning Software in C++
  • Researched various route planning approaches for an unmanned helicopter (UMAT) tasked with exploring predefined areas for hazardous gas detection
  • Developed and implemented a routing algorithm in C++ for integration into the mission planning software developed by ESG
  • Contributed to enhancing the mission planning capabilities of UAVs through dynamic and efficient route planning techniques
C++ QT CMake Unix
Embedded Software Engineer
Software Engineering & Development: Aerosystems Avionics
Elektroniksystem- und Logistik-GmbH (ESG) May 2013 - Sep 2014
  • Contributed to the software development team responsible for creating a time- and safety-critical distributed middleware in C
  • Deployed the middleware on a highly modular avionics platform developed by ESG, ensuring robust and reliable performance
  • Worked with a platform composed of multiple Unix-like modules, designed to be interchangeable on the fly for enhanced flexibility and functionality
C CMake Unix

Skills

Technologies and Skills: Proven Expertise from Real-World Projects

Big Data Technologies
Apache Spark 5
Apache Kafka 5
Neo4j 5
TigerGraph 5
Big Data Architectures 5
Kafka Streams 4
Apache Airflow 2
Apache Iceberg 2
Delta Lake 4
Databricks 3
Snowflake 2
Hadoop HDFS 3
MapReduce 3
Apache Hive 3
Apache HBase 3
SQL 5
MySQL 5
Oracle 4
PostgreSQL 4
Informatica 3
YARN 3
Elasticsearch 3
Apache Cassandra 3
Software Development
Scala 5
Python 5
Java 5
C/C++ 4
Clean Code 5
Software Engineering 5
Software Architectures 5
Test-driven development 5
Cloud & Deployment
Docker 4
Kubernetes 4
AWS 4
Jenkins 3
git 5
Artificial Intelligence
PyTorch 2
LLMs (Transformers) 3
TensorFlow 2
scikit-learn 2
NLP 2
Machine Learning Algorithms 4
Statistics 3
Communication
Requirements Analysis 4
Non-technical Consulting 4
Training for Developers 5
Content Creation
Visualization for Learning 4
Video Creation 4
Blog Posts 4