Data Engineer · Computer Scientist · Builder

Connecting The Dots

I build the infrastructure that turns raw data into insights.
From data platform engineering and machine learning to industrial computer vision systems.

Simon Schröder, Data Engineer and Computer Scientist

About Me

I'm a Data Engineer and Computer Scientist from the Ruhr Area, Germany. I started programming early — building websites for friends and family, tinkering with stock market data — and never stopped.

I hold a B.Sc. and M.Sc. in Computer Science from Ruhr University Bochum, with a focus on software engineering and machine learning. After graduating, I spent time in the chemical industry applying computer vision to real-world challenges and handling various kinds of chemical data. Then my journey led me into the transport & logistics industry as a Platform Data Engineer — building on Databricks and Azure to enable entire teams to get maximum value from their data.

In parallel, I co-founded Validatix — a platform that makes industrial data trustworthy and interpretable for both humans and machines.

What I Do

My work spans the full data stack — from cloud infrastructure to production ML systems.

Data Platform Engineering

Designing and operating scalable, cloud-native data platforms on Databricks and Azure. Enabling teams to build reliable pipelines and get maximum value from their data.

Machine Learning & Computer Vision

Building and deploying production-grade ML models with PyTorch and TensorFlow. Applied computer vision experience in industrial quality control and environmental monitoring.

Data Engineering & ETL

Ingesting, transforming, and modeling data at any scale — from simple pipelines to complex multi-source architectures — so researchers and business teams can trust what they see.

Infrastructure & Automation

Automating infrastructure deployments, securing data assets, and keeping platforms stable, reproducible, and production-ready.

Professional Journey

B.Sc. & M.Sc. Applied Computer Sciences

Ruhr University Bochum

Specialized in software engineering and machine learning with a strong emphasis on computer vision. Covered the full ML spectrum from classical statistical methods to deep learning architectures.

Data Scientist & Data Engineer

Chemical Industry

Deployed computer vision models on real-world industrial use cases — ensuring process quality, product consistency, and environmental compliance. Built data engineering pipelines using on-premises and cloud technologies, handling process and analytical data to deliver actionable insights to chemists and process experts.

Computer Vision Data Engineering On-Premises Cloud Azure Databricks

Platform Data Engineer

Current
Transport & Logistics

Building and operating a Databricks platform on Azure that empowers developer and business teams to derive value from data at scale. Driving platform innovation, automating infrastructure deployments, securing data assets, and ensuring teams always have access to the latest, responsibly-deployed tooling. Presented at a Databricks-hosted user group event, sharing best practices with the broader community.

Databricks Azure Infrastructure Automation CI/CD

Projects & Ventures

Validatix

Co-founded a platform that creates trustworthy data for humans and machines. Validatix specializes in analyzing and contextualizing thousands of sensor data streams for industrial plants and machinery.

Learn more

Events & Speaking

December 2025

Databricks User Group Speaker

Presented on orchestration of data pipelines on Azure Databricks within multi workspace data platform for large-scale logistics operations. Shared insights with the community on our approach and best practices.

View Post on LinkedIn

Let's Connect

Interested in data engineering, machine learning, or building trustworthy systems? Reach out through any of these platforms.