DevOps/Site Reliability Engineer for SLA-Based Systems
I specialize in building, operating, and supporting resilient, SLA-compliant infrastructure platforms with Rancher, OpenShift, and Kubernetes at scale.
About Me

I’m a hands-on DevOps/Site Reliability Engineer with over a decade of experience supporting mission-critical, SLA-bound platforms in enterprise and hybrid cloud environments.
Currently, I work full-time at cloudWerkstatt, managing infrastructure platforms built on Rancher and OpenShift across VMware and IBM Cloud. I ensure 24/7 uptime, lead upgrades, deploy clusters, automate infrastructure, and support internal tools such as GitLab, Keycloak, Taiga, and more.
I’ve previously worked for large enterprises such as T-Systems / Deutsche Telekom Solutions, supporting SAP landscapes, and Visma, where I managed private cloud deployments on OpenStack.
When not on-call, I contribute to open-source projects like the Grafana Ansible Collection, build monitoring dashboards, and run a home lab to test new tech.
How I Can Help

Kubernetes & Platform Engineering
➡️ Rancher & OpenShift cluster deployment and operations➡️ Platform upgrades & lifecycle management
➡️ GitOps integration
➡️ Resilience design and performance tuning

Infrastructure Automation
➡️ Infrastructure as Code using Ansible (incl. AWX/Semaphore)➡️ CI/CD automation with GitLab Pipelines & GitHub Actions
➡️ Automated reporting (email, HTML, XLSX)
➡️ Python and Bash scripting

Observability & Monitoring
➡️ Prometheus stack, Alertmanager, Grafana dashboards➡️ Custom exporters in Go for monitoring integrations
➡️ Log monitoring with Grafana Loki

Security & Access Control
➡️ Cloudflare services, Unifiy network devices➡️ pfSense firewall experience
➡️ Identity & access via Keycloak and Red Hat Identity Management

Toolchain & Infra Management
➡️ Managing GitLab, Mattermost, Red Hat Satellite, Taiga, Keycloak, IDM ...➡️ Linux server hardening & Podman-based container isolation
➡️ PostgreSQL, MariaDB experience
Open Source & Community
I’m an active open-source contributor, especially within the Grafana ecosystem. I maintain Ansible roles for Loki, Promtail, and Alloy, used by community members and companies alike.
I have also authored several Grafana dashboards, which can be found here.
For a deeper dive into my projects and contributions, visit my Github page.
Training and Certifications
But let's be real – certifications and training are a foundation, but the magic 🪄 happens when curiosity ignites 🔥.
Let’s Work Together
Looking for a reliable DevOps or Site Reliability Engineer for your infrastructure?