Site Reliability Engineer – Devops

The ideal candidate will have proven experience in designing and building Site Reliability Engineering solutions with best practices and modern technologies. OAT is actively converting it’s standalone TAO software into a world class Software as a Service (SaaS) offering. You will be expected to design, create and maintain the parts required for this effort. Our aim is to build a self-healing, hybrid SaaS product with excellent Durability, Performance and Availability. The ideal candidate will bring knowledge in designing, building and operating platforms to achieve this aim.

The candidate’s duties and responsibilities will include:

  • Design, build, manage and observe cloud infrastructure to support high peak-load applications exceeding our SLA of 99.8%
  • Work within cross functional teams and contribute from the very start to ensure successful harmonious development and deployment of new code and infrastructure
  • Provide valuable metrics to internal/external customers and management
  • Proactively ensure the highest level of platform availability
  • Performance test & tune applications, identify the root cause of issues and work as part of a cross functional team to fix them
  • Work with highly scaled Databases and Compute Environments
  • Maintain hosted environments and keep them healthy and up-to-date
  • Build and maintain the tools to roll out updates and ensure the platform health
  • Perform maintenance work on live environments


This role requires a variety of strengths and skills, including:

  • Thinking “as an engineer”
  • Hands-on experience with Kubernetes
  • Understanding the concept of Service Meshes (Istio)
  • Proficiency in at least one Scripting and Programming Language (Golang would be a plus)
  • Observability and Alerting practice
  • Understanding the GitOps workflow / configuration management
  • CI/CD Systems
  • Confidence working in cloud environments (GCP, AWS)
  • Knowledge of Agile & ITIL is a plus

Perks working for OAT

  • Flexible Arbeitszeiten
  • Social benefits
  • Arbeiten für eine Open-Source-Lernplattform
  • Access to conferences, training, certifications, etc.
  • The possibility to work 100% from home
  • Firmen- und Teamevents
  • Internationales und multikulturelles Arbeitsumfeld
Jetzt bewerben