About the project
To work with the Software Engineering teams, we are looking for a Senior Cloud Site Reliability Engineer to implement SRE from scratch and to improve and mature development cycles in Integration/Continuous Deployment as part of a project in the aviation industry.
Your responsibilities
Develop code, scripts, systems or tools that reduce operational burden by automating complex and repetitive tasks, enables engineering teams to increase the velocity at which they can safely deploy changes to production, and monitors the effects of changes across systems, services, or products
Analyze telemetry data to develop capacity planning models, identify patterns, and trends that drive continuous improvement, and highlight opportunities to deploy automation to monitor and manage services and/or products
Develop tests and implements changes to optimize code and improve the observability, reliability, and operability of platforms, systems, and products
Our requirements
Ability and experience to build and introduce SRE from scratch
At least 5 years of experience in the field of DevOps or SRE
Familiarity with one or more general purpose programming languages
Experience with Azure Cloud
Hands-on experience with security assurance tools
Knowledge of AKS, Infrastructure as code (IAC) patterns and principles
Understanding of Microservice architecture
Knowledge of Helm / Service Mesh / Kubernetes and Docker containerization
Knowledge of PowerShell or Bash
Good command of English (min. B2)