We’re looking for a Senior SRE Automation Engineer to lead and drive automation across the operations lifecycle. The ideal candidate will be responsible for identifying and implementing automation opportunities to reduce manual intervention, minimise service tickets, and enable self-service capabilities. You will lead efforts to build self-healing infrastructure, eliminate repetitive tasks, and incorporate AI-based tools for proactive issue resolution and auto-remediation—ultimately enhancing platform stability and customer success.
Design and implement automation pipelines to eliminate manual operational tasks and improve service efficiency.
Automate all manual patching activity
Integrate AI/ML-based tools for incident detection, root cause analysis, and automated remediation to enhance platform resilience.
Build and maintain self-healing scripts and workflows using Infrastructure-as-Code (IaC) and event-driven automation frameworks.
Analyze recurring incidents to identify patterns and opportunities for automation and optimization.
Identify and automate standard operating procedures and repetitive day-to-day tasks to reduce ticket volume and manual intervention.
Lead service improvement initiatives through automation to improve overall team performance and customer satisfaction.
Own and continuously improve observability and alerting strategies to support proactive operations.
Effectively communicate with users to build trust and drive timely resolution of issues within SLA.
Collaborate with cross-functional teams to resolve complex problems and align on operational goals.
Handle escalations and critical incidents in a fast-paced environment with clear communication and swift action.
Mentor junior engineers, fostering a DevOps-first culture and encouraging skill development.
Demonstrate strong analytical and troubleshooting skills, including real-time issue identification and resolution in live environments.
Maintain thorough and accurate documentation of automation implementations, including known gaps and future opportunities.
Excellent analytical and problem-solving skills to diagnose, troubleshoot, and resolve complex technical issues.
Proficient in scripting and programming languages such as Python, Go, and Bash.
Strong hands-on experience with automation frameworks and tools including Terraform, Ansible, Chef, and Puppet.
Familiarity with automation scripting tools for infrastructure and operations (e.g. Python, Terraform, Ansible).
Experience working with AI-driven operations tools and AIOps platforms such as Moogsoft, BigPanda, Dynatrace, or custom ML-based pipelines.
In-depth knowledge of CI/CD, GitOps, and event-driven systems for modern DevOps practices.
Solid background in Linux systems and containerized environments like Docker and Kubernetes.
Proven experience in designing resilient, self-healing systems for high availability and operational efficiency.
Deep understanding of cloud platforms and technologies, including Microsoft Azure, Amazon Web Services (AWS), as well as on-premises and data center environments
Experience integrating with LLMs for operational tasks or incident summarization.
Certifications in cloud platforms or DevOps tools (e.g., AWS Certified DevOps Engineer).
Exposure to service mesh, service discovery, or modern networking stacks.
At OneAdvanced, we are at the forefront of delivering sector-focused technology solutions that simplify complexity, drive meaningful progress, and help build a fairer, more inclusive society. We’re much more than a software company. We deliver SaaS workflow applications and IT services that power organisations across Education, Government, Healthcare, Legal, Manufacturing, Housing, Retail, and more.
OneAdvanced is one of the UK’s largest business software and services companies. Based in Birmingham (The Mailbox), operating across the UK, Ireland, India, and Australia. Our secure, scalable platform, including OneAdvanced AI, our private AI service for UK organisations, powers connectivity and innovation across critical sectors. Alongside our software are our IT services, including hosting, managed services, and application modernisation.
We strive to create an inclusive workplace that drives innovation and collaboration, championing diverse perspectives and ideas. Our Environmental, Social and Governance (ESG) strategy is embedded in everything we do, guiding us to create meaningful impact for our people, our customers and the planet.
Join us and become part of a team that’s powering the world of work and making a real difference.
Learn more at www.oneadvanced.com
Software Powered by iCIMS
www.icims.com