Overview
A Product DevOps Engineer is responsible for designing, deploying, and operating the cloud-native infrastructure that supports software applications and services, with a strong focus on Kubernetes, Terraform, GitOps, and Flux. This includes configuring and managing Kubernetes clusters, terraform infrastructure as code, and GitOps workflows to ensure seamless and automated deployment of applications.
The role will work closely with development teams to ensure continuous integration and continuous delivery of software applications and will take ownership of the operational maintenance and support of our Shared infrastructure estate.
Responsibilities
- Design and management of cloud-native system architectures using Kubernetes, guaranteeing high availability, top performance, and reliability, with a focus on continuous operation and improvements.
- Implement and manage Terraform infrastructure as code to provision and manage cloud resources, ensuring consistency and repeatability across environments.
- Develop and maintain GitOps workflows using tools like Flux or similar, to automate deployment of applications and ensure consistency across environments.
- Design and operation of container-based hosting environments using Kubernetes, including management of pods, services, and deployments.
- Management and configuration of cloud-native networking, including load balancers, ingress controllers, and service meshes.
- Support incident and problem management to identify root cause and fix, define and put in place improvements to avoid the problem occurring again, with a focus on automation and self-healing systems.
- Interact with development teams to understand their requirements, help them to find the best technological solution for their needs, and ensure seamless integration with cloud-native infrastructure.
- Analyze and improve systems' performance & scalability, and automate systems management using tools like Prometheus, Grafana, and Kubernetes dashboard.
- Undertake regular review of IT Security vulnerabilities related to Infrastructure, participate in remediation work to improve Infra security scores, and ensure compliance with organizational security policies.
- Provide 3rd line support where required in an on-call roster covering one or more of our applications, with a focus on resolving complex technical issues and improving overall system
Requirements
Must have:
- Strong experience with Kubernetes, including cluster management, Networking fundamentals, pod management, and service management.
- Experience with infrastructure as code, including provisioning and managing cloud resources.
- Experience with GitOps workflows.
- Strong understanding of cloud-native architectures, including microservices, service meshes, and containerization.
- Experience with continuous integration and continuous delivery (CI/CD) pipelines, including Docker builds.
- Strong problem-solving skills, with the ability to troubleshoot complex technical issues and improve overall system reliability.
- Excellent communication skills, with the ability to work closely with development teams and other stakeholders to ensure seamless integration with cloud-native infrastructure.
Nice to have:
- Experience with Terraform/ OpenTofu
- Experience with FluxGitOps tool.
- Familiarity with NGINX Ingress Controller, including configuration, deployment, and troubleshooting, with knowledge of features such as path-based routing, SSL/TLS termination, and load balancing.
- CI/CD tools like GitHub Actions / Azure DevOps pipelines or similar.
- Experience with cloud providers like AWS, GCP, or Azure, including management of cloud resources and services.
- Understanding of IT security principles, including network security, identity and access management, and compliance.
We’re looking for someone who is:
- driven to push the boundaries and lead change and performance
- communicative to leave no-one in the dark and to work with your team successfully
- reliable so we know that we can call on you to meet deadlines
- passionate about the latest technologies and standards
- proactive to suggest improvements, identify and fix potential issues
- solid technically speaking, to advise both Clients and internal teams
Our people are key to our success and we pride ourselves on offering a dynamic, creative, innovative and supportive environment. Having the right combination of a 'can-do' approach, strong work ethic, integrity, friendliness and attention to detail is crucial.
Even if you don’t tick all the boxes for one particular role, but you have a keen interest in what we do, send us your details, we may find a suitable match during the interview process.
Deltatre consciously nurtures an environment where each and every team member feels safe to bring their whole selves to work, in which everyone is valued and respected for who they are and what they bring. Everyone has the opportunity to reach their full potential, and every team member is expected to treat everyone with dignity and respect, value different perspectives, use inclusive language and work in alignment with Deltatre's commitment to diversity and inclusion. At Deltatre, everyone is welcome and celebrated.