Overview
Job Overview:
We are seeking a skilled Site Reliability Engineer (SRE) with 3 to 5 years of experience in alert monitoring, application monitoring, and automation. The ideal candidate should have hands-on experience in any programming language such as Java, Python, or PHP and a strong understanding of AWS microservices architecture. Knowledge of Nginx, Apache, MySQL, and MongoDB will be an added advantage.
Key Responsibilities:
[Must Have] Design, implement, and maintain monitoring solutions for applications and infrastructure.
[Must Have] Develop and maintain alerting mechanisms to proactively detect and resolve system issues.
[Must Have] Automate operational tasks using scripting and programming languages.
[Must Have] Troubleshoot production issues, perform root cause analysis, and ensure quick resolution.
[Must Have] Implement logging, tracing, and observability solutions for microservices.
Work with cloud-based architectures, particularly AWS, to optimize system performance.
Ensure reliability, availability, and performance of applications through effective monitoring.
Collaborate with development and operations teams to implement SRE best practices.
Optimise database performance and troubleshoot issues in MySQL and MongoDB.
Manage and configure web servers like Nginx and Apache for scalability and performance.
Required Skills & Qualifications:
3 to 5 years of experience in Site Reliability Engineering (SRE), DevOps, or related roles.
Proficiency in any one programming language: Java, Python, or PHP.
Hands-on experience with AWS services and microservices architecture.
Strong understanding of application and infrastructure monitoring tools.
Experience with alerting and observability platforms like Prometheus, Grafana, ELK, or Datadog.
Knowledge of containerization and orchestration (Docker, Kubernetes is a plus).
Familiarity with CI/CD pipelines and automation tools.
Experience in managing and optimizing databases such as MySQL and MongoDB.
Exposure to Nginx and Apache configuration and troubleshooting.
Strong problem-solving skills and ability to work in a fast-paced environment.
Good to Have:
Certification in AWS, Kubernetes, or DevOps practices.
Experience with Infrastructure as Code (IaC) tools like Terraform or CloudFormation.
Familiarity with security best practices for cloud and application monitoring
Job Types: Full-time, Permanent
Pay: ₹654,318.19 - ₹2,522,957.03 per year
Schedule:
- Day shift
- Morning shift
Work Location: In person