Overview
We are seeking a highly motivated and skilled Cloud Infrastructure Engineer to join our dynamic team. This role is responsible for the monitoring, maintenance, and support of our critical infrastructure, including FTP, DNS, Storage (File Block and Object Storage), and related systems. The ideal candidate will possess a strong understanding of these technologies, experience with scripting and monitoring tools, and the ability to work effectively in a shift-based environment. This position involves L1/L2 troubleshooting and escalation, ensuring the stability and performance of our services.
Responsibilities
- :Monitoring and Maintenance
:Proactively monitor the health and performance of FTP, DNS, File Block Storage, and Object Storage systems using various monitoring tools, including shell scripts and Grafana.Perform routine maintenance tasks, including patching, upgrades, and configuration changes.Identify and resolve performance bottlenecks and capacity issues
- .Incident Management
:Respond to alerts and incidents, performing initial triage and troubleshooting (L1/L2 support).Escalate complex issues to senior engineers or other teams as needed.Create and manage tickets for incidents and service requests.Document troubleshooting steps and resolutions in a knowledge base
- .Shift Work
:Work in a rotating shift schedule to provide 24/7 coverage for critical systems.Follow established procedures and protocols for incident handling and escalation during off-hours
- .Automation and Scripting
:Develop and maintain shell scripts for monitoring, automation, and reporting.Identify opportunities to automate manual tasks and improve efficiency
- .Documentation
:Create and maintain accurate and up-to-date documentation of systems, configurations, and procedures
- .Collaboration
:Work closely with other teams, including development, security, and networking, to ensure the smooth operation of our infrastructure. Participate in team meetings and knowledge-sharing sessions
.
Qualification
- s:Experienc
e:5+ years of experience in a systems administration, cloud infrastructure, or related role.Experience with monitoring and supporting FTP, DNS, File Block Storage, and Object Storage systems.Experience with L1/L2 troubleshooting and incident managemen
- t.Technical Skill
s:Strong understanding of Linux/Unix operating systems.Proficiency in shell scripting (e.g., Bash, Python).Experience with monitoring tools such as Grafana (or similar).Familiarity with ticketing systems (e.g., Jira, ).Knowledge of networking concepts (TCP/IP, DNS, routing
- ).Soft Skill
s:Excellent problem-solving and troubleshooting skills.Strong communication and interpersonal skills.Ability to work independently and as part of a team.Ability to prioritize tasks and manage time effectively.Ability to remain calm and focused under pressur
e.Bonus Points (Optional
- ):Experience with K8s (e.g., AWS, Azure, GCP
- ).Experience with configuration management tools (e.g., Ansible, Puppet, Chef
- ).Relevant certifications (e.g., Linux+, AWS Certified SysOps Administrator