Overview
Position : Technical Lead – HPC (High performance Computing)
Location : Chennai, India
Experience : 6 – 8 Years
Education : BE/BTech or Masters degree
Mandatory : Min 4 exp in the below skills
- HPC
- hands-on Linux system administration expertise is a MUST.
MUST - core Linux expertise to administer and set up or configure Linux clusters on bare metals(on-prem clusters).
Resume with highlights on-prem cluster setup, would likely be a good fit.
- Any Cloud -AWS (Amazon Web Services), Microsoft Azure, Google Cloud Platform (GCP),
IBM Cloud, Oracle Cloud
- Cloud computing technologies (gRPC, Kafka, Kubernetes, ZeroMQ, Redis, Ceph, etc.).
- Performance tunning
Key Responsibilities:
· Design, implementation & support of high-performance compute clusters
· Solid knowledge on HPC systems, including CPU/GPU architecture, scalable/robust storage, high-bandwidth inter-connects, and a knowledge of cloud based computing architectures
· Apply their attention to detail to generate HW BOMs for the HCP Clusters, provide vendor management and oversee HW release activities.
· Use their strong skills with the Linux OS to configure appropriate operating systems for the HPC system
· Understand and assemble the project specifications and performance requirements at the subsystem and system levels. Adhere and drive to project timelines to insure program achievements complete on time.
· Support design and release of new products to manufacturing and ultimately the customer, providing quality golden images, procedures, scripts and documentation to the manufacturing team and customer support team.
Required Qualifications:
· Validated in-depth and flavor agnostic knowledge of Linux systems (SuSE, RedHat, Rocky, Ubuntu)
· Experience of crafting and maintaining robust storage
· Strong HPC HW knowledge especially in the server, GPU, networking, Storage, BIOS & BMC arenas.
· Experience in System-D, Net boot/PXE, Linux HA.
· Strong understanding of TCP/IP fundamentals and knowledge of protocols, DNS, DHCP, HTTP, LDAP, SMTP.
· Ability to code and develop Shell and Python scripts.
· Experience with one or more of the listed Configuration Mgmt utilities. (Salt, Chef, Puppet etc) .
Preferred Qualifications:
· Possess a strong DevOps focus: Knowledge of setting up a continuous development pipeline (Jenkins), Repository software (Git-based), Singularity & Docker Containers.
· Kubernetes, Prometheus & Grafana experience
· Knowledge of Apache/Nginx, Setting up proxy/reverse proxy, application server routing, load balancing (HA Proxy)
· BS or MS degree + 6 to 10 years validated experience
· Computer Engineering or Electrical Engineer related fields
Skills and Abilities:
· Team Orientation & Interpersonal – Highly motivated teammate with ability to develop and maintain collaborative relationships with all levels within and external to the organization.
· Organization & Time Management – Able to plan, schedule, organize, and follow up on tasks related to the job to achieve goals within or ahead of established time frames.
· Multi-task – Ability to expeditiously organize, coordinate, manage, prioritize, and perform multiple tasks simultaneously to swiftly assess a situation, determine a logical course of action, and apply the appropriate response.
· Adaptability to Change – Able to be flexible and supportive, and able to assimilate change positively and proactively in rapid growth environment.
· Outstanding teammate with excellent written and verbal communications skills.
Job Type: Full-time
Pay: ₹1,500,000.00 - ₹3,500,000.00 per year
Work Location: In person