Back to Jobs

4 Weeks ago

Data Engineer - Senior

Apply Now

Pune, Maharashtra, India

Information Technology

Full-Time

Cummins Inc.

Overview

Description

Although the role category specified in the GPP is Remote, the requirement is for Hyrid.

Key Responsibilities

Design & Development : Design and automate the deployment of distributed systems for ingesting and transforming data from various types of sources (relational, event-based, unstructured).
Data Quality & Integrity : Design and implement frameworks for continuously monitoring and troubleshooting data quality and integrity issues.
Data Governance : Implement data governance processes, ensuring effective management of metadata, access, and retention for both internal and external users.
ETL Pipelines : Design and provide guidance on building reliable, efficient, and scalable data pipelines that integrate data from diverse sources using ETL/ELT tools or custom scripting.
Database Optimization : Develop and implement physical data models and optimize database performance using efficient indexing and table relationships.
Cloud Data Solutions : Create and manage large-scale data storage and processing solutions using cloud-based platforms like Azure Databricks, Data Lakes, Hadoop, and NoSQL databases (e.g., Cassandra, MongoDB).
Automation & Productivity : Leverage modern tools and techniques to automate repeatable data preparation and integration tasks, minimizing manual efforts and error-prone processes.
Agile Methodologies : Participate in agile development methodologies such as DevOps, Scrum, and Kanban to ensure timely delivery of critical analytics initiatives.
Mentorship & Collaboration : Mentor junior developers, collaborate with cross-functional teams, and contribute to the overall success of the data platform.

Responsibilities

Qualifications :

Knowledge/Skills

Proven track record in developing efficient data pipelines and mentoring junior developers.
Hands-on experience with Spark (Scala/PySpark), SQL, and Spark Streaming.
Proficient in troubleshooting and optimizing batch/streaming data pipeline issues.
Expertise in Azure Cloud Services (Azure Databricks, ADLS, EventHub, EventGrid, etc.).
Strong understanding of data models (SQL/NoSQL), including Delta Lake or Lakehouse.
Experience with CI/CD tools for automating deployments.
Knowledge of big data storage strategies, performance optimization, and database indexing.
Familiarity with Agile software development methodologies.

Nice-to-Have

Understanding of the machine learning lifecycle and experience integrating ML models into data pipelines.
Exposure to open-source big data technologies and IoT.
Familiarity with building analytical solutions in cloud environments.
Experience in large file movement and data extraction tools.

Education & Certifications

A degree in Computer Science, Engineering, Information Technology, or a related field, or equivalent relevant experience is required.
Additional certifications in Azure, Spark, or cloud-based data engineering solutions are a plus.

Competencies

System Requirements Engineering : Ability to translate stakeholder needs into verifiable system requirements, ensuring alignment with project goals.
Collaboration : Strong ability to build partnerships and work effectively within cross-functional teams to achieve shared objectives.
Effective Communication : Skilled in delivering clear communications to diverse audiences, both technical and non-technical.
Customer Focus : Dedicated to building strong customer relationships and delivering solutions that meet their needs.
Problem Solving : Proficient in using systematic analysis and industry-standard methodologies to solve complex technical challenges.
Data Quality : Knowledgeable in identifying, understanding, and correcting data quality issues across operational processes.
Solution Documentation & Testing : Thorough in documenting solutions and validating them through structured testing practices to ensure they meet business requirements.
Decision Making : Able to make timely, data-driven decisions that maintain project momentum.

Qualifications

Skills & Experience :

Experience :
6-8 years of hands-on experience in data engineering, with a focus on building data pipelines, and working with cloud-based data solutions (preferably Azure Databricks).
Advanced knowledge in Spark (Scala/PySpark), SQL, and cloud platforms like Azure.
Familiarity with the design, development, and maintenance of large-scale data storage solutions (Hadoop, NoSQL databases, Data Lakes).
Experience in mentoring junior developers and working in Agile development teams.
Technical Skills :
Advanced proficiency in SQL and Spark.
Expertise in data pipeline design and automation.
Knowledge of cloud data services (Azure Databricks, ADLS, EventHub, EventGrid).
Experience with CI/CD tools for pipeline deployment automation.
Familiarity with big data tools like Hive, Kafka, Hbase, and the use of Delta Lake.
Experience in building and optimizing ETL/ELT pipelines.

Work Schedule

This role requires collaboration with stakeholders in the US, with an expected overlap of 2-3 hours during EST working hours on an as-needed basis.

Job Systems/Information Technology

Organization Cummins Inc.

Role Category Remote

Job Type Exempt - Experienced

ReqID 2412312

Relocation Package No

Share job

Similar Jobs

View All

16 Hours ago

MTS II - Software Engineer

Information Technology

4 - 7 Yrs
Pune

MAJOR RESPONSIBILITIES • Design, implement, integrate, and verify software applications and tools using JavaScript, NodeJS, and C++. • Enhance, optimize, and improve the efficiency and robustness of current software, with a particular focus on OSS ...

More info

2 Days ago

Business Advisory Analyst

Information Technology

Bangalore, Karnataka, India

Skill required: Banking Services - Core BankingDesignation: Business Advisory AnalystQualifications:BBA/BCom/Master of Business AdministrationYears of Experience:3 to 5 yearsAbout AccentureAccenture is a global professional services company with lea...

More info

2 Days ago

Front End Developer

Information Technology

Bangalore, Karnataka, India

Position Title: Front End DeveloperCompany: Johnson Controls (JCI)Location: BangaloreJob Summary: We are seeking a talented Front End Developer with 4-7 years of experience to join our dynamic team. The ideal candidate will have a strong background ...

More info

2 Days ago

Database Engineer III (Big Data)

Information Technology

Bangalore, Karnataka, India

LivePerson (NASDAQ: LPSN) is the global leader in enterprise conversations. Hundreds of the world’s leading brands — including HSBC, Chipotle, and Virgin Media — use our award-winning Conversational Cloud platform to connect with millions of consume...

More info

2 Days ago

Data Scientist Manager

Information Technology

Bangalore, Karnataka, India

Job DescriptionLeads a team of people who design, develop and program methods, processes, and systems to consolidate and analyze unstructured, diverse “big data” sources to generate actionable insights and solutions for client services and product e...

More info

2 Days ago

Data Scientist Manager

Information Technology

Bangalore, Karnataka, India

More info

2 Days ago

Sr. QA Engineer

Information Technology

Bangalore, Karnataka, India

Role Summary:Picarro is seeking an exceptional Sr. QA Engineer for functional testing of Picarro Analyzers. This role expects you to analyze requirements, create and execute test-plan, and record results in test-repo. This person is also expected to...

More info

2 Days ago

C++ Graphics and Windowing System Software Engineer - Mir

Information Technology

Bangalore, Karnataka, India

We build a high-performance, high-efficiency stack for window managers and display subsystems in C++, called Mir. We're growing the team and looking for new colleagues who share our passion for precision, performance and user experience.Our goal is ...

More info

Talk to us

Feel free to call, email, or hit us up on our social media accounts.

Email info@antaltechjobs.in