
Overview
Job Summary:
We are looking for a skilled and motivated Python Data Engineer with strong hands-on experience in Pydantic, PySpark, and NumPy to join our growing data engineering team. The ideal candidate should have a solid understanding of data processing, data validation, and scalable data pipelines. This role requires availability during Central Standard Time (CST) business hours.
Key Responsibilities:
Design, build, and maintain scalable data processing pipelines using PySpark.
Utilize Pydantic for structured data validation and serialization in Python applications.
Work with large-scale data sets using NumPy and other Python data libraries.
Develop efficient, reusable, and testable code with a focus on performance and maintainability.
Collaborate with cross-functional teams including data scientists, product managers, and backend developers.
Participate in code reviews, design discussions, and system architecture planning.
Troubleshoot and optimize data workflows for speed, reliability, and scalability.
Ensure adherence to best practices in data engineering and software development.
Required Skills & Qualifications:
4+ years of experience in Python programming with a focus on data engineering or backend systems.
Strong expertise in:
Pydantic (data models, validation, schemas)
PySpark (dataframes, RDDs, performance tuning)
NumPy (numerical computations, large dataset handling)
Familiarity with version control (Git), CI/CD pipelines, and agile development practices.
Experience working in cloud environments (AWS, GCP, or Azure) is a plus.
Must be able to work and collaborate during CST business hours.
Preferred Qualifications:
Experience with additional Python libraries (e.g., Pandas, FastAPI, SQLAlchemy).
Knowledge of distributed computing frameworks and data lake architectures.
Exposure to containerized environments (Docker/Kubernetes) is an advantage.
Bachelor's or Master’s degree in Computer Science, Data Engineering, or related field.
Job Type: Full-time
Pay: ₹25,000.00 - ₹70,000.00 per month
Schedule:
- Day shift
Experience:
- Python: 3 years (Required)
- PySpark: 3 years (Required)
Work Location: Remote