Overview
Cloudwick is a leading AWS certified data lake and advanced analytics partner. We provide build and buy data lake services and solutions that solve an organizational need for production ready data lake powered analytics. Cloudwick has customers mainly in the USA and EMEA. Cloudwick is headquartered in the USA with offices in California, London and Bangalore.Cloudwick is committed to employing a diverse workforce. Qualified applicants will receive consideration without regard to race, color, religion, sex, national origin, age, sexual orientation, gender identity, gender expression, veteran status, or disabilityPlease visit our website https://cloudwick.com.
Job Summary:
About the Job: The Community You Will Join: At Cloudwick, you’ll be part of a dedicated team of professionals passionate about fostering a dynamic and inclusive workplace culture, with a specific focus on transforming healthcare through data.
A Typical Day:
● Design, develop, and maintain ETL processes and data pipelines with Scala/PySpark, ensuring seamless integration of healthcare data formats such as HL7 and FHIR.
● Collaborate with data scientists, analysts, and healthcare stakeholders to understand data requirements and model Electronic Health Records (EHR) and Electronic Case Report (ECR) for high-quality, compliant data solutions.
● Optimize and tune data pipelines for performance and scalability, ensuring rapid access to critical healthcare information.
● Ensure data quality and integrity through robust testing and validation processes, adhering to healthcare regulations. ● Implement data governance and security best practices to protect sensitive patient information.
● Monitor and troubleshoot data pipelines to ensure continuous data flow, promptly addressing any issues to maintain operational efficiency.
● Stay up-to-date with the latest trends and technologies in data engineering, particularly in the healthcare sector.
What You Bring to the Table:
● B.E/B.Tech, preferably in Computer Science or Engineering.
● 5+ years of experience in handling data and designing ETL pipelines, with mandatory 4+ years of experience writing
PySpark code, specifically in healthcare applications.
● Proven experience working on AWS using services like S3, Glue, Redshift, Lambda, IAM, and DynamoDB, focusing on
healthcare data management.
● Demonstrated experience in capturing data requirements from product owners and business users within the
healthcare domain, particularly around EHR and ECR systems.
● Good to have exposure to data modeling, data analytics, and design in both batch processing and real-time streaming,
especially for healthcare applications.
● Solid understanding of data mapping, data processing patterns, distributed computing, and building applications for
real-time and batch analytics in a healthcare context, including HL7 and FHIR standards.
● Strong programming skills in design and implementation using Python and PySpark.
● Good exposure to database architecture with Redshift, specifically for healthcare datasets.
● Experience with multiple file formats such as Avro, Parquet, ORC, and JSON, particularly in the context of healthcare
data analytics.
● Developing, constructing, testing, and maintaining architectures for data lakes, data pipelines, data warehouses, and
large-scale data processing systems on AWS, focusing on healthcare applications.
● Extensive experience using Spark, Scala, PySpark, Python, and SQL to handle complex healthcare data
transformations.
● Hands-on experience in using AWS services like S3, Glue, Lambda, Redshift, and IAM for healthcare solutions.
● Experience with client interactions in the healthcare domain, ensuring the delivery of data solutions that meet regulatory standards.
● Proficient in writing complex SQL queries for healthcare data analysis.
● Preferred experience in the healthcare domain, with knowledge of EHR, ECR, and familiarity with HL7 and FHIR
standards.
● AWS certifications relevant to data engineering and healthcare solutions are a plus.
Location-Bangalore (“Complete work from Office”)
Job Type: Full-time
Pay: ₹558,324.90 - ₹1,938,521.88 per year
Schedule:
- Morning shift
Education:
- Bachelor's (Preferred)
License/Certification:
- HL7/ FHIR (Preferred)
Work Location: In person
Expected Start Date: 24/04/2025