Free cookie consent management tool by TermsFeed Data Engineer-Python,PySpark,SQL ,Spark Architecture,Azure Databricks | Antal Tech Jobs
Back to Jobs
1 Week ago

Data Engineer-Python,PySpark,SQL ,Spark Architecture,Azure Databricks

decor
Coimbatore, Tamil Nadu, India
Information Technology
Full-Time
Siemens Healthineers

Overview

As a Data Engineer, you are required to:

Design, build, and maintain data pipelines that efficiently process and transport data from various sources to storage systems or processing environments while ensuring data integrity, consistency, and accuracy across the entire data pipeline.

Integrate data from different systems, often involving data cleaning, transformation (ETL), and validation. Design the structure of databases and data storage systems, including the design of schemas, tables, and relationships between datasets to enable efficient querying. Work closely with data scientists, analysts, and other stakeholders to understand their data needs and ensure that the data is structured in a way that makes it accessible and usable.

Stay up-to-date with the latest trends and technologies in the data engineering space, such as new data storage solutions, processing frameworks, and cloud technologies. Evaluate and implement new tools to improve data engineering processes.


Qualification
: Bachelor's or Master's in Computer Science & Engineering, or equivalent. Professional Degree in Data Science, Engineering is desirable.


Experience level
: At least 3 - 5 years hands-on experience in Data Engineering


Desired Knowledge & Experience
:

  • Spark: Spark 3.x, RDD/DataFrames/SQL, Batch/Structured Streaming
    • Knowing Spark internals: Catalyst/Tungsten/Photon
  • Databricks: Workflows, SQL Warehouses/Endpoints, DLT, Pipelines, Unity, Autoloader
  • IDE: IntelliJ/Pycharm, Git, Azure Devops, Github Copilot
  • Test: pytest, Great Expectations
  • CI/CD Yaml Azure Pipelines, Continuous Delivery, Acceptance Testing
  • Big Data Design: Lakehouse/Medallion Architecture, Parquet/Delta, Partitioning, Distribution, Data Skew, Compaction
  • Languages: Python/Functional Programming (FP)
  • SQL: TSQL/Spark SQL/HiveQL
  • Storage: Data Lake and Big Data Storage Design

additionally it is helpful to know basics of:

  • Data Pipelines: ADF/Synapse Pipelines/Oozie/Airflow
  • Languages: Scala, Java
  • NoSQL: Cosmos, Mongo, Cassandra
  • Cubes: SSAS (ROLAP, HOLAP, MOLAP), AAS, Tabular Model
  • SQL Server: TSQL, Stored Procedures
  • Hadoop: HDInsight/MapReduce/HDFS/YARN/Oozie/Hive/HBase/Ambari/Ranger/Atlas/Kafka
  • Data Catalog: Azure Purview, Apache Atlas, Informatica


Required Soft skills & Other Capabilities
:

Great attention to detail and good analytical abilities.

Good planning and organizational skills

Collaborative approach to sharing ideas and finding solutions

Ability to work independently and also in a global team environment.

Share job
Similar Jobs
View All
1 Hour ago
MTS II - Software Engineer
Information Technology
  • 4 - 7 Yrs
  • Pune
MAJOR RESPONSIBILITIES • Design, implement, integrate, and verify software applications and tools using JavaScript, NodeJS, and C++. • Enhance, optimize, and improve the efficiency and robustness of current software, with a particular focus on OSS ...
decor
1 Day ago
Business Advisory Analyst
Information Technology
  • Bangalore, Karnataka, India
Skill required: Banking Services - Core BankingDesignation: Business Advisory AnalystQualifications:BBA/BCom/Master of Business AdministrationYears of Experience:3 to 5 yearsAbout AccentureAccenture is a global professional services company with lea...
decor
1 Day ago
Front End Developer
Information Technology
  • Bangalore, Karnataka, India
Position Title: Front End DeveloperCompany: Johnson Controls (JCI)Location: BangaloreJob Summary: We are seeking a talented Front End Developer with 4-7 years of experience to join our dynamic team. The ideal candidate will have a strong background ...
decor
1 Day ago
Database Engineer III (Big Data)
Information Technology
  • Bangalore, Karnataka, India
LivePerson (NASDAQ: LPSN) is the global leader in enterprise conversations. Hundreds of the world’s leading brands — including HSBC, Chipotle, and Virgin Media — use our award-winning Conversational Cloud platform to connect with millions of consume...
decor
1 Day ago
Data Scientist Manager
Information Technology
  • Bangalore, Karnataka, India
Job DescriptionLeads a team of people who design, develop and program methods, processes, and systems to consolidate and analyze unstructured, diverse “big data” sources to generate actionable insights and solutions for client services and product e...
decor
1 Day ago
Data Scientist Manager
Information Technology
  • Bangalore, Karnataka, India
Job DescriptionLeads a team of people who design, develop and program methods, processes, and systems to consolidate and analyze unstructured, diverse “big data” sources to generate actionable insights and solutions for client services and product e...
decor
1 Day ago
Sr. QA Engineer
Information Technology
  • Bangalore, Karnataka, India
Role Summary:Picarro is seeking an exceptional Sr. QA Engineer for functional testing of Picarro Analyzers. This role expects you to analyze requirements, create and execute test-plan, and record results in test-repo. This person is also expected to...
decor
1 Day ago
C++ Graphics and Windowing System Software Engineer - Mir
Information Technology
  • Bangalore, Karnataka, India
We build a high-performance, high-efficiency stack for window managers and display subsystems in C++, called Mir. We're growing the team and looking for new colleagues who share our passion for precision, performance and user experience.Our goal is ...
decor

Talk to us

Feel free to call, email, or hit us up on our social media accounts.
Social media