Data Engineer - data42

Job expired!

Join Novartis as a Data Engineer - data42

Discover an exciting opportunity to work with leading data scientists and domain experts on the data42 platform. Your role will be pivotal in answering scientific questions using multi-modal data. As a Data Engineer, your responsibilities will span from gathering use-case requirements to building ETL processes and data pipelines for seamless data analysis.

Key Responsibilities:

  • Collaborate with domain experts, data scientists, and stakeholders to meet specific data needs.
  • Design, develop, test, and maintain ETL processes and data pipelines for data extraction, preparation, and iteration.
  • Implement and maintain data quality checks to ensure accurate and high-quality data.
  • Identify and rectify data inconsistencies and irregularities.
  • Promote a culture of transparency and communication about data modifications and lineage.
  • Advocate for data engineering best practices by ensuring ETL processes are efficient, well-documented, and tested.
  • Contribute to knowledge-sharing efforts across the data42 platform and the broader Novartis data engineering community.
  • Ensure compliance with security and governance principles.

Minimum Requirements:

  • Bachelor’s degree in computer science or other quantitative fields (Mathematics, Statistics, Physics, Engineering, etc.) or equivalent practical experience.
  • Proven experience as a data engineer, data wrangler, or similar role.
  • Exceptional programming skills with expertise in Python, R, and Spark.
  • Experience with diverse data types, including images, tabular, unstructured, and text data.
  • Proficiency in scalable data processing engines, data ingestion, extraction, and modeling.
  • Strong statistical knowledge and the ability to assess data quality and resolve inconsistencies.
  • Excellent communication and stakeholder management skills.
  • Ability to work independently and as part of global Agile teams.

Desirable Additional Skills:

  • Hands-on experience with Palantir Foundry (Code Repository, Code Workbook, Contour, Data Lineage, etc.).
  • Knowledge of CDISC data standards (SDTM, ADaM).
  • Experience using AI (e.g., GenAI/LLMs) for data wrangling.
  • Experience with pooling of clinical trial data.
  • High-level understanding of the drug discovery and development process.

Skills Desired:

Algorithms, Computer Programming, Computer Science, Computer Vision, Data Science, People Management, Project Management, Research and Development (R&D).

Company: Novartis

Job Title: Data Engineer - data42

Step into a role where your data engineering skills can drive scientific breakthroughs. Join our global team and contribute to transforming the future of data science. Apply today to become a Data Engineer at Novartis!