Join Novartis as a Data Engineer - data42
Discover an exciting opportunity to work with leading data scientists and domain experts on the data42 platform. Your role will be pivotal in answering scientific questions using multi-modal data. As a Data Engineer, your responsibilities will span from gathering use-case requirements to building ETL processes and data pipelines for seamless data analysis.
Key Responsibilities:
- Collaborate with domain experts, data scientists, and stakeholders to meet specific data needs.
- Design, develop, test, and maintain ETL processes and data pipelines for data extraction, preparation, and iteration.
- Implement and maintain data quality checks to ensure accurate and high-quality data.
- Identify and rectify data inconsistencies and irregularities.
- Promote a culture of transparency and communication about data modifications and lineage.
- Advocate for data engineering best practices by ensuring ETL processes are efficient, well-documented, and tested.
- Contribute to knowledge-sharing efforts across the data42 platform and the broader Novartis data engineering community.
- Ensure compliance with security and governance principles.
Minimum Requirements:
- Bachelor’s degree in computer science or other quantitative fields (Mathematics, Statistics, Physics, Engineering, etc.) or equivalent practical experience.
- Proven experience as a data engineer, data wrangler, or similar role.
- Exceptional programming skills with expertise in Python, R, and Spark.
- Experience with diverse data types, including images, tabular, unstructured, and text data.
- Proficiency in scalable data processing engines, data ingestion, extraction, and modeling.
- Strong statistical knowledge and the ability to assess data quality and resolve inconsistencies.
- Excellent communication and stakeholder management skills.
- Ability to work independently and as part of global Agile teams.
Desirable Additional Skills:
- Hands-on experience with Palantir Foundry (Code Repository, Code Workbook, Contour, Data Lineage, etc.).
- Knowledge of CDISC data standards (SDTM, ADaM).
- Experience using AI (e.g., GenAI/LLMs) for data wrangling.
- Experience with pooling of clinical trial data.
- High-level understanding of the drug discovery and development process.
Skills Desired:
Algorithms, Computer Programming, Computer Science, Computer Vision, Data Science, People Management, Project Management, Research and Development (R&D).
Company: Novartis
Job Title: Data Engineer - data42
Step into a role where your data engineering skills can drive scientific breakthroughs. Join our global team and contribute to transforming the future of data science. Apply today to become a Data Engineer at Novartis!