Data Engineer

  • Full Time
Job expired!
About Us Exscientia is an AI-driven pharmatech firm dedicated to the discovery, design, and development of the best possible drugs using the most effective and fastest methods. Exscientia pioneered the first-ever operational precision oncology platform capable of guiding treatment selection and enhancing patient outcomes in a forward-looking, interventional clinical study. It also advances the integration of AI-designed small molecules into clinical settings. Our product pipeline showcases our capability to swiftly turn scientific ideas into precision-designed therapeutic candidates, with more than 15 projects currently in progress. By creating superior drugs at a quicker pace, we believe the best scientific concepts can quickly become the best medicines for patients. The Role We are looking for an experienced Data Engineer to join the Tech Group of the Precision Medicine unit. You will develop innovative data pipelines and warehousing solutions serving as the base for our pharmacoscopy image analysis platform and supporting Exscientia's ground-breaking research. You will build systems that can accommodate tens of millions of high-resolution images and sophisticated biological omics and clinical metadata. Your work will ensure not only that our data is stored and utilized according to FAIR principles, but more importantly, support the scientists at Exscientia who are working tirelessly to deliver the right drug to the right patients. You will have the opportunity to: - Work on highly impactful data problems, in support of drug discovery. - Operate at the intersection of big data and biotech research. - Use modern, cloud native tools. - Contribute to design decisions. - Work in a collaborative and supportive team environment. - Receive training on the latest novel technologies. Requirements Essential: - BSc in Computer Science, Data Engineering or a related field. - 3+ years of experience as a Data Engineer in an agile environment. - Basic understanding of general machine learning principles. - Familiarity with FAIR data principles. - Experience with technologies such as Python 3.6+, relational databases (eg. PostgreSQL, MySQL), orchestration frameworks (eg. Airflow, Luigi, Nextflow, Prefect), REST APIs, Docker + Linux, GCP, AWS or Azure. - Highly motivated with an ownership mindset and a proactive approach to innovation and creative problem-solving. - Strong team working skills and ability to work in a multidisciplinary team environment. - Excellent written and verbal communication skills. Desirable: - Understanding of clinical data management and imaging data. - Knowledge of Next Generation Sequencing data. - Experience with technologies such as BaseRow, FastAPI, Django, NextFlow, PySpark, GCP, or the R programming language. Benefits - Join our team to make a positive impact on patients' lives by revolutionizing the pharmaceutical industry through AI-driven discovery. - Begin or further expand your career, with the opportunity to gain valuable skills, work on meaningful problems and learn from leading technological and scientific professionals. - A highly competitive compensation package in line with the Austrian collective bargaining agreement, beginning from €44,000 gross per year. We will negotiate an attractive package based on your background and work experience. - We prioritize the health and well-being of our team, thus offer a generous vacation allowance, along with wellness and mindfulness support. - We provide daily lunches, regular team events, and ample opportunities for casual networking. - An opportunity to join an inclusive, collaborative, and intellectually stimulating environment. - Hear directly from our team about why they enjoy working with us [here](https://www.exscientia.ai/careers).