Data Architect

  • Full Time
Job expired!

Are you passionate about precision medicine and eager to advance the healthcare industry?

Recent advancements in core technology have finally enabled Artificial Intelligence (AI) to significantly impact clinical care. Tempus' proprietary platform connects a whole ecosystem of real-world evidence, delivering live, actionable insights to doctors. This provides crucial information about the appropriate treatments for the right patients, at the right time.

Tempus is on a mission to create the world's largest combined dataset of molecular and clinical data. At Tempus, products are managed and developed by small, self-reliant teams made up of developers, designers, data scientists, and product managers. You and your team set the objectives, develop the software, deploy the code, and contribute to a rapidly growing software platform that will have a lasting impact on the field of cancer research and treatment.

Tempus develops software that is as flexible as our teams. Our modern tech stack – React, NodeJS, and Python on GCP – empowers our teams to quickly iterate and lead our industry in innovation. Our decentralized, microservice structure and focus on automation allows us to provide advanced solutions with confidence and on a large scale.

What You’ll Do

  • Manage an enterprise data model in collaboration with engineers, product managers, scientists, and operators to blend structured data from various complex domains (clinical records, genomics, NGS lab, radiology, et al.).
  • Create and maintain entity-relationship diagrams, data dictionaries, API specifications, and data translation documentation at various levels of abstraction (conceptual, logical, physical) and across multiple data storage technologies (relational, NoSQL).
  • Champion and educate engineering team members on data modeling rules, standards, and best practices.
  • Evaluate the completeness of source system data models and data by profiling partner data.
  • Implement solutions for proactive data quality monitoring with traceability to source systems.

Why we’re looking for you:

  • You have domain knowledge in healthcare and next-generation sequencing data.
  • You have solid experience and knowledge of 3NF, dimensional (star schema), and data vault modeling techniques.
  • You have demonstrated exceptional SQL skills in an enterprise data warehouse environment.
  • You are familiar with /ELT and BI architectures, concepts, and frameworks.
  • You understand and can clearly articulate the long-term impacts of key decisions between database technologies (relational, MPP, NoSQL). You also have experience designing solutions across multiple technologies.
  • You have worked with data modeling tools such as Erwin, Vertabelo, or Lucidchart.

Bonus points for:

  • Experience with GCP architecture
  • Experience working with clinical and/or genomic data
  • Experience writing and debugging Python
  • Experience implementing master, reference, or metadata management solutions

 

#LI-EV1

We are an equal opportunity employer. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.