Tech Lead Manager (TLM) - Supercomputing Scheduling
- Other
- San Francisco
- 06/12/2024
- -
About the Team: The Supercomputing Scheduling Pillar at OpenAI focuses on reliability, scalability, and user-friendliness in job lifecycle management. We pride ourselves on providing efficient and flexible job scheduling, quota management, and streamlined job execution workflows. Our goal is to enhance researcher productivity by ensuring high goodput, efficient packing, and a consistent, ergonomic training workflow, scaling up to larger supercomputers while minimizing operational load.
About the Role: As a Tech Lead Manager (TLM) / Engineering Manager within our Scheduling Pillar, you will lead a dynamic team that designs, deploys, and manages job lifecycle management systems for model training on some of the world's largest supercomputers. This role offers an immense scale, tight timelines, and the chance to significantly impact OpenAI’s mission. A deep technical understanding is essential, though not specifically in ML/DL.
This position is based in San Francisco, CA, and follows a hybrid work model with three days in-office per week. Relocation assistance is available for qualified candidates.
You might be a perfect fit if you:
Experience with AI/ML workloads is an asset but not required.
OpenAI is committed to advancing AI technology that can profoundly benefit all of humanity. Our core mission is to ensure that the development of artificial intelligence is conducted with safety and public welfare in mind. We welcome diverse perspectives and are proud to be an equal opportunity employer.
If you’re ready to shape the future of technology, apply today to join our team at OpenAI!
For more information on our privacy policies and employment regulations, please visit our career page.