Senior Machine Learning Engineer- GenAI Runtime

Job expired!
Founded in late 2020 by a group of machine learning engineers and researchers, MosaicML enables companies to securely fine-tune, train and deploy bespoke AI models on their data, ensuring maximum security and control. Compatible with all major cloud providers, MosaicML provides maximum flexibility for AI development. By 2023, MosaicML’s pre-trained transformer models became a new standard for open source, commercially usable LLMs, and have been downloaded over 3 million times. MosaicML believes that a company’s AI models are just as valuable as any other core IP and that top-quality AI models should be accessible to all. Now part of Databricks since July 2023, we are passionate about enabling our customers to solve the world's most challenging problems — from developing the next mode of transportation to accelerating the development of medical breakthroughs. We achieve this by creating and operating the world's finest data and AI platform, allowing our customers to use deep data insights to improve their business. Summary: We are seeking top-tier Machine Learning Engineers to empower businesses to deploy LLMs and advanced generative models in production. Your role will include building and maintaining Mosaic’s ML Runtime, which enables customers to employ their unique data to develop models with optimal quality, performance, and cost. Expectations: - Design and implement tools and open source technologies to enable the development of automated ML pipelines for data preprocessing, model training, hyperparameter tuning, and model evaluation for Databricks' customers. - Design and implement robust, scalable ML infrastructure and model serving components. - Implement advanced optimization techniques to reduce the resource footprint of models while retaining their performance. - Collaborate with product managers and cross-functional teams to drive technology-first initiatives. - Support our user community through documentation, talks, tutorials, and collaborations. - Contribute to the broader AI community. Position requirements: - 4+ years of full time industry experience - Experience with deep learning frameworks (e.g. PyTorch, TensorFlow) - Experience with GPUs and alternative deep learning accelerators - Strong sense of design and usability - Effective communication skills. - Prior history of contributing to or developing open source projects is appreciated but not mandatory. About Databricks Databricks is the data and AI company. More than 9,000 organizations worldwide rely on the Databricks Lakehouse Platform to unify their data, analytics and AI. Databricks is headquartered in San Francisco, with offices around the globe. Founded by the initial creators of Apache Spark™, Delta Lake and MLflow, Databricks aims to help data teams solve the world’s most challenging problems. Our Commitment to Diversity and Inclusion At Databricks, we are committed to fostering a diverse and inclusive culture. We ensure our hiring practices are inclusive and meet equal employment opportunity standards. Individuals seeking employment at Databricks are considered without regard to age, color, disability, ethnicity, family or marital status, gender identity or expression, language, national origin, physical and mental ability, political affiliation, race, religion, sexual orientation, socio-economic status, veteran status, and other protected characteristics. Local Pay Range: $166,000 – $225,000 USD.