Senior Applied Machine Learning Engineer, NEMO Microservices

Job expired!

Join NVIDIA: Senior Applied Machine Learning Engineer for NeMo Microservices Suite Platform

For over 25 years, NVIDIA has been at the forefront of computer graphics, PC gaming, and accelerated computing innovation. Our technological advancements, driven by amazing teams, are now pushing the boundaries of AI to define the future of computing. Our GPUs are the brains behind computers, robots, and self-driving cars that understand the world like never before.

Why Join Us?

At NVIDIA, you'll immerse yourself in a diverse and supportive environment where innovation thrives. Our mission is to turn visionary ideas into groundbreaking products. Joining our team means working alongside the industry’s best talent to make a lasting impact on the world.

About the Role: Senior Applied Machine Learning Engineer

We are seeking a Senior Applied Machine Learning Engineer to be part of the team building the NeMo Microservices Suite Platform. Our objective is to create next-generation AI services and interfaces that enhance and deploy AI at scale. Our focus includes developing cutting-edge speech, vision, and NLP technologies to solve real-world problems. We actively participate in all phases of the machine learning lifecycle, from initial conception through to applied research, and optimized inference and deployment.

What You'll Be Doing:

  • Developing a new generation of Compound AI Systems platforms with reasoning capabilities across multiple modalities such as images, videos, audio, and text.
  • Creating distributed cloud applications, microservices, and MLOps platforms that scale up to huge models.
  • Building microservices for task-specific AI cloud services.
  • Implementing core infrastructure for cloud-native AI training and inference.
  • Pursuing optimal performance under high load conditions.

What We Need to See:

  • BS, Masters, or equivalent experience in computer science, computer architecture, or a related field.
  • 5+ years of experience in the field.
  • Exceptional coding skills aimed at high-quality software creation.
  • Capability of working independently, defining project goals and scope, interacting with the open-source community, and managing development efforts.
  • Experience with implementing microservices and cloud-native applications using technologies like HTTP REST, gRPC, protobuf, and JSON.
  • Proficiency in deploying applications on Kubernetes platforms and familiarity with helm charts, kustomize, and k8s operator.
  • In-depth understanding of performance, security, and reliability in complex distributed infrastructure.
  • Excellent programming and software design skills in Python or Golang, including debugging, performance, service health analysis, and test design.

Ways to Stand Out from the Crowd:

  • Experience deploying machine learning or statistical models into production environments using frameworks like PyTorch, Tensorflow, ONNX Runtime, and TensorRT.
  • Knowledge of deep learning frameworks such as Megatron Core, NeMo, HuggingFace Accelerate, HuggingFace Transformers, DeepSpeed, and similar.
  • Experience with MLOps orchestration platforms such as Seldon Core, Kserve, BentoML, and similar.
  • Expertise with inference engines like VLLM, TensorRT-LLM, and similar.
  • Background in developing production NLP systems and handling high availability environments.

Compensation and Benefits

The base salary range for this role is 148,000 USD - 276,000 USD, depending on your location, experience, and comparable salaries within the company. Additionally, you will be eligible for equity and benefits.

Equal Opportunity Employment

At NVIDIA, we are deeply committed to fostering a diverse work environment. We are proud to be an equal opportunity employer and welcome applications on an ongoing basis. Our hiring and promotion practices do not discriminate based on race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.

Company Name: NVIDIA

Job Title: Senior Applied Machine Learning Engineer, NEMO Microservices

Join us and be part of a team that's changing the world through technology!