Senior Software Engineer, Deep Learning Inference Workflows

Job expired!

Join NVIDIA as a Senior Software Engineer in Deep Learning Inference Workflows

We are seeking a talented Senior Software Engineer to join our team, focusing on Deep Learning Inference Workflows. If you are passionate about making a significant impact in Deep Learning by developing a state-of-the-art inference framework to accelerate Deep Learning models, especially Large Language Models, on NVIDIA GPUs, this opportunity is for you!

Key Responsibilities

As a member of our Deep Learning software team, you will:

  • Develop components for TensorRT, NVIDIA’s SDK for high-performance deep learning inference.
  • Utilize C++ and Python to create graph parsers, optimizers, and tools for the efficient deployment of trained deep learning models.
  • Collaborate with deep learning experts, GPU architects, and DevOps engineers across diverse teams.

Qualifications

To be successful in this role, you should possess:

  • A Bachelor's, Master's, PhD, or equivalent in Computer Science, Computer Engineering, Electrical Engineering, or a related field.
  • 6+ years of software development experience.
  • Proficiency with C++11/C++14.
  • A strong understanding of Machine Learning concepts, particularly Natural Language Processing.
  • Excellent communication skills and an aptitude for collaboration and teamwork.

Preferred Skills

Ways to stand out from the crowd include:

  • Experience in developing System Software.
  • Proficiency in Python and background in GPU kernel programming using CUDA or OpenCL.
  • Experience with software performance benchmarking, profiling, and optimizations.
  • Background in compiler development.
  • Experience with TensorRT, PyTorch, TensorFlow, ONNX Runtime, or other ML frameworks.

Why NVIDIA?

NVIDIA is widely regarded as one of the tech world's most desirable employers. We boast some of the most forward-thinking and hard-working individuals. If you are creative, autonomous, and love challenges, we want to hear from you. Join our TensorRT Workflows team and contribute to building the real-time, cost-effective computing platform driving our success in this exciting and rapidly growing field.

Compensation and Benefits

The base salary range for this position is $180,000 - $339,250 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar roles. Additionally, you will be eligible for equity and a comprehensive benefits package.

Diversity and Inclusion

NVIDIA is committed to fostering a diverse work environment and is proud to be an equal-opportunity employer. We highly value diversity in our current and future employees and do not discriminate based on race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.

Apply today to join NVIDIA’s cutting-edge team and drive the next generation of Deep Learning Inference Workflows!

Company name: NVIDIA

Job title: Senior Software Engineer, Deep Learning Inference Workflows