Senior Systems Engineer, Site Reliability - Autonomous Vehicles

  • Full Time
Job expired!
NVIDIA is looking for a motivated Site Reliability Engineer to join our Tegra Solution Engineering team, which develops hardware and software systems to power the DRIVE Sim and DRIVE Constellation platforms. NVIDIA DRIVE Sim and DRIVE Constellation provide a digital testing environment for autonomous vehicle developers to create, develop, deploy, and validate their applications. Our Site Reliability Team focuses on enhancing the reliability of our platform by carefully measuring the user experience, linking it to the health of our platform, actively responding to outages, and collaborating with our internal and external partners for continuous improvement. As a Site Reliability Engineer for DRIVE Sim and DRIVE Constellation, you'll work alongside the DRIVE Constellation platform team to design customer digital testing platform deployment plans on the cloud. You'll establish best practices, choose and develop the tools and automations to enhance the platform's reliability, and drive roadmaps. At NVIDIA Tegra Solution Engineering team, we expect everyone to be highly independent, an exceptional teammate, and passionately committed to the mission. We work independently and team up when necessary and everyone here is focused on their life's work. What you'll be doing: - Collaborating with customers to define digital test architecture based on the NVIDIA DRIVE Constellation platform - Coordinating with the Constellation Platform team and SRE leadership to understand customer needs and translate them into deployment plans - Leading and maintaining physical servers, switches, and storage devices in lab and data-center environments - Installing and provisioning new hardware and software for Linux-based systems (Ubuntu) - Automating configuration management, software updates, and maintenance of system availability using modern DevOps tools (Ansible, Gitlab, etc.) - Providing technical support for production system deployments - Planning and maintaining new systems that support the NVIDIA DRIVE SIM and Automated Driving Software stacks - Working closely with software engineers and hardware architects to troubleshoot problems, identify new needs, and improve workflows - Building, deploying, and providing production support for any services you work on - Diagnosing and resolving hardware, network, and software problems What we need to see: - BS or equivalent in a computer-related field - Minimum of 3 years of experience - Proven ability to script in bash, and at least one high-level language (preferably Python) - Experience working with Linux servers and technologies such as Ansible, GIT, and Docker - Profound understanding of operating systems, computer networks, and high-performance applications - Excellent verbal and written communication skills Ways to stand out from the crowd: - A passion for providing high-quality support for your users - Experience maintaining cloud infrastructure applications - Exceptional teamwork skills across various barriers - Experience with computer algorithms and an ability to select the best possible algorithms to tackle the scaling challenge The base salary range is 144,000 USD - 224,250 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.