With NVIDIA Metropolis, we are constructing the AI platform and technological bases of the next-generation of smart factories, cities, airports, fulfillment centers, stores, and more. By using LLMs, LMMs, Computer Vision, and sensor fusion, we are able to bring sensors to life, improving our comprehension, accelerating processes, and driving efficiency across all operations in our surroundings. Through the use of GenAI and Synthetic data, we are facilitating the creation of foundation models to transform operations across retail, supply chain, transportation, and digital industrialization. Collaborating with every industry leader and innovative startup, we are leading the charge in innovation for some of the largest economic activities and processes. We are seeking an AI Developer Technology Engineer to join our team and help bring our groundbreaking technologies to market and work with our key customers.
What you'll be doing:
- Using AI and NVIDIA SDKs, you will construct groundbreaking applications to support a global developer ecosystem.
- You will train, fine-tune, and quantify AI models with a variety of DL and NVIDIA frameworks like NeMo and TAO.
- You will be able to swiftly adjust to development requirements for edge applications to cloud-native APIs.
- You will work directly with key customers to understand current and future problems, assist in resolving issues, and enable SOTA performance.
- You will publish white papers, reference architectures, and OSS projects on Git Hub.
- You will develop and run benchmarks for future AI algorithms and collaborate in brainstorming and implementing optimization strategies on GPUs.
- You will closely collaborate with the architecture, research, and system software teams to build exciting demos.
- Some travel may be required.
What we need to see:
- B.S. or M.S. degree in Computer Science or a similar field with a proven track record.
- 5+ years of experience.
- Deep expertise in deep learning technologies and one or more frameworks.
- Experience building computer vision solutions with GPUs.
- Strong C++/Python programming skills.
- Familiarity with Apache Kafka, Apache Spark, Redis, Cassandra, and ElasticSearch.
- Experience in building and deploying containers in K8 environments.
- Experience in building REST APIs and web applications.
- Strong communication, presentation, and organizational skills, coupled with a logical approach to problem-solving, good time management, and task prioritization.
Ways to stand out from the crowd:
- Understanding of Large Language Models, Large Multi-Modal Models, Foundation Models, ViTs, their architectures, and trade-offs.
- Some experience building LLM applications using the Langchain framework with vector store.
- Experience with AI model and algorithm performance optimization.
- Familiarity with one or more of the following - Postman, Django, Flask, and Gradio.
- Ability to adapt to constantly changing environments and technology and to work directly with customers or partners.
The base salary range is 124,000 USD - 230,000 USD. Your base salary will be determined by your location, experience, and the wages of employees in similar positions. You will also be eligible for equity and benefits. NVIDIA accepts applications on a continual basis.