Senior Solutions Architect - Generative AI

Job expired!

Senior Solutions Architect - Generative AI at NVIDIA

Are you passionate about cutting-edge technology and AI innovations? NVIDIA is seeking a dynamic and experienced Generative AI Solution Architect with specialized expertise in training Large Language Models (LLMs) and implementing workflows based on Pretraining, Finetuning LLMs & Retrieval-Augmented Generation (RAG).

About the Role

As a key member of our AI Solutions team, you will play a pivotal role in architecting and delivering cutting-edge solutions that leverage NVIDIA's powerful generative AI technologies. This position requires a deep understanding of language models, particularly open-source LLMs, and a strong proficiency in designing and implementing RAG-based workflows.

Key Responsibilities

  • Architect end-to-end generative AI solutions focusing on LLMs training, deployment, and RAG workflows.
  • Collaborate closely with customers to understand their language-related business challenges and design tailored solutions.
  • Support pre-sales activities, including technical presentations and demonstrations of LLM and RAG capabilities.
  • Work closely with NVIDIA engineering teams to provide feedback and contribute to the evolution of generative AI software.
  • Engage directly with customers/partners to understand their requirements and challenges.
  • Lead workshops and design sessions to define and refine generative AI solutions focused on LLMs and RAG workflows.
  • Lead the training and optimization of Large Language Models using NVIDIA’s hardware and software platforms.
  • Implement strategies for efficient and effective training of LLMs to achieve optimal performance.
  • Design and implement RAG-based workflows to enhance content generation and information retrieval.
  • Work closely with customers to integrate RAG workflows into their applications and systems.
  • Stay abreast of the latest developments in language models and generative AI technologies.
  • Provide technical leadership and guidance on best practices for training LLMs and implementing RAG-based solutions.

Required Qualifications

  • Master's or Ph.D. in Computer Science, Artificial Intelligence, or equivalent experience.
  • 7-11+ years of hands-on experience in a technical AI role, with a strong focus on generative AI and training Large Language Models (LLMs).
  • Proven track record of successfully deploying and optimizing LLM models for inference in production environments.
  • In-depth understanding of state-of-the-art language models, including GPT-3, BERT, or similar architectures.
  • Expertise in training and fine-tuning LLMs using popular frameworks such as TensorFlow, PyTorch, or Hugging Face Transformers.
  • Proficiency in model deployment and optimization techniques for efficient inference on various hardware platforms, with a focus on GPUs.
  • Strong knowledge of GPU cluster architecture and the ability to leverage parallel processing for accelerated model training and inference.
  • Excellent communication and collaboration skills with the ability to articulate complex technical concepts to both technical and non-technical stakeholders.
  • Experience leading workshops, training sessions, and presenting technical solutions to diverse audiences.

Desirable Skills

  • Experience in deploying LLM models in cloud environments (e.g., AWS, Azure, GCP) and on-premises infrastructure.
  • Proven ability to optimize LLM models for inference speed, memory efficiency, and resource utilization.
  • Familiarity with containerization technologies (e.g., Docker) and orchestration tools (e.g., Kubernetes) for scalable and efficient model deployment.
  • Deep understanding of GPU cluster architecture, parallel computing, and distributed computing concepts.
  • Hands-on experience with NVIDIA GPU technologies and GPU cluster management.
  • Ability to design and implement scalable and efficient workflows for LLM training and inference on GPU clusters.

With competitive salaries and a generous benefits package, NVIDIA is widely considered one of the technology world’s most desirable employers. Our teams are comprised of some of the most forward-thinking and hardworking individuals. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you!

NVIDIA is committed to fostering a diverse work environment and proudly stands as an equal-opportunity employer. We highly value diversity in our current and future employees and do not discriminate based on race, religion, color, national origin