Senior Solutions Architect - Generative AI at NVIDIA
Are you passionate about cutting-edge technology and AI innovations? NVIDIA is seeking a dynamic and experienced Generative AI Solution Architect with specialized expertise in training Large Language Models (LLMs) and implementing workflows based on Pretraining, Finetuning LLMs & Retrieval-Augmented Generation (RAG).
About the Role
As a key member of our AI Solutions team, you will play a pivotal role in architecting and delivering cutting-edge solutions that leverage NVIDIA's powerful generative AI technologies. This position requires a deep understanding of language models, particularly open-source LLMs, and a strong proficiency in designing and implementing RAG-based workflows.
Key Responsibilities
- Architect end-to-end generative AI solutions focusing on LLMs training, deployment, and RAG workflows.
- Collaborate closely with customers to understand their language-related business challenges and design tailored solutions.
- Support pre-sales activities, including technical presentations and demonstrations of LLM and RAG capabilities.
- Work closely with NVIDIA engineering teams to provide feedback and contribute to the evolution of generative AI software.
- Engage directly with customers/partners to understand their requirements and challenges.
- Lead workshops and design sessions to define and refine generative AI solutions focused on LLMs and RAG workflows.
- Lead the training and optimization of Large Language Models using NVIDIA’s hardware and software platforms.
- Implement strategies for efficient and effective training of LLMs to achieve optimal performance.
- Design and implement RAG-based workflows to enhance content generation and information retrieval.
- Work closely with customers to integrate RAG workflows into their applications and systems.
- Stay abreast of the latest developments in language models and generative AI technologies.
- Provide technical leadership and guidance on best practices for training LLMs and implementing RAG-based solutions.
Required Qualifications
- Master's or Ph.D. in Computer Science, Artificial Intelligence, or equivalent experience.
- 7-11+ years of hands-on experience in a technical AI role, with a strong focus on generative AI and training Large Language Models (LLMs).
- Proven track record of successfully deploying and optimizing LLM models for inference in production environments.
- In-depth understanding of state-of-the-art language models, including GPT-3, BERT, or similar architectures.
- Expertise in training and fine-tuning LLMs using popular frameworks such as TensorFlow, PyTorch, or Hugging Face Transformers.
- Proficiency in model deployment and optimization techniques for efficient inference on various hardware platforms, with a focus on GPUs.
- Strong knowledge of GPU cluster architecture and the ability to leverage parallel processing for accelerated model training and inference.
- Excellent communication and collaboration skills with the ability to articulate complex technical concepts to both technical and non-technical stakeholders.
- Experience leading workshops, training sessions, and presenting technical solutions to diverse audiences.
Desirable Skills
- Experience in deploying LLM models in cloud environments (e.g., AWS, Azure, GCP) and on-premises infrastructure.
- Proven ability to optimize LLM models for inference speed, memory efficiency, and resource utilization.
- Familiarity with containerization technologies (e.g., Docker) and orchestration tools (e.g., Kubernetes) for scalable and efficient model deployment.
- Deep understanding of GPU cluster architecture, parallel computing, and distributed computing concepts.
- Hands-on experience with NVIDIA GPU technologies and GPU cluster management.
- Ability to design and implement scalable and efficient workflows for LLM training and inference on GPU clusters.
With competitive salaries and a generous benefits package, NVIDIA is widely considered one of the technology world’s most desirable employers. Our teams are comprised of some of the most forward-thinking and hardworking individuals. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you!
NVIDIA is committed to fostering a diverse work environment and proudly stands as an equal-opportunity employer. We highly value diversity in our current and future employees and do not discriminate based on race, religion, color, national origin