Sr. Distinguished Engineer, Generative AI, Platform Agents and Tooling (Remote Eligible)

Job expired!

Join Capital One: Sr. Distinguished Engineer, Generative AI, Platform Agents and Tooling (Remote Eligible)

Location: 201 Third Street, San Francisco, California, United States

At Capital One, we're on a mission to create trustworthy, reliable, and human-in-the-loop AI systems to revolutionize banking. Leading the industry in applying machine learning, we strive to create real-time, intelligent, and automated customer experiences. From notifying customers about unusual charges to providing real-time assistance, our AI applications are simplifying and humanizing banking.

Thanks to our investments in public cloud infrastructure and machine learning platforms, we are uniquely positioned to harness the transformative power of AI. We are dedicated to building exceptional applied science and engineering teams, continuing our industry-leading capabilities, and delivering scalable, high-performance AI infrastructure.

About the Role

We are seeking an experienced Sr. Distinguished Engineer in AI Platforms to help us build the foundation of our enterprise AI capabilities. In this role, you will develop generic platform services to support applications powered by Generative AI, create SDKs and APIs, develop information retrieval agents, and models as a service for optimizing LLMs via RAG, among other tasks. You will also manage end-to-end coordination with operations, curate high-quality datasets, and help productionize models. Your collaboration with applied research and product teams will identify and prioritize ongoing and upcoming services.

Responsibilities

  • Develop abstracted platform services to support applications powered by Generative AI.
  • Create SDKs and APIs for applications including information retrieval, fraud detection, AI assistants, recommendations, and more on our AI platforms.
  • Design and build RAG service platform orchestrations, including prompt engineering, guardrails, vector databases, and API grounding.
  • Build a prompt management service through cross-organizational partnerships.
  • Stay updated with the latest advancements in operationalizing machine learning and GenAI technologies.
  • Implement capabilities to support MLOps for foundation models.

Basic Qualifications

  • Bachelor's degree in Computer Science, Computer Engineering, or a related technical field.
  • 9+ years of programming experience with Python, Go, Scala, or C/C++.
  • 6+ years of experience designing and deploying enterprise AI or ML applications.
  • 3+ years of experience implementing full lifecycle ML automation using MLOps.
  • 4+ years leading teams developing Machine Learning solutions.
  • 1+ year of experience with LLM-based conversational AI systems.

Preferred Qualifications

  • Master’s degree or PhD in Engineering, Computer Science, or a related technical field with a focus on modern AI techniques.
  • Strong problem-solving and analytical skills, with the ability to work independently and as part of a team.
  • Experience with Graph or Network Theory and Graph ML, including frameworks like Deep Graph Learning (DGL) or NetworkX.
  • Experience designing large-scale distributed platforms in cloud environments like AWS, Azure, or GCP.
  • Experience building abstracted SDKs and familiarity with Haystack or Langchain.
  • Experience architecting cloud systems for security, performance, scalability, and cost.
  • Experience delivering large models through the MLOps life cycle from exploration to serving.
  • Experience using Kubeflow Pipelines to deliver models to production.
  • Ability to navigate an environment with competing priorities and deadlines, preferably with experience in tech and product-driven companies/startups.
  • Experience in AI technology stack areas including prompt engineering, guardrails, vector databases, LLM hosting, and fine-tuning.
  • Experience with vertical integration of LLMs with enterprise applications.

Compensation and Benefits

The minimum and maximum full-time annual salaries for this role are:

  • New York City (Hybrid On-Site): $321,500 - $366,900 for Sr Distinguished Machine Learning Engineer; $274,800 - $313,600 for Distinguished Machine Learning Engineer
  • San Francisco, California (Hybrid On-Site): $340,500 - $388,700 for Sr Distinguished Machine Learning Engineer; $291,100 - $332,300 for Distinguished Machine Learning Engineer