Staff AI Infrastructure Engineer

Job expired!
AI/ML Infrastructure Engineer Job Opening at XPeng Motors

AI/ML Infrastructure Engineer Job Opening at XPeng Motors

XPeng Motors is a leading smart electric vehicle (EV) company in China. We design, develop, and manufacture smart EVs, seamlessly integrating advanced Internet, AI, and autonomous driving technologies. Committed to in-house R&D and intelligent manufacturing, we aim to revolutionize mobility for our customers through technology and data.

Job Title: AI/ML Infrastructure Engineer

We are seeking a talented AI/ML Infrastructure Engineer to enhance our productivity. In this role, you will identify and resolve infrastructure gaps to ensure reliable, efficient, and scalable solutions, impacting our research and development operations.

Key Responsibilities:

  • Identify and resolve infrastructure gaps for reliable and scalable solutions.
  • Develop AI/ML infrastructure to enhance the efficiency of our ML teams.
  • Design solutions for critical areas such as distributed storage, scheduling systems, high availability, and core reliability for large-scale GPU clusters.
  • Monitor and optimize AI/ML infrastructure performance, ensuring high availability and efficient resource utilization.
  • Develop and deploy automation tools, monitoring solutions, and operational strategies to streamline infrastructure management.
  • Collaborate with ML developers, data engineers, and DevOps professionals to create a cohesive AI/ML infrastructure ecosystem.

Minimum Skill Requirements:

  • Bachelor's degree in Computer Science, Engineering, or a related technical field.
  • 5-8+ years of experience in software engineering with strong expertise in large-scale distributed systems, preferably within the AI/ML domain.
  • Proficiency in programming languages such as Python, Go, or C++, and knowledge of cloud platforms like AWS or Azure.
  • Strong communication and collaboration abilities for working with diverse teams.

Preferred Skill Requirements:

  • Deep understanding of AI/ML workflows including model training, data processing, and inference pipelines.
  • Experience with containerization technologies (Docker, Kubernetes), automation tools (Ansible, Terraform), and monitoring solutions (Prometheus, Grafana).
  • Exceptional problem-solving skills to analyze complex systems and implement scalable solutions.
  • A passion for continuous learning and staying updated with new technologies and best practices in AI/ML infrastructure.

What We Offer:

  • A fun, supportive, and engaging work environment.
  • The opportunity to significantly impact autonomous driving and the transportation revolution.
  • The chance to work with cutting-edge technologies and top talent in the field.
  • Competitive compensation package.
  • Snacks, lunches, and fun activities.

The base salary range for this full-time position is $180,000-$300,000, in addition to bonus, equity, and benefits. Salary ranges are determined by role, level, and location. Within this range, individual pay is determined by work location and other factors like skills, experience, and relevant education or training.

XPeng Motors is an Equal Opportunity Employer, dedicated to providing equal employment opportunities to all qualified individuals without regard to race, age, color, sex, sexual orientation, religion, national origin, disability, veteran status, or marital status.

Company Name: XPeng Motors

Job Title: Staff AI Infrastructure Engineer