Software Engineer - Machine Learning Infrastructure

  • Full Time
Job expired!

Who we are

About Stripe

Stripe is a financial infrastructure platform for businesses. Millions of firms—from the largest global enterprises to the boldest startups—utilize Stripe for payment acceptance, revenue growth, and to speed up new business chances. Our objective is to boost the GDP of the internet, and there is a monumental amount of work ahead. This means you have an unprecedented chance to make the global economy accessible to everyone while doing the most vital work of your career.

About the team

The Machine Learning Infrastructure group at Stripe strives to offer cutting-edge infrastructure and backing to build and operationalizing AI/ML models for all company verticals, which includes but not limited to models that curb risks across Stripe’s products and services globally, and models aiding our customers combat fraud leveraging Stripe’s user-facing products such as Radar and Identity. ML is a major focus for Stripe in the upcoming years. With the incredible advancements taking place in the AI field, we are prepared to speed up the adoption of AI/ML across all sectors of the company by creating highly scalable and reliable foundational infrastructure.

What you’ll do

You will team up closely with machine learning engineers, data scientists, and platform infrastructure teams to construct potent, flexible, and user-friendly systems that significantly increase speed throughout the company. 

Responsibilities

  • Construct powerful, flexible, and user-friendly infrastructure that fuels all of ML at Stripe.
  • Design and construct swift, reliable services for ML model training and serving, and scale that infrastructure across numerous regions.
  • Develop services and libraries that enable ML engineers at Stripe to seamlessly shift from experimentation to production across Stripe’s systems.
  • Collaborate with product teams and ML engineers to develop easy-to-use  infrastructure for production ML models.

Who you are

We’re in search of individuals with a powerful background or interest in building successful products or systems; you’re enthusiastic about solving business issues and making impact, you are comfortable managing many moving parts; and you’re fine with learning new technologies and systems. Several of our engineers work remotely from the US and Canada, and we’d be willing to discuss with you about the possibility of working remotely.

We don't expect any single candidate to be proficient in all of these areas. For instance, we have splendid team members who are deeply focused on their customers’ needs and building amazing user experiences, but didn’t come in with a wealth of systems knowledge.

Minimum requirements

  • More than five years of experience building software applications in large scale distributed systems.
  • A strong sense of inquisitiveness and a desire to both learn and disseminate knowledge with your peers. We appreciate working in a collaborative environment and we hope you do too.
  • A solid engineering background and experience with infrastructure and/or distributed systems. You’ll work mostly in Python and Java but we care more about your overall engineering skills than your knowledge of a particular language.
  • Comfortability with the full life cycle of software development, from design and implementation to testing and deployment.
  • Experience with building and maintaining low latency, high availability systems, especially in terms of reliability, testing, and observability.
  • A sense of practicality: you know when to strive for the ideal solution and when to adjust your approach.

Preferred qualifications

  • More than 2 years of experience supporting Machine Learning Infrastructure.
  • Experience optimizing the end-to-end performance of distributed systems.
  • Experience training and deploying machine learning models to production to solve crucial business issues.