Senior Software Engineer, Data

Job expired!

Join AssemblyAI: Transforming AI with Cutting-Edge Audio Data Transcription

About AssemblyAI

At AssemblyAI, we’re pioneering the field of Applied AI by developing advanced models to transcribe and understand audio data seamlessly through a straightforward web API. With the rapid advancements in AI technology, we constantly stay attuned to the latest research breakthroughs, integrating these innovations into our production-ready AI models. Our Automated Speech Recognition (ASR) technology surpasses industry leaders like Google, AWS, and Microsoft, making our API a preferred choice for hundreds of companies and thousands of developers who transcribe and analyze millions of videos, podcasts, phone calls, and Zoom meetings daily.

Backed by top-tier investors such as Accel, Insight Partners, Y Combinator’s AI Fund, Patrick and John Collision, Nat Friedman, and Daniel Gross, AssemblyAI is poised to become a leader in the burgeoning applied AI market. Join our exceptional remote team and contribute to building a landmark AI company!

About the Role: Senior Software Engineer, Data

AssemblyAI is seeking a talented Software Engineer to join our Data Infrastructure team. This position offers the opportunity to shape the vision, scope, and structure of the team and influence the architecture and capabilities you build. Candidates should have a robust background in Data Engineering coupled with experience in software engineering, including best practices such as testing strategies and code reviews.

Given this is a cross-functional role, it requires close collaboration with our Research and Data Operations teams. The ideal candidate should be adept at managing relationships with different stakeholders and effectively communicating complex ideas to varied audiences.

Responsibilities

  • Develop and maintain Data Platforms for our Research Team, managing tools like Airflow, BigQuery, Dataproc, and Dataflow.
  • Design and implement highly scalable data pipelines on distributed computing platforms using Google Cloud Platform (GCP).
  • Contribute to the development of our multimedia AI Lakehouse.
  • Enhance our Data Lineage System.
  • Create internal tools to help other teams visualize, use, and understand large datasets.
  • Implement guardrails to optimize cost, data quality, usability, and speed.

Requirements

  • 5+ years of professional software engineering experience, demonstrating clean, maintainable, and well-tested code.
  • 3+ years of experience as a Data Engineer or in a similar role.
  • Proficiency with BigTable, BigQuery, Dataproc, Dataflow, Dataplex, and Cloud Composer among other GCP services.
  • Familiarity with distributed data processing frameworks like Apache Beam and Apache Spark, with a deep understanding of both batch and stream processing.
  • Experience with workflow automation tools such as Airflow or managed solutions like Composer and Astronomer.
  • Fluent in Python and SQL.
  • Experience building internal applications and tools for developers or researchers.
  • Proven experience with Data Lineage systems.
  • Hands-on experience with Terraform, Docker, Kubernetes, and CI/CD.
  • Knowledge of GCP IAM patterns and best practices.
  • Experience with Mage or Prefect is a plus.

Pay Transparency

At AssemblyAI, we strive to attract and retain exceptional talent from diverse backgrounds with a commitment to pay equity. Our salary ranges are designed to be competitive within our size, stage, and industry while forming part of a holistic set of compensation and benefits designed to reward our team.

Salary determinations consider factors such as experience, skill level, qualifications, and maintaining internal equity. While the range provided is a general expectation for the listed role, we remain open to candidates with varying levels of experience, communicated with updates as necessary. Please note, the salary range is specified for U.S. candidates; international candidates may see adjustments relevant to their location:

Salary Range: $180k - $240k

Life at AssemblyAI

We are a dedicated team of problem solvers, innovators, and AI researchers with over 20 years of expertise in Machine Learning, Speech Recognition, and NLP. As a fully remote team, we seek ambitious, curious, and self-motivated individuals. We emphasize trust and autonomy, ensuring everyone has the opportunity to excel.

We are committed to creating an inclusive work environment