Join Groupon as a Software Development Engineer III - ETL Engineer
Company: Groupon
Groupon is a leading marketplace that connects people with the best their city or destination has to offer. Through real-time mobile commerce across local businesses, live events, and travel destinations, Groupon helps individuals discover experiences that make life full, fun, and rewarding. By supporting local businesses, we foster strong customer relationships and vibrant communities. Despite having thousands of employees worldwide, our culture inspires innovation, rewards risk-taking, and celebrates success.
Why Work at Groupon?
At Groupon, we understand that great people make great companies. We blend the best of both worlds by combining the resources and scale of a big company with the autonomy and impact potential of a smaller organization. We are curious, fun, a little intense, and deeply committed to helping local businesses thrive.
About the Data Ingestion Team
The Data Ingestion team at Groupon is central to all things data. We define and build next-generation, cloud-based solutions to ingest and curate petabytes of data into our data lake and data warehouse. Our mission is to empower data analysts and data scientists across all business units to make informed business decisions. In this role, you will have the chance to combine your skills in computer science, distributed systems, and scalable, high-performance production systems.
Your Responsibilities
- Design and develop high-volume batch/streaming data ingestion pipelines and frameworks spanning AWS and GCP data platforms.
- Conceive, code, and launch next-generation data ingestion and curation platforms.
- Participate in defining requirements, system, and data architectural discussions.
- Technically lead and mentor junior engineers in software development and data engineering best practices.
- Collaborate with cross-functional agile teams, including software engineers, data engineers, ML experts, and data analysts.
What We're Looking For
We’re excited about you if you have:
- BS/MS in Computer Science with 5+ years of experience in distributed systems, data engineering, or software engineering.
- Strong coding skills in languages such as Python and Java.
- Experience with SQL and NoSQL databases (Teradata, MySQL, Postgres, Cassandra, Elasticsearch, HBase, etc.).
- Familiarity with UNIX command line.
- Experience in big data technologies like Apache Spark, Hadoop, Hive, etc.
- In-depth experience with ETL, data lineage, and data quality issues.
- Experience in batch and streaming data pipelines.
- Experience with AWS or GCP cloud technologies, especially related to data processing at scale.
- Strong understanding of production services with tight SLAs and CI/CD practices.
- Excellent written and verbal communication skills.
- Passion for building data platforms and a desire to innovate.
Nice to Haves
- Experience with Airflow and Apache Kafka.
- Knowledge of Scala.
- Experience with HBase and Redis cache.
- Experience in Google Data Streams and Google Dataproc.
- Experience with change data capture (CDC) technologies.
- Experience with modern data warehouse technologies like Delta Lake and Hudi frameworks.
- Experience developing distributed applications with central logging, metrics, monitoring, and alerting tools.
Important Traits
- Customer-focused: Doing what’s right for the customer drives our business forward.
- Quality-obsessed: Your code just works and scales linearly.
- Team players: You work well with others, provide and receive constructive feedback.
- Fast learners: You adapt quickly to new technologies and apply them to solve problems.
- Pragmatic: You understand when it's appropriate to take shortcuts without sacrificing quality.
- Owners: You know how to make a positive impact on the business.
Groupon’s purpose is to build strong communities through thriving small businesses. To learn more about the