Junior Big Data Engineer

Job expired!

Join Similarweb as a Junior Big Data Engineer in Prague!

About Similarweb

At Similarweb, we are revolutionizing the digital landscape by providing businesses with unparalleled insights into online activities. Our cutting-edge platform and unique data empower over 4,300 customers globally, including industry giants like Google, eBay, and Adidas, to make game-changing decisions that drive their digital strategies. We went public on the New York Stock Exchange in 2021 and have been growing rapidly ever since!

Why Work with Us?

We're expanding our global footprint with a brand-new site in Prague, a hub of Europe's top tech talent. We're in search of exceptional pioneers to help build a dynamic team. At Similarweb, you’ll innovate swiftly, collaborate with brilliant minds, tackle significant challenges, work with cutting-edge technologies and data at scale, and make a tangible impact on some of the world's most innovative companies.

Position: Junior Big Data Engineer

Are You the Right Fit?

Whether you are an experienced software engineer looking for a new challenge or interested in transitioning into the dynamic world of data engineering, we have the perfect role for you. We’re seeking passionate individuals with a background in software engineering to join our team as Data Engineers. Don’t worry if you lack experience in data engineering – we’ll provide you with a comprehensive bootcamp to equip you with all the necessary skills.

Reporting to our Team Manager, R&D, this role offers the perfect opportunity to leverage your backend engineering experience and dive into the exciting field of big data and cutting-edge technologies.

Why is This Role Critical?

As a Big Data Engineer at Similarweb, you’ll work closely with data scientists to optimize machine learning algorithms, streamline code workflows, and architect resilient infrastructure solutions. Your backend engineering expertise will be instrumental in shaping the future of our data-driven initiatives. Our bootcamp program ensures you get the support you need to succeed, making your contributions pivotal in extracting actionable insights from vast datasets and driving innovation in the digital realm.

Key Responsibilities

As part of the R&D team, your daily tasks may include:

  • Collaborating with data scientists to understand algorithmic requirements and optimize code for performance and scalability.
  • Designing and implementing robust data pipelines to efficiently process and analyze large volumes of structured and unstructured data.
  • Architecting scalable and resilient infrastructure solutions for data-intensive workloads and machine learning workflows.
  • Continually evaluating and integrating new technologies to enhance our data processing capabilities.
  • Collaborating with cross-functional teams to drive innovation and provide impactful data-driven solutions.

About the R&D Team

As a member of our R&D team, you will tackle challenging algorithmic problems without a playbook, offering solutions for real customers and working alongside top engineers and scientists in the industry.

Ideal Candidate

This role is perfect for someone who:

  • Is passionate about data.
  • Has excellent communication skills (in English), enabling constant dialogue within and across data teams.
  • Has at least 2 years of development experience in software or data engineering using languages like Python, Java, or Scala.
  • Possesses strong programming skills and knowledge of Data Structures, Design Patterns, and Object-Oriented Programming.
  • Has experience with containerization technologies like Docker.
  • Understands CI/CD practices and has experience with Git.
  • Is familiar with SQL and NoSQL databases.
  • Is acquainted with a cloud provider (AWS, Azure, or GCP).
  • Can prioritize tasks effectively and work independently or as part of a team.
  • Has a strong sense of ownership over the team's products.
  • Thrives in a fast-paced, dynamic environment.

Advantages: Experience with Big Data technologies and frameworks such as Spark, Airflow, Kafka, Parquet, Databricks, EMR, Kubernetes.

Why You’ll Love Working at Similarweb

  • Product Passion: Our employees are our best advocates. When polled, 99% cited "the product" as a key reason for joining us. Imagine working with the world's most powerful