Job Description
The range for this position (mid) is 12,300 - 17,600 PLN gross (employment contract)
A hybrid work model that incorporates solutions developed by the leader and the team
The Data Science Hub (DSH) is where we solve various business problems using analytical techniques and machine learning. We deliver insights and make decisions based on terabytes of data processed daily. Our team is an excellent place for people who seek continual development opportunities and a unique chance to gain interdisciplinary knowledge about how e-commerce platforms work. The variety of impacted business domains is best described by a diverse portfolio of projects, including:
- logistics - delivery time prediction, logistic network optimization
- marketing - category recommendation, next purchase prediction
- pricing - price optimization
- finance - sales forecasting
- and many more…
The Data Science Hub consists of 5 teams:
- 3 Data Science teams
- Data Analytics team
- Data Engineering team
We are hiring for the Data Engineering team, where we focus on data processing and preparation, deployment and maintenance of our projects, and sharing our skills with the rest of the team.
Join our team to enhance your skills related to deploying advanced data processing techniques and Machine Learning approaches.
We are looking for people who:
- Have the ability to fluently work with SQL in traditional engines (e.g., MySQL, PostgreSQL) or cloud engines (e.g., BigQuery, Snowflake). You will be working with SQL on a daily basis.
- Have experience in Python programming and are familiar with software engineering best practices (PEP8, clean architecture, code review, CI/CD etc.)
- Have a positive attitude and ability to work in a team
- Are eager to continuously develop and broaden their knowledge
Additionally, it would be advantageous if you have:
- Experience with Big Data ecosystem (Spark, Airflow)
- Knowledge of BigData tools in Google Cloud Platform or other public clouds (e.g AWS, Azure)
- Commercial experience in DevOps and CI/CD practice (e.g. GitHub Actions) in the area of ML/AI
- Experience with cloud applications architecture
Our tech stack:
- Python
- Google Cloud Platform (AirFlow, BigQuery, Composer)
- GitHub (code storage, CI/CD, hosting our own Data Science Python library)
What we offer:
- A hybrid work model that you will agree with your leader and the team. We have well-located offices (with fully equipped kitchens and bicycle parking facilities) and excellent working tools (height-adjustable desks, interactive conference rooms).
- Annual bonus up to 10% of the annual gross salary (depending on your annual assessment and the company's results).
- A wide selection of fringe benefits in a cafeteria plan – you choose what you like (e.g., medical, sports or lunch packages, insurance, purchase vouchers).
- Paid English classes related to the specific nature of your job.
- Work with a team you can always count on — we have top-class specialists and experts aboard.
- A high degree of autonomy in organizing your team’s work; we encourage continuous development and trying new things.
- Hackathons, team outings, training budget, and an internal educational platform, MindUp (including training courses on work organization, means of communication, motivation to work, and various technologies and subject matters).
Your responsibilities will include:
- Actively building data processing tools for modeling and analysis – in close cooperation with both Data Science teams.
- Assisting both Data Science teams in developing data sources for ad-hoc analyses and Machine Learning projects.
- Processing terabytes of data using Google Cloud Platform BigQuery, Composer, Dataflow, and , while optimizing processes in terms of their performance and GCP cloud processing costs.
- Collecting process requirements from project groups, and automating tasks related to preprocessing and data quality monitoring, prediction serving, as well as Machine Learning model monitoring and retraining.
- Maintaining engineering quality for each project and cooperating with your colleagues on engineering excellence.
Why work with us?
- You will have a significant impact on one of the world's largest e-commerce platforms through provided data and processes.
- Given the wide range of projects we involve in, you will never run out of interesting challenges.
- You will have access to vast datasets (measured in petabytes).
- You will get to work with a team of experienced engineers and BigData specialists who are willing to share their knowledge (incl. with the general public, as part of allegro.tech).
- Your professional growth will follow the most recent open-source technological trends.
- You will have an actual impact on product development directions and technology selection – we use the most recent and best technological solutions because they closely align with our needs.
- We are a full-stack provider – we design, code, test, deploy and maintain our solutions.
Apply to Allegro and see why it's #goodtobehere