You will be participating in exciting projects that cover the entire data lifecycle - from integration of raw data with primary and third-party systems, through advanced data modeling, to state-of-the-art data visualization and development of innovative data products.
You will have the opportunity to learn how to build and work with both batch and real-time data processing pipelines. You will work in a modern cloud-based data warehousing environment alongside a team of diverse, intense, and interesting coworkers. You will liaise with other departments - such as the product & tech, core business verticals, trust & safety, finance and others - to enable them to succeed.
Your responsibilities
- Design, implement, and support data warehousing;
- Integration of raw data with primary and third-party systems
- Data warehouse modeling for operational and application data layers
- Development within Amazon Redshift cluster
- SQL development as part of an agile team workflow
- ETL design and implementation in Matillion ETL
- Real-time data pipelines and applications using serverless and managed AWS services such as Lambda, Kinesis, API Gateway, and so on.
- Design and implementation of data products enabling data-driven features or business solutions
- Building data dashboards and advanced visualizations in Sisense for data cloud teams (previously known as Periscope Data) with a focus on UX, simplicity, and usability
- Cooperation with other departments on data products - such as product & technology, marketing & growth, finance, core business, advertising, and others
- Being part of and contributing to a strong team culture and ambition to be at the forefront of big data
- Evaluation and improvement of data quality through the implementation of test cases, alerts, and data quality safeguards
- Upholding the team values: Simpler. Better. Faster.
- A strong desire to learn
Required minimum experience (mandatory)
- 1- 2 years of experience in data processing, analysis, and problem-solving with large volumes of data;
- Good SQL skills across various relational data warehousing technologies especially in cloud data warehousing (for instance, Amazon Redshift, Google BigQuery, Snowflake, Vertica, and so on.)
- At least 1 year of experience with one or more programming languages, preferably Python
- Ability to communicate insights and findings to a non-technical audience
- Written and verbal proficiency in English
- Entrepreneurial mindset and creative thinking capacity; highly-driven and self-motivated; a strong curiosity and a constant drive to learn
- A top-class university technical degree in fields such as computer science, engineering, mathematics, physics, etc.
Additional experience (a significant plus)
- Experience working with customer-centric data at a large scale, preferably in an online / e-commerce environment
- Experience with modern big data ETL tools (for instance, Matillion)
- Experience with AWS data ecosystem (or other cloud providers)
- A proven track record in business intelligence solutions, development and scaling of data warehouses, and data modeling
- Tagging, Tracking, and reporting using Google Analytics 360
- Knowledge of modern real-time data pipelines (for instance, serverless framework, lambda, kinesis, etc.)
- Experience with modern data visualization platforms such as Periscope, Looker, Tableau, Google Data Studio, etc.
- Knowledge of Linux, bash scripting, Javascript, HTML, XML
- Experience with Docker Containers and Kubernetes
#LI-TM1