Data Engineer

  • Full Time
Job expired!
We are looking for risk-takers, collaborators, the inspired and the inspirational. We want individuals who are brave enough to work at the cutting edge and create solutions that will enrich and improve the lives of people across the globe. If you want to make the world say wow, let's talk. The conversation starts here. If this role matches your ambitions and skill set, let's get started with your application. Feel free to explore our other open positions as well. Our numerous opportunities can lead to infinite possibilities. Job Title: Data Engineer Project Details: This project involves designing and developing data delivery on Sony Music Publishing’s data warehouse on AWS. Technology and Sub-technology: AWS Base Location: Bengaluru Type: Hybrid Qualifications: BE/B.Tech in Computer Science and 4+ years of experience. Job Overview: The Data Engineer is responsible for designing and developing data delivery on our data warehouse on AWS. This data will be used in visual dashboards/reports that Sony Music Publishing teams use to better understand trends and insights to improve market share/songwriter deals. Primary Skills: - Experience in data architecture including data modeling, data mining and data ingestion. - Experience with AWS associated technologies (S3 buckets, Glue, Data Pipeline, DMS, RDS, Redshift, Aurora, Lambda). - Knowledge of creating ETL scripts with languages such as Python, Node.js, SQL. - Experience in data warehousing and big data. - Experience with Relational databases (SQL Server). - Experience working in Agile/Scrum teams. AWS Services/Skills Competency: - Python: Intermediary - PySpark: Intermediary - EMR/Glue: Advanced - CICD: Intermediary - Serverless Framework: Intermediary - Cloud Formation Templates: Intermediary - Redshift: Advanced - Lambdas: Advanced - Step Functions: Advanced - Cloud Watch: Intermediary - ElasticSearch/Open Search: Advanced - Kibana: Advanced - Kinesis: Advanced - Redshift Spectrum: Advanced - DMS: Advanced Good to have Skills: - PySpark - CICD - Cloud Formation Templates Responsibilities and Duties: - Works with product owners, developers and AWS Infrastructure team to design and develop ETL processes. - Ability to automate and optimize processes as much as possible. - Ability to work in an Agile/Scrum team. - Ability to problem solve and propose solutions. - Use established Analytics standards/processes in ETL processes. - Ability to communicate with technical and business teams. - Ability to learn new technology quickly. Keywords: Python, PySpark, EMR/Glue, CICD, Serverless Framework, Cloud Formation Templates, Redshift, Lambdas, Step Functions, Cloud Watch, ElasticSearch/Open Search, Kibana, Kinesis, Redshift Spectrum, DMS.