AWS/Apache/ETL Data Engineer

Job expired!

Hello and we're glad you decided to apply for this job! We are a growing early-stage IT startup based in the US and India called Reveal Healthtech.

We are on a mission to "reveal" the transformative potential of technology in healthcare delivery. We are looking for passionate individuals like you to join us in this mission and leverage technology in the most innovative ways possible to meet our clients' needs through partnership with us.

Learn more about why we started Reveal Healthtech on our website.

If all of this resonates with you and the idea of building a business from scratch excites you, read our job description😎

Please note this is a hybrid role. #LI_Hybrid.

Requirements

What are we looking for?

We are seeking a talented Data Engineer to join our growing team. As a Data Engineer, you will play a crucial role in designing, implementing, and maintaining our infrastructure. You will work cross-functionally to automate processes, improve system reliability, and scale our platform. In this role, you will get the chance to leverage your technical expertise to help us accomplish our mission of partnering with our clients and meeting their needs in the most innovative way possible.


Responsibilities

As a data engineer responsible for maintaining a data lake, your key responsibilities would include:

  1. Data Ingestion: You will be tasked with collecting and extracting data from multiple sources, such as databases, APIs, or files, and loading it into the data lake. This involves understanding the data sources, designing data ingestion pipelines, and ensuring data quality and integrity during the process.
  2. Data Transformation and Preparation: Once the data is in the data lake, you will need to transform and prepare it for analysis. This may involve cleaning, filtering, aggregating, or joining data sets to create a unified and reliable dataset that can be used for reporting and analysis.
  3. Data Modeling and Architecture: You will be responsible for designing and implementing the data lake's schema and structure. This includes defining data tables, partitions, and optimizing storage mechanisms for efficient data retrieval and analysis.
  4. Data Governance and Security: Maintaining data privacy, security, and compliance are critical aspects of your role. You will need to establish and enforce data governance policies, implement access controls, and ensure data is handled in accordance with relevant regulations and best practices.
  5. Tableau Integration: Using Tableau, you will build interactive visualizations and reports based on the data in the data lake. This involves creating dashboards, charts, and other visual representations to enable effective data exploration and analysis for end-users.
  6. Performance Optimization: You will continuously monitor and optimize the performance of data pipelines, data lake storage, and Tableau dashboards. This may involve tuning queries, indexing data, or optimizing Tableau workbooks to ensure efficient and timely access to data.
  7. Collaboration and Documentation: Effective collaboration with other teams, such as data scientists, analysts, or business stakeholders, is crucial. You will need to understand their requirements, provide support, and document data pipelines, transformations, and Tableau configurations for future reference.

Key Skills and Qualifications

  • Qualifications: Bachelor's degree in Computer Science, Engineering, or a related field. A master's degree is a plus.
  • Proven experience as a Data Engineer or in a similar role
  • Strong programming skills in languages in JVM language, Java, Kotlin and or Scala as well as SQL and Python
  • Experience with data integration and [ELT or ] tools like Apache Spark, Airbyte, Meltano, Singer
  • Familiarity with big data processing frameworks like Apache Kafka, Apache Spark or Apache Flink
  • Proficiency in working with relational databases, data warehouses, data lake (Iceberg/S3 and Parquet), and distributed storage systems
  • Knowledge of data modeling, data warehousing concepts, and dimensional modeling


What do you bring to enrich us?

  • Energy and enthusiasm
  • Autonomous and self-motivated
  • Growth mindset
  • Enjoy facing challenges
  • Passion for building new things
  • Curiosity and deep interest in the world
  • Ability to constructively challenge the status quo

Benefits

What do we offer in return?

  • Membership in a growing/amazing team - A great opportunity to be part of the early stages of a new technology services & product engineering company with a high potential for initial members.
  • Trust over control - We believe in strong business fundamentals and have a vision for scaling from day one. We value our people highly and believe in their potential in this role.
  • Several learning opportunities on the job and beyond - We already have a curated list of courses you can dive right into!
  • Industry competitive compensation and benefits

Next Steps

  • Send us your updated CV - if you can mention how you have enriched your previous company in a cover letter, that would be great!
  • If your profile fits what we're looking for, we will have our Talent personnel reach out to you to understand your profile/interests and how we can best align mutually.
  • You will complete two tasks - a computer-based reasoning test and a technical case study, followed by a job-based interview in the subsequent rounds.
  • Finally, you will have a chat with our founder to find out more about us and determine if this is the right next career move for you!