
Principal Data Engineer - AWS
Role summary
We are seeking a Principal Data Engineer with extensive experience in data engineering and a strong background in AWS. The role involves designing and building large-scale, production-grade data platforms, leveraging core AWS data services like S3, Glue, Redshift, EMR, Lambda, Step Functions, Kinesis, and MSK. Proficiency in Python, Scala, or Java, along with deep experience in ETL/ELT frameworks and pipeline orchestration tools such as Airflow, is essential. The ideal candidate will have solid knowledge of data modeling, distributed systems, performance tuning, and experience with CI/CD and Infrastructure as Code (Terraform, CDK, CloudFormation). Strong communication skills are required to effectively convey technical information to non-technical audiences.
- 10+ years of experience in data engineering or software engineering, with strong & dedicated hands-on AWS experience of 4-5 years.
- Proven experience designing large-scale, production-grade data platforms
- Expertise with core AWS data services such as:
- Amazon S3, Glue, Redshift, EMR
- Lambda, Step Functions
- Kinesis / MSK
- Strong programming skills in Python, Scala, or Java
- Deep experience with ETL / ELT frameworks and data pipeline orchestration tools (Airflow, AWS-native equivalents, etc.)
- Solid knowledge of data modeling, distributed systems, and performance tuning
- Experience working in CI/CD, Infrastructure as Code (Terraform, CDK, CloudFormation)
- Strong communication skills with the ability to translate technical concepts for non-technical stakeholders