Tuppl logo
Tuppl Verified
Real Estate, Software Development, PropTech

Sr.Data Engineer (Senior Level) – AWS & Streaming (Visa Independent candidate only)

Fort Mill, South Carolina, United StatesHybridContractSeniorPosted 2 months agoVisa sponsorship available

Is this role right for you?

Upload your resume and get a skill-by-skill breakdown — see exactly where you match, where you're close, and what to highlight. Not a mystery percentage.

Get a tailored resume highlighting what this role needs.

Role summary

Seeking a Mid-Senior Data Engineer with strong expertise in AWS-based data engineering, real-time streaming technologies, and enterprise-grade data quality frameworks. The role involves designing, building, and optimizing scalable batch and streaming data pipelines, implementing robust data validation and monitoring processes, and supporting mission-critical analytics platforms. Key responsibilities include developing ETL/ELT pipelines using AWS Glue, PySpark, and Python, building event-driven workflows with AWS Lambda, and managing real-time streaming solutions with Kafka, KSQL, and Apache Flink. The engineer will also implement and enforce data quality frameworks and optimize data processing performance, scalability, reliability, and cost in cloud environments.

Job Title: Sr.Data Engineer (Senior Level) – AWS & Streaming

Experience Level – 15+ Years

Location: Fort Mill, SC (3 days hybrid)

Visa Independent candidate only

contract

Role Summary:

We are seeking a Mid–Senior Data Engineer with strong expertise in AWS-based data engineering, real-time streaming technologies, and enterprise-grade data quality frameworks. The ideal candidate will design, build, and optimize scalable batch and streaming data pipelines, implement robust data validation and monitoring processes, and support mission-critical analytics platforms.

Key Responsibilities:

• Develop and maintain scalable ETL/ELT pipelines using AWS Glue, PySpark, and Python

• Build event-driven workflows using AWS Lambda

• Design and manage real-time streaming solutions using Kafka, KSQL, and Apache Flink

• Implement and enforce comprehensive data quality frameworks, including validation, profiling, monitoring, and reconciliation

• Optimize data processing performance, scalability, reliability, and cost in cloud environments

• Collaborate with cross-functional teams to deliver reliable, production-grade data platforms and ensure data integrity across the pipeline

Required Skills:

• Strong hands-on experience with Python and PySpark

• Proven expertise in AWS Glue, Lambda, and other cloud-native data services

• Solid experience with the Kafka ecosystem (topics, partitions, consumer groups, streaming patterns)

• Demonstrated experience building and supporting data quality frameworks (validation rules, reconciliation checks, profiling, anomaly detection)

• Strong understanding of distributed data processing and scalable architecture patterns

Good-to-Have Skills:

• Experience with Apache Flink for real-time stream processing and stateful computations

• Knowledge of KSQL or other streaming SQL engines

• Exposure to CI/CD pipelines, IaC (Terraform/CloudFormation), and DevOps practices

• Familiarity with data lake/lakehouse architectures and table formats such as Iceberg, Delta, or Hudi

• Experience working in enterprise or financial data environments

Ready to apply?
You'll be redirected to Tuppl's application page.