
Sr. AWS Data Engineer
Role summary
We are seeking a Senior AWS Data Engineer to design, build, and operate scalable, cloud-native data platforms for batch and streaming use cases, emphasizing governance, performance, and reliability. The role requires strong hands-on experience with Python, PySpark, SQL, and a suite of AWS services including Glue ETL, Lake Formation, Step Functions, Lambda, Aurora, DynamoDB, and S3. Responsibilities include developing ETL jobs, optimizing Spark performance, implementing data quality checks, and managing data pipelines. A working knowledge of Informatica PowerCenter and SQL Server is needed for migration support. This is an onsite, full-time position requiring 8+ years of IT experience, with a strong emphasis on AWS Cloud, SQL, and Python, along with good experience in Kafka/Flink and Airflow.
- Please note, this role is not able to offer visa transfer or sponsorship now or in the future\*
Role Tile: Senior AWS Data Engineer
About The Role
Design, build, and operate scalable, cloud‑native data platforms supporting batch and streaming use cases, with strong focus on governance, performance, and reliability.
Responsibilities
- Programming
- Python: Strong hands-on experience with Python for data engineering tasks, including scripting, automation, and ETL logic development
- PySpark: Proficiency in writing and optimizing PySpark jobs for large-scale data transformations
- SQL: Advanced SQL skills for data querying, transformation logic, and stored procedure conversion from SQL Server
- Big Data Processing Frameworks
- Apache Spark: Strong experience with Spark core concepts — RDDs, DataFrames, Datasets, partitioning, and performance tuning
- Data partitioning and optimization: Experience with data skew handling, broadcast joins, caching strategies, and Spark tuning
- AWS Services (Hands-On Experience Required)
- AWS Glue ETL: Developing and deploying Glue jobs (Python Shell and Spark), job bookmarks, dynamic frames, and custom connectors
- AWS Glue Data Catalog: Managing databases, tables, crawlers, classifiers, and schema versioning
- AWS Lake Formation: Configuring data lake permissions, fine-grained access control, and data filtering
- AWS Step Functions: Designing and implementing state machines for ETL workflow orchestration, error handling, and retry logic
- AWS Lambda: Writing serverless functions for event-driven triggers, lightweight transformations, and pipeline utilities
- Amazon Aurora: Working with Aurora PostgreSQL compatible for relational data storage and query optimization
- Amazon DynamoDB: Designing and querying NoSQL tables
- Amazon S3: Proficiency in S3 data lake design — partitioning strategies, storage classes, lifecycle policies, and S3 event notifications
- AWS IAM: Understanding of roles, policies, and least-privilege access patterns relevant to data pipeline security
- ETL Development & Migration
- Informatica PowerCenter (working knowledge): Ability to read and interpret Informatica workflows, sessions, mappings, and transformations to support conversion to AWS Glue
- ETL framework development: Experience building reusable, configurable ETL frameworks with logging, error handling, retry mechanisms, and metadata-driven execution
- Data pipeline design patterns: Familiarity with incremental loads, CDC (Change Data Capture), full loads, and SCD (Slowly Changing Dimensions)
- SQL Server (working knowledge): Ability to understand SQL Server schemas, stored procedures, and SSIS packages for migration analysis
- Data Engineering Best Practices
- Data quality and validation: Implementing data quality checks, reconciliation logic, and exception handling within pipelines
- Metadata-driven frameworks: Building configurable pipelines driven by metadata stored in Aurora or DynamoDB
- Logging and observability: Integrating CloudWatch logging, custom metrics, and alerting into data pipelines
- Unit and integration testing: Writing test cases for ETL logic using frameworks such as pytest
- Version control: Proficiency with Git for source code management, branching strategies, and code reviews
What You Need To Have To Be Considered
- 8+ years in IT related role
- Strong hands on experience in AWS Cloud, SQL and Python
- Good experience with Kafka/Flink, AWS Glue and Airflow
Work model:
On Site
At Cognizant, we strive to provide flexibility wherever possible, and we are here to support a healthy work-life balance though our various wellbeing programs. Based on this role’s business requirements, this is an onsite position requiring 5 days a week in a client or Cognizant office in Charlotte, NC.
Sample Cognizant interview questions
- 1
Implement a platform for handling live user authentication.
system designmedium - 2
How would you explain the purpose and functionality of GitHub to someone unfamiliar with coding or version control systems?
technicalmedium - 3
Determine if a string can be a palindrome after deleting at most one character.
codingmedium - 4
Maximize the minimum distance between aggressive cows in stalls.
codingmedium - 5
Unique Combinations that Sum to a Target Find all unique combinations in an array that sum to a target. Input: candidates = [2,4,6], target = 6 Output: [[2,2,2], [2,4], [6]] Explanation: Uses backtracking to find all valid combinations that sum to 6, allowing for explicitly repeated elements.
codingmedium
Sign up for a personalized interview prep pack tailored to this role.
Similar roles
- AWS Data EngineerOpenkyber · Alaska, Alaska, United States · Remote
- AWS Data EngineerThe Phoenix Group · United States · Remote
Senior AWS Data EngineerAdastra · Markham, Ontario, Canada · Hybrid
AWS Data EngineerIris Software Inc. · Toronto, Ontario, Canada · Hybrid
AWS Data EngineerIris Software Inc. · Toronto, Ontario, Canada · Hybrid