Realign logo
Realign Verified
Software, Business Process Management, Enterprise Architecture

Hadoop Data Engineer-7

Addison, Texas, United StatesOnsiteFull Time$115,000–$115,000 /yrPosted 2 months ago

Is this role right for you?

Upload your resume and get a skill-by-skill breakdown — see exactly where you match, where you're close, and what to highlight. Not a mystery percentage.

Get a tailored resume highlighting what this role needs.

Role summary

We are seeking a seasoned Hadoop Data Engineer with a minimum of 10 years of experience to join our team in Addison, Texas. The role involves designing, implementing, and optimizing scalable data pipelines using Hadoop ecosystem tools (HDFS, Hive, Spark) and MS SQL Server. You will be responsible for developing data ingestion processes, building and maintaining datasets, performing performance tuning for distributed workloads, and creating complex SQL queries and ETL workflows. Collaboration with data scientists for ML model feature readiness, implementing monitoring, and documenting architecture are key aspects of this role. Experience with AI/ML is preferred.

Addison, Texas 75001 Posted March 29th, 2026

Looking for more job opportunities? Click here!

Job Type: Full Time

Job Category: IT

Job Description

Role: Hadoop Data Engineer

Location: Addison, TX

FTE only

Job Description

Must Have Technical/Functional Skills

Primary Skill: Hadoop (HDFS, Hive, Spark), Big Data ETL/ELT, Distributed Processing, MS SQL Server,

Secondary: Artificial Intelligence/ Machine learning

Experience: Minimum 10 years

Roles & Responsibilities

? Design and implement scalable batch and/or streaming data pipelines using Hadoop ecosystem tools.

? Develop and optimize data ingestion processes from multiple sources (RDBMS, files, APIs, logs).

? Build and maintain datasets in HDFS/Hive and ensure data quality, lineage, and governance.

? Perform performance tuning for distributed workloads (partitioning, file formats, resource management).

? Create and optimize complex queries, stored procedures, and ETL workflows in MS SQL Server.

? Collaborate with data scientists/analysts to deliver feature-ready datasets for ML models.

? Implement monitoring and alerting for pipeline health and data SLAs.

? Document architecture, workflows, data dictionaries, and operational runbooks.

? Support production deployments, incident triage, and root cause analysis.

Required Skills & Qualifications

? Strong hands-on experience with Hadoop components (e.g., HDFS, Hive, YARN, MapReduce/Spark).

? Experience with data modeling and data warehousing concepts.

? Solid proficiency in MS SQL Server (T-SQL, query optimization, indexing, stored procedures).

? Experience with ETL/ELT design patterns and job scheduling (e.g., Oozie/Airflow/Control-M).

? Strong understanding of distributed computing concepts and performance tuning.

? Familiarity with Python/Scala/Java for data processing (any one preferred).

? Bachelor’s degree in Computer Science, Engineering, or equivalent experience.

Required Skills

DEVOPS ENGINEER

SENIOR EMAIL SECURITY ENGINEER

Ready to apply?
You'll be redirected to Realign's application page.