Hadoop Data Engineer-8

Addison, Texas, United StatesOnsiteFull Time$115,000–$115,000 /yrPosted 2 months ago

Is this role right for you?

Upload your resume and get a skill-by-skill breakdown — see exactly where you match, where you're close, and what to highlight. Not a mystery percentage.

Get a tailored resume highlighting what this role needs.

Role summary

A Hadoop Data Engineer with a minimum of 10 years of experience is sought to design, implement, and optimize scalable data pipelines using the Hadoop ecosystem (HDFS, Hive, Spark). The role involves developing data ingestion processes, building and maintaining datasets, and performing performance tuning for distributed workloads. Proficiency in MS SQL Server for complex queries and ETL workflows is essential. Collaboration with data scientists to prepare datasets for ML models, implementing monitoring, and documenting architecture are also key responsibilities. Familiarity with Python, Scala, or Java for data processing is preferred.

Addison, Texas 75001 Posted March 29th, 2026

Looking for more job opportunities? Click here!

Job Type: Full Time

Job Category: IT

Job Description

Role: Hadoop Data Engineer

Location: Addison, TX

FTE only

Job Description

Must Have Technical/Functional Skills

Primary Skill: Hadoop (HDFS, Hive, Spark), Big Data ETL/ELT, Distributed Processing, MS SQL Server,

Secondary: Artificial Intelligence/ Machine learning

Experience: Minimum 10 years

Roles & Responsibilities

? Design and implement scalable batch and/or streaming data pipelines using Hadoop ecosystem tools.

? Develop and optimize data ingestion processes from multiple sources (RDBMS, files, APIs, logs).

? Build and maintain datasets in HDFS/Hive and ensure data quality, lineage, and governance.

? Perform performance tuning for distributed workloads (partitioning, file formats, resource management).

? Create and optimize complex queries, stored procedures, and ETL workflows in MS SQL Server.

? Collaborate with data scientists/analysts to deliver feature-ready datasets for ML models.

? Implement monitoring and alerting for pipeline health and data SLAs.

? Document architecture, workflows, data dictionaries, and operational runbooks.

? Support production deployments, incident triage, and root cause analysis.

Required Skills & Qualifications

? Strong hands-on experience with Hadoop components (e.g., HDFS, Hive, YARN, MapReduce/Spark).

? Experience with data modeling and data warehousing concepts.

? Solid proficiency in MS SQL Server (T-SQL, query optimization, indexing, stored procedures).

? Experience with ETL/ELT design patterns and job scheduling (e.g., Oozie/Airflow/Control-M).

? Strong understanding of distributed computing concepts and performance tuning.

? Familiarity with Python/Scala/Java for data processing (any one preferred).

? Bachelor’s degree in Computer Science, Engineering, or equivalent experience.

Required Skills

DEVOPS ENGINEER

SENIOR EMAIL SECURITY ENGINEER

Ready to apply?

You'll be redirected to Realign's application page.