Hadoop Data Engineer-8
Role summary
A Hadoop Data Engineer with a minimum of 10 years of experience is sought to design, implement, and optimize scalable data pipelines using the Hadoop ecosystem (HDFS, Hive, Spark). The role involves developing data ingestion processes, building and maintaining datasets, and performing performance tuning for distributed workloads. Proficiency in MS SQL Server for complex queries and ETL workflows is essential. Collaboration with data scientists to prepare datasets for ML models, implementing monitoring, and documenting architecture are also key responsibilities. Familiarity with Python, Scala, or Java for data processing is preferred.
Addison, Texas 75001 Posted March 29th, 2026
Looking for more job opportunities? Click here!
Job Type: Full Time
Job Category: IT
Job Description
Role: Hadoop Data Engineer
Location: Addison, TX
FTE only
Job Description
Must Have Technical/Functional Skills
Primary Skill: Hadoop (HDFS, Hive, Spark), Big Data ETL/ELT, Distributed Processing, MS SQL Server,
Secondary: Artificial Intelligence/ Machine learning
Experience: Minimum 10 years
Roles & Responsibilities
? Design and implement scalable batch and/or streaming data pipelines using Hadoop ecosystem tools.
? Develop and optimize data ingestion processes from multiple sources (RDBMS, files, APIs, logs).
? Build and maintain datasets in HDFS/Hive and ensure data quality, lineage, and governance.
? Perform performance tuning for distributed workloads (partitioning, file formats, resource management).
? Create and optimize complex queries, stored procedures, and ETL workflows in MS SQL Server.
? Collaborate with data scientists/analysts to deliver feature-ready datasets for ML models.
? Implement monitoring and alerting for pipeline health and data SLAs.
? Document architecture, workflows, data dictionaries, and operational runbooks.
? Support production deployments, incident triage, and root cause analysis.
Required Skills & Qualifications
? Strong hands-on experience with Hadoop components (e.g., HDFS, Hive, YARN, MapReduce/Spark).
? Experience with data modeling and data warehousing concepts.
? Solid proficiency in MS SQL Server (T-SQL, query optimization, indexing, stored procedures).
? Experience with ETL/ELT design patterns and job scheduling (e.g., Oozie/Airflow/Control-M).
? Strong understanding of distributed computing concepts and performance tuning.
? Familiarity with Python/Scala/Java for data processing (any one preferred).
? Bachelor’s degree in Computer Science, Engineering, or equivalent experience.
Required Skills
DEVOPS ENGINEER
SENIOR EMAIL SECURITY ENGINEER