Data Engineer
Role summary
We are seeking a Data Engineer to build and maintain integrations for collecting security data from various third-party platforms. The role involves developing pipelines to parse, standardize, and normalize log data, implementing log ingestion tools, and defining data schema mappings. You will optimize ingestion workflows for performance and reliability, produce technical documentation, and collaborate with engineering teams on automation and AI-driven parsing. The ideal candidate has 3-6 years of experience in data engineering or security data pipelines, strong knowledge of ETL, API development, schema design, and experience with log collection technologies, security data platforms, programming languages like Python, and cloud environments.
Responsibilities
- Build and maintain integrations that collect security data from third party platforms such as EDR tools, cloud services, and compliance systems
- Develop pipelines to parse, standardise, and normalise both structured and unstructured log data into a unified internal format
- Implement and manage log ingestion using tools such as Fluentd, Logstash, Vector, or similar collectors and forwarders
- Define and maintain mappings between external data schemas and internal data models to ensure consistency and usability
- Optimise ingestion workflows to improve performance, reliability, and data quality across all pipelines
- Produce clear technical documentation and support internal teams or customers with integration-related guidance
- Partner with backend and platform engineers to enable end-to-end data flow and contribute to automation initiatives, including AI-driven parsing capabilities
Your Skills
- 3–6 years of experience in data engineering, integration engineering, or security data pipelines
- Strong understanding of data integration concepts including ETL processes, API development, and schema design
- Hands-on experience parsing and transforming log data using regex, Grok patterns, or similar techniques across formats like JSON, syslog, or CSV
- Practical experience with log collection or forwarding technologies such as Fluent Bit, Logstash, Elastic Beats, OpenTelemetry, or equivalent tools
- Familiarity with security data platforms such as Splunk, Microsoft Sentinel, QRadar, or similar SIEM solutions
- Proficiency in at least one programming language such as Python, Go, or TypeScript
- Experience working in cloud environments (AWS, Azure, or GCP), with exposure to streaming or messaging systems like Kafka being advantageous
Similar roles
- Senior Data EngineerExperion Technologies · Plano, Texas, United States · Hybrid
- Lead Data EngineerSmart IT Frame LLC · Los Angeles, California, United States · Hybrid
Principal Data EngineerRS21: A Data Science and Visualization Company · United States · Remote
Senior Data EngineerRaag Solutions · Bellevue, Washington, United States · Onsite- Lead Data EngineerRetail Insight Ltd · Illinois, United States · Hybrid