
Data Engineer (Remote)
Role summary
CrowdStrike, a global leader in cybersecurity, seeks a Principal Data Engineer with deep expertise in LLMs and AI platforms. This remote role involves designing, building, and deploying exabyte-scale data infrastructure for AI-driven security products. Responsibilities include developing scalable, fault-tolerant solutions, providing technical leadership, mentoring engineers, and transforming prototypes into production services. The ideal candidate has 10+ years of data engineering experience, including 3+ years in AI/ML platforms, with hands-on LLM engineering, RAG, and distributed systems design. Strong coding skills in Python/JVM, knowledge of MLOps, containerization, cloud platforms, and data processing frameworks are essential. This is a strategic opportunity to shape the future of AI-powered cybersecurity.
About The Company
CrowdStrike is a global leader in cybersecurity dedicated to stopping breaches with an advanced AI-native platform. Renowned for its innovative approach, CrowdStrike leverages cutting-edge technology to provide comprehensive security solutions that protect organizations from sophisticated cyber threats. With a focus on innovation, integrity, and customer-centricity, the company empowers businesses worldwide to defend their digital assets effectively. CrowdStrike's commitment to excellence and its dynamic work environment make it an ideal place for professionals seeking to make a meaningful impact in the cybersecurity landscape.
About The Role
We are seeking a highly skilled Principal Data Engineer with deep expertise in Large Language Models (LLMs) and AI platforms to join our team. In this strategic role, you will be responsible for designing, building, and deploying robust data infrastructure that underpins our next-generation AI-driven security products. Your work will involve developing scalable, fault-tolerant, and cost-effective data solutions at exabyte scale, enabling CrowdStrike to stay at the forefront of cybersecurity innovation. You will provide technical leadership across teams, mentor engineers, and collaborate closely with research and product teams to transform prototypes into production-grade services. This is an exceptional opportunity for a seasoned data engineering professional to influence the future of AI-powered cybersecurity solutions and lead complex projects that have a global impact.
Qualifications
- Master’s or PhD in Computer Science, Data Engineering, or a related STEM field, or equivalent practical experience.
- 10+ years of progressive experience in data engineering or platform engineering, with at least 3 years focused on AI/ML or data science platforms at large scale.
- Hands-on experience with LLM engineering, including fine-tuning, prompt engineering, and deployment.
- Strong expertise in Retrieval-Augmented Generation (RAG) and agentic workflows.
- Proven track record in designing and delivering large-scale distributed systems, including sharding, partitioning, and concurrency management.
- Exceptional coding skills in high-level programming languages such as Python and JVM technologies, with a focus on performance, maintainability, and testing.
- Deep understanding of engineering best practices, including code reviews, resilient architecture, and comprehensive testing strategies.
- Experience in leading engineering teams, providing mentorship, and conducting technical workshops and reviews.
- Familiarity with MLOps tools (MLflow, SageMaker, Vertex AI), containerization (Docker, Kubernetes), and cloud platforms (AWS, GCP, OCI).
- Knowledge of distributed data processing frameworks like Spark, Dask, and Flink, as well as data warehousing solutions such as Snowflake and BigQuery.
- Experience with message queuing and streaming technologies like Kafka and Pulsar.
Responsibilities
- Architect, implement, and optimize data platforms and pipelines for LLMs, RAG, and AI agentic systems at exabyte scale.
- Drive the adoption and deployment of agentic workflows and techniques to create autonomous, data-driven security features.
- Design scalable, fault-tolerant, and cost-effective data solutions that support rapid iteration and high-quality deployment.
- Write production-ready code emphasizing performance, maintainability, and rigorous testing to ensure reliable delivery.
- Provide technical leadership in data modeling, normalization, and semantic cataloging for AI/ML workloads.
- Establish and promote best practices for MLOps and DataOps, including monitoring, observability, and zero-touch recovery mechanisms.
- Mentor engineering teams through workshops, design reviews, and technical guidance to strengthen AI platform capabilities.
- Collaborate with research and product teams to transition prototypes into scalable, production-grade services.
- Manage the entire lifecycle of critical data services from development through deployment and ongoing monitoring.
Benefits
- Market-leading compensation and equity awards.
- Comprehensive physical and mental wellness programs.
- Competitive vacation and holiday schedule.
- Paid parental and adoption leave.
- Opportunities for professional development and continuous learning.
- Employee networks, regional groups, and volunteer initiatives to foster community engagement.
- Vibrant office culture with world-class amenities.
- Recognition as a Great Place to Work across the globe.
Equal Opportunity
CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program. CrowdStrike participates in the E‑Verify program. We do not discriminate on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical or mental disability, medical condition, genetic information, or any other characteristic protected by law. Employment decisions are based solely on valid job requirements.
Similar roles
Data Engineer (Remote)Sundayy · Canada · Remote- Senior Data Engineer (Remote)Claritev · United States · Remote
Data Engineer (Remote)Sundayy · United States · Remote- Data Engineer (Remote)FCT · Louisiana, Louisiana, United States · Remote
- Data Engineer (Remote)Kforce Inc · Birmingham, Alabama, United States · Remote