
Data Engineer (Remote)
Role summary
CrowdStrike is seeking a Principal Data Engineer with expertise in LLMs and AI platforms to design, build, and deploy scalable data infrastructure for AI-driven security products. This senior role involves developing sophisticated data pipelines and systems to enable advanced security features. The engineer will lead technical initiatives, influence platform architecture, and collaborate with cross-functional teams to transform research prototypes into production-grade services. The position requires 10+ years of data/platform engineering experience, 3+ years in AI/ML platforms, and proficiency in Python, JVM technologies, RAG, and MLOps tools. Experience with distributed systems, cloud platforms, and streaming technologies is essential.
About The Company
CrowdStrike is a global leader in cybersecurity, renowned for its innovative approach to stopping breaches through an advanced AI-native platform. With a focus on delivering cutting-edge security solutions, CrowdStrike combines expertise in threat intelligence, endpoint protection, and cloud security to safeguard organizations worldwide. The company's commitment to innovation, integrity, and excellence has established it as a trusted partner for enterprises seeking robust cybersecurity defenses. CrowdStrike's dedication to leveraging the latest technology ensures that clients stay ahead of evolving cyber threats, making it a pioneer in the cybersecurity industry.
About The Role
The Principal Data Engineer at CrowdStrike is a senior-level position that requires deep expertise in Large Language Models (LLMs) and AI platforms. This role involves designing, building, and deploying scalable data infrastructure to support next-generation AI-driven security products. The successful candidate will be instrumental in developing sophisticated data pipelines and systems that enable advanced security features through AI and machine learning technologies. This position offers an exciting opportunity to lead technical initiatives, influence platform architecture, and collaborate with cross-functional teams to transform research prototypes into production-grade services that enhance CrowdStrike’s cybersecurity offerings.
Qualifications
- Master’s or PhD in Computer Science, Data Engineering, or a related STEM field, or equivalent practical experience
- 10+ years of progressive experience in data engineering or platform engineering
- At least 3 years of focused experience in AI/ML or data science platforms at a large scale
- Hands-on experience with Large Language Models (fine-tuning, prompt engineering, deployment)
- Proficiency in RAG (Retrieval-Augmented Generation) and agentic workflows
- Proven track record in designing and delivering large-scale distributed systems
- Exceptional coding skills in Python, JVM technologies, and related languages
- Strong understanding of engineering best practices including code reviews, resilient architecture, and comprehensive testing
- Experience in leading technical teams, mentoring engineers, and conducting workshops and design reviews
- Knowledge of MLOps tools such as MLflow, SageMaker, Vertex AI
- Experience with distributed data processing frameworks like Spark, Dask, Flink
- Familiarity with cloud platforms (AWS, GCP, OCI) and container orchestration tools (Docker, Kubernetes)
- Experience with message queuing, streaming technologies, and data warehousing solutions
Responsibilities
- Architect, implement, and optimize data platforms and pipelines for LLMs, RAG, and AI agentic systems at exabyte scale
- Drive the adoption of agentic workflows and techniques to create autonomous, data-driven security features
- Design scalable, fault-tolerant, and cost-effective data solutions emphasizing rapid iteration and high-quality deployment
- Develop production-ready code focused on performance, maintainability, and rigorous testing standards
- Provide technical leadership in data modeling, normalization, and semantic cataloging for AI/ML workloads
- Establish best practices for MLOps/DataOps, including monitoring, observability, and zero-touch recovery mechanisms
- Mentor engineering teams, conduct workshops, and lead design reviews to enhance platform knowledge
- Collaborate with research and engineering teams to transform prototypes into scalable, reliable services
- Manage end-to-end lifecycle of critical data services: development, testing, deployment, and monitoring
Benefits
- Market-leading compensation and equity awards
- Comprehensive physical and mental wellness programs
- Competitive vacation and holiday schedule
- Paid parental and adoption leave
- Opportunities for professional development and continuous learning
- Employee networks, regional groups, and volunteer initiatives to foster community engagement
- Vibrant office culture with world-class amenities
- Recognition as a Great Place to Work across the globe
Equal Opportunity
CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program. CrowdStrike participates in the E-Verify program. We do not discriminate based on race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, physical or mental disability, medical condition, genetic information, or any other characteristic protected by law. Employment decisions are made solely based on valid job requirements.
Similar roles
Data Engineer (Remote)Sundayy · Canada · Remote- Senior Data Engineer (Remote)Claritev · United States · Remote
Data Engineer (Remote)Sundayy · United States · Remote- Data Engineer (Remote)FCT · Louisiana, Louisiana, United States · Remote
- Data Engineer (Remote)Kforce Inc · Birmingham, Alabama, United States · Remote