AI Software Engineer
Role summary
This Principal Solutions Architect role focuses on designing and developing high-performance 'AI On-Prem Appliances' for Generative AI inference and computer vision at the industrial edge. The position requires leading the architecture of GenAI and Agentic AI solutions, including LLM, VLM, and VLA model integration, and conducting hands-on prototyping for enterprise customers. Key responsibilities involve optimizing AI pipelines for efficiency, architecting multi-agent workflows, and driving hybrid AI strategies. The role demands extensive systems engineering experience with a strong emphasis on AI/ML architectures, production deployments, Python, C++, AI runtimes, and containerization technologies.
Principal Solutions Architect
We're working with a global leader in connected intelligent edge and wireless innovation on this exciting opportunity.
As a Principal Engineer, you will spearhead the development of "AI On-Prem Appliances," high-performance hardware designed for Generative AI inference and computer vision at the industrial edge. You will lead the creation of scalable blueprints for GenAI and Agentic AI workloads, utilizing bleeding-edge hardware acceleration to transform industries like healthcare, retail, and smart manufacturing.
The Role
• Own the end-to-end architecture of solution blueprints for GenAI and Hybrid-AI deployments, focusing on LLM, VLM, and VLA model integration.
• Lead hands-on prototyping and proof-of-concepts (POCs) for enterprise customers, building pilot systems that demonstrate the power of on-premises AI inference.
• Architect multi-agent workflows (Agentic AI) including planning, tool-use, and function calling using dedicated AI hardware platforms.
• Optimize complex AI pipelines for latency, throughput, and power efficiency, performing model quantization, pruning, and distillation for edge deployment.
• Drive hybrid AI strategies by partitioning workloads across on-device, on-prem, and cloud environments to maximize privacy and performance.
What You'll Need
• 15+ years of hands-on systems engineering experience, with at least 10 years focused on AI/ML architectures and production-grade deployments.
• Expert-level proficiency in Python and C++ with deep knowledge of AI runtimes and hardware-aware optimization (TensorRT, ONNX, etc.).
• Proven track record shipping complex AI systems involving LLMs, Computer Vision (CNNs), RAG pipelines, and Vector Databases.
• Deep experience with industrial IoT (IIoT) protocols, video analytics pipelines, and containerization (Docker/Kubernetes).
• Strong background in heterogeneous computing and leveraging AI accelerators to handle massive inference workloads locally.
What's On Offer
• Highly competitive base salary range of $220,200 - $330,400.
• Robust total compensation package including annual discretionary bonuses and restricted stock units (RSUs).
• Opportunity to shape the future of industrial AI at a company defining the "Connected Intelligent Edge."
• Comprehensive benefits package designed to support health, wealth, and work-life balance.
Apply via Haystack today!
Similar roles
AI Software EngineerNumerator · United States · Remote- Senior AI Software EngineerRemoteHunter · United States · Remote
- Junior AI Software EngineerAgility PR Solutions · Ontario, Canada · Remote
AI Software EngineerBroadcom · Georgia, United States · Onsite- AI Software EngineerAgility PR Solutions · Ontario, Canada · Remote