Staff Site Reliability Engineer, Production Engineering
Role summary
Dropbox is seeking a Staff Site Reliability Engineer to lead the company-wide reliability strategy, focusing on stability, observability, and operational excellence in the era of AI-assisted software development. This role involves defining multi-year reliability goals, roadmaps, and standards across key areas like observability, debugging, and incident management. The engineer will lead cross-team initiatives to mitigate reliability risks associated with increased development velocity and complexity, and partner with engineering leaders to enhance monitoring, alerting, and incident response systems at scale. Key responsibilities include identifying and mitigating AI-introduced reliability risks, providing technical leadership and mentorship, and ensuring clear communication with senior stakeholders on reliability priorities and progress. A BS in Computer Science or equivalent experience, 12+ years in software/site reliability engineering, and deep experience with distributed systems and production operations are required. Experience with AI-assisted development workflows and scaling developer productivity platforms is preferred.
Role Description
As a Site Reliability Engineer focused on company-wide reliability strategy, you will play a crucial role in advancing Dropbox’s stability, observability, incident response, and operational excellence as AI technologies reshape how software is built and operated. You will help define the reliability strategy for a new chapter of agentic development and AI-enabled software delivery, including preparing Dropbox for increases in pull request volume, service complexity, incident patterns, and demand for debugging and monitoring tools. You will partner across Engineering, Product, and leadership teams to raise the bar for reliability, guide long-term platform investments, and ensure Dropbox continues to deliver dependable experiences for millions of users.
Our Engineering Career Framework is viewable by anyone outside the company and describes what’s expected for our engineers at each of our career levels. Check out our blog post on this topic and more here.
Responsibilities
Many teams at Dropbox run Services with on-call rotations, which entails being available for calls during both core and non-core business hours. If a team has an on-call rotation, all engineers on the team are expected to participate in the rotation as part of their employment. Applicants are encouraged to ask for more details of the rotations to which the applicant is applying.
Requirements
Preferred Qualifications
Compensation
US Zone 1
This role is not available in Zone 1
Similar roles
- Staff Site Reliability Engineer, Production EngineeringDropbox · Canada: Select
- Senior Site Reliability Engineer, Production EngineeringAnduril · Costa Mesa, California, United States · Onsite
- Senior Site Reliability Engineer, Production EngineeringAnduril · Costa Mesa, California, United States · Onsite
- Senior Site Reliability Engineer, Production EngineeringAnduril · Seattle, Washington, United States · Hybrid