Lead GCP Cloud Engineer
Role summary
Seeking a Lead Data Engineer to design and build a next-generation data platform using a modern lakehouse approach with Databricks and PySpark. This hands-on leadership role involves architecting scalable data pipelines, implementing canonical data models, and ensuring data quality on Google Cloud Platform (BigQuery, Dataproc, Cloud Storage). The role requires strong Python and SQL proficiency, experience with Airflow for orchestration, and the ability to collaborate with business stakeholders. You will also mentor a team of data engineers and contribute to best practices for scalability and reliability. Experience with data lake/lakehouse architectures and advanced data modeling is essential.
Role: Lead Data Engineer Location: Remote (U.S. preferred) Compensation: ~$150K base (flexible for the right candidate) Type: Full-time or Contract-to-Hire
The Opportunity We are seeking a Lead Data Engineer to play a critical role in designing and building a next-generation data platform. This is a highly impactful, hands-on leadership role focused on modernizing data architecture using Databricks, PySpark, and a Lakehouse approach. You will work closely with both technical teams and business stakeholders to define requirements, design scalable solutions, and drive end-to-end implementation.
What You'll Do
- Lead the design and implementation of scalable data pipelines and data platforms using modern lakehouse architecture
- Build and optimize production-grade PySpark pipelines in Databricks
- Design and implement canonical data models across multiple data sources
- Apply medallion architecture (bronze/silver/gold layers) for structured and unstructured data
- Drive data quality improvements in a complex, mixed-format environment (JSON, CSV, XML/DDEX)
- Partner with business stakeholders to run workshops, gather requirements, and translate needs into technical solutions
- Architect solutions leveraging Google Cloud Platform (BigQuery, Dataproc, Cloud Storage) and orchestration with Airflow (Astro)
- Mentor and guide a team of data engineers, including contractors
- Contribute to best practices around scalability, reliability, and performance
- Leverage modern tools (including AI) to improve engineering productivity
Required Qualifications
- 6+ years of data engineering experience, including 2+ years in a lead or senior-lead capacity
- Deep, production-level experience with Databricks and PySpark (not just PoC work)
- Strong experience with Google Cloud Platform, including: BigQuery, Dataproc, Cloud Storage (GCS)
- Experience designing data lake / lakehouse architectures (GCS as system-of-record is a plus)
- Advanced data modeling expertise, including: Dimensional modeling, Canonical/domain modeling, Entity resolution
- Strong proficiency in Python and SQL, with a focus on production-quality, reliable code
- Experience with Airflow (or Astro managed Airflow) for orchestration
- Proven ability to work directly with business stakeholders and lead requirements-gathering sessions
- Experience delivering end-to-end data platform implementations
Nice to Have
- Experience with dbt (especially within medallion architectures)
- Background in music, media, digital rights, or royalties
- Exposure to AWS environments
- Experience working in startup or consulting environments
What We're Looking For
- Strong ownership mindset - someone who takes initiative and drives outcomes
- Ability to operate as a technical leader and business translator
- Comfortable working in fast-paced, evolving environments
- Focus on building scalable, maintainable systems, not just quick fixes
- Engineers who think beyond pipelines - problem solvers, not just implementers
Our benefits package includes: Comprehensive medical benefits Competitive pay, 401(k) Retirement plan and much more!
About OpenKyber Technology is our focus and quality is our commitment. As a national expert in delivering flexible technology and talent solutions, we strategically align industry and technical expertise with our clients' business objectives and cultural needs. Our solutions are tailored to each client and include a wide variety of professional services, project, and talent solutions. By always striving for excellence and focusing on the human aspect of our business, we work seamlessly with our talent and clients to match the right solutions to the right opportunities. Learn more about us at inspyrsolutions.com.
OpenKyber provides Equal Employment Opportunities (EEO) to all employees and applicants for employment without regard to race, color, religion, sex, national origin, age, disability, or genetics. In addition to federal law requirements, OpenKyber complies with applicable state and local laws governing nondiscrimination in employment in every location in which the company has facilities.
For applications and inquiries, contact: hirings@openkyber.com
Similar roles
GCP Cloud EngineerQuantiphi · Canada · Remote
Lead GCP Cloud EngineerLIGHTFEATHER IO LLC · Alexandria, Virginia, United States · Onsite
Lead GCP Cloud EngineerCollective Health · Plano, Texas, United States · Hybrid
GCP Cloud EngineerCollective Health · Texas, United States · Hybrid
GCP Cloud EngineerLIGHTFEATHER IO LLC · Alexandria, Virginia, United States · Onsite