
DATA211: Senior Data Engineer, ETL and Data Platform
Role summary
JerseySTEM is seeking experienced, pro-bono Data Engineers to stabilize and scale core data pipelines for analytics and reporting. This role requires ownership of production-grade ETL workflows across multiple data sources, focusing on operational reliability, sound data modeling, and platform sustainability. Responsibilities include designing and maintaining ETL pipelines using MySQL, integrating third-party systems and APIs, implementing efficient data refresh strategies, managing schema changes, and building analytical models, data marts, and a centralized data warehouse. The ideal candidate will ensure data quality, implement monitoring and documentation, and provide technical leadership in data engineering best practices. This is a remote, volunteer position requiring a minimum six-month commitment of approximately six flexible hours per week.
All JerseySTEM roles are pro-bono (unpaid) positions.
JerseySTEM is a mission-driven professional network of pro-bono contributors dedicated to improving access to STEM education and career pathways for underserved middle school girls in New Jersey.
Members contribute their professional skills and leverage their networks in service of the organization’s gender-equity agenda.
Membership is a
minimum six-month commitment of approximately six flexible hours per week
and includes a $100 refundable deposit, returned after six months of active membership. K-12 educators, retirees, veterans, interns, and students are exempt from the deposit.
This is a pro-bono volunteer position.
JerseySTEM is seeking experienced Data Engineers to stabilize and scale core data pipelines that power analytics and reporting. The current platform complexity requires ownership from engineers who can design, implement, and maintain production grade workflows across multiple data sources.
This role focuses on operational reliability, sound data modeling, and long term platform sustainability.
- Design, build, and maintain production grade ETL pipelines using MySQL and external data sources
- Integrate third party systems and APIs, including Integrate.io
- Implement CDC and incremental loading strategies for efficient and reliable refresh
- Manage schema changes, late arriving data, and source inconsistencies
- Design and maintain analytical models including fact and dimension tables
- Build and evolve data marts and a centralized data warehouse
- Implement monitoring, documentation, and pipeline standards
- Ensure data quality, consistency, and operational resilience
- Provide technical leadership and define data engineering best practices
- Seven or more years of hands on experience in data engineering or data platform roles
- Strong experience working with MySQL in analytical or hybrid environments
- Proven experience integrating external APIs and third party systems
- Demonstrated experience implementing CDC or incremental load patterns
- Deep understanding of dimensional modeling and warehouse architecture
- Advanced SQL skills and strong proficiency in Python or similar languages
- Ability to operate independently and own pipelines end to end
What This Role Is Not
- Not limited to ad hoc scripts or one off fixes
- Not a purely advisory position
- Not a passive oversight role
What Success Looks Like
- ETL pipelines are stable, incremental, and predictable
- API ingestion runs with minimal manual intervention
- Data models are trusted and analytics ready
- Analytics teams focus on insights rather than resolving data issues