Stefanini Group logo
Stefanini Group Verified
Information Technology & Services, IT Consulting, Outsourcing

Data Engineer

San Francisco, California, United StatesHybridFull TimePosted 2 months agoVisa sponsorship available

Is this role right for you?

Upload your resume and get a skill-by-skill breakdown — see exactly where you match, where you're close, and what to highlight. Not a mystery percentage.

Get a tailored resume highlighting what this role needs.

Role summary

Stefanini Group is seeking a Contract Data Engineer for hybrid roles across the USA, with assignments lasting one year. The role involves implementing and managing data products, ensuring scalable, secure, and efficient data pipelines within a cloud-based Common Data Platform (CDP). Responsibilities include collecting, parsing, managing, analyzing, and visualizing large datasets to derive actionable insights. The Data Engineer will design, develop, and maintain data pipelines, participate in Agile rituals, resolve data pipeline issues, deploy monitoring and alerting, and collaborate with cross-functional teams to meet data requirements. A Bachelor's degree in a related field or equivalent experience is required, along with specific experience in Python, PySpark, SQL, cloud data warehousing, and modern data stack technologies.

Job Description
Stefanini Group is hiring!
Stefanini is looking for a Data Engineer for various location across USA (Hybrid).
For quick apply, please connect with Prakhar Goel: 248 263 5255/ Prakhar.goel@stefanini.com
W2 Candidates Only!
Position Summary
We currently have multiple openings for Contract Workers to join us in Data Engineer roles for one year assignments to implement and manage data products, ensuring that our data pipelines are scalable, secure, and efficient. We are working to modernize how we manage and leverage it. The Common Data Platform (CDP) is an exciting new, multi-district program to create a cloud based, end-to-end data management platform to reduce data cost and improve user experience. Initially CDP was developed in partnership with the Supervision and Regulation business, but this system is now positioned to become 'the standard' data management platform.
As a Data Engineer, this CW will be responsible for collecting, parsing, managing, analyzing, and visualizing large sets of data to turn information into actionable insights. They will work across multiple platforms to ensure that data pipelines are scalable, repeatable, and secure, capable of serving multiple users.
Responsibilities

  • Design, develop, and maintain robust and efficient data pipelines to ingest, transform, catalog, and deliver curated, trusted, and quality data from disparate sources into our Common Data Platform.
  • Actively participate in Agile rituals and follow Scaled Agile processes as set forth by the CDP Program team.
  • Deliver high-quality data products and services following Safe Agile Practices.
  • Proactively identify and resolve issues with data pipelines and analytical data stores.
  • Deploy monitoring and alerting for data pipelines and data stores, implementing auto-remediation where possible to ensure system availability and reliability.
  • Employ a security-first, testing, and automation strategy, adhering to data engineering best practices.
  • Collaborate with cross-functional teams, including product management, data scientists, analysts, and business stakeholders, to understand their data requirements and provide them with the necessary infrastructure and tools.
  • Keep up with the latest trends and technologies, evaluating and recommending new tools, frameworks, and technologies to improve data engineering processes and efficiencies.

Qualifications

  • Bachelor's degree in Computer Science, Information Systems, or a related field, or equivalent experience.
  • Our ideal candidate would have all of these skills but as many as possible will suffice: Databricks - PySpark, SQL (Starburst is a bonus), Gitlab, -CI/CD Pipelines, Python, and Tableau.
  • 2+ years' experience with tools such as Databricks, Collibra, and Starburst.
  • 3+ years' experience with Python and PySpark.
  • Experience using Jupyter notebooks, including coding and unit testing.
  • Recent accomplishments working with relational and NoSQL data stores, methods, and approaches (STAR, Dimensional Modeling).
  • 2+ years of experience with a modern data stack (Object stores like S3, Spark, Airflow, Lakehouse architectures, real-time databases) and cloud data warehouses such as RedShift, Snowflake.
  • Overall data engineering experience across traditional ETL & Big Data, either on-prem or Cloud.
  • Data engineering experience in AWS (any CFS2/EDS) highlighting the services/tools used.
  • Experience building end-to-end data pipelines to ingest and process unstructured and semi-structured data using Spark architecture.

Listed salary ranges may vary based on experience, qualifications, and local market. Also, some positions may include bonuses or other incentives.
Stefanini takes pride in hiring top talent and developing relationships with our future employees. Our talent acquisition teams will never make an offer of employment without having a phone conversation with you. Those face-to-face conversations will involve a description of the job for which you have applied. We also speak with you about the process including interviews and job offers.
About Stefanini Group
The Stefanini Group is a global provider of offshore, onshore and near shore outsourcing, IT digital consulting, systems integration, application, and strategic staffing services to Fortune 1000 enterprises around the world. Our presence is in countries like the Americas, Europe, Africa, and Asia, and more than four hundred clients across a broad spectrum of markets, including financial services, manufacturing, telecommunications, chemical services, technology, public sector, and utilities. Stefanini is a CMM level 5, IT consulting company with a global presence. We are CMM Level 5 company.

Ready to apply?
You'll be redirected to Stefanini Group's application page.

Similar roles