NEED ONLY US CTITZENS :: Site Reliability Engineer

Jersey, New Jersey, United StatesOnsiteContractPosted 2 months agoVisa sponsorship available

Is this role right for you?

Upload your resume and get a skill-by-skill breakdown — see exactly where you match, where you're close, and what to highlight. Not a mystery percentage.

Get a tailored resume highlighting what this role needs.

Role summary

A Site Reliability Engineer is needed on a contract basis in Jersey City, NJ, to ensure the reliability, observability, and operational hygiene of a telecom program (RRT + STIR/SHAKEN). This role involves maintaining platform stability across SBC telemetry and hybrid contact center touchpoints (Genesys + Amazon Connect migration). Key responsibilities include defining and monitoring SLIs/SLOs, building observability integrations, driving proactive alerting and incident response, implementing operational automation, and enforcing production readiness and change management discipline. The role requires 7-10+ years in SRE/DevOps or infrastructure engineering, with strong experience in monitoring, alerting, and incident response within regulated BFS environments.

Title: Site Reliability Engineer

Location: Jersey City,NJ(Onsite)

Job Type: Contract

Role Summary

The SRE Engineer will own reliability, observability, and operational hygiene for the combined telecom program (RRT + STIR/SHAKEN). Ensures platform stability across SBC telemetry and hybrid contact center touchpoints (Genesys + Amazon Connect migration), with disciplined incident response and change governance through disciplined SRE and DevOps practices.

Why this role exists

To ensure the combined program remains reliable, observable, and operationally disciplined—particularly important in BFS environments where low latency voice paths, SBC telemetry, and a hybrid contact center stack (Genesys + Amazon Connect migration) must meet stringent uptime, incident response, and change governance requirements.

Key Responsibilities

• Define and monitor SLIs/SLOs for latency, availability, error rates, and signing success/failure signals

• Build observability integrations (metrics/logs/alerts) aligned to enterprise monitoring standards and approved platforms

• Drive proactive alerting, incident response, root cause analysis, and post incident reliability improvements

• Implement operational automation, including monitoring/alerting workflows, runbook automation, and repeatable diagnostics

• Enforce production readiness gates and change management discipline; support after hours change windows as needed

Required Qualifications

• 7–10+ years in SRE / DevOps or infrastructure engineering

• Strong experience with monitoring, alerting, and incident response and post incident improvements

• Familiarity with highly available, low latency systems in regulated environments.

• Experience operating in regulated BFS environments

Preferred Qualifications

• SRE experience supporting telecom/voice platforms or real time systems

•
Familiarity with hybrid on prem + cloud observability patterns (Genesys + Amazon Connect coexistence)

• Automation and resiliency testing experience; reliability engineering playbooks

• Background in compliance driven operational environments

•
Experience working in BFS Sector

Thanks

Aatmesh

aatmesh.singh@ampstek.com

Ready to apply?

You'll be redirected to Ampstek's application page.