Site Reliability Engineer / DevOps Engineer
Role summary
We are seeking a dynamic and proactive Site Reliability Engineer / DevOps Engineer to join our innovative technology team. In this role, you will be at the forefront of ensuring the stability, scalability, and security of our cloud-based and on-premise IT infrastructure. Your expertise will drive the automation of deployment pipelines, optimize system performance, and enhance disaster recovery strategies. If you thrive in a fast-paced environment and are passionate about building resilient systems, this is your opportunity to make a significant impact!
Job Summary
We are seeking a dynamic and proactive Site Reliability Engineer / DevOps Engineer to join our innovative technology team. In this role, you will be at the forefront of ensuring the stability, scalability, and security of our cloud-based and on-premise IT infrastructure. Your expertise will drive the automation of deployment pipelines, optimize system performance, and enhance disaster recovery strategies. If you thrive in a fast-paced environment and are passionate about building resilient systems, this is your opportunity to make a significant impact!
Duties
- Design, implement, and maintain scalable cloud infrastructure using platforms such as AWS, Google Cloud Platform, and OpenStack.
- Automate deployment processes with tools like Jenkins, GitLab CI/CD, Ansible, Puppet, Chef, and Terraform to streamline software releases.
- Manage container orchestration platforms including Docker, Kubernetes, and OpenShift to support microservices architectures.
- Monitor system health and performance using tools such as New Relic, Splunk, Elasticsearch, and Nagios; analyze logs for troubleshooting and optimization.
- Develop scripting solutions in Bash (Unix shell), PowerShell, Python, Groovy, Perl, Ruby, Go, and C# to automate routine tasks and improve system reliability.
- Implement security best practices for cloud infrastructure and IT systems including firewall management, identity & access management (IAM), DNS security, and cloud security protocols.
- Lead incident response efforts by diagnosing outages or performance issues swiftly to minimize downtime; perform disaster recovery planning and testing.
Skills
- Extensive experience with cloud computing platforms such as AWS, Azure, Google Cloud Platform, and OpenStack.
- Proficiency in containerization technologies including Docker and Kubernetes; experience with virtualization using VMware or similar solutions.
- Strong knowledge of enterprise software including WebSphere, Weblogic, JBoss, Tomcat, Microsoft SQL Server, MySQL, Oracle Database, and Microsoft Windows Server environments.
- Hands-on expertise with configuration management tools like Ansible, Puppet, Chef; familiarity with CI/CD pipelines using Jenkins or TFS.
- Skilled in scripting languages such as Bash (Unix shell), PowerShell, Python, Groovy; ability to write robust automation scripts.
- Familiarity with distributed systems architecture involving RESTful APIs, microservices design patterns and TCP/IP networking protocols.
- Experience with monitoring tools like New Relic or Splunk for log analysis; understanding of system testing and software troubleshooting techniques.
- Knowledge of disaster recovery planning including incident management procedures; experience with incident response workflows. Join us to be part of a forward-thinking team dedicated to delivering reliable IT solutions! Your expertise will help shape our infrastructure’s future while ensuring seamless service delivery across diverse environments.
Pay: $50.00 - $55.00 per hour
Benefits:
- Health insurance
- Paid time off
Work Location: In person

