Site Reliability Engineer
Optomi, in partnership with one of our premiere Financial clients, are seeking a highly skilled Site Reliability Engineer for their Charlotte, NC location. As a key member of our Digital Space team, you will play a critical role in setting up and maintain applications acting as a trusted team member!
What the right candidate will enjoy:
- 12 month contract, with potential full time conversion.
- Hybrid working environment!
- Charlotte, NC location!
Required Qualifications of the right candidate:
- Experience using DevOps tools and technologies such as GitLab, and Infrastructure as Code tools such as Terraform
- Strong troubleshooting skills and building and enhancing the observability using monitoring tools
- Proactive approach to Observability maturity, identifying problems, performance bottlenecks, and areas for improvement for observability
- Leading incident response and supporting application teams. Blameless postmortems
- Developer feedback for enhanced logging, runbooks and addressing technical debt. Promoting observability best practices
- Experience in monitoring tools Dynatrace & Splunk
- Experience in public cloud platforms, preferably AWS and Api gateways
- Experience developing API or Microservices or Frontend is a plus
- Experience using source version control (SVC) such as Git
Responsibilities
- Run the production environment by monitoring availability and taking a holistic view of system health
- Support the applications with OnCall rotation support.
- Provide stability to our applications and facilitates rapid feature development by taking active control on direction of the service and be proactive
- Automate and eliminate manual work and look for opportunities for automation
- Maintaining and implementing the SLO implementation adoption and automation
- Production Readiness/Health Scoring & Error Budget Tracking
- Runbook standards, maintenance, and updates