Title: Site Reliability Engineer
Location: San Francisco, CA(Local candidates only)
Duration: Contract
Responsibilities:
- Guide architecture and development teams on how to make applications highly available, reliable, and performant at a global scale
- Partner with architecture teams to ensure operability, measurability, and manageability are accounted for in business features and enablers
- Collaborate with product owners and managers to Implement and monitor key metrics to meet SLOs and SLA
- Collaborate with development team members to troubleshoot and resolve problems
- Drive the Root Cause Analysis of production issues and other failures within the product software, pipeline, or other DevOps support processes or technology
- Additional responsibilities may be required for the various roles
Qualifications:
- 7+ years of experience in Automation Programming in one or more of the following scripting programming languages: Python, Go, Java, Ruby, Rust, and JavaScript (with priority being given to Python and Go but not required). Bash is not a programing language.
- 7+ years of experience working with Linux terminal tools and writing shell scripts within a Linux environment
- Strong understanding of public cloud service concepts
- Strong understanding of Unix/Linux operating systems internals and administration (Debian understanding is preferred but not required)
- Strong understanding of networking (e.g. TCP/IP, routing, network topologies, and hardware), storage systems, and database systems
- Strong experience in debugging and optimizing code and automating routine tasks
Additional skills/experience may be required for the various roles