Overview
A mission-driven health and wellness company is seeking a mid to senior-level DevOps Engineer to help drive a culture focused on reliability and performance. This company specializes in delivering innovative digital health solutions that support both mental and physical well-being. With strong backing from top investors and a focus on cutting-edge technology, the company offers a unique opportunity to work on scalable platforms that make a meaningful impact.
What You Will Do
- Foster a culture of reliability within the engineering team, emphasizing monitoring, alerting, and scaling practices.
- Participate in architectural discussions with a focus on site reliability engineering.
- Manage and optimize CI/CD pipelines and deployment processes.
- Enhance the developer experience by improving tooling and processes.
- Design, implement, and maintain scalable and secure cloud infrastructure.
- Collaborate closely with software engineers to ensure efficient operations and resolve production issues.
- Implement and maintain monitoring and alerting systems using industry-standard tools.
- Continuously improve system reliability, performance, and security.
- Ensure compliance with regulatory requirements and industry best practices.
What You Will Need to Succeed
- 4+ years of experience in DevOps, Site Reliability Engineering, or a similar role.
- Experience working as a Founding DevOps Engineer or part of a very small DevOps team with broad responsibility
- Strong proficiency with cloud platforms, particularly AWS and GCP.
- Experience with containerization technologies (e.g., Docker) and orchestration tools (e.g., Kubernetes).
- Expertise in CI/CD tools and practices, with a focus on automation.
- Familiarity with monitoring and logging tools such as Datadog and Sentry.
- Strong knowledge of infrastructure-as-code principles and tools (e.g., Terraform, CloudFormation).
- Experience managing production environments for web applications.
- Solid understanding of network security principles.
- Excellent problem-solving skills with the ability to troubleshoot complex systems.
- Strong communication skills and the ability to collaborate with cross-functional teams.