Be a part of our success story. Launch offers talented and motivated people the opportunity to do the best work of their lives in a dynamic and growing company. Through competitive salaries, outstanding benefits, internal advancement opportunities, and recognized community involvement, you will have the chance to create a career you can be proud of. Your new trajectory starts here at Launch!
The Role:
As the Director of our DevOps Discipline, you will lead the design, implementation, and management of our infrastructure and operations. Your role is crucial in ensuring the reliability, scalability, and performance of our systems. You will drive the adoption of DevSecOps practices and Site Reliability Engineering principles, fostering a culture of automation, efficiency, and continuous improvement. Your expertise will be critical in automating operations, enhancing system resilience, improving deployment processes, and providing technical leadership to the DevOps team.
Responsibilities Include:
Discipline Leadership:
- Define and refine the modern age DevOps best practices for our clients
- Lead and mentor a team of DevOps and SRE professionals, fostering a culture of automation and continuous improvement.
- Set technical standards and best practices, ensuring high reliability and efficient operations.
- Build and empower the team to create standardized methodologies within the DevOps discipline along with the flexibility to scale for our clients.
- Drive innovation and explore new technologies.
Infrastructure and Operations Management:
- Design, implement, and manage scalable, reliable, and secure cloud infrastructure (e.g., AWS, Azure, GCP).
- Automate infrastructure provisioning, configuration management, and deployment processes using tools like Terraform, Ansible, and CI/CD pipelines.
- Monitor system performance, identify and resolve issues, and ensure high availability and disaster recovery.
DevOps Practices:
- Drive the adoption of DevOps practices, including continuous integration, continuous delivery, and automated testing.
- Develop and maintain CI/CD pipelines, ensuring efficient and reliable software deployments.
- Implement and manage containerization and orchestration technologies (e.g., Docker, Kubernetes).
- Integrate the full DevOps processes from Plan, Code, Build, Test, Release, Deploy, Operate, and Monitor.
Site Reliability Engineering:
- Apply SRE principles to improve system reliability, scalability, and performance.
- Develop and implement monitoring, logging, and alerting solutions to proactively identify and address issues.
- Conduct performance tuning, capacity planning, and system optimization to meet service level objectives (SLOs).
Security and Compliance:
- Ensure the security and compliance of our infrastructure and operations, implementing best practices for data protection and access control.
- Conduct regular security assessments, vulnerability scans, and penetration tests.
- Develop and enforce security policies, procedures, and incident response plans.
Stakeholder Engagement:
- Collaborate with architecture, development, QA, and product teams to understand their needs and provide technical solutions that meet business objectives.
- Present technical concepts and project status updates to non-technical stakeholders in a clear and compelling manner.
- Build strong relationships with clients, ensuring their technical requirements are met and their expectations are exceeded.
Skill Development:
- Foster a culture of continuous learning and professional growth within the team.
- Stay updated on industry trends, emerging technologies, and best practices in DevOps and SRE.
Preferred Qualifications:
- Proven experience (10+ years) in DevOps, SRE, or a related field, with a track record of leading technical teams.
- Professional IT consulting services experience required
- Strong proficiency in cloud platforms (e.g., AWS, Azure, GCP) and infrastructure-as-code tools (e.g., Terraform, Ansible).
- Extensive experience with CI/CD pipelines, containerization (e.g., Docker), and orchestration (e.g., Kubernetes).
- Strong understanding of monitoring, logging, and alerting tools (e.g., Prometheus, Grafana, ELK stack).
- Excellent problem-solving, critical-thinking, and analytical skills.
- Strong communication and presentation skills, with the ability to convey technical concepts to non-technical audiences.
- Experience with agile methodologies and project management tools (e.g., Jira, Confluence).
- Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field
Compensation & Benefits:
As an employee at Launch, you will grow your skills and experience through a variety of exciting project work (across industries and technologies) with some of the top companies in the world! Our employees receive full benefits—medical, dental, vision, short-term disability, long-term disability, life insurance, and matched 401k. We also have an uncapped, take-what-you-need PTO policy. The anticipated base wage range for this role is $190,000-$210,000. Education and experience will be highly considered, and we are happy to discuss your wage expectations in more detail throughout our internal interview process.