Job Description:
Run our infrastructure with Terraform, Azure PaaS and/or Kubernetes.
- Make monitoring and alerting notify on symptoms and not on outages.
- Document so your findings turn into repeatable actions-and then into automation.
- Improve the deployment process to make it as boring as possible.
- Independently debug production issues across services and levels of the stack.
- Proactive communication with issues and propose ideas and solutions within the product team to reduce the workload by automation.
- Plan, design and execute solutions within product team to reach specific goals agreed within the team.
- Plan and execute configuration change operations both at the application and the infrastructure level.
- Actively look for opportunities to improve the availability and performance of the system by applying the learnings from monitoring and observation
- Complete Root Cause Analysis (RCA) investigations
- Responsible for gaining a deep understanding of the portfolio and understand the integrations
- Improving DevOps practices and accelerating delivery and take a lead role in troubleshooting technical issues and recommending changes to improve resiliency
- Develop strategic technology roadmaps
- Respond to TechOps incidents and provide support for customer incidents.
All you'll need for success
Minimum Qualifications- Education & Prior Job Experience:
Bachelors degree in Computer Engineering, Computer Science, Electrical Engineering or related field, and 5 years of experience
General knowledge of the following areas with deep knowledge in 2 areas:
- Implement "Infrastructure as Code" using Terraform in Azure and on-prem infrastructure resources
- Implement Github , GHA CI/CD and ADO cloud for automation
- Load balancing the application including Proxies and CDN (automate)
- Implementing monitoring, observability in AKS and K8S
- Monitoring and Metrics in Dynatrace, Prometheus, Grafana and integrations with Moogsoft/xMatters
- Open source Logging infrastructure
- Able to script Automated performance testing scenarios for APIs and Web front ends and embed in CI/CD pipelines dashboarding/reporting query languages
- Backend storage management and scaling
Preferred Qualifications- Education & Prior Job Experience:
Masters degree in Computer Engineering, Computer Science, Electrical Engineering or related field, and 3 years of experience
- Airline Industry experience helpful
Skills, Licenses & Certifications
Proficiency and demonstrated experience in the following technologies:
- Experienced in technology transformations and migration to one or more Cloud platforms such as AWS, Azure or GCP
- Hands-on experience with Infrastructure as a Service (IaaS), Platform as a Service (PaaS) tools and platforms, and containers and container orchestration platforms (aka Docker & Kubernetes)
- Expertise in one or more cloud native relational databases such as MySql, PostgreSql and NoSQL databases such as Cassandra and MongoDB and databases and migration to/from enterprise class databases highly desired
- Strong technical knowledge and skills that are broad and deep, covering various hardware, software, and technology platforms
- Nodejs, Typescript, JavaScript
- Database and persistence frameworks: Mongo, Oracle, Object/Relational Mapping, Query performance tuning
- Experience with Mongo Schema Design and Mongo Aggregation Framework
- Develop, implement, and maintain applications and systems that integrate MongoDB
- Web Services: Graph QL, REST/SOAP (JSON/WSDL/XML)
- DB Admin/SQL Server
- Terraform
- SysAdmin
- Troubleshooting Network Issues
- VM Management
- Dynatrace
- Ping Federate
- Airwall
- Security Vulnerabilities (remediation/compliance)
- IIS