Ops Engineer
Richardson, TX (on-site)
As an Ops Engineer, you are responsible for the health, performance, and availability of multiple production systems in a variety of remotely-hosted environments. In order to ensure that these systems continue to perform at the highest level, they design, implement, and maintain tools that allow the team to monitor the health of all these systems, and when issues are detected, they take corrective action as needed to maintain availability.
On an ongoing basis, Ops Engineers also deploy for updates to the operational software and systems. This involves fielding the latest new features and bug fixes, building and testing the software, coordinating the updates with the customer base and ISPs, and deploying and verifying those updates.
Required Skills:
- Linux, Windows, and networking
- Shell scripting, Python programming
- Git, Bamboo, GitLab
- Containerization, Kubernetes/Rancher
- Experience with cloud platforms (preferably AWS)
- Testing, automation
- Agile and DevOps principles
- Collaborative team spirit
- Excellent communication skills, both written and verbal
- Strong analytical and problem-solving skills
- Ability to work well under pressure
- Ability to recognize needs and willingness to take initiative to find resolutions
Desired Skills:
- ElasticSearch, Kibana, Prometheus, Grafana, Loki, Kafka
- Security+ certification
Qualifications
- Degree: BS
- Major(s): Computer Science, Computer Engineering, Electrical Engineering, Physics, Mathematics
- Experience: 5 or more years
- Clearance: Position requires TS and ability to obtain SSBI (including polygraph)
- Applicants must be a US citizen