Short Description:
Sr Cloud Engineer
Location: Santa Clara, CA
100% On-site
9am - 5pm plus lunch
Description:
The seasoned Sr Cloud Engineer is asked to help maintain the Lab infrastructure and ensure daily management of the site’s environment. The individual acts as an administrator and ensures environment stability, while also supporting development teams on the creation and migration of projects. Individual will be asked to support the security posture regarding encryption management and deployment.
Key Responsibilities:
- AWS Infrastructure Provisioning and Management: Ensures development teams have required infrastructure, environments, and access in place to execute development plans through collaboration with shared services team.
- Manage and maintain Linux image creation and deployment with associated systems, ensuring high levels of performance, availability, and security including vulnerability management.
- Configure and manage AWS virtual private clouds (VPCs), subnets, security groups, and other AWS networking components and services.
- Install, configure, and troubleshoot Linux operating systems and software applications in both on-premises and AWS cloud environments.
- Supports the whole infrastructure (servers, databases, file shares, robot machines, orchestrator, and application installation) by setting up and managing the lab environment.
- Works with team to provision automation infrastructure, including provisioning of new AWS formations and associating them to different environments.
- Platform Administration: Manage and maintain the orchestrator platform, user access, license mgmt., scheduling, environment, etc.
- Assists in the automation lifecycle by managing production environments and code migrations, ensuring separation of duties.
- Performance monitoring and optimization: Supports infrastructure operations and strategy, monitoring health, the status of deployed machines, and system performance.
- Works with the team to ensure proper usage and maintenance of Automation Cloud and Installed software to ensure it remains secure, stable, and up-to-date with the latest features.
Security and Compliance:
- Supports escalation requests regarding Infrastructure/Security by working with relevant teams. Support the creation and deployment of encryption baseline for workstations and assets, and manage existing operation of encryption infrastructure.
- Backup and disaster recovery: Implement backup and disaster recovery solutions to ensure the availability and integrity of data together with the corresponding teams.
Continuous Improvement:
- Identify, analyze, and implement enhancements to optimize performance, effectiveness, and cost associated with the environment.
- Training and Support: Provide basic guidance and support to users, developers, and other stakeholders involved in using AWS and Linux environments.
Documentation:
- Maintain updated documentation of configurations and procedures related to AWS infrastructure, environments, and access.
- Monitor AWS and on-prem Linux servers, analyze usage patterns, and optimize costs and performance.
- Monitors incoming support tickets raised from business users or raised as a result of automation defects.
- Troubleshoot issues related to automation tools, processes, and infrastructure.
Job Knowledge and Skills:
Must have:
- Familiarization with the Least Privileged Access principle.
- Understanding of Software Development Lifecycle.
- Strong understanding of AWS concepts, principles, and best practices.
- Quick technical learner – able to pick up sets of responsibilities quickly.
- Knowledge of LDAP Directory protocol.
- Expertise with the Cloud Formation.
- Experience with cross-regional teams
- Experience with Linux Server (Ubuntu, Debian)
- Hardware/software compatibility and build history.
Nice to have:
- Working knowledge of other AWS services/Google/Azure
- Project Management Methodologies
- DNS/DHCP IP protocols
- Network protocols and network administration knowledge, network security.
- Good knowledge of security as it relates to cloud-based infrastructure.
- Understanding of cloud computing concepts.
- WinMagic SecureDoc.
Should have implementation experience in most of the below technology areas (breadth) and deep technical expertise in some of the below technologies:
- Serverless AWS technologies like Cloud Formation
- AWS API technologies like API gateway and its integration with Cognito.
- Containerization like Dockers and Kubernetes.
- Authentication services like Cognito. Must have knowledge on oauth1, oauth2, and SAML authentication.
- Experience with s3.
- Data encryption in AWS.
- Exposure to multiple AWS components like Lambda, EC2, SQS, SNS, SES, Cloud Watch, RDS, etc.
- Experience in cloud data eco-system, specifically with AWS. Must have at least the minimum working knowledge or experience with AWS-managed databases like RDS, Aurora, Redshift, etc.
Competencies:
Focus on results, customer orientation, negotiation and conflict management, effective communication, strong analytical and problem solving, teamwork, planning and organization, and work under pressure.
Education/Certifications:
- Bachelor’s degree in information technology, computer science or a related field.
- Postgraduate degree – Desirable.
- 3-5 Years of relevant experience in AWS, use of the Cloud Formation Platform and Infrastructure.
- Infrastructure Management (servers, databases, file shares, robot machines, orchestrator, and application installation)
Additional Notes
- The position supports approximately 70 on-site employees in addition to remote workers. Many are high-level PhD researchers all very intelligent employees working in a start-up environment.
- Strong systems engineer who understands and is willing to do admin work
- Work with software that SV has available, make sure it works, push out updates, patches, etc.
- Continually deploy, integrate, update, etc. Support team in deploying applications
- Should have implemented could formation
- Receive in laptops, need to build to specs, create image, cover for the whole team
- Advanced knowledge of what linux runs
- Deal with security – encryption for system, how do they execute that – some procedures, need to adapt, document, etc.
- System deployment, re-imaging
- Plan so they have a sense of what is needed, purchase systems, make sure they meet company requirements
- What is under warranty, familiar with hands on systems
- What needs to be replaced, what needs to be ordered
- End user system support
- Framework in the cloud – make sure it is upgraded, user will work with team to see how it is supposed to work, how security will be deployed, work with business team in this area.