Responsibilities
- Planning, design, management, maintenance and support of cloud infrastructure for high-traffic workloads that operate at an enterprise scale
- Drive automation of tasks and implementing of infrastructure services
- Identify and rectify potential risks within the infrastructure, network or security
- Research, setup, testing and implementation of technologies and solutions to improve the performance, reliability, availability, security and efficiency of infrastructure on AWS
- Troubleshoot, perform root cause analysis and working closely with the developers to implement corrective/preventive actions during and after an incident
Requirements
- 8+ years experience with using a broad range of AWS technologies (e.g. EC2, IAM, VPC, CloudWatch, EKS, ECS, Security Hub, DynamoDB, SecretManager, GuardDuty, etc)
- Solid experience in Terraform as Infrastructure as Code
- Experienced in a 24x7x365 uptime Amazon AWS environment leveraging git repositories and CI/CD tools like Jenkins
- Ability to analyze and resolve complex infrastructure resource and application deployment issues (e.g. by using APM tools like NewRelic, Dynatrace, etc)
- Knows the best practice and cloud security (AWS Well-Architected Framework)