About the Role:
● This position is responsible for end to end management of Shopalyst's cloud infrastructure platform (AWS and Google Cloud).
● Candidate should have a bachelor's degree in Computer Science and Engineering (or equivalent) plus minimum 3+ years of experience in managing AWS cloud infrastructure.
Core responsibilities:
Infra management
● Review infrastructure change requests, suggest & implement optimal solution in terms
of performance, maintainability & cost
● Keep the infrastructure updated/upgraded. This includes servers, operating system,
packages and application software
● Maintain all infrastructure configuration as IaC (Infrastructure as Code)
Infra monitoring and site reliability
● Continuous monitoring of infrastructure and ensuring 24x7 availability
● Implement/maintain automated alerting mechanisms for handling infra outages
● Document and maintain infra recovery procedures, ensure successful backup of critical
data
● Continuously monitor infra utilization and alert on when to scale up/scale down
instances
Infra Security, Compliance & Certifications
● Infrastructure access management - ensure least privilege as the default policy. Review
infra access periodically. Manage access to critical systems like AWS/Open VPN
● Adherence to CIS/AWS security benchmarks on AWS Security Hub
● Implement controls required for different attestations/certifications like SOC 2 Type 2,
ISO 27xxx, PCI DSS etc
DevOps
● Enable CI/CD process for application deployment
Infra cost monitoring and optimisation
● Services-wise infrastructure cost monitoring analysis and optimisation
Infra trends and new services
● Evaluate new cloud services, analyse feasibility of adoption
Required Skills and Experience:
● AWS: 8+ years experience with using a broad range of AWS technologies (e.g. EC2,S3, ELB, VPC,Route 53, IAM, CloudWatch, CloudFront, RDS, Lambda, Glacier) to develop and maintain an Amazon AWS based cloud solution
● DevOps: Solid experience as a DevOps Engineer in a 24x7 uptime Amazon AWS environment, including automation experience with configuration management tools.
● Infra automation: Experience in Terraform/Ansible
● Scripting Skills: Strong scripting (e.g. Python) and automation skills.
● Operating Systems: Linux system administration and strong shell scripting skills
● Monitoring Tools: Experience with web servers (e.g. Nginx).
● Problem Solving: Ability to analyze and resolve complex infrastructure resource and
application deployment issues.
Requirements
Desired Skills (Not essential but beneficial to have):
● DB Skills: Basic DB administration experience - Cassandra, RDS, MySQL
● Search engines: Experience with search engines such as Apache Solr/Elastic Search
● Version Control: Experience administrating version control systems such as Git