Job Description
We are offering a long-term contract opportunity for a Site Reliability Engineer in Naperville, Illinois. This role involves the design and implementation of resilient cloud infrastructure using various AWS services and the establishment of observability frameworks.
Responsibilities:
• Design and enforce strategies for infrastructure-as-code (IaC) using Terraform or CloudFormation.
• Use AWS services such as EC2, ECS/EKS, RDS, Lambda, S3, CloudFormation, etc to architect and implement robust cloud infrastructure.
• Establish and maintain observability frameworks utilizing tools like CloudWatch, Prometheus, Grafana, ELK, or Datadog.
• Develop and sustain CI/CD pipelines for automated deployment and testing.
• Utilize your skills in automation, AWS technologies, Python scripting, Bash scripting, and Amazon CloudWatch.• Candidate must possess a minimum of 5 years of experience as a DevOps Engineer or in a similar role.
• Proficiency in automation is mandatory for this role.
• Expertise in AWS technologies is a crucial requirement.
• Strong Python scripting skills are essential.
• Demonstrable experience with Bash scripting is required.
• General scripting experience is necessary.
• Familiarity with various scripting languages is important.
• Comprehensive understanding of Amazon Web Services (AWS) is a must.
• Experience with Amazon CloudWatch is desirable.
• Proven track record in software delivery is a significant asset.
• Knowledge of Kubernetes is necessary for this role.
• Experience with Prometheus is a key requirement.
• Proficiency in Grafana is highly beneficial for this position.