Post Job Free
Sign in

Systems Engineer - AWS Cloud - (Remote)

Company:
Vaco by Highspring
Location:
San Diego, CA
Posted:
May 06, 2025
Apply

Description:

Job Title: Linux Systems Engineer

Position Summary:

We're on the hunt for a seasoned AWS Cloud Linux Systems Engineer to strengthen our IT/DevOps team. This role is ideal for someone who thrives in a cloud-first environment and brings deep experience in managing Linux systems, automating infrastructure, and maintaining high-performing, secure cloud environments. You'll be responsible for architecting, deploying, and supporting critical AWS services while ensuring system reliability and performance.

Key Responsibilities

Cloud Infrastructure & System Management:

Deploy and manage scalable, secure Linux environments across AWS and, occasionally, Azure or GCP.

Handle core AWS services such as EC2, S3, VPCs, Load Balancers, and RDS; bonus if you're comfortable with GCP equivalents like Compute Engine and Cloud Functions.

Ensure the infrastructure is robust, cost-effective, and designed for high availability.

Collaborate with cross-functional teams to design and integrate cloud-native solutions into CI/CD pipelines. Automation & Infrastructure as Code:

Build and maintain Infrastructure as Code (IaC) using tools like Terraform and AWS CloudFormation.

Streamline operations through automation with Ansible, Chef, or Puppet.

Write and maintain scripts that help manage and scale cloud environments efficiently. Cloud Security & Compliance:

Apply best practices in cloud security-controlling access, managing IAM roles, securing endpoints, and encrypting data.

Conduct regular audits and vulnerability assessments.

Work alongside security teams to address compliance, access controls, and infrastructure hardening. Monitoring, Performance & Optimization:

Implement and manage tools such as Prometheus, Grafana, AWS CloudWatch, Datadog, or similar.

Identify and resolve performance bottlenecks, manage resource usage, and optimize costs.

Configure alerts and manage logs using solutions like ELK Stack, Splunk, or native cloud logging services. Linux Systems & Application Management:

Administer Linux-based environments including updates, patching, and troubleshooting.

Configure and manage essential services like Apache, NGINX, MySQL, and PostgreSQL.

Manage storage solutions such as EBS volumes and S3 buckets. Collaboration & Documentation:

Partner with Dev, QA, and Ops teams to ensure reliable application deployment and system stability.

Maintain comprehensive documentation covering infrastructure, processes, and policies.

Support and mentor junior team members through knowledge sharing and technical guidance. Incident Response & Root Cause Analysis:

Respond promptly to critical incidents affecting cloud infrastructure.

Conduct root cause analysis and implement long-term solutions to recurring problems.

Qualifications & Skills

Required Education & Experience:

Bachelor's degree in Computer Science, IT, or equivalent professional experience.

7-9 years of experience in cloud infrastructure, specifically AWS.

5-7 years of deep Linux systems engineering experience (Ubuntu, CentOS, or RedHat). Core Technical Skills:

Expertise in AWS cloud technologies and infrastructure management.

Strong Linux administration background.

Hands-on experience with IaC tools such as Terraform and CloudFormation.

Proficiency in automation tools like Ansible, Puppet, or Chef.

Solid grasp of cloud networking: DNS, VPNs, CIDR, subnets, and VPCs.

Knowledge of cloud security principles and IAM management.

Experience with monitoring/logging tools like CloudWatch, Datadog, LogicMonitor, or similar. Preferred Qualifications:

Relevant certifications (AWS, ITIL, CAMP, etc.).

Familiarity with asset management and compliance in cloud environments.

Experience with cloud-native monitoring and alerting systems. Soft Skills:

Strong communication skills and a proactive mindset.

Exceptional troubleshooting ability and a customer-service-first attitude.

Ability to work independently and as part of a team in a fast-paced, evolving environment.

Flexibility for after-hours or weekend work during updates, outages, or project rollouts.

Apply