Post Job Free
Sign in

DevOps Operations Engineer

Company:
Certu Systems
Location:
Pittsburgh, PA, 15289
Posted:
June 19, 2025
Apply

Description:

DevOps Operations Engineer

Location: Pittsburgh, PA

Department: Operations

Job Type: Full Time/in person

Company Description

Join an exciting AI startup on a mission to revolutionize logistics operations through actionable insights and real-time predictions that drive reliability, efficiency, transparency, and cost savings.

Certu Systems was founded by a passionate group of innovators and problem-solvers committed to solving the persistent supply chain challenges impacting global industries. Our goal is simple: bring supply and demand closer together by leveraging intelligent data analytics, automated workflows, and continuous process improvements.

Role Description

As a DevOps Operations Engineer, you will be responsible for the infrastructure, deployment pipelines, and overall operational health of our AI-driven logistics platform. Your work will directly contribute to the optimization and scalability of our solutions, helping customers manage logistics and supply chain operations more efficiently. You’ll work closely with both the engineering and Operations teams to streamline processes, automate workflows, and maintain a high level of operational efficiency.

As part of a lean, fast-growing organization targeting a high-impact niche within the logistics and delivery space, you’ll play a pivotal role in delivering disruptive solutions that transform how global supply chains operate.

Key Responsibilities:

Infrastructure Management & Automation:

Set up, configure, and maintain cloud-based infrastructure to support AI-driven logistics operations, ensuring high availability, scalability, and reliability.

Automate internal infrastructure and services provisioning and management, including ACL and permissions management.

Optimize server and storage configurations.

CI/CD Pipeline Management:

Design, implement, and maintain continuous integration and continuous deployment (CI/CD) pipelines.

Integrate deployment and testing into the pipeline, automating workflows.

Ensure seamless and automated versioning, rollback strategies, and continuous delivery of updates to production.

Monitoring, Performance, and Incident Management:

Implement and configure system monitoring and logging tools to track both application and performance in real-time.

Proactively monitor the operational health of system models and the logistics platform, quickly identifying and addressing performance issues.

Troubleshoot and resolve incidents, performing root cause analysis and improving operational resilience over time.

Collaboration with Engineering Teams:

Partner closely with internal teams to ensure smooth deployment, monitoring, and scaling of machine learning models.

Ensure continuous alignment between infrastructure, operations, and customer-facing features to deliver end-to-end solutions.

Disaster Recovery and Backup:

Implement and regularly test disaster recovery plans for critical logistics systems ensuring business continuity during system failures.

Set up automated backup processes for important data, including real-time operational data and historical logistics records.

Cost Optimization:

Optimize cloud resources to minimize costs without compromising system performance or scalability, especially during peak demand periods.

Continuously evaluate infrastructure usage and recommend optimizations to ensure cost-efficiency.

Key Qualifications:

Education: Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent experience).

Experience:

5+ years of experience in a DevOps, systems operations, or cloud infrastructure management role, particularly in scaling and managing cloud-based systems.

Hands-on experience with cloud platforms (AWS, GCP, or Azure) and associated services.

Strong knowledge of containerization technologies (e.g., Docker, Kubernetes) for efficient deployment and scaling of applications.

Experience with CI/CD pipelines, version control systems, and related tools (Jenkins, GitLab CI, CircleCI).

Technical Skills:

Proficiency in scripting to automate operational tasks.

Familiarity with monitoring tools for real-time monitoring of infrastructure and AI model performance.

Experience working with data storage systems.

Soft Skills:

Problem-solving mindset with the ability to troubleshoot complex systems.

Strong collaboration skills to work across teams, particularly with engineering and data science teams.

Ability to balance operational efficiency with system-specific needs, ensuring that the logistics platform is both robust and responsive.

Preferred Qualifications:

Certifications:

AWS Certified DevOps Engineer, Google Cloud Professional Cloud Architect, or similar certifications.

Automation Tools:

Familiarity with tools like Jenkins, GitLab, or CircleCI for managing deployment pipelines in an automated manner.

Benefits

Competitive compensation packages

Employee stock options

Opportunity for rapid advancement and career growth

Be part of a mission-driven, high-impact startup shaping the future of logistics

Apply