Post Job Free
Sign in

Site Reliability Engineer

Company:
ZeOmega
Location:
Plano, TX, 75023
Posted:
December 25, 2025
Apply

Description:

Position Summary SRE combines software engineering practices with IT engineering practices to create highly reliable systems.

Site reliability engineers are responsible for the reliability of the full stack, from the front-end, customer-facing applications to the back-end database and hardware infrastructure.

Responsible for the design and delivery of the mission critical stack, with focus on security, resiliency, scale, and performance.

Authority for end-to-end performance and operability.

Partner with development teams in defining and implementing improvements in service architecture.

Work involves defining and documenting technical architecture of complex and highly scalable products.

A minimum of 8+ years of experience of running large scale customer facing web services.

PRINCIPLE JOB RESPONSIBILITIES: * Engage in and improve the whole lifecycle of services-from inception and design, deployment, operation, and refinement.

* Understanding, implementation experience and troubleshooting of Database technology * Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning, and launch reviews.

* Maintain services once they are live by measuring and monitoring availability, latency, and overall system health.

* Scale systems sustainably through mechanisms like automation; evolve systems by pushing for changes that improve reliability.

* Practice sustainable incident response.

* Strong communication and analytical skills * Familiarity with security practices in web application delivery and General knowledge of network topology * Experience with configuration management tools Skills: * Coding, CI/CD, Databases, Microservices, Verion Control, Docker, Kubernetes, Using Monitoring tools * Good scripting skills (Python/Perl/Ruby/Bash) * Hands on experience of Linux and Windows environment (intermediate) * Experience with Database (Oracle/MSSQL) * Experience with standard continuous integration tools (Jenkins/Quick Build/Bamboo/ Go/TeamCity/Cruise Control) * Version control systems (Git/SVN/CVS/Perforce/ClearCase/Mercurial) * Experience in working with build tools (Buildout/Ant/Maven) * Work on software configuration management systems (Puppet/Chef/Salt/Ansible) * Network Administration Experience: A minimum of 8+ years of experience of running large scale customer facing web services.

Education B Tech / BE/ M Tech/ MCA/ BSc/ MSc from reputed University Skills BC - Dependability and Reliability BC - Initiative BC - Time Management TC - Build creation & Validation ( buildout) Competencies Adaptability/Flexibility Communication Skills BC - Time Management DC - US Healthcare domain Knowledge Customer Service Dependability/Reliability FC - Managing Project Issues,critical path

Apply