Post Job Free
Sign in

Site Reliability Risk Management

Location:
Dallas, TX
Salary:
135000
Posted:
April 02, 2025

Contact this candidate

Resume:

Key Skills Profile Summary

Experience in SRE, Support, System

Administration, Infrastructure

Automation, Release & Change

Management.

Hosting apps in AWS & Azure

Cloud, Implementation, Reviewing,

Mentoring, DevOps, Java,

Microservices, Rest API, SaaS &

PaaS.

Focuses on ensuring system

reliability, observability, and

performance optimization with

using ELK Stack.

Migrations, Tech refresh, Patch

Management, Vulnerability &

Configuration Assessment,

Margining, Risk and Workflow

Application Development.

Agile Test-Driven Development,

Tech Lead, Production Support and

end-to-end Delivery. Skilled in cost

and effort analysis, off-shore team

handling and vendor management.

Expert-level knowledge in Risk &

Compliance, Infrastructure,

Application & Data Protection,

Cyber Transformation &

Operations, and Digital Identity.

Application integration with TIBCO

Suite of Products. Developing and

deploying workflow procedures

using TIBCO iProcess and AMXBPM.

• Performance-driven professional with 14+ years of experience in Development, Design, Delivery, Support, and Site Reliability Engineering, along with IT Compliance, Change & Release Management for diverse applications.

• Enterprise Solution Architect with expertise in cloud-based application design, delivery, and support across AWS and Azure.

• Architecting Cloud & Infrastructure Automation using DevOps, CI/CD pipeline implementation, Deploying Solutions in a serverless model.

• Implementing Infrastructure as Code (IaC) using Terraform & CloudFormation for scalable cloud solutions.

• Strong background in IT operations, production support management, and project lifecycle management.

• Hands-on experience in infrastructure setup, upgrades, maintenance, and deployment, covering the full software development lifecycle from design to release and support.

• Expertise in cloud migrations from on-premises to AWS/Azure, ensuring optimized infrastructure and contingency planning.

• Proficient in Terraform, AWS CDK, scripting languages, and network programming, with strong knowledge of TCP/IP technologies.

• Driving resource utilization and cost efficiency for both cloud and on- premise services.

• Setting up centralized logging with the ELK Stack (Elasticsearch, Logstash, Kibana) for log aggregation and measure system health.

• Optimize system performance by analyzing logs, metrics, and traces from ELK.

• Single point of contact for Application Management, covering administration, support, infrastructure availability, enhancements, and performance monitoring.

• Extensive experience in setting up multiple clusters, Load Balancing, High Availability, and Failover functionality.

• Manage and optimize clusters for high performance and scalability.

• Proven track record of migrating and hosting on-premise systems into AWS

& Azure with 24/7 availability and 100% uptime.

• Leading Vulnerability & Risk Assessment, troubleshooting, cost analysis, and cloud security to ensure robust hosting and data protection.

• Process enhancement expert with experience managing vendor relationships, on-site technical teams, and SMEs.

• Possess expert-level knowledge, experience, and proficiency across the Security portfolio, including Risk & Compliance, Infrastructure, Application

& Data Protection, Cyber Transformation & Operations, and Digital Identity.

• Expertise in A2A/B2B/A2B integration with TIBCO BW, EMS, and Administrator tools.

• Developing and maintaining knowledge documentation for the Site Reliability Engineering team and its stakeholders. Suresh RK

Senior Level Assignments

Solution Architecture/ Site Reliability Engineering / Production Support Lead Industry Preference: Cloud Computing, DevOps, Support, Incident & Release Management *******.******@*****.*** +1-551-***-****

https://www.linkedin.com/in/suresh-rk-9aa41a289/

Career Timeline

Apr’11-Feb’13 Mar’13-Jul’13 Aug’13-Mar’19 Apr’19-May’22 Jun’22-Feb’23 Sep’23-’Feb25 Soft Skills

Certification

• AWS Certified Solution Architect – Associate

• AWS Certified Developer - Associate

• AWS Certified SysOps Administrator – Associate

• Certified PG Program in Cloud Computing

Education

• Master of Computer Applications

Notable Accomplishments Across the Career

• Attained SPOT, STAR, remuneration rewards.

• Holds unique identity on hosting enterprise application on cloud at most secured.

• Best employee of the year award in multiple organizations. Technical competency

• Cloud skills • AWS, Azure, GCP

• Support skills • L2 and L3 production Support 24 * 7, BAU support.

• AWS skills • Amazon EC2, S3, VPC, Route53, DynamoDB, Lambda, Serverless, CloudWatch, IAM, ELB, API Gateway, Cloud Front, SQS, SNS, RDS, Direct Connect, Step Functions, SAM, Kinesis, Athena, Glue, etc.

• Azure skills • VMs, VM Scale Sets, Data Store, Service Management, SQL, etc.

• DevOps Skills • AWS CodeCommit, CodeBuild, CodeDeploy, CodePipeline, ECS, EKS, Jenkins, GitHub, BitBucket, Docker, K8S, JFrog, JIRA, Grafana, Elastic Search, LogStash, Kibana & Wily.

• Standard

programming skills

• Java, Python, Maven, Groovy, MicroServices, JMS, Spring, Hibernate, NodeJS, SQL, PL/SQL, MS SQL Server

• Data Analytics • NO SQL, DynamoDB, Apache Druid

• Monitoring Tools • Grafana, ELK Stack, Dynatrace, Splunk & Wily

• IAC Tools • Cloud Formation, Terraform

• Web technologies • CSS, HTML, JSON.

• SOA governance • Webservices (SOAP, REST) and WSDL.

• Integration/Middlew

are solutions

• TIBCO BW, Administrator, EMS, Hawk, Adapters – ADB/MQ/File, iProcess, AMX BPM, Activiti BPM, WebLogic Administration, Tomcat.

• Utilities • Eclipse, SVN, ClearCase, SonarQube, Confluence, XML, XSLT, XPath, WLST, Continuous Integration, Quality Centre and Shell scripting.

• Incident & Problem

Management Tools

• ServiceNow, Manage Now, CA Service Management, iChamp and Remedy

SGX, Singapore

Optimum Solutions,

Singapore

Optimum Solutions,

Singapore

TCS, Singapore DBS Bank, Singapore IDFC First Bank, India Planner

Problem Solver

Collaborator

Innovator

Decision Maker

Work Experience

At present working as a Cloud, DevOps & SRE Lead in Realtech Services, LLC, USA. Previous Work Experience

Project#6 : Retail Assets and Shared Services

Company : IDFC First Bank

Role : Assistant Vice President Sep 2023 – Feb 2025 Environment : Lambda, Dynamo DB, S3, EC2, Route 53, API Gateway, VPC, RDS, Cloud Watch, Athena, Glue, ELB, Auto-Scaling, IAM, Cloud Formation, Docker, GIT, BitBucket, Jenkins, K8S, JIRA, JFrog, Java, Spring Boot, Microservices, Maven, Groovy, Python, Terraform, Dynatrace, ELK Stack, ServiceNow, SQL, PL/SQL. This project involved designing and implementing a comprehensive retail asset management system for a leading bank to streamline loan processing, enhance risk assessment, and improve customer experience. The solution aimed to automate end-to-end loan lifecycle management, from application to disbursement, underwriting, and collections.

Roles:

o Design and deliver highly scalable cloud-native and server less applications on AWS using services like EC2, Lambda, API Gateway, DynamoDB, S3, and CloudWatch. o Develop and deploy Infrastructure as Code (IaC) using Terraform & CloudFormation to automate cloud infrastructure management.

o Identify and implement cloud automation solutions to enhance efficiency and reduce manual effort. o Lead the hosting and migration of enterprise applications from on-premises to AWS, minimizing data center footprint and improving scalability.

o Ensure 100% uptime of cloud-based applications, managing administration, support, and performance enhancements.

o Act as the single point of contact for designing, implementing, and successfully hosting applications on AWS.

o Oversee ITSM processes, including Incident, Problem, Change, Release, and Service Level Management, ensuring minimal service disruptions.

o Setting up centralized logging with the ELK Stack (Elasticsearch, Logstash, Kibana) for log aggregation and measure system health.

o Optimize system performance by analyzing logs, metrics, and traces from ELK Stack and Dynatrace. o Monitor infrastructure performance, security, and compliance, implementing mitigation strategies where needed.

o Manage and maintain the ServiceNow platform, ensuring smooth operations, updates, and enhancements. o Oversee the design, implementation, and optimization of CI/CD pipelines to enable automated build, test, and deployment processes.

o Utilize DevOps tools such as AWS CodeCommit, CodeBuild, CodeDeploy, CodePipeline, Jenkins, Docker, Kubernetes, Ansible, SonarQube, JFrog, BitBucket, Dynatrace and ELK for seamless development and deployment.

o Define CI/CD strategies for major releases, ensuring process compliance and integration with Agile development workflows.

o Lead vulnerability and risk assessment, ensuring secure hosting and data protection in all environments. o Develop and oversee compliance controls, aligning with legal and internal policies while regularly revising risk mitigation strategies.

o Define and implement non-standard testing approaches for Penetration Testing, Performance Testing, and Vulnerability Scanning.

o Drive onshore, offshore, and vendor scrum teams to meet project timelines using Agile methodologies. o Oversee change and release management by planning, evaluating risks, and ensuring compliance with security and organizational policies.

o Work closely with development, operations, and QA teams to ensure smooth implementation of changes and releases.

o Monitor project baselines, controlling costs, resource allocation, and timelines to ensure high-quality project execution.

o Design and deliver custom dashboards using AWS Kinesis Streams for real-time traffic monitoring. Project#5 : Derivatives Clearing Interfaces

Company : Singapore Exchange Group

Role : Assistant Vice President Jun 2022 – Feb 2023 Environment : Lambda, Dynamo DB, S3, EC2, Route 53, VPC, RDS, Cloud Watch, Athena, Glue, ELB, Auto-Scaling, IAM, Cloud Formation, Docker, GIT, BitBucket, Jenkins, K8S, JFrog, JIRA, Java, Spring Boot, MicroServices, Struts, Apache Tomcat, Apache Druid, Maven, Groovy, Python, Dynatrace, ELK Stack, Grafana, Terraform, ServiceNow, SQL, PL/SQL, MS SQL Server, Linux/Windows OS, MQ and Remedy. Derivatives Clearing Interfaces is made up of setup of systems that are responsible for the functionality of computing settlement prices, computing risk margin, performing post trade, reporting, clearing and settlement activities. The core post trade and clearing systems is implemented with Genium Inet clearing engine which is an integrated trading/clearing solutions provided by Nasdaq OMX. Besides the Genium Inet there are also in- house solutions like COSMOS (Collateral Submission Management and Optimization System), MOS (Mutual Offset System), GPS (Generic Pricing System) and Report Server developed by SGX and vendor staff. Roles:

o Provided BAU Support for machine critical applications. o Lead analysis of identifying issues or problems which may require changes to procedures, standards or systems. Ensure issues or problems are responded to, resolved, redirected or escalated to right levels. o Provided guidance, assistance, coordination and follow-up on complex problems and ensures resolution. o Worked independently with minimal supervision but with a keen sense to escalate timely and appropriately. o Engaged with end user/business stakeholders to identify problem areas and provide strategic resolutions. Also, produce Management reports and statistics for trending issues. o Worked with Product Stream leads in prioritizing issue fixes and accountable for people management and project allocation for the team.

o Actively participated in requirements gathering, business analysis, technical design discussions and involved in creating data flow diagrams. Also, involves in preparing and reviewing functional design and detail technical design documentation of the projects. o Set up centralized logging with the ELK Stack (Elasticsearch, Logstash, Kibana) for log aggregation and analysis.

o Optimized system performance by analyzing logs, metrics, and traces from Dynatrace and ELK Stack. o Developed automated alerting and remediation workflows to reduce system downtime. o Participated in on-call rotations and conduct root cause analysis (RCA) after incidents. o Managed and maintained the ServiceNow platform, ensuring smooth operations, configuration, and enhancements based on business needs.

o Provided L2 & L3 application support and responsible for delivering high quality support and leadership to their team. Also, actively participate in Production/incident calls. o Took ownership of user problems and collaborate with various teams to resolve end user issues proactively and timely.

o Investigated issues to fully understand root cause and suggest corrective actions to prevent reoccurrence. Escalated any issues that cannot be resolved to the appropriate resource within the Team. o Recommended solutions based on analysis and redesign of business processes and procedures. o Supported scheduled system software and hardware upgrades as directed through the Change Management System.

o Provided teams with expertise in Spring-based microservices, Docker, K8S, Continuous Integration and Continuous Delivery.

o Virtualized the servers using Docker for the test environments and dev-environments needs, also configuration automation using Docker containers.

o Containerized all the modules - Spring Boot and Java applications by using Docker. o Used CI/CD tools Jenkins, Git, Jira and Docker daemon for configuration management and automation using Ansible.

Project#4 : Service Request Revamp

Company : Development Bank of Singapore

Role : Assistant Vice President Apr 2019 – May 2022 Environment : Lambda, Dynamo DB, S3, EC2, Route 53, VPC, RDS, Cloud Watch, Athena, Glue, ELB, Auto-Scaling, IAM, Docker, RESTful Web Services (Spring Boot), Java, MicroServices, TIBCO Suite of Products, Activiti BPM, K8S, GIT, BitBucket, Jenkins, JIRA, Nexus v3, Python, Maven, Groovy, ELK Stack, Grafana, Jasper Reports, UV, IDE, MQ, Apache Tomcat, MariaDB, SQL & PL/SQL. This project is intended to establish the customer business requirements for Revamp of Service Requests. As part of this enhancement, Integrated Workflow (IWF) system will build new workflow application to handle the Service Requests between multiple systems across countries. This application provides Orchestration Layer services for STP/NSTP processing of SRs which are applicable for all channels like iServe, ChotBot, CRM, PWEB, IB/MB etc.

Roles:

o Worked as part of Cloud DevOps team for different internal automation and built configuration management.

o Developed groovy scripts for automation of the build and release process. o Automated the front ended platform into highly scalable, consistent, repeatable infrastructure using a high degree of automation using Jenkins, Cloud Formation and Terraform. o Setup and Maintenance of automated environment using Maven and Groovy scripts. o Provided tools and services to enable build and deployment of microservices. o Responsible for build and deployment automation using Docker Containers. Deployed Spring Boot applications through JBoss Servers.

o Implemented docket on the production side and worked on Docker images, containers and deployed web applications and Microservices.

o Developed, maintained and enhanced pre and post build scripts (Groovy and Python). o Created S3 buckets and managed policies and utilized S3 bucket and Glacier for storage and backup on AWS.

o Created a Continuous Delivery process to include support building of Docker Images and published into a private repository- Nexus v3.

o Implemented continuous delivery framework using Jenkins, Maven on multiple environments. o Implemented a production ready, load balanced, highly available & fault-tolerant infrastructure. o Created Cloud Watch alerts for instances and using them in Auto Scaling Launch Configurations. o Monitored the server alerts through Nagios, Cloud Trial, and troubleshooting the alerts. o Design and implement monitoring solutions and primarily focuses on ensuring system reliability, observability, and performance optimization using ELK stack. o Optimize system performance by analyzing logs, metrics, and traces from ELK. o Participated in on-call rotations and conduct root cause analysis (RCA) after incidents. o Expertise in EAI, SOA, BPM, Cloud, and DevOps for Business Integration Requirements using TIBCO BusinessWorks, EMS, and Administrator tools.

o Develop business process automation using Activiti BPM, Java, and Microservices to enhance efficiency. Project#3 : IWF (Imaging and Workflow Services)

Company : Optimum Solutions Pte Ltd, Singapore

Client : Development Bank of Singapore

Role : BPM & Integration Analyst Aug 2013 – Mar 2019 Environment : Activiti BPM, TIBCO iProcess, AMX BPM, BW, EMS, Administrator, Hawk, GI, Vaadin, IBM MQ, WebLogic Server, JBoss, Maria DB, My SQL, Oracle, IDE, UV, Grafana, Wily, iChamp, JIRA, Invensys, CyberArk, FileNet, Scan Client, STDP, etc.

The IWF Enterprise Business Process Management Platform will support various end-to-end process initiatives and enable DBS to move towards becoming a more Process-Centric organization by Providing a standard tool for process design, simulation, process execution, process monitoring, process audit, process analysis and business rules management.

Enabling embedding of process controls and alerts thereby ensuring SLA adherence and regulatory compliance. Provide flexibility to effect process change with minimal time and effort; to incorporate best practices or regulatory changes, or to respond to changing market conditions. Laying the foundation for continuous business process improvement.

Project#2 : TIBCO BPM and SOA Products Technical Support Company : TCS Asia Pacific Pte Ltd, Singapore

Client : Citi Bank, Singapore.

Role : Senior TIBCO Analyst Mar 2013 - Jul 2013

Environment : TIBCO AXM BPM, iProcess, BW, EMS, Administrator, Hawk, Oracle DB, ADB Adapter and MQ Adapter.

Project#1 : TIBCO BPM and SOA Products Technical Support Company : Optimum Solutions Pte Ltd, Singapore

Client : Citi Bank, Singapore.

Role : Senior Consultant Apr 2011 – Mar 2013

Environment : TIBCO iProcess, BW, EMS, Administrator, Hawk, Oracle DB, ADB Adapter and MQ Adapter.



Contact this candidate