Post Job Free
Sign in

Manager Aws

Location:
Cupertino, CA
Posted:
December 14, 2020

Contact this candidate

Resume:

Mobile Ph: 408-***-**** Hari Muthuswamy email: adionq@r.postjobfree.com

Summary:

Six years experience managing technical teams

Six years experience working as SRE/Devops lead

Three years of experience working with Ansible and Puppet

Six months experience using Terraform for Cloud Provisioning in AWS.

Five plus years of scripting in Perl and two years of scripting in Python

Eight years of experience as a Unix Specialist

Four years of experience using Git for version control

Six months experience working with Container Orchestration using Kubernetes

Two years of experience as a Database Administrator, working in the capacity of a physical and a logical Database Administrator

Four years of experience as a Team Lead

Two years of experience in supporting E-commerce environments, troubleshooting, performance tuning, installation and administration of web sites running Iplanet and Apache, both secure and non secure, setting up SSL certificates, renew SSL certificates etc as part of the overall system administration and support

Six months of experience in distributed computing using J2EE technologies

Six years of experience in C programming in a Unix environment

Six years of experience in scripting in unix environment, using Perl, Expect, AWK, SED, Ksh, Bash etc

Six months experience in JAVA programming and Python scripting

Proficient in meeting tight deadlines, performance and client interaction. Excellent Presentation skills

Excellent overall Project life cycle experience and team spirit

Technical Skills:

Operating Systems: Ubuntu, Redhat, Centos, Solaris, AIX, Windows 2008, Windows 2012,Windows 2003

Package Management: apt-get, yum

Virtualization: Docker, ECS, KVM, VirtualBox,Vagrant, Vmware

Monitoring: Nagios, Victorops, Sentry, Zabbix

Cloud Based Technologies: AWS (EC2,S3,Route53,RDS,EFS,VPC,EMR, Cloudwatch,Lambda,CDN)

Log Management: Sumologic, Logstash

Automation: Ansible, Puppet

Cloud Provisioning: Terraform

Container Orchestration: Kubernetes

Build Tools: Maven

Continuous Integration: Jenkins

Task Queue Management: Celery

RDBMS: Mysql,Informix, Oracle,Postgres

Big Data databases: Hbase,Mongo,Cassandra

Web Technology: Apache, Nginx, Haproxy, Websphere, Redis,CGI Scripting, SSL Certificate Management

Languages: C/C++, Java, SQL, Perl, Expect, Ksh, Bash, Python, AWK, SED, HTML, Java Script

Version Control Tools: Git ( Stash,Bitbucket, Github), Phabricator

Problem Management: Jira

Name Server: Bind/DNS

Backup Management: Tivoli Storage Manager (TSM), Bacula

Storage Arrays: Data Domain, Symmetrix, Netapp, Celerra and FastT

SAN and StorageManagement: Mcdata, Connectrix Manager, ECC, Fast T Manager, Data Ontap, DD OS

EXPERIENCE

Devops Lead Consultant (May 2014 - Current )

Provided Devops consulting services to multiple startups in the Bay Area. I have listed below the various consulting assignments I have taken on.

Sr Devops Engineer at Automatic Labs- Sirius XM owned company ( Aug 2019 – Present)

● Refactor the Ansible / Rundeck environment, Upgraded ansible to version 2.5, update Ansible playbooks to work with the upgraded version.

● Developed scripts in Python to manage AWS instances, Sumologic search manipulations.

● dockerized the location services application and built out a Kubernetes cluster in AWS, setup a deployment process for this microservices.

● Enhance the existing terraform infrastructure by pulling in parts of AWS infrastructure that had been created manually in the past.

● Setup a Continous Integration environment using Jenkins, creating webhooks in the git repositories and triggering build jobs in Jenkins.

● Develop training material and run bootcamps, training team members in Ansible and Terraform and get them upto speed

● Streamline the monitoring infrastructure, reviewed and migrated alerts from various monitoring systems, consolidating to just 2 monitoring systems (Sentry and VictorOps)

● Rollout Sumologic based logging to various micro services thereby increasing ability to troubleshoot and identify issues faster.

● Provided oncall support for the production environment.

SRE Consultant at Pinterest ( Oct 2018 – July 2019 )

● Provided SRE oncall support for the Data Storage and Caching Team.

● Automate provisioning of HBASE clusters using Terraform and Puppet

● Develop puppet modules for deployment of code for HBASE clusters.

● As embedded SRE in the Storage team, provide infrastructure support to the Storage team.

● Develop terraform modules to deploy hbase clusters in AWS

Sr.Devops Consultant at Adaptv/Oath (Formerly Yahoo ) ( Jan 2017 - Sep 2018 )

● Provided enhancements to their current homegrown CICD process, incorporating additional functionalities.

● Developed automation process using ansible to bring up EMR clusters on demand

● Built a Jenkins based CICD framework based for deploying software on a daily basis.

● Setup custom metrics in Cloudwatch to enable Route53 based load balancing of elbs across regions.

● Migrate reporting applications associated with the Adserving platform from datacenters associated with Adaptv/AOL to AWS

● Enhanced their Nagios monitoring, removing outdated alerts, adding new alerts for new features.

● Maintain and enhance their Jenkins build and deploy environment.

● Remediate cloud security issues and violations, based on audits performed by third party vendors.

● Provide oncall support once every 4 weeks, troubleshoot production issues and ensure 24 by7 site availability.

● Identify and resolve performance issues with regards to their ad serving site.

Devops Lead Consultant at Sentient Energy, Burlingame ( Mar 2016 – Dec 2016)

● refactored Ansible and Jenkins based deployment process for Sentient software.

● Set up Jenkins server and build jobs, scheduled builds overnight to support development needs using Jenkins, Git and Maven.

● Troubleshoot Jenkins build failures.

● Provide enhancements to the Terraform based cloud provisioning for new Sentient clients.

● Review and create new security policies for the aws environment and implement those policies

● Troubleshoot and fix failures on new customer terraform based environment deployments in AWS.

● added enhancements to the terraform based AWS provisioning environment. Incorporated the requirements for a separate postgres instance for each customer environment, developed scripts for quick deployment of single instances in AWS using Boto.

Devops Lead Consultant at Ampush, San Francisco ( Jan 2015– Feb 2016 ).

● Automate the deployment of Ampush software using Puppet.

● Wrote puppet manifests for new features of the Ampush software.

● Redesigned the puppet environment, separating data from code, using Hiera as a data store and wrote external node classifiers to pull data from Hiera.

● Put in place a process to manage users,authentication using IAM for their AWS environment

● Put together a process to manage their AMIs in the AWS EC2 environment.

● Provide day to day support to Development, QA and their Production environment.

● Developed scripts to provision new servers and shutdown new servers based on requirements.

● Supported a Zabbix based monitoring system

Devops Lead Consultant at Incapture Technologies, San Francisco ( May 2014 – Jan 2015)

● Migrate guest servers using VirtualBox to to KVM

● Setup a Nagios based monitoring environment to monitor Production, QA and Research machines

● Write puppet modules to integrate nagios monitoring scripts into the deployment process. Develop puppet modules to deploy Rabbitmq, Redis, Logstash. Supported a puppet environment comprising of 3 puppet instances and clients running Ubuntu, Redhat and Windows.

● Manage an AWS EC2 environment, deploying new servers, decommissioning servers, perform disaster recovery tests in EC2 environment.

● Setup a full fledged backup system for the environment using Bacula

● Setup a backup process and developed scripts for backing up and restore of Cassandra and Mongo databases.

● Build out new Ubuntu and Redhat based virtual servers in KVM and VirtualBox.

● Troubleshoot performance issues with Cassandra queries.

● Setup slave mongo databases to replicate and use that process to reduce mongo database filesystem space usage.

● Develop and test out process to upgrade mongo from 2.4.5 to 2.6.4

● Provide support for the network environment comprisiing of 3 cisco switches and a firewall.

● Basic Administration of Cisco network switches.

Manager, Storage Area Network Services at IBM ( June 2008 – April 2014)

● Managed a team of 5 to 8 technical staff who were in different geographic locations.

● Worked at multiple IBM accounts ( Philips Medical, Hitachi Data Systems, Nisource, Blue Cross Blue Shield of Massachussetts, VF Corporation ) as a systems specialist.

● Built out offshore teams to provide server support at multiple IBM accounts

● Developed and maintained relationships with clients, ensuring service level agreements are met with client satisfaction.

● Acted as an SME in troubleshooting SAN issues.

● Ensure team is properly staffed with the required skill sets at all times.

●Periodically doing hands on work such as developing scripts to monitor and manage Storagetek tape libraries running ACSLS and Gresham.

●Performed Disaster Recovery tests for multiple IBM accounts at the Sterling Forest site.

●Developed plans and performed migration of TSM servers from version 5 to version 6.

Systems Integration Specialist, North Shore LIJ Hospital, Westbury, NY (Mar 2004 – June 2008 )

Working as an outsourced consultant, I was involved in developing and expanding the enterprise infrastructure as relates to the Unix and Enterprise backup environment.

Was the primary architect and implementor for

expanding the TSM ( Tivoli Storage Manager) environment. Expansion included

a. Moving from a single instance TSM server running on 1 Aix machine to 8 instances of TSM servers running on 4 AIX partitions .

b. Moving to a Library – Client relationship.

c. Migrate media from LTO1 to LTO4.

d. Periodic upgrades to the TSM servers and the storage agents supporting LAN free backups, to keep them in supported levels

e. Develop reports for monthly SLA reporting, tracking growth in terms of Occupancy and daily backup/restore traffic, resource utilization

f. Evaluate technologies such as Virtual Tape Libraries, Data Deduplication and identify vendor products that would satisfy current and future business requirements

g. Prepare and update Disaster Recovery documentation for recovering TSM using the TSM DR module

h. Architected solution to backup the Celerra NAS to TSM which involved using NDMP for DR purposes

creating and maintaining the NIM environment for AIX machines

maintain 6 hacmp clusters running versions 4.5 and 5.3

Partition and manage 3 P570 machines, 4 P550 machines using VIO, micropartitioning and Partition Load Manager (PLM).

Perform AIX OS upgrades as and when required

Administer DNS using BIND

Developed Disaster Recovery Procedures to recover the TSM infrastructure and AIX servers. Performed annual Disaster Recovery tests at the Disaster Recovery site in Sterling Forest.

Provision Clariion Storage to hosts involving zoning using the Connectrix Manager, lun masking using Navisphere, create and expand luns.

EDUCATION

The University of Texas at Arlington, Arlington, Texas - Master of Science in Information Systems



Contact this candidate