Post Job Free
Sign in

Devops Engineer Cloud Infrastructure

Location:
Daly City, CA, 94015
Posted:
June 02, 2025

Contact this candidate

Resume:

Charles Lee

***Crestview Circle, Daly City, CA **015, USA

Mobile: 408-***-**** Email: ********@*****.***

LinkedIN : https://www.linkedin.com/in/charles-lee-84002a15/

OBJECTIVE Seeking a position as senior/staff Devops engineer

SUMMARY

•CI/CD Pipeline Architecture: Bash, Python, Helm charts, GitHub, Gitlab, GitHub Actions, Jenkins, ArgoCD, JFrog, Bitbucket, AWS CDK, GCP Build.

•Cloud Infrastructure Management: JVM, .NET, Docker, Kubernetes (on-prem, EKS, GKE), Terraform, Ansible, AWS ECS/S3/R53/SSM/SES/SNS/Ingress Controller/Lambda/CloudFront/CloudFormation, GCP Compute/Storages/Profiler/Build/Trace, on prem Vmware/OpenStack, Linux (CentOS/Ubuntu/RedHat), Apache/Nginx, Kafka, Windows.

•Observability & Monitoring: SLA/SLO/SLI/RCA, Chaos engineering, AWS Cloudwatch, GCP monitoring, Pagerduty, AppD, New Relic, DataDog, Prometheus, Grafana, ELK, Fluentbit, Splunk.

•Security and Compliance: AWS Config/IAM, Firewall, ZTNA, SD-WAN, SIEM, EDR, FedRamp, DevOpsSec (DAST, SAST, SCA) Security scan by using Coverity, Blackduck, Sonarqube and Snyk.

•Database management: AWS RDS/Aurora, GCP Cloud SQL, PostgreSQL, MySQL, MongoDB, Redis, Kafka.

EDUCATION

San Jose State University, Bachelor of Science, Computer Science, San Jose, CA

Current Certifications:

Google Professional DevOps engineer,

Certified Kubernetes Administrator

Certified Kubernetes Application developer

Certified Kubernetes Security Specialist

Kubernetes and Cloud Native Associate

Kubernetes and Cloud Native Security Associate

AWS SysOps Administrator,

Terraform Cloud Engineer,

Prometheus Certified Associate,

Fortinet Certified Professional – security operations.

https://www.credly.com/users/charles-lee.8480321e/edit#badge-portfolio

EXPERIENCES

Senior DevOps Engineer, FortiNet, Sunnyvale CA, 04/2023 – present

•As project leader to collaborate with various business units to facilitate successful application onboarding onto the Kubernetes platform since I joined Fortinet’s stock price rises from $50+ to $100+.

•Implement security policies to protect user/repo/merge requests/data on CICD environments.

•Design CI build pipelines at GitLab CI/Jenkins/Git Actions and CD deploy pipelines at ArgoCD to AWS, GCP, on-prem Openstack.

•Create Terraform, Ansible scripts to build robust, secure, multi-region and sclable EKS/ECS, GKE, Kubernetes clusters at AWS, GCP, Openstack.

•Setup SLOs and evaluate SLIs so Fortinet’s cutomers have 100% trust to Fortinet products.

•To automate DevSecOps (DAST, SAST, SCA, SBOM) scripts, ensure Fortinet’s cybersecurity products can fast, reliblbe be delivered to cutomers fast and secure.

•Maintain security best practices and complicances, implement practices like AWS Security Hub/GCP security command center, IAM best practices, Velero backup all Kubernetes clusters to AWS S3/Google storage, and Fortinet security devices such as Firewall, VPN, SIEM, EDR. ZTNA for protection.

•24/7 on-call rotation, troubleshoot incidents to ensure rapid resolution and minimal service disruption. Diagnose root cause of system failures and isolate the components/failure scenarios in EKS/GKE, Opensearch, high latency Opensearch, AWS Lambda, Kafka, RDS and S3, examine IAM policies, DNS resoultion, network traffic, SG settings, implementing Prometheus, Garfana, Zabbix and Datadog(APM).

•Perform postmotrems/RCA, setup SLO and evaluate SLIs focus on sustainable incident response and identifying the contributing causes.

•Practice Chaos engineering by using AWS SSM fault injection creating disruptive events to stress applications and observe behaviors under stress to identify system weakness and improve system’s robustness.

Senior DevOps Engineer, EverLaw, Oakland CA, 10/2021 – 04/2023

•Everlaw is a startup, a data company; my main job was to support Everlaw’s customers at US Dept. of Justice, State DOJ, global law firms, and university law schools.

•Joined 24/7 on call rotation to administrate, automate and manage CICD deployments implement Python, Bash, Terraform, Ansible, Jenkins, GitHub Actions, ArgoCD, and Helm on AWS EKS, GCP GKE, and AWS Govcloud.

•Implemented Terraform to build Pageduty incident managerment integrated with Grafana, AWS Cloudwatch.

•Managed GCP Big Query, CloudSQL Cloud Logging and GKE,

•Design code security and code quality policies implement Sonarqube and Snyk to do DAST, SAST, SCA Security scan on building Go, Java, C, JS, Python.

Staff DevOps Engineer, Aktana, San Francisco CA, 05/2019 – 09/2021

•Aktana is a startup, healthcare company, my main job was to support Aktana’s Global ETL system which 24/7 download medical data from Salesforce to AWS US, UK, EU, JP, AU, CA regions. This JVM process runs in the Jenkins runners are located in AWS EKS, when data are downloaded, they first will be kept in MySQL/S3, and then sent to EMR to do analysis, and then the results will be through MSK/Kafka email to Pharma’s sales, during on-call we need to monitor every step in this process.

•Designed Terraform scripts to improve infrastructure operations with automation and tooling.

•Optimized CI/CD pipelines and DevOps tooling by replacing BASH with Python.

•Designed Ansible playbooks, Python, and BASH scripts in Jenkins servers to speed up Aktana’s global production deployment.

•As SRE 24/7 on call support rotation implemented DataDog, Grafana, and Prometheus monitoring metrics for Aktana service oriented infrastructure

•As Infrastructure engineer migrated Aktana’s Docker compose clusters to AWS EKS clusters.

Senior DevOps Engineer, Certain Inc, San Francisco CA, 07/2016 – 05/2019

•Certain is a startup, my job was first to migrate Certain Event management system from VMware servers in datacenter to AWS EC2 environment.

•As Infrastructure engineer built ElasticSearch, Log stash, Kibana in AWS, to setup system logs, application logs, CloudWatch logs, CloudFront logs, Flow logs, and Coludtrails logs all sent to ELK located at AWS.

•As SRE built DataDog monitoring system, to monitor AWS EC2, VPC, RDS and Certain’s JVM applications.

•When On call, I will be the first person to join the bridge to do troubleshooting and tried to solve customer’s issue ASAP.

•Worked on root cause analysis to improve service delivery, maturity, and scalability.

•Tied automation into a Site Reliability Framework 24/7 on call rotation support and troubleshoot Certain production SaaS at AWS Linux/Windows environment.

•Implemented Ansible playbooks to build all components (ELK, DataDog, JVM deployments) in AWS.

Senior DevOps Engineer, Ebay Inc, San Jose CA, 02/2014 – 07/2016

•As site reliability engineer on 24/7 rotation managed/administrated eBay Advertising Network

•As key contributor setup on-prem ELK stack (ElasticSearch, Logstash, Kibana) central logging environment to collect logs, create documents and train users.

•As DevOps engineer support eBay publisher JVM production and QA environments.

•Create web, API, TCP, HTML code, DNS, and transactions tests in Catchpoint monitoring system.

•Create Salt states/scripts on dev, QA and production environment for products deployment.

Principle DevOps Engineer, NextBio, Santa Clara CA, 04/2007– 01/2014

•NextBio was a startup, I was Nextbio key DevOps engineer managed/maintained 99% up time SaaS big data cloud Research/Clinical/Patient/Datahub/Ontology applications on clustered, Nextbio was acquired by Illumina

•As Hadoop system admin to managed data pipelines which transform human genome data from AWS S3 to NextBio datacenter Hadoop servers

•Migrated AWS default VPCs with hosts in branch offices to consolidate billing

•Managed JBoss, clustered Apache HTTP, clustered Tomcat Solr, clustered Hadoop, clustered MySQL, clustered Secure FTP, NFS/NetApp servers and clustered Barracuda load balancers.



Contact this candidate