PARTHA PRATIM SARMA
******.*******@*****.***
Seattle, WA https://www.linkedin.com/in/partha-sarma/
SENIOR SOFTWARE DEVELOPER
Highly accomplished professional with extensive experience in design and development of distributed systems and massively scalable architectures. Well-versed in Micro Services, Cloud infrastructure and Observability. History of success in monitoring, analytics platforms, incident management, and change management.
TECHNICAL SKILLS
Programming Languages: Java, Python, Shell
SDLC methodologies and techniques: Distributed Systems, Cloud Computing, Microservices, Agile, Continuous Integration and Deployment, Infrastructure as Code, DevOps, Site Reliability Engineering.
Cloud Technologies (AWS): Lambda (Serverless), EC2, DynamoDB (Non-Relational Db), SQS (Message processing), SNS (Event Processing), S3, CloudTrail and CloudWatch (Logging and Monitoring), CodeDeploy, API Gateway, Cloudformation, Athena (Query Service), Quicksight (Dashboarding), Lookout for Metrics (Machine Learning powered Anomaly Detection).
EXPERIENCE
AMAZON, Bellevue, WA April 2014 - Present
SDE III and lead – Alexa Voice Service, Product Quality Management Systems, October 2020 – Present
Developed the 3-year and 5-year strategic project roadmaps with clearly defined goals and success criteria working closely with Product Managers and senior leadership. Led a team of 5 engineers and contributed to design and development.
•Designed and developed services using Machine Learning powered anomaly detection to detect Alexa customer facing product quality, security and policy violating issues bringing defects down by ~30%.
•Led the design and Tech Strategy of unification of Alexa Skills monitoring and 3P Device monitoring platforms in a single multi-tenant platform.
•Contributed to critical path code and provided code reviews to increase code quality.
•Led an Operational Excellence programs aimed at reducing resolution time of customer impacting issues and increasing developer productivity across the organization.
•Actively participated in mentoring programs to help Support and System Engineers with career growth opportunities and helping them role-switch to developer roles.
•Analyzed integration defects in Alexa 3P products and defining strategies to ensure Alexa integration in production devices are done right before they are launched or before any Over-the-Air change by recommending changes to public facing API specifications and documentation.
•Acted as an Alexa Operational Readiness Review bar raiser to ensure service is designed correctly and followed the operational best practices before they are launched.
SDE II – Alexa Voice Service, Traffic Management Systems, January 2018 – September 2020
Led design and development of services which can detect Alexa integration issues in 3P Alexa products like Sonos speakers, Samsung Smart TVs etc.
•Designed and developed services to automatically manage throttle values of 3P Alexa products to ensure bad players cannot brown-out the underlying Alexa systems but allows legitimate traffic.
•Successfully co-led migration of multiple critical-path high traffic services to (native) AWS, to ensure services can scale successfully for high velocity events like Super Bowl, Christmas and New Year peaks.
•Led the development of tools to provide forecasting of 3P Alexa traffic to ensure downstream Alexa services are correctly scaled every month.
•Drove initiatives to determine how backend service metrics map to customer engagement behavior of Alexa products to establish business rules and thresholds around risk monitoring.
•Led initiatives to lower mean time to resolution by 10% YoY of customer facing issues working closely with Support Engineers, Product Managers and Solution Architects.
Partha Pratim Sarma 206-***-**** Page Two
Site Reliability Engineer IV – Amazon eCommerce Retail, October 2015 – December 2017
Designed and developed tools that perform automated load test on production servers and determine the best infrastructure type suited for a particular service.
•Designed and developed tools to perform automated scaling and descaling of hosts as a part of peak traffic season readiness.
•Developed tools to generate data lakes to enable analytics on Issue management KPIs to find bottlenecks and reduce mean time to resolution of high severity issues.
•Provided strategic plans and executed migrations of services to newer server instance types to increase performance and reduce baseline operating costs.
•Provided On-call support for mission critical retail website facing services and developing strategies to reduce issue volume.
Site Reliability Engineer III – Amazon eCommerce Retail, April 2014 – September 2015
Developed tools to automate reporting of server kernel configurations to enable planning of server instance migrations.
•Performing load test on services to benchmark service performance across different instance types.
•Derived the best practices of load testing on production hosts which later became the source of truth across the wider organization.
•Wrote standard operating procedures on issue management during large scale impact incidents.
AMDOCS January 2014 – Mar 2014
Software Engineer II- Flexible Billing Formatter
Designed and developed components which allows for easy configuration of different bill formats.
•Identified and fixed bugs which reduced bill failures by 23% bring a savings of ~300 hours of operational work monthly.
•Developed scripts to easily retrieve billing rejects and job failures to enable faster issue mitigation.
•Wrote tests to increase the code coverage of the service from 55 – 80%.
TECH MAHINDRA May 2011 – December 2013 Software Engineer II – Migration Order Gateway Interchange
Developed components to handle migration orders from new services to make the service the authoritative source to handle migration orders.
•Developed features to support new services like British Telecom Sports.
•Migrated the service to newer infrastructure to enable scaling up to 5X.
•Handled Live customer impacting incidents, solved application SQL queries, Service Requests and provided On-call Support.
•Performed the responsibilities of an Oracle Junior Database Administrator and conducted database maintenance activities like Index rebuilding and table reorganizations.
EDUCATION
Computer Science and Engineering, Sikkim Manipal Institute of Technology, Sikkim, India