Yi Fan
Redmond, Washington, United States *********@*****.***
Summary
Senior Software Engineer with over 8 years of experience designing, upgrading, and maintaining large-scale backend and platform systems with a strong emphasis on Go-based ecosystems, developer tooling, and performance optimization. Proven expertise in enhancing SDKs, managing complex dependency graphs in monorepos, and building internal tooling to improve developer productivity, observability, and debugging workflows. Deep background in systems-level programming, performance profiling, and runtime optimization, with prior exposure to C/C++ libraries and OS-level concepts. Adept at collaborating with infrastructure and platform teams, driving best practices, and providing reliable business-hours support for mission-critical Go services in enterprise environments.
Professional Experience
Senior Engineer
Microsoft Nov 2021 – Oct 2025
Led the upgrade and standardization of Go SDKs and runtime environments across multiple internal services within a large enterprise monorepo, reducing build inconsistencies by over 45% and ensuring compatibility with evolving internal platform requirements using Go 1.20–1.22, Bazel 6.x, and internal artifact repositories.
Designed and maintained Go dependency management tooling integrated with Bazel, enabling deterministic builds and automated version pinning, which decreased dependency-related production incidents by 38% and improved CI pipeline reliability across more than 200 microservices.
Enhanced Go debugging and developer tooling by integrating Delve, improved IDE support for VS Code and GoLand, and custom static analysis rules, resulting in a 30% reduction in mean time to diagnose production issues during business hours support rotations.
Built and optimized internal linters and static analysis pipelines using golangci-lint, staticcheck, and custom AST-based analyzers, enforcing best practices and reducing code review cycles by approximately 25% across distributed teams.
Drove performance profiling initiatives using pprof, runtime tracing, and flame graphs, identifying CPU and memory bottlenecks in high-throughput services and achieving up to 40% latency reduction and 28% memory footprint savings.
Collaborated closely with platform and infrastructure teams to integrate Go services with container orchestration platforms, such as Kubernetes 1.26+, improving autoscaling efficiency and reducing infrastructure costs by an estimated $1.2M annually.
Authored and maintained comprehensive Go documentation and internal best-practice guides covering concurrency patterns, memory management, error handling, and dependency hygiene, significantly improving onboarding speed for new engineers by over 35%.
Implemented CI/CD pipelines using Jenkins, GitLab CI, and Bazel remote caching, accelerating build times by 50% and improving release cadence while maintaining strict quality gates through unit and integration testing.
Provided business-hours Go ecosystem support, acting as a subject-matter expert for runtime upgrades, tooling failures, and dependency conflicts, resolving critical issues with a 99.9% SLA compliance rate.
Integrated message-driven architectures using Kafka and internal event buses, ensuring Go services handled high-volume asynchronous workloads with backpressure control and fault tolerance, supporting millions of daily transactions.
Senior Engineer II
Rally Health Jun 2019 – Nov 2021
Developed and maintained backend services primarily in Go and C++ for large-scale infrastructure management platforms, focusing on SDK stability, API reliability, and performance optimization within distributed systems handling global traffic.
Upgraded Go runtimes and foundational libraries across shared services, ensuring backward compatibility and minimizing downtime, which reduced post-upgrade incidents by 42% through proactive testing and staged rollouts.
Built internal Go tooling for dependency analysis and version auditing within a monorepo environment, improving visibility into transitive dependencies and eliminating security vulnerabilities 60% faster during audits.
Applied advanced performance tuning techniques, such as CPU affinity, memory pooling, and syscall optimization, leveraging deep knowledge of OS internals to improve throughput by up to 33% under peak load conditions.
Integrated Bazel-based build systems with Go modules and legacy C/C++ libraries, streamlining cross-language builds and reducing build failures by 47% across heterogeneous codebases.
Enhanced observability through structured logging, metrics, and distributed tracing using Prometheus, Grafana, and OpenTelemetry, enabling faster root-cause analysis and reducing MTTR by 29%.
Designed RESTful APIs and internal RPC interfaces with strict backward compatibility guarantees, supporting long-lived enterprise clients and ensuring seamless SDK upgrades.
Implemented robust unit, integration, and load testing strategies using Go testing frameworks, custom harnesses, and CI pipelines, improving overall test coverage to over 85%.
Collaborated with infrastructure teams to deploy and operate services on Kubernetes and private cloud platforms, optimizing resource utilization and improving service resilience.
Mentored junior engineers on Go best practices, concurrency patterns, and performance profiling, contributing to higher code quality and reduced onboarding time across teams.
Software Development Engineer Intern
Huawel Jun 2018 – Sep 2018
Developed full-stack applications with a strong backend focus, leveraging Go and C++ components for performance-critical services while integrating frontend systems to support operational dashboards and internal tools.
Designed and optimized backend APIs and service layers, improving response times by 35% through efficient data access patterns, caching strategies, and concurrent processing models.
Built early internal tooling for debugging and monitoring Go services, laying the groundwork for standardized observability practices adopted by later teams.
Implemented CI/CD pipelines using Jenkins and Git-based workflows, reducing deployment times by 40% and improving overall release stability.
Integrated message queues such as RabbitMQ for asynchronous processing, ensuring reliable data delivery and fault tolerance under variable traffic loads.
Performed performance profiling and memory analysis on backend services, identifying inefficiencies and reducing infrastructure costs through optimized resource usage.
Collaborated closely with network and infrastructure engineers to align application behavior with underlying system constraints and OS-level optimizations.
Technical Skills
Backend: Go (Golang), C#, C, C++, Python, Django, Java, Ruby, JS/TS, GraphQL, REST APIs
Databases: PostgreSQL, MySQL, MongoDB, Redis, Elasticsearch, SQL
Cloud/DevOps: AWS (EC2, S3, Lambda, ECS, CloudFormation), Azure, Docker, CI/CD, Kubernetes, Docker
AI & Machine Learning: LangChain, OpenAI API, Pandas, FastAPI-based AI Service
Education
Master’s degree in Information Science from the University of Pittsburgh(2017-2019)
Bachelor's degree in Computer Science from the University of Xidian (2013 – 2017)