Hoang Ngo Senior Software Engineer Python, Java & AI Specialist
*****.******@*****.*** Milpitas, CA LinkedIn
SUMMARY
Senior Software Engineer with 10+ years of experience architecting and scaling high-throughput distributed systems. Expert in the Python and Java ecosystems, with a proven track record at Google building backend orchestration for Gemini LLMs and mission-critical fraud engines handling 3k+ QPS. Specialist in high- performance Python (Asyncio, Pydantic, Memory Tuning) and enterprise Java (Guice, Spring Boot, JVM Optimization). Adept at bridging the gap between Generative AI research and production-grade, multi-region cloud architectures.
TECHNICAL SKILLS
• Expert Python (High Performance): Python 3.12+, Asyncio (Event Loop Internals), Multiprocessing, Metaprogramming, Type Hinting (Pydantic/Mypy), Memory Management (__slots__, GC Tuning), Performance Profiling (cProfile/Py-spy).
• Senior Java (Enterprise Scale): Java 17+, JVM Performance Tuning, Multithreading & Concurrency, Guice (Dependency Injection), Spring Boot, Hibernate/JPA, JUnit/Mockito, Maven/Bazel.
• AI & LLM Infrastructure: Inference Serving Optimization (TTFT/TPOT), Dynamic Request Batching, Vector Databases (Pinecone, Milvus), Model Evaluation Frameworks, TPU/GPU Resource Orchestration, LangChain.
• Distributed Systems & Middleware: gRPC & Protocol Buffers, Service Mesh Architecture, Message Brokers (Cloud Pub/Sub, Kafka, RabbitMQ), Distributed Caching (Redis, Memorystore), Event-Sourcing, Rate Limiting (Token Bucket).
• Databases & Storage: Google Spanner (Distributed SQL/External Consistency), PostgreSQL
(Performance Tuning, CTEs, Row-Level Security), Bigtable, BigQuery, Redis Distributed Locks.
• Architecture & SRE: System Design Authority, Multi-region Active-Active Deployment, Canary Analysis
(Mann-Whitney U), SLIs/SLOs/Error Budgets, Infrastructure-as-Code (Terraform), Borg/Kubernetes.
• Compliance & Security: HIPAA & GDPR Implementation, OAuth2/OIDC, JWT Token Rotation, Encryption at Rest/Transit (AES-256, TLS 1.3), Audit Logging & Data Integrity. PROFESSIONAL EXPERIENCE
GOOGLE Software Engineer Sunnyvale, CA
Google Gemini Platform (Generative AI Ecosystem) Apr 2023 - Present
• Architected the Python-based serving infrastructure and orchestration layers for Gemini LLMs, implementing asynchronous streaming with asyncio to reduce Time-to-First-Token (TTFT) by 30%.
• Developed high-reliability Java (gRPC) sidecars to integrate Python-based inference services with core internal authentication, quota management, and audit logging systems.
• Optimized TPU v5e throughput and compute efficiency by 15% through a dynamic request batching layer and multi-tier caching for LLM embeddings and frequent prompt responses.
• Built an automated LLM evaluation framework with model-based grading and A/B testing infrastructure, doubling the experimentation velocity for ML research teams.
• Engineered a fault-tolerant API layer with application-level admission control and structured logging, reducing customer-facing errors by 25% during 10x traffic spikes.
Nov 2020 – Mar 2026
Google Pay Risk and Fraud Platform (Internal Analytics) Nov 2020 – Mar 2023
• Optimized core payment backend logic using Java and Guice, refactoring database access patterns in Spanner to reduce P99 transaction latency from 220ms to 160ms.
• Constructed distributed fraud detection pipelines in Python processing 3k+ QPS with a P99 latency
<160ms, contributing to an 8% reduction in annual fraud losses.
• Designed a versioned, zero-downtime rule engine using Python and manual partition assignment in event streams, enabling the deployment of risk-scoring signals via atomic module swaps.
• Developed immutable event-sourcing infrastructure using memory-efficient Python dataclasses and Protobufs, leveraging Bigtable for state management and BigQuery for tamper-evident logs.
• Automated incident response by building a Python-based canary analysis service using Mann-Whitney U statistical tests to detect latency regressions, reducing MTTR by 40%. BRIGHTINSIGHT Software Architect San Francisco Bay Area Jan 2019 – Feb 2020
• Architected a multi-tenant, HIPAA-compliant backend using Python (FastAPI) and SQLAlchemy, implementing row-level PHI segmentation via PostgreSQL policies to ensure 100% data isolation.
• Engineered modular Infrastructure-as-Code (IaC) with Terraform for AWS ECS (Fargate) and RDS Multi-AZ, automating partner environment provisioning and reducing onboarding time by 25%.
• Designed secure IoT data ingestion pipelines using MQTT over TLS and Python asyncio consumers, handling high-frequency telemetry from connected medical devices with zero data loss.
• Established engineering standards for a Python-centric team by implementing static analysis (mypy, Flake8) and achieving >90% test coverage with pytest. FPT SOFTWARE Software Engineer United States Jun 2014 – Sep 2018
• Developed enterprise Java/Spring Boot applications for global banking clients, implementing secure transaction processing modules and migrating legacy SOAP services to RESTful microservices.
• Implemented high-throughput ETL pipelines using Python (Django) and Celery with Redis, utilizing background workers for asynchronous transaction processing and reporting.
• Optimized PostgreSQL performance for large-scale financial datasets by utilizing Common Table Expressions (CTEs) and complex indexing strategies, reducing nightly reconciliation times.
• Engineered secure authentication flows using the OAuth2 Authorization Code Grant and JWTs, enabling delegated access for external retail partners while maintaining strict audit trails.
• Led the containerization of legacy VM-based services using Docker, creating consistent development environments and streamlining the deployment pipeline via Jenkins CI/CD.
• Developed comprehensive test suites using JUnit/Mockito (Java) and unittest/pytest (Python), utilizing Mock objects to achieve >85% code coverage for mission-critical banking logic. EDUCATION
Bachelor’s degree Computer Science University of Greenwich 2011 – 2014