Post Job Free
Sign in

Senior Software Engineer Backend & AI Infra Expert

Location:
Leander, TX, 78641
Posted:
April 29, 2026

Contact this candidate

Resume:

Hoang Ngo Senior Software Engineer

213-***-**** *****.******@*****.*** Leander, TX LinkedIn

Summary

Senior Software Engineer with over 10 years of experience specializing in backend and AI infrastructure development. Expertise in building scalable, high-performance systems across industries like tech, finance and healthcare. Proficient in Python, Java, Typescript and cloud platforms, with a strong focus on system design, machine learning infrastructure, and distributed systems. Skilled in delivering secure, robust software solutions that drive business success, leveraging modern technologies and development best practices. Technical Skills

• Programming Languages: Python, Go, Java, TypeScript, SQL, NoSQL

• Frontend Development: React, Angular, Vue.js, HTML, CSS, TailwindCSS, Material-UI, Webpack, Responsive Design

• Backend Development: FastAPI, Flask, Django, Golang, Pydantic, Spring Boot, Node.js, Express, NextJS, GraphQL

• AI & LLM Infrastructure: TensorFlow Serving, Triton Inference Server, ONNX Runtime, Kubernetes, Docker, AWS SageMaker, Google AI Platform, Dynamic Request Batching, Databricks, PySpark, Model Pruning, Quantization, Pinecone, Milvus, FAISS, Horovod, Ray, Dask, MLflow, TensorBoard, A/B Testing, Kafka

• Databases: Google Spanner, PostgreSQL (Performance Tuning, Row-Level Security), MySQL, Bigtable, BigQuery, Redis, MongoDB, Amazon S3, Google Cloud Storage, Snowflake, Redshift

• System Design & Architecture: Microservices, Service Mesh, gRPC, REST APIs, Multi-region Active-Active, Canary Analysis, Load Balancing, Caching (Redis, Memcached), Rate Limiting, Horizontal Scaling, Auto-scaling, Fault Tolerance, High Availability

• Cloud & DevOps: Terraform, Kubernetes, Borg, AWS (ECS, RDS), GCP, Azure, Docker, CI/CD, Jenkins, Cloud Computing, Ansible, Prometheus

• Security & Compliance: OAuth2/OIDC, JWT, AES-256 Encryption, HIPAA & GDPR, TLS 1.3, Audit Logging

• Performance Optimization: Memory Management, GC Tuning, Profiling (cProfile/Py-spy), Java JVM Performance Tuning

• Version Control: Git, GitHub, GitLab

• Soft Skills: Mentoring, Team Collaboration, Agile Methodology, Fast-paced Environments, Cross-functional Professional Experience

Google Gemini Platform Software Engineer L5 Sunnyvale, CA Apr 2023 – Mar 2026

• Designed and implemented high-performance Python-based serving infrastructure for Gemini LLMs, optimizing asynchronous streaming with asyncio to improve Time-to-First-Token (TTFT) by 30%.

• Developed and deployed machine learning models with advanced optimization techniques, improving model inference speed by 20% and accuracy for large-scale language model tasks.

• Enhanced model training pipelines, implementing automated error detection, and data augmentation, resulting in a 15% improvement in model performance and training efficiency.

• Optimized TPU v5e throughput and compute efficiency by 15%, utilizing dynamic request batching and multi- tier caching to enhance LLM embeddings and prompt response time.

• Built an automated LLM evaluation framework with A/B testing capabilities, increasing experimentation velocity for ML research teams by 2x, driving faster innovation and deployment.

• Engineered a fault-tolerant API layer with application-level admission control and structured logging, reducing customer-facing errors by 25% during 10x traffic spikes, ensuring high availability and reliability.

• Mentored 4 junior engineers, improving team performance by 30% through hands-on guidance on Python and Java best practices, leading to improved code quality and faster development cycles. Google Pay Risk and Fraud Platform Software Engineer L4 Sunnyvale, CA Nov 2020 – Mar 2023

• Developed a real-time fraud detection dashboard using React, TypeScript, and Material-UI, enabling transaction risk tracking and fraud prevention metrics visualization for improved decision-making.

• Integrated machine learning model outputs into the frontend, delivering predictive fraud insights, which increased the detection accuracy by 18% and improved operational efficiency.

• Applied responsive design with TailwindCSS, optimizing the user interface across devices, improving user experience and engagement by 25%.

• Optimized payment backend logic using Java, Guice, and Spanner, refactoring database access patterns to reduce P99 transaction latency from 220ms to 160ms, improving system responsiveness.

• Built high-performance distributed fraud detection pipelines in Python, processing 3k+ QPS with P99 latency under 160ms, contributing to an 8% reduction in annual fraud losses.

• Designed and implemented a versioned, zero-downtime rule engine using Python and manual partition assignment in event streams, improving deployment reliability and uptime by 99.9%. BrightInsight Software Architect San Francisco Bay Area Jan 2019 – Feb 2020

• Architected a full-stack, HIPAA-compliant platform using Python (FastAPI) and Node.js for secure medical data handling, ensuring 100% data isolation through row-level security in PostgreSQL.

• Designed and developed a secure IoT data ingestion pipeline for telemetry data from medical devices, utilizing Python (asyncio) and encryption (AES-256) to meet healthcare compliance standards.

• Implemented a React-based frontend to securely display real-time health metrics, allowing users to interact with medical data while adhering to strict privacy regulations (HIPAA & GDPR).

• Automated partner environment provisioning using Terraform, reducing onboarding time by 25% while ensuring the scalability and security of cloud infrastructure (AWS).

• Worked in a fast-paced startup environment, where I led the development of a scalable, modular architecture for handling sensitive medical data, contributing to the company’s rapid growth and funding success.

• Built secure and efficient data pipelines and APIs, ensuring high availability and security for medical data used in regulatory-compliant healthcare applications.

FPT Software Software Engineer United States Jun 2014 – Sep 2018

• Developed secure transaction processing modules using Java (Spring Boot) and RESTful microservices, improving system efficiency and scalability.

• Designed and implemented high-throughput ETL pipelines in Python (Django, Celery) with Redis for asynchronous processing, handling large-scale transaction data.

• Optimized PostgreSQL performance with advanced indexing strategies and complex queries, reducing nightly reconciliation time by 30%.

• Led the migration of legacy SOAP services to REST APIs, improving service maintainability and reducing integration complexity for external partners.

Education

Bachelor’s degree Computer Science University of Greenwich 2011 – 2014 Certifications

• Google AI Essentials

• Cloudera Certified Developer for Apache Hadoop (CCDH)

• MongoDB for DBAs



Contact this candidate