Post Job Free
Sign in

Sr. Software Engineer (AI Development)

Company:
QUICK USA, Inc.
Location:
Irvine, CA
Posted:
May 15, 2025
Apply

Description:

Position

Sr. Software Engineer (AI Development)

Summary

A growing startup IT company is seeking a highly skilled Senior Software Engineer to join their team. The ideal candidate will play a key role in designing, developing, and deploying API services powered by Large Language Models (LLMs) while integrating seamlessly with existing TypeScript-based backend systems.

Essential Duties

Design, develop, and deploy a Python-based API server with LLM

Build API endpoints using FastAPI

Implement LLM workflows using LangChain/LangGraph libraries

Develop vector search functionality with Qdrant DB

Implement real-time communication with the frontend using WebSockets

Effectively integrate with the existing TypeScript backend

Optimize LLM functionality for performance and cost

Collaborate with team members to improve the overall system

Working Hours, Working style

Monday - Friday; 8 hours a day

Core hours: 9:30 AM – 2:30 PM (Flexible schedule outside core hours)

Working Location

Irvine, CA

Salary/Benefit

$100K - 140K DOE

Health insurance

Retirement plan (simple IRA)

Paid time off (PTO) & sick leave

Holidays

Saturdays, Sundays, and major US holidays

Qualifications Requir

ed:Busine

ss-level proficiency in English (spoken and written)5+ yea

rs of professional experience in Python developmentProven

experience building and deploying production-level applications using LLMs (e.g., GPT, Claude, Azure OpenAI, or Google Gemini)Solid

understanding of RESTful API design and developmentExperi

ence integrating AI features into real-world applicationsExperi

ence or interest in the following technologies: FastAPI, LangChain / LangGraph, Qdrant or other vector databasesWebSoc

ket communicationBasic

knowledge of TypeScript and React for integration with frontend systemsPrefer

red/Plus:Familiarity with prompt engineering, fine-tuning, or retrieval-augmented generation (RAG)Hands-on experience with OpenAI, Anthropic, or other model APIs in scalable environmentsAwareness of AI safety, latency management, and cost optimization strategies

Apply