Post Job Free
Sign in

Machine Learning Engineer

Company:
The Cypress Group
Location:
Manhattan, NY, 10261
Posted:
June 29, 2025
Apply

Description:

My Client is a fast-moving startup based in NYC building cutting-edge AI-powered solutions to solve real-world problems in the building industry.

Our mission is to harness the latest advances in large language models (LLMs) to build intuitive, human-centric tools.

We believe small teams can make a big impact—and we’re looking for people who thrive in dynamic, high-ownership environments.

Role Overview We’re seeking a Machine Learning Engineer with a strong focus on prompt engineering to help us develop, fine-tune, and deploy LLM-driven applications.

You’ll work closely with our product, engineering, and design teams to iterate quickly on new features, prototypes, and research-backed implementations that bring language models to life in production.

What You’ll DoDesign, test, and optimize prompts for large language models (e.g., GPT-4, Claude, Gemini) across a range of tasks.Fine-tune open-source or proprietary language models using instruction tuning, RLHF, or LoRA-based approaches.Develop robust evaluation pipelines to assess model performance, safety, and reliability.Collaborate with product and UX teams to integrate LLM functionality into real-world user flows.Research and experiment with new prompting techniques (e.g., chain-of-thought, self-reflection, retrieval-augmented generation).Write reusable components and tools to streamline LLM integration and testing.Monitor model behavior in production and iterate on prompts/models based on user feedback and performance data.

What We’re Looking For2–5+ years of experience in machine learning, NLP, or applied AI.Experience working with large language models (OpenAI, Anthropic, Hugging Face, Cohere, etc.).Strong Python skills and familiarity with ML frameworks like PyTorch or TensorFlow.Deep understanding of prompt engineering techniques and LLM behavior.Experience with tools for LLMOps (e.g., LangChain, Weights & Biases, MLflow, etc.) is a plus.Ability to balance research exploration with practical product needs.Excellent communication skills and a collaborative, startup-minded attitude.

Nice to HaveExperience with vector databases (e.g., Pinecone, Weaviate, FAISS).Familiarity with API-based LLM deployments and serverless infrastructure.Published research or blog posts in the NLP/LLM space.Previous startup experience or working on small, agile teams.

What We OfferCompetitive salary and early-stage equityFlexible hybrid schedule with a beautiful workspace in NYCOpportunity to work on cutting-edge AI applications with real user impactA small, ambitious, and collaborative teamGenerous vacation, healthcare, and wellness benefits

Apply