We are looking for an experienced Python Developer with expertise in using TensorFlow/PyTorch, LangChain, OpenAI API, Elasticsearch and a deep understanding of Natural Language Processing to help us develop and optimize high-performance applications. You will be responsible for implementing, testing, and maintaining data pipelines, machine learning models, and NLP techniques to extract valuable insights from data.
Key Responsibilities:
● Design, develop, and maintain Python-based data analysis and machine learning applications with clean and well-documented code
● Develop, optimize and deploy ML models for information retrieval, LLM-based agents, embeddings (FAISS, Pinecone, Weaviate), predictive analytics, and Retrieval-Augmented Generation (RAG)
● Research and implement NLP algorithms for text classification, sentiment analysis, named entity recognition (NER), and topic modeling, including troubleshooting and debugging to ensure reliable performance at scale
● Implement data pipelines and ETL processes for big data processing
● Collaborate with cross-functional teams to understand business requirements and build scalable tech Required
Skills & Qualifications:
● Strong proficiency in Python with hands-on experience in libraries like Pandas, NumPy, scikit-learn, TensorFlow, PyTorch
● Expertise in information retrieval, statistical analysis, data visualization and developing LLM-based agents, embeddings (FAISS, Pinecone, Weaviate), predictive analytics, and Retrieval-Augmented Generation (RAG)
● Hands-on experience with Natural Language Processing (NLP) libraries such asNLTK, spaCy, Hugging Face, or similar tools
● Experience with data wrangling techniques, including cleaning, transforming, and merging data sets from various sources
● Familiarity with machine learning algorithms and frameworks (supervised,
● unsupervised learning, and deep learning techniques)
● Solid understanding of text analytics such as text pre-processing, tokenization, stemming, lemmatization, and part-of-speech tagging
● Experience with cloud platforms (AWS, GCP, or Azure) and containerization (Docker, Kubernetes) is a plus.
● Knowledge of data visualization tools (Matplotlib, Seaborn, ggplot2, Plotly, etc.)
● Strong problem-solving skills and attention to detail with ability to work in an agile, fast-paced environment and deliver results under tight deadlines
Preferred Qualifications:
● 4 year Bachelor’s degree in Computer Science, Information Technology, Data Science, Statistics or related domains, or equivalent qualification
● 4+ years in developing scalable ML models, NLP models and systems from 0 to 1 and deploying them to production
● Strong knowledge of RESTful APIs and GraphQL for frontend-backend communication
● Familiarity with version control using Git, CI/CD tools, and deployment pipelines
● Knowledge of big data tools and platforms (Spark, Hadoop, etc.) and experience with managing databases
Benefits:
● Competitive salary and performance-based incentives
● Flexible working hours and remote work opportunities
● Opportunities for professional growth and skill development
● A collaborative and dynamic work environment
● Health and wellness benefits, including medical, dental, and vision insurance