Data Scientist - Python/ SQL- Fully Remote
Location
🇮🇳 Kanpur, India
Type
full_time
Salary
Undisclosed
Posted
2d ago
Job Description
Data Scientist - Gen AI, NLP, Databricks India, Remote Growth through diversity, equity, and inclusion. As an ethical business, we do what is right — including ensuring equal opportunities and fostering a safe, respectful workplace for each of us. We believe diversity fuels both personal and business growth. We're committed to building an inclusive community where all our people thrive regardless of their backgrounds, identities, or other personal characteristics. Tasks/
Responsibilities
: Implementing end-to-end GenAI powered RAG & multi-agent systems: Providing guidance on system architecture & components Building TTD system components: LLM Agents LLM Agents setup & prompt engineering / prompting strategies Routing State management Vector stores and more generally context providing Rerankers Validation and assessment rules Supporting project delivery: Running end-to-end initiative (Business understanding, Data understanding/preparation, Modeling, Evaluation and Deployment) Analyzing and interpreting the findings Delivering high quality technical solution to the customer Drawing conclusions and recommendations- including expected
benefits
What We're Looking For
: Must Have: Familiarity with theory behind various deep learning concepts Experience with Machine Learning (ML), especially in the area of Generative AI (LLM/LMM) with focus on Natural Language Processing (NLP) or multimodal models. Experience with business
requirements
gathering, transforming them into technical plan, data processing, feature engineering, models evaluation, hypothesis testing and model deployment Fluency in Python and object programing, working knowledge of SQL and vector database Solid experience with Databricks is a must Knowledge of specific Deep Learning and GenAI libraries like: NumPy, PyTorch, HuggingFace, LangChain, LangGraph and GenAI APIs i.e. OpenAI/Gemini/Claude