Remote Data Scientist (Python)
Location
🇮🇳 Solapur, India
Type
full_time
Salary
Undisclosed
Posted
1d ago
Job Description
About Firstsource Firstsource Solutions Limited, an RP-Sanjiv Goenka Group company (NSE: FSL, BSE: 532809, Reuters: FISO.BO, Bloomberg: FSOL:IN), is a specialized global business process services partner, providing transformational solutions and services spanning the customer lifecycle across Healthcare, Banking and Financial Services, Communications, Media and Technology, Retail, and other diverse industries. With an established presence in the US, the UK, India, Mexico, Australia, South Africa, and the Philippines, we make it happen for our clients, solving their biggest challenges with hyper-focused, domain-centered teams and cutting-edge tech, data, and analytics. Job Location: Remote We are seeking a skilled AI/ML professional to develop and fine-tune NLP models tailored to the mortgage industry.
The role
involves end-to-end data analysis, model training (including instruction tuning and RLHF), and algorithm optimization. The ideal candidate will collaborate with domain experts, conduct rigorous experimentation, and uphold ethical AI practices to deliver accurate, relevant, and bias-mitigated solutions. Data Analysis and Preprocessing: Analyze and preprocess diverse datasets relevant to the mortgage industry, ensuring data quality and relevance for model training. Research and implement state-of-the-art NLP models, focusing on pre-training as well instruction tuning pre- trained LLMs for mortgage-specific applications. Algorithm Implementation: Develop and optimize machine learning algorithms to enhance model performance, accuracy, and efficiency. Ethics and Bias Mitigation: Ensure responsible AI practices are followed by identifying potential biases in data and models, implementing strategies to mitigate them. Strong background in machine learning, deep learning, and NLP, Proficiency in Python and experience with ML frameworks such as TensorFlow or PyTorch. Experience with NLP frameworks and libraries (e.g., Hugging Face Transformers) for developing language models. Data Handling: Proficiency in handling large datasets, feature engineering, and statistical analysis Strong analytical skills with the ability to solve complex problems using data-driven approaches. Excellent communication skills to effectively collaborate with technical teams and non-technical stakeholders. D. in Data Science, Computer Science, Statistics, or a related field. Cloud Computing: Familiarity with cloud platforms (e.g., AWS, Azure) for scalable computing solutions. Understanding of ethical considerations in AI development, including bias detection and mitigation.