

WorkTrust Solutions
Data Scientist with Gen AI Experience
⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for a Data Scientist with Generative AI experience on a 6-month remote contract, offering a competitive pay rate. Candidates need 6+ years in Data Science, 2+ years in Generative AI, and insurance domain expertise.
🌎 - Country
United States
💱 - Currency
$ USD
-
💰 - Day rate
Unknown
-
🗓️ - Date
October 14, 2025
🕒 - Duration
More than 6 months
-
🏝️ - Location
Remote
-
📄 - Contract
Fixed Term
-
🔒 - Security
Unknown
-
📍 - Location detailed
United States
-
🧠 - Skills detailed
#MLflow #Libraries #Data Modeling #Storage #Data Pipeline #Classification #Scala #PyTorch #ML (Machine Learning) #Data Science #Documentation #Langchain #NLP (Natural Language Processing) #AI (Artificial Intelligence) #Model Evaluation #Data Storage #Data Engineering #Datasets #NER (Named-Entity Recognition) #SpaCy #Python #Automation #Hugging Face #TensorFlow #Databases #"ETL (Extract #Transform #Load)"
Role description
Data Scientist With Gen AI - 6 month contract - remote
• We are seeking an experienced Data Scientist with deep expertise in Generative AI and a strong background in the Insurance domain to join our advanced analytics team. The ideal candidate will focus on building and optimizing classification models and information extraction systems, enabling automation and intelligence across insurance workflows such as claims processing, underwriting, and document analysis.
• You will collaborate closely with data engineers responsible for orchestration, data pipelines, and storage infrastructure, allowing you to focus on model innovation, experimentation, and accuracy.
Key Responsibilities:
• Design, build, and deploy machine learning classifiers and information extraction models tailored for insurance use cases (e.g., claims triage, policy text extraction, risk categorization).
• Develop and fine-tune Generative AI and LLM-based models (e.g., using LangChain, LlamaIndex, or custom embeddings) to extract insights from unstructured data such as claim notes, policy documents, and correspondence.
• Collaborate with ML engineers to integrate models into production workflows while maintaining scalability and reliability.
• Evaluate and improve model precision, recall, and interpretability using rigorous statistical and ML methodologies.
• Partner with domain experts to identify patterns, define taxonomies, and establish ground truth datasets for supervised and semi-supervised training.
• Build and maintain evaluation frameworks for model performance tracking and continuous improvement.
• Work alongside orchestration and data storage teams to ensure clean, accessible data pipelines that support rapid experimentation.
• Stay current on advancements in GenAI, document understanding, and insurance analytics to guide internal innovation.
Required Skills & Experience:
• 6+ years of professional experience in Data Science / Machine Learning, with 2+ years in Generative AI / NLP / LLM projects.
• Strong experience developing classification and entity extraction models (NER, text classification, OCR-based extraction).
• Proven track record working with insurance datasets — claims, underwriting, or policy text.
• Proficiency in Python and ML/NLP libraries such as PyTorch, TensorFlow, Hugging Face, spaCy, Scikit-learn.
• Familiarity with vector databases (Pinecone, FAISS, Chroma) and LLM frameworks (LangChain, LlamaIndex, RAG).
• Solid grasp of data modeling and feature engineering for structured and unstructured data.
• Experience with model evaluation, MLOps collaboration, and versioning tools (MLflow, DVC, etc.).
• Excellent communication and documentation skills, especially translating model results into business impact for insurance stakeholders.
Data Scientist With Gen AI - 6 month contract - remote
• We are seeking an experienced Data Scientist with deep expertise in Generative AI and a strong background in the Insurance domain to join our advanced analytics team. The ideal candidate will focus on building and optimizing classification models and information extraction systems, enabling automation and intelligence across insurance workflows such as claims processing, underwriting, and document analysis.
• You will collaborate closely with data engineers responsible for orchestration, data pipelines, and storage infrastructure, allowing you to focus on model innovation, experimentation, and accuracy.
Key Responsibilities:
• Design, build, and deploy machine learning classifiers and information extraction models tailored for insurance use cases (e.g., claims triage, policy text extraction, risk categorization).
• Develop and fine-tune Generative AI and LLM-based models (e.g., using LangChain, LlamaIndex, or custom embeddings) to extract insights from unstructured data such as claim notes, policy documents, and correspondence.
• Collaborate with ML engineers to integrate models into production workflows while maintaining scalability and reliability.
• Evaluate and improve model precision, recall, and interpretability using rigorous statistical and ML methodologies.
• Partner with domain experts to identify patterns, define taxonomies, and establish ground truth datasets for supervised and semi-supervised training.
• Build and maintain evaluation frameworks for model performance tracking and continuous improvement.
• Work alongside orchestration and data storage teams to ensure clean, accessible data pipelines that support rapid experimentation.
• Stay current on advancements in GenAI, document understanding, and insurance analytics to guide internal innovation.
Required Skills & Experience:
• 6+ years of professional experience in Data Science / Machine Learning, with 2+ years in Generative AI / NLP / LLM projects.
• Strong experience developing classification and entity extraction models (NER, text classification, OCR-based extraction).
• Proven track record working with insurance datasets — claims, underwriting, or policy text.
• Proficiency in Python and ML/NLP libraries such as PyTorch, TensorFlow, Hugging Face, spaCy, Scikit-learn.
• Familiarity with vector databases (Pinecone, FAISS, Chroma) and LLM frameworks (LangChain, LlamaIndex, RAG).
• Solid grasp of data modeling and feature engineering for structured and unstructured data.
• Experience with model evaluation, MLOps collaboration, and versioning tools (MLflow, DVC, etc.).
• Excellent communication and documentation skills, especially translating model results into business impact for insurance stakeholders.