Data Scientist

⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for a Senior/Principal Applied Data Scientist on a 3-month full-time remote contract. Key skills include Python, unstructured data extraction, and MLOps. Experience in healthcare data workflows and cloud platforms is preferred.
🌎 - Country
United States
πŸ’± - Currency
$ USD
-
πŸ’° - Day rate
-
πŸ—“οΈ - Date discovered
August 8, 2025
πŸ•’ - Project duration
3 to 6 months
-
🏝️ - Location type
Remote
-
πŸ“„ - Contract type
Fixed Term
-
πŸ”’ - Security clearance
Unknown
-
πŸ“ - Location detailed
United States
-
🧠 - Skills detailed
#Automation #"ETL (Extract #Transform #Load)" #Programming #Classification #Cloud #Base #Python #Lean #Compliance #Kubernetes #GCP (Google Cloud Platform) #Azure #ML (Machine Learning) #Langchain #Docker #Data Engineering #Deployment #Data Extraction #Monitoring #Data Science #pydantic #AI (Artificial Intelligence) #AWS (Amazon Web Services) #Data Processing #Containers #CRM (Customer Relationship Management)
Role description
Senior/Principal Applied Data Scientist Join a fast-paced, seed-stage AI startup backed by top-tier investors, working at the cutting edge of intelligent automation. The company is seeking an Applied Data Scientist to support a key client engagement and drive forward core technical initiatives. This role offers exposure to advanced agentic workflows, unstructured data processing, and the opportunity to shape real-world AI deployments. This is a 3-month full-time remote contract role, with EST working hours. What you’ll do: β€’ Data science and solutions engineering to build and design agentic workflows β€’ Solve novel, complex data science problems in document classification and structured data extraction β€’ Write code to extract structured information from unstructured patient records β€’ Leverage deep technical expertise to support customers and troubleshoot complex issues β€’ Develop and expand a robust customer knowledge base to answer common questions, significantly reducing onboarding time for new customers. β€’ Work with the product team to prioritize the roadmap based on customer feedback β€’ Communicate findings to stakeholders in a clear and concise way β€’ Ensure privacy and Compliance standards (HIPAA, PHI/PII) are upheld throughout the ML lifecycle About you: β€’ Background in data science, agentic frameworks, machine learning, and engineering β€’ Strong programming skills in Python β€’ Deep experience extracting structured data from unstructured PDFs using frameworks like: DSPy, Pydantic, Trustcall, LangGraph, LangChain, LlamaIndex, and Docling β€’ Deep understanding of LLM models β€’ Work closely with subject matter experts to create and refine evaluations sets β€’ Experience working with both technical and non-technical stakeholders β€’ Excellent communication, collaboration, and customer relationship management skills β€’ Strong grasp of the full MLOps lifecycle, including data engineering, training/evaluation, serving, and monitoring β€’ Proficiency in creating Docker containers Nice to have: β€’ Experience with cloud platforms (AWS, GCP, Azure), Kubernetes β€’ Background in insurance or healthcare data workflows β€’ Familiarity with models from Bedrock and Anthropic β€’ Prior startup or lean team experience