

Principal Data Scientist
β - Featured Role | Apply direct with Data Freelance Hub
This role is a fully remote Principal Data Scientist position with a contract length of "unknown" and a pay rate of "unknown." Candidates must have 8+ years of experience in predictive modeling within healthcare, strong skills in Python or R, and expertise in disease onset modeling.
π - Country
United States
π± - Currency
$ USD
-
π° - Day rate
-
ποΈ - Date discovered
July 31, 2025
π - Project duration
Unknown
-
ποΈ - Location type
Remote
-
π - Contract type
W2 Contractor
-
π - Security clearance
Unknown
-
π - Location detailed
New York City Metropolitan Area
-
π§ - Skills detailed
#Deep Learning #Predictive Modeling #Programming #R #"ETL (Extract #Transform #Load)" #Leadership #Statistics #TensorFlow #Deployment #PyTorch #Data Science #Libraries #ML (Machine Learning) #Computer Science #Datasets #Python #AI (Artificial Intelligence) #BERT
Role description
Heading 1
Heading 2
Heading 3
Heading 4
Heading 5
Heading 6
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur.
Block quote
Ordered list
- Item 1
- Item 2
- Item 3
Unordered list
- Item A
- Item B
- Item C
Bold text
Emphasis
Superscript
Subscript
Fully REMOTE Contract
MUST work EST hours
W2 ONLY - not able to sponsor at this time
MUST have experience with Disease onset or prognostic modeling
We are seeking an experienced and visionary Principal Data Scientist to lead our efforts in developing advanced predictive models and AI solutions for healthcare. The ideal candidate will possess a deep understanding of machine learning methodologies, a proven track record of delivering impactful data-driven solutions in a real-world setting, and the ability to drive innovation across diverse therapeutic areas.
Main responsibilities.
β’ Lead the design, development, and deployment of cutting-edge predictive models using various machine learning and AI techniques, including tree-based models (e.g., XGBoost) and transformer-based architectures (e.g., BERT), for early disease detection and proactive interventions.
β’ Drive the strategic direction of data science initiatives across multiple therapy areas, identifying opportunities to leverage real-world data (e.g., open claims data, EHR) for improved patient outcomes and drug development, including the use of federated analytics and federatML.
β’ Provide technical leadership and mentorship to a team of data scientists, fostering a culture of innovation, rigorous experimentation, and best practices in MLOps.
β’ Evaluate and select appropriate modeling techniques and performance metrics (e.g., Precision, Recall, Bayes factor, NNT) based on specific problem statements and business objectives.
β’ Collaborate closely with cross-functional teams including business owners, payers, clinicians, epidemiologists, statisticians, and IT to translate complex business problems into tractable data science solutions for deployment in real world.
β’ Stay abreast of the latest advancements in machine learning, deep learning, and AI, and proactively integrate novel approaches into our predictive modeling capabilities.
β’ Communicate complex analytical findings and their implications clearly and concisely to both technical and non-technical audiences.
About you
β’ PhD or Master's degree in a quantitative field (e.g., Computer Science, Statistics, Biomedical Informatics, Engineering, Physics).
β’ 8+ years of progressive experience in data science, with a significant portion focused on predictive modeling and advanced analytics in healthcare or life sciences.
β’ Demonstrated expertise in machine learning algorithms and deep learning architectures, including strong practical experience with transformer models (e.g., BERT).
β’ Proficiency in programming languages such as Python or R, and experience with relevant data science libraries (e.g., scikit-learn, TensorFlow, PyTorch, XGBoost).
β’ Experience working with large-scale, real-world healthcare datasets such as claims data, electronic health records (EHR), or clinical trial data.
β’ Strong understanding of statistical concepts and experimental design.
β’ Proven ability to lead complex data science projects from conception to deployment, with a focus on delivering measurable business impact.
β’ Excellent communication, interpersonal, and leadership skills, with the ability to influence and collaborate effectively across all levels of the organization.