CloudseedAi

Senior Data Scientist / Engineer

⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for a Senior Data Scientist/Engineer on a contract basis, focusing on agriculture and water datasets. Requires 5-8 years of experience, proficiency in Python and SQL, and strong data engineering skills. Compensation is equity-based. Expected duration: over 6 months.
🌎 - Country
United States
πŸ’± - Currency
$ USD
-
πŸ’° - Day rate
Unknown
-
πŸ—“οΈ - Date
November 6, 2025
πŸ•’ - Duration
More than 6 months
-
🏝️ - Location
Unknown
-
πŸ“„ - Contract
Unknown
-
πŸ”’ - Security
Unknown
-
πŸ“ - Location detailed
United States
-
🧠 - Skills detailed
#Python #Data Management #Scala #Cloud #Data Science #SQL (Structured Query Language) #Strategy #Compliance #AI (Artificial Intelligence) #Web Scraping #Data Engineering #"ETL (Extract #Transform #Load)" #Security #Data Pipeline #Spatial Data #Metadata #Time Series #Leadership #Deep Learning #Version Control #Batch #Transformers #Datasets #AWS (Amazon Web Services) #Statistics #Data Processing #Forecasting #GDPR (General Data Protection Regulation) #Documentation #ML (Machine Learning) #Model Validation
Role description
Company Description Cloudseed Inc is dedicated to solving the problem of global food and water security through the use of Artificial Intelligence (AI) and other emerging technologies. Our mission is to develop tools for consumers and the food industry to accelerate adaptation of land and water practices for a changing climate. Cloudseed Inc is a bootstrapped company. Co-Founders are unpaid during this growth phase. We do this because we think we have a compelling product and we want to help the planet. THIS IS AN EQUITY COMPENSATION ROLE on a Contract Basis. You would be joining the co-founders and the initial founding engineering team of Cloudseed Inc. The position is un-salaried while Cloudseed Inc is raising capital and establishing revenue. Equity is negotiable based on experience and fit. Ideally you are an engineer with extra time minimum 15 hours / week ideally 20 - 40 to devote to a startup in the beginning with the expectation that you'll join full time when the company is funded - equity package will be structured accordingly. We offer an accelerated vesting schedule and a % of company ownership. Overview We’re looking for an experienced Data Scientist to quantify agricultural production risk, water scarcity, and access/affordability across regions and crops. This role combines deep domain expertise in agriculture, hydrology, and risk with advanced data engineering and modeling skills to deliver production-ready insights and tools. Key Responsibilities 1. Quantify Agricultural and Water Risk β€’ Analyze production risk, water scarcity, and affordability across geographies and crops. β€’ Develop early-warning systems, forecasts, and scenario models to anticipate disruptions and inform decisions. 1. Model Development β€’ Build and operationalize yield, price, and risk models using time-series, Bayesian, econometric, and ML/DL methods for forecasts, scenarios, and early-warning signals. β€’ Implement robust model validation, uncertainty quantification, and performance tracking. 1. Data Engineering & Pipelines β€’ Independently collect, ingest, and stage complex datasets (APIs, scrapers, bulk downloads, satellite archives). β€’ Build reproducible ETL/ELT pipelines in Python and SQL, including metadata, QC, lineage, and version control. β€’ Manage large-scale datasets (TB-level) across formats such as COG, GeoTIFF, NetCDF, Zarr, and Parquet. 1. Core Datasets (Required Expertise) β€’ Agriculture: USDA NASS/CDL, farmer-level data, and satellite-derived production metrics. β€’ Water & Climate: NOAA, PRISM, USGS, DWR, SMAP, Sentinel, Landsat. β€’ Soils: SSURGO, STATSGO, SoilGrids, HWSD; derive attributes like AWC, pH, salinity, texture, and drainage. β€’ Insurance & Risk: Loss/trigger frameworks (NDVI/PRF/MPCI), catastrophe curves, basis-risk, and stress tests. 1. Open-Source & Delivery β€’ Develop and deliver APIs, notebooks, and dashboards using open-source tools for enterprise and mobile applications. β€’ Contribute to reproducible, transparent data science through documentation, governance, and MLOps best practices. 1. Client & Strategy Leadership β€’ Lead high-value, bespoke client analyses and translate findings into actionable insights and SLAs. β€’ Define and evolve the organization’s data science vision, standards, and technical roadmap. Required Skills β€’ 5–8+ years delivering production data science with agriculture + water datasets (required) and soil datasets (required). β€’ Engineering skill set: Independently source, ingest, clean, and stage multi-TB datasets. Comfortable with COG/GeoTIFF/NetCDF/Zarr/Parquet, APIs, web scraping, and batch/distributed processing. Required Skills β€’ Experience: 5–8+ years in production-grade data science working with agriculture, water, and soil datasets. β€’ Data Engineering: Ability to source, ingest, clean, and stage multi-terabyte datasets independently.APIs, web scraping, and batch/distributed processing. β€’ Technical Stack: β€’ Python (required): Production grade scalable data pipelines, ML model training at scale, and analytics workflows. β€’ SQL & Postgres/PostGIS: Advanced spatial queries and data management. β€’ Proficiency with APIs, web scraping, distributed/batch processing, and geospatial data formats. β€’ Cloud-Optimized Geospatial Data Science / Engineering: Comfortable with COG/GeoTIFF/NetCDF/Zarr/Parquet, β€’ AWS Cloud Infrastructure knowledge. β€’ Analytics & AI/ML Techniques: β€’ Time Series Analysis and Forecasting β€’ Spatio-temporal Modeling and Geostatistics β€’ ML-based Data Imputation β€’ Classical & Modern ML Methods (Random Forests, XGBoost, Gradient Boosting, SVMs) β€’ Deep Learning Architectures (CNNs, RNNs, LSTMs, Transformers for geospatial/time-series data) β€’ Bayesian Inference and Probabilistic Modeling β€’ Global Data Operations: Operationalize global datasets; Handle CRS/ellipsoids, multilingual data, time zones, units, currencies, and privacy compliance (GDPR), and global refresh/versioning cadence. β€’ Communication: Excellent ability to translate technical insights for executives, engineers, and non-technical audiences. Preferred Qualifications β€’ Bayesian hierarchies, causal inference. β€’ Remote sensing (Landsat/Sentinel/VIIRS), feature engineering from COG/Zarr. β€’ Crop models (APSIM/DSSAT) and climate scenarios (CMIP6/SSPs). β€’ Insurance analytics (rating, loss costing, reinsurance) What You'll Learn β€’ Agricultural domain expertise and crop modeling algorithms β€’ Large-scale geospatial data processing techniques β€’ AI/ML applications in agriculture and conversational interfaces β€’ Production scientific computing and optimization strategies β€’ Startup culture and growth strategy and emphasis navigating an environment of pivots with an emphasis on cash flow and technical leadership opportunities Level: Senior (4-7 years experience) - High impact role with growth potential as we scale our platform and team. Cloudseed is an Equal Opportunity employer committed to diversity in the workplace. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, age, national origin, disability, protected veteran status, gender identity or any other factor protected by applicable federal state or local laws. Cloudseed is also committed to working with and providing reasonable accommodations to individuals with disabilities. Privacy is a top priority for Cloudseed. We build it into our products and view it as an essential part of our business.