Athenaworks

Senior Applied Scientist

⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for a Senior Applied Scientist with a contract length of "unknown" and a pay rate of "unknown," focused on remote work. Key skills include applied ML/NLP, Python, and experience with FOIA and legal compliance. A Bachelor's or Master's degree is required.
🌎 - Country
United States
πŸ’± - Currency
$ USD
-
πŸ’° - Day rate
Unknown
-
πŸ—“οΈ - Date
February 8, 2026
πŸ•’ - Duration
Unknown
-
🏝️ - Location
Remote
-
πŸ“„ - Contract
Unknown
-
πŸ”’ - Security
Unknown
-
πŸ“ - Location detailed
Washington, United States
-
🧠 - Skills detailed
#SpaCy #GDPR (General Data Protection Regulation) #Classification #Normalization #Athena #ML (Machine Learning) #Datasets #Python #Automation #AI (Artificial Intelligence) #Data Extraction #PyTorch #Compliance #Computer Science #NLP (Natural Language Processing) #Leadership #TensorFlow #"ETL (Extract #Transform #Load)" #Metadata #Langchain #Quality Assurance #Data Science
Role description
We deliver technology to some of the best startups and companies in the world through diverse empowered teams of technologists that want to create change in the digital world. We are an organization that was created to be a safe place for all people from all areas of life. A place where teams are gender-balanced and compensated equally. And a place where careers are unrestrained regardless of cultural, educational, or geographical background. We value people with strong technical skills that are collaborative, curious, results-driven, and take ownership. We embrace people that want to be themselves, have daily flexibility, grow, learn, and make a difference wherever the opportunity presents itself. So We Hope This Sounds Like You. Because We Are Always Looking For Exceptional Senior Applied Scientist Engineers To Work In Immersive Client Projects That Will Challenge Your Abilities. This Position Requires Role Summary We are seeking a Senior Applied Scientist to design and deploy AI systems and agentic workflows that safely automate document understanding and decision-making across FOIA, Privacy Act, eDiscovery, and Data Breach workflows. This role focuses on applied AI for high-risk, regulated documents, where systems must make or assist decisions related to review, classification, entity extraction, redaction, and disclosure, while remaining explainable, policy-aware, auditable, and defensible. You will work above core OCR and ingestion pipelines, transforming document content, metadata, layout, policy rules, and reviewer behavior into bounded automation that can be trusted in production. You will help define what decisions can be automated, under what conditions, and with what safeguards. What You’ll Own Automation of Document Decisions Design AI systems that safely automate or assist decisions related to: β€’ Document review, including: β€’ Responsiveness and relevance β€’ Privilege and confidentiality β€’ Sensitivity and disclosure risk β€’ Entity and data extraction, including: β€’ Identifying, normalizing, and consolidating people, organizations, locations, and attributes β€’ Resolving entities across documents and datasets β€’ Redaction decisions, including: β€’ What information requires protection β€’ Where and how it should be applied β€’ Under which legal or policy authority Define Automation Boundaries And Confidence Thresholds, Including β€’ Auto-apply vs. recommend vs. require human review β€’ Document-, page-, and entity-level decision logic β€’ Conditions under which automation must defer to reviewers Establish human-in-the-loop controls so all automated decisions are reviewable, reversible, and auditable. Applied GenAI, Agentic Workflows & Document Intelligence β€’ Build LLM-powered workflows for: β€’ Document and case summarization with citations and traceability β€’ Review assistance for responsiveness, privilege, and confidentiality β€’ Redaction and extraction rationale generation (the β€œwhy” behind decisions) β€’ Design and deploy Retrieval-Augmented Generation (RAG) systems grounded in: β€’ Document text β€’ Metadata and layout signals β€’ Entity models and prior reviewer decisions β€’ Design agentic AI workflows that orchestrate multi-step processes (e.g., ingest β†’ classify β†’ extract β†’ review β†’ redact β†’ QA), with: β€’ Policy-aware reasoning β€’ Confidence gating β€’ Human approval loops β€’ Implement guardrails and grounding to ensure GenAI outputs remain accurate, explainable, and policy-aligned Policy-Aware AI Systems β€’ Translate legal and regulatory requirements into clear, testable AI behavior β€’ Align systems with: β€’ FOIA exemptions and disclosure rules β€’ Privacy Act requirements β€’ eDiscovery standards for responsiveness, privilege, and confidentiality β€’ Data breach impact assessment and notification workflows β€’ Partner with legal, privacy, and compliance stakeholders to validate and refine decision logic Quality Assurance, Trust & Defensibility β€’ Build QA frameworks for automated decisions, including: β€’ Review consistency and conflict detection β€’ Entity completeness and normalization checks β€’ Redaction over- and under-coverage analysis β€’ Monitor model performance, drift, and automation outcomes in production β€’ Ensure all decisions are traceable, explainable, and defensible for audits, litigation, and public scrutiny Cross-Functional Leadership β€’ Collaborate with Product, Engineering, Legal, Privacy, and Operations teams β€’ Influence product direction across Legal, Breach, and Government markets β€’ Own AI outcomes in productionβ€”not just model metrics or prototypes Required Qualifications β€’ Bachelor’s or Master’s degree in Computer Science, AI, Data Science, or a related field β€’ 5+ years of experience building and deploying applied ML / NLP systems in production, particularly for document understanding, classification, or extraction β€’ 2+ years of hands-on experience with large language models or modern GenAI techniques, including prompt design, RAG pipelines, or LLM-assisted workflows β€’ Strong Python proficiency and experience with frameworks such as: β€’ PyTorch or TensorFlow β€’ HuggingFace, spaCy, LangChain (or equivalent) β€’ Experience building explainable, auditable AI systems β€’ Experience designing human-in-the-loop automation β€’ Comfortable working in ambiguous, high-risk environments β€’ U.S. Citizen eligible for Public Trust clearance Preferred Qualifications β€’ Experience with FOIA, Privacy Act, HIPAA, GDPR, or breach response workflows β€’ Background in eDiscovery, legal tech, or regulated government systems β€’ Experience with: β€’ Layout-aware document models β€’ OCR confidence and degradation analysis β€’ Large-scale entity resolution across document collections β€’ Prior technical leadership or mentoring experience β€’ Experience scaling AI systems to millions of documents or pages A happy team makes a huge difference, that's why we provide: β€’ Payment in USD or in your local currency β€’ A truly flexible work schedule β€’ Holiday and performance bonuses β€’ An excellent paid time off policy β€’ 4 free Udemy courses a year β€’ Home exercise & wellness membership β€’ An opportunity for you to help create change in the industry β€’ And more! ATHENAWORKS is an inclusive safe organization that only considers your technical ability, work experience, ability to collaborate, your capacity to grow to the next level of your career, and ability to deliver great work. This means that we also embrace/welcome self-taught people as well! We will NEVER consider any other personal or professional aspects of your life. We hope that you choose to have a conversation with us today and find out what makes us different from any company that you have experienced.