

Athenaworks
Senior Applied Scientist
β - Featured Role | Apply direct with Data Freelance Hub
This role is for a Senior Applied Scientist with a contract length of "unknown" and a pay rate of "unknown," focused on remote work. Key skills include applied ML/NLP, Python, and experience with FOIA and legal compliance. A Bachelor's or Master's degree is required.
π - Country
United States
π± - Currency
$ USD
-
π° - Day rate
Unknown
-
ποΈ - Date
February 8, 2026
π - Duration
Unknown
-
ποΈ - Location
Remote
-
π - Contract
Unknown
-
π - Security
Unknown
-
π - Location detailed
Washington, United States
-
π§ - Skills detailed
#SpaCy #GDPR (General Data Protection Regulation) #Classification #Normalization #Athena #ML (Machine Learning) #Datasets #Python #Automation #AI (Artificial Intelligence) #Data Extraction #PyTorch #Compliance #Computer Science #NLP (Natural Language Processing) #Leadership #TensorFlow #"ETL (Extract #Transform #Load)" #Metadata #Langchain #Quality Assurance #Data Science
Role description
We deliver technology to some of the best startups and companies in the world through diverse empowered teams of technologists that want to create change in the digital world.
We are an organization that was created to be a safe place for all people from all areas of life. A place where teams are gender-balanced and compensated equally. And a place where careers are unrestrained regardless of cultural, educational, or geographical background.
We value people with strong technical skills that are collaborative, curious, results-driven, and take ownership. We embrace people that want to be themselves, have daily flexibility, grow, learn, and make a difference wherever the opportunity presents itself.
So We Hope This Sounds Like You. Because We Are Always Looking For Exceptional Senior Applied Scientist Engineers To Work In Immersive Client Projects That Will Challenge Your Abilities. This Position Requires
Role Summary
We are seeking a Senior Applied Scientist to design and deploy AI systems and agentic workflows that safely automate document understanding and decision-making across FOIA, Privacy Act, eDiscovery, and Data Breach workflows.
This role focuses on applied AI for high-risk, regulated documents, where systems must make or assist decisions related to review, classification, entity extraction, redaction, and disclosure, while remaining explainable, policy-aware, auditable, and defensible. You will work above core OCR and ingestion pipelines, transforming document content, metadata, layout, policy rules, and reviewer behavior into bounded automation that can be trusted in production. You will help define what decisions can be automated, under what conditions, and with what safeguards.
What Youβll Own
Automation of Document Decisions
Design AI systems that safely automate or assist decisions related to:
β’ Document review, including:
β’ Responsiveness and relevance
β’ Privilege and confidentiality
β’ Sensitivity and disclosure risk
β’ Entity and data extraction, including:
β’ Identifying, normalizing, and consolidating people, organizations, locations, and attributes
β’ Resolving entities across documents and datasets
β’ Redaction decisions, including:
β’ What information requires protection
β’ Where and how it should be applied
β’ Under which legal or policy authority
Define Automation Boundaries And Confidence Thresholds, Including
β’ Auto-apply vs. recommend vs. require human review
β’ Document-, page-, and entity-level decision logic
β’ Conditions under which automation must defer to reviewers
Establish human-in-the-loop controls so all automated decisions are reviewable, reversible, and auditable.
Applied GenAI, Agentic Workflows & Document Intelligence
β’ Build LLM-powered workflows for:
β’ Document and case summarization with citations and traceability
β’ Review assistance for responsiveness, privilege, and confidentiality
β’ Redaction and extraction rationale generation (the βwhyβ behind decisions)
β’ Design and deploy Retrieval-Augmented Generation (RAG) systems grounded in:
β’ Document text
β’ Metadata and layout signals
β’ Entity models and prior reviewer decisions
β’ Design agentic AI workflows that orchestrate multi-step processes (e.g., ingest β classify β extract β review β redact β QA), with:
β’ Policy-aware reasoning
β’ Confidence gating
β’ Human approval loops
β’ Implement guardrails and grounding to ensure GenAI outputs remain accurate, explainable, and policy-aligned
Policy-Aware AI Systems
β’ Translate legal and regulatory requirements into clear, testable AI behavior
β’ Align systems with:
β’ FOIA exemptions and disclosure rules
β’ Privacy Act requirements
β’ eDiscovery standards for responsiveness, privilege, and confidentiality
β’ Data breach impact assessment and notification workflows
β’ Partner with legal, privacy, and compliance stakeholders to validate and refine decision logic
Quality Assurance, Trust & Defensibility
β’ Build QA frameworks for automated decisions, including:
β’ Review consistency and conflict detection
β’ Entity completeness and normalization checks
β’ Redaction over- and under-coverage analysis
β’ Monitor model performance, drift, and automation outcomes in production
β’ Ensure all decisions are traceable, explainable, and defensible for audits, litigation, and public scrutiny
Cross-Functional Leadership
β’ Collaborate with Product, Engineering, Legal, Privacy, and Operations teams
β’ Influence product direction across Legal, Breach, and Government markets
β’ Own AI outcomes in productionβnot just model metrics or prototypes
Required Qualifications
β’ Bachelorβs or Masterβs degree in Computer Science, AI, Data Science, or a related field
β’ 5+ years of experience building and deploying applied ML / NLP systems in production, particularly for document understanding, classification, or extraction
β’ 2+ years of hands-on experience with large language models or modern GenAI techniques, including prompt design, RAG pipelines, or LLM-assisted workflows
β’ Strong Python proficiency and experience with frameworks such as:
β’ PyTorch or TensorFlow
β’ HuggingFace, spaCy, LangChain (or equivalent)
β’ Experience building explainable, auditable AI systems
β’ Experience designing human-in-the-loop automation
β’ Comfortable working in ambiguous, high-risk environments
β’ U.S. Citizen eligible for Public Trust clearance
Preferred Qualifications
β’ Experience with FOIA, Privacy Act, HIPAA, GDPR, or breach response workflows
β’ Background in eDiscovery, legal tech, or regulated government systems
β’ Experience with:
β’ Layout-aware document models
β’ OCR confidence and degradation analysis
β’ Large-scale entity resolution across document collections
β’ Prior technical leadership or mentoring experience
β’ Experience scaling AI systems to millions of documents or pages
A happy team makes a huge difference, that's why we provide:
β’ Payment in USD or in your local currency
β’ A truly flexible work schedule
β’ Holiday and performance bonuses
β’ An excellent paid time off policy
β’ 4 free Udemy courses a year
β’ Home exercise & wellness membership
β’ An opportunity for you to help create change in the industry
β’ And more!
ATHENAWORKS is an inclusive safe organization that only considers your technical ability, work experience, ability to collaborate, your capacity to grow to the next level of your career, and ability to deliver great work. This means that we also embrace/welcome self-taught people as well! We will NEVER consider any other personal or professional aspects of your life. We hope that you choose to have a conversation with us today and find out what makes us different from any company that you have experienced.
We deliver technology to some of the best startups and companies in the world through diverse empowered teams of technologists that want to create change in the digital world.
We are an organization that was created to be a safe place for all people from all areas of life. A place where teams are gender-balanced and compensated equally. And a place where careers are unrestrained regardless of cultural, educational, or geographical background.
We value people with strong technical skills that are collaborative, curious, results-driven, and take ownership. We embrace people that want to be themselves, have daily flexibility, grow, learn, and make a difference wherever the opportunity presents itself.
So We Hope This Sounds Like You. Because We Are Always Looking For Exceptional Senior Applied Scientist Engineers To Work In Immersive Client Projects That Will Challenge Your Abilities. This Position Requires
Role Summary
We are seeking a Senior Applied Scientist to design and deploy AI systems and agentic workflows that safely automate document understanding and decision-making across FOIA, Privacy Act, eDiscovery, and Data Breach workflows.
This role focuses on applied AI for high-risk, regulated documents, where systems must make or assist decisions related to review, classification, entity extraction, redaction, and disclosure, while remaining explainable, policy-aware, auditable, and defensible. You will work above core OCR and ingestion pipelines, transforming document content, metadata, layout, policy rules, and reviewer behavior into bounded automation that can be trusted in production. You will help define what decisions can be automated, under what conditions, and with what safeguards.
What Youβll Own
Automation of Document Decisions
Design AI systems that safely automate or assist decisions related to:
β’ Document review, including:
β’ Responsiveness and relevance
β’ Privilege and confidentiality
β’ Sensitivity and disclosure risk
β’ Entity and data extraction, including:
β’ Identifying, normalizing, and consolidating people, organizations, locations, and attributes
β’ Resolving entities across documents and datasets
β’ Redaction decisions, including:
β’ What information requires protection
β’ Where and how it should be applied
β’ Under which legal or policy authority
Define Automation Boundaries And Confidence Thresholds, Including
β’ Auto-apply vs. recommend vs. require human review
β’ Document-, page-, and entity-level decision logic
β’ Conditions under which automation must defer to reviewers
Establish human-in-the-loop controls so all automated decisions are reviewable, reversible, and auditable.
Applied GenAI, Agentic Workflows & Document Intelligence
β’ Build LLM-powered workflows for:
β’ Document and case summarization with citations and traceability
β’ Review assistance for responsiveness, privilege, and confidentiality
β’ Redaction and extraction rationale generation (the βwhyβ behind decisions)
β’ Design and deploy Retrieval-Augmented Generation (RAG) systems grounded in:
β’ Document text
β’ Metadata and layout signals
β’ Entity models and prior reviewer decisions
β’ Design agentic AI workflows that orchestrate multi-step processes (e.g., ingest β classify β extract β review β redact β QA), with:
β’ Policy-aware reasoning
β’ Confidence gating
β’ Human approval loops
β’ Implement guardrails and grounding to ensure GenAI outputs remain accurate, explainable, and policy-aligned
Policy-Aware AI Systems
β’ Translate legal and regulatory requirements into clear, testable AI behavior
β’ Align systems with:
β’ FOIA exemptions and disclosure rules
β’ Privacy Act requirements
β’ eDiscovery standards for responsiveness, privilege, and confidentiality
β’ Data breach impact assessment and notification workflows
β’ Partner with legal, privacy, and compliance stakeholders to validate and refine decision logic
Quality Assurance, Trust & Defensibility
β’ Build QA frameworks for automated decisions, including:
β’ Review consistency and conflict detection
β’ Entity completeness and normalization checks
β’ Redaction over- and under-coverage analysis
β’ Monitor model performance, drift, and automation outcomes in production
β’ Ensure all decisions are traceable, explainable, and defensible for audits, litigation, and public scrutiny
Cross-Functional Leadership
β’ Collaborate with Product, Engineering, Legal, Privacy, and Operations teams
β’ Influence product direction across Legal, Breach, and Government markets
β’ Own AI outcomes in productionβnot just model metrics or prototypes
Required Qualifications
β’ Bachelorβs or Masterβs degree in Computer Science, AI, Data Science, or a related field
β’ 5+ years of experience building and deploying applied ML / NLP systems in production, particularly for document understanding, classification, or extraction
β’ 2+ years of hands-on experience with large language models or modern GenAI techniques, including prompt design, RAG pipelines, or LLM-assisted workflows
β’ Strong Python proficiency and experience with frameworks such as:
β’ PyTorch or TensorFlow
β’ HuggingFace, spaCy, LangChain (or equivalent)
β’ Experience building explainable, auditable AI systems
β’ Experience designing human-in-the-loop automation
β’ Comfortable working in ambiguous, high-risk environments
β’ U.S. Citizen eligible for Public Trust clearance
Preferred Qualifications
β’ Experience with FOIA, Privacy Act, HIPAA, GDPR, or breach response workflows
β’ Background in eDiscovery, legal tech, or regulated government systems
β’ Experience with:
β’ Layout-aware document models
β’ OCR confidence and degradation analysis
β’ Large-scale entity resolution across document collections
β’ Prior technical leadership or mentoring experience
β’ Experience scaling AI systems to millions of documents or pages
A happy team makes a huge difference, that's why we provide:
β’ Payment in USD or in your local currency
β’ A truly flexible work schedule
β’ Holiday and performance bonuses
β’ An excellent paid time off policy
β’ 4 free Udemy courses a year
β’ Home exercise & wellness membership
β’ An opportunity for you to help create change in the industry
β’ And more!
ATHENAWORKS is an inclusive safe organization that only considers your technical ability, work experience, ability to collaborate, your capacity to grow to the next level of your career, and ability to deliver great work. This means that we also embrace/welcome self-taught people as well! We will NEVER consider any other personal or professional aspects of your life. We hope that you choose to have a conversation with us today and find out what makes us different from any company that you have experienced.






