
AI Python Developer – (OCR, NLP, Tokenization)
⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for an AI Python Developer with 5+ years in AI development, focusing on OCR, NLP, and tokenization. It is a remote contract position with a pay rate of “$X/hour”, requiring experience in Python, JSON, and healthcare AI.
🌎 - Country
United States
💱 - Currency
$ USD
-
💰 - Day rate
-
🗓️ - Date discovered
June 9, 2025
🕒 - Project duration
Unknown
-
🏝️ - Location type
Remote
-
📄 - Contract type
Unknown
-
🔒 - Security clearance
Unknown
-
📍 - Location detailed
United States
-
🧠 - Skills detailed
#Monitoring #NLP (Natural Language Processing) #Python #Hugging Face #Libraries #"ETL (Extract #Transform #Load)" #PHP #SpaCy #TensorFlow #OpenCV (Open Source Computer Vision Library) #JSON (JavaScript Object Notation) #NER (Named-Entity Recognition) #API (Application Programming Interface) #ML (Machine Learning) #Compliance #AI (Artificial Intelligence)
Role description
Heading 1
Heading 2
Heading 3
Heading 4
Heading 5
Heading 6
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur.
Block quote
Ordered list
- Item 1
- Item 2
- Item 3
Unordered list
- Item A
- Item B
- Item C
Bold text
Emphasis
Superscript
Subscript
AI Python Developer – (OCR, NLP, Tokenization)
Company: Care Monitoring System
Location: Remote “USA Based”
Employment Type: Contract (with potential to extend)
Experience Level: Mid to Senior Level (5+ years in AI development)
About the Role
We are seeking an experienced AI Python Developer to help finalize an existing AI module designed to auto-populate medication data from various pharmacy PDF formats into our HIPAA-compliant eMAR system.
You’ll take over a partially completed system and enhance its accuracy and performance using OCR, NLP, tokenization, and structured JSON output.
Responsibilities
· Review and improve the current AI model and codebase.
· Extract medication data from various types of PDFs (scanned, typed, structured, unstructured).
· Apply OCR tools (e.g., Tesseract, Google Document AI).
· Use tokenization and NLP techniques to parse instructions, frequency, and medication names.
· Convert parsed data into clean JSON structure for integration into our Laravel-based eMAR system.
· Collaborate with backend developers for smooth API/data flow.
Requirements
· 5+ years of experience in AI/ML development using Python.
· Proficient in Python with expertise in OCR and NLP.
· Familiar with libraries such as spaCy, PyPDF2, Hugging Face, TensorFlow, OpenCV, and Tesseract.
· Solid understanding of tokenization, named entity recognition (NER), and NLP pipelines.
· Ability to produce structured JSON outputs.
· Experience working with or integrating into Laravel/PHP systems is a plus.
· Ability to work independently and deliver production-quality results.
Preferred
· Background in healthcare AI, or eMAR systems.
· Understanding of HIPAA compliance and secure data handling.