Realign LLC

Unstructured.io Developer-5

⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for an Unstructured.io Developer with a contract length of 6–12 months, offering a competitive pay rate. Remote work is available, requiring 8+ years of IT experience, 3+ years with Unstructured.io, and strong Python skills.
🌎 - Country
United States
💱 - Currency
Unknown
-
💰 - Day rate
Unknown
-
🗓️ - Date
November 8, 2025
🕒 - Duration
More than 6 months
-
🏝️ - Location
Remote
-
📄 - Contract
Unknown
-
🔒 - Security
Unknown
-
📍 - Location detailed
Massachusetts
-
🧠 - Skills detailed
#GIT #HTML (Hypertext Markup Language) #SQL (Structured Query Language) #Azure #Docker #Data Processing #Python #API (Application Programming Interface) #Scala #Langchain #Microservices #AI (Artificial Intelligence) #Databases #Indexing #ML (Machine Learning) #Cloud #AWS (Amazon Web Services) #NLP (Natural Language Processing) #Data Ingestion #Data Engineering #"ETL (Extract #Transform #Load)"
Role description
Job Type: Contract Job Category: IT Hiring: Unstructured.io Developer Location: Remote (Boston, MA) Contract: 6–12 Months (Extendable) Job Summary: We are seeking an experienced Unstructured.io Developer to work on enterprise-grade data ingestion and document processing solutions. The ideal candidate will have strong hands-on experience with Unstructured.io framework, data transformation pipelines, and integration with LLM / Vector DB / Search platforms. In this role, you will develop and optimize workflows for parsing, cleaning, and indexing complex enterprise documents. Key Responsibilities Develop and enhance data processing pipelines using Unstructured.io for converting unstructured data (PDF, DOCX, HTML, Emails, Scans) into structured formats. Integrate extracted data with Vector Databases or Search Indexing workflows for LLM/RAG applications. Optimize parsing performance, accuracy, and consistency across various document formats. Work with Python-based microservices, APIs, and orchestration frameworks. Collaborate with Data Engineering, ML, and Product teams to design scalable ingestion architectures. Implement best practices for scalable, reusable pipeline components. Monitor, debug, and resolve pipeline issues across staging and production environments. Required Skills & Experience Overall IT Experience: 8+ Years 3+ years hands-on experience implementing Unstructured.io in production environments. Strong experience with Python, including parsing, data transformation, and API development. Experience building RAG (Retrieval-Augmented Generation) or Document AI workflows. Hands-on with Vector Databases (Pinecone, Weaviate, Chroma, FAISS, Milvus, etc.). Familiarity with Cloud Platforms (AWS preferred). Experience with Docker, Git, CI/CD pipelines. Nice to Have Experience with frameworks like LangChain / LlamaIndex. Knowledge of NLP, embeddings, and tokenization. Experience integrating with LLM providers (OpenAI, Anthropic, Azure OpenAI, etc.). Familiarity with document OCR tools (Tesseract, Azure Form Recognizer, AWS Textract). Required Skills CLOUD DEVELOPER SQL APPLICATION DEVELOPER