Sr. Data Engineer (Azure, PostgreSQL & OCR Pipelines) - REMOTE

⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for a Sr. Data Engineer specializing in Azure, PostgreSQL, and OCR pipelines, offering a 3-month remote contract at $57/hr. Key skills required include ETL/ELT experience, Python proficiency, and familiarity with procurement systems like SAP Ariba.
🌎 - Country
United States
πŸ’± - Currency
$ USD
-
πŸ’° - Day rate
456
-
πŸ—“οΈ - Date discovered
September 27, 2025
πŸ•’ - Project duration
3 to 6 months
-
🏝️ - Location type
Remote
-
πŸ“„ - Contract type
W2 Contractor
-
πŸ”’ - Security clearance
Unknown
-
πŸ“ - Location detailed
Illinois, United States
-
🧠 - Skills detailed
#AI (Artificial Intelligence) #"ETL (Extract #Transform #Load)" #ADF (Azure Data Factory) #Indexing #Agile #Cloud #Storage #Scala #Data Pipeline #Scripting #SAP #Data Extraction #Azure #Databases #PostgreSQL #Python #Azure Data Factory #Security #Automation #Data Engineering #Documentation #Database Schema #ML (Machine Learning) #API (Application Programming Interface) #Schema Design #GitHub #REST (Representational State Transfer) #Logging #Monitoring #Metadata
Role description
Please send me your updated resume to midhun.t@thefountaingroup.com Pay range starts at $57/hr - Fully REMOTE! Position: Cloud Data Engineer (Azure, PostgreSQL & OCR Pipelines) Industry : Pharmaceutical (ABOJP00042373) 3-Month contract to start! This is a W2 Contract, with a possibility of extension or conversion beyond the original duration based on performance & budget at that time. What to look for: Data Engineer / ETL background using Azure data engineering: Azure Data Factory, Blob Storage, CI/CD basics; building scalable ETL/ELT with retries, logging, monitoring. Relational design & performance: Azure PostgreSQL schema design (contracts, OCR outputs, spend), indexing/partitioning, query tuning. β€’ β€’ β€’ OCR pipeline is the end-to-end workflow that takes raw supplier contract files (PDFs/scans) and turns them into structured, validated data your AI and analytics can use Python automation: pipeline orchestration, API clients, data validation, error handling. Familiarity with procurement systems such as SAP Ariba and its integration helpful. Key Deliverables β€’ β€’ Ingestion Pipeline: Build and deploy a robust ETL/ELT pipeline using Azure to ingest 50,000+ contracts. β€’ β€’ Metadata Extraction: Configure and run OCR workflows (e.g., OlmOCR/Azure Document Intelligence) to extract key contract fields such as dates, parties, terms etc. β€’ β€’ Scalable Database Schema: Design and implement a schema in Azure PostgreSQL to store contract metadata, OCR outputs, and supplier spend data. Collaborate with the Software Developer to design a future-ready schema for AI consumption. Required Skills & Experience β€’ Data Engineering & ETL/ELT β€’ β€’ Experience with Azure PostgreSQL or similar relational databases β€’ β€’ Skilled in building scalable ETL/ELT pipelines (preferably using Azure) β€’ β€’ Proficient in Python for scripting and automation β€’ OCR Collaboration β€’ β€’ Ability to work with internal Machine Learning Engineering teams to validate and structure extracted data β€’ β€’ Familiarity with OCR tools (e.g., Azure Document Intelligence, Tesseract) is a plus β€’ SAP Ariba Integration β€’ β€’ Exposure to cXML, ARBCI, SOAP/REST protocols is a plus β€’ β€’ Comfortable with API authentication (OAuth, tokens) and enterprise-grade security β€’ Agile Collaboration & Documentation β€’ β€’ Comfortable working in sprints and cross-functional teams β€’ β€’ Able to use Github Copilot to document practices for handover β€’ Preferred Qualifications β€’ β€’ Experience with large-scale contract ingestion projects β€’ β€’ Familiarity with procurement systems and contract lifecycle management β€’ β€’ Background in integrating data pipelines with AI or analytics platforms β€’ Why Join Us? β€’ β€’ Focused Scope with Future Impact: Client the foundation for an AI-driven negotiation platform β€’ β€’ Cutting-Edge Tools: Work with SAP Ariba, OCR, Azure, and advanced analytics β€’ β€’ Collaborative Environment: Partner with Software Developers and AI specialists By applying for this job, you agree to receive calls, AI-generated calls, text messages, or emails from and its affiliates, and contracted partners. Frequency varies for text messages. Message and data rates may apply. Carriers are not liable for delayed or undelivered messages. You can reply STOP to cancel and HELP for help. You can access our privacy policy at Privacy Policy