GuruSchools LLC

AI Data Engineer

⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for an AI Data Engineer with a long-term remote contract, offering a competitive pay rate. Key skills required include proficiency in Python/Java, microservices, AI systems, and cloud integration. A BSc/BA in Computer Science or related field is mandatory.
🌎 - Country
United States
💱 - Currency
$ USD
-
💰 - Day rate
Unknown
-
🗓️ - Date
June 25, 2026
🕒 - Duration
Unknown
-
🏝️ - Location
Remote
-
📄 - Contract
Unknown
-
🔒 - Security
Unknown
-
📍 - Location detailed
Newark, NJ
-
🧠 - Skills detailed
#Microsoft SQL #PostgreSQL #Scala #Docker #Python #Cloud #Data Engineering #Security #SQL Server #Ansible #GIT #Terraform #Distributed Computing #Java #Infrastructure as Code (IaC) #Microsoft SQL Server #Leadership #"ETL (Extract #Transform #Load)" #SQL (Structured Query Language) #Kubernetes #Monitoring #AWS (Amazon Web Services) #GitHub #React #Azure cloud #SaaS (Software as a Service) #IP (Internet Protocol) #Databases #Microservices #MS SQL (Microsoft SQL Server) #AI (Artificial Intelligence) #Datadog #Istio #Computer Science #Azure #SAML (Security Assertion Markup Language) #Automation #Linux #Logging #Spring Boot
Role description
AI Data Engineer Location: Remote Duration: Long Term Key Responsibilities • Proven experience as a software engineer with strong proficiency in Python and/or Java, writing clean, scalable, production-grade code. • Solid experience designing and building microservices and RESTful APIs in distributed, cloud-based environments. • Experience designing, implementing, and extending AI agentic systems, including tool use, planning, and autonomous decision-making workflows. • Experience building multi-agent systems where multiple agents collaborate, delegate, and coordinate to complete complex tasks. • Hands-on experience building conversational or chat systems with both short-term (session) and long-term (persistent) context management. • Experience building Retrieval-Augmented Generation (RAG) systems - including document ingestion, chunking strategies, vector stores, and retrieval pipelines. • Experience building MCP servers • Experience integrating agentic systems with external APIs, third-party services, and enterprise data sources. • Strong understanding of security in agentic systems - authentication, authorization, least-privilege access, prompt injection defense, and audit logging. • Knowledge of relational databases (e.g. PostgreSQL, Microsoft SQL Server) and vector databases (e.g. Qdrant, Pinecone, pgvector, Weaviate). • Experience using system and performance monitoring tools (e.g. New Relic, Datadog). • Proficient in Git and comfortable working in CI/CD-driven development workflows. • Excellent critical-thinking, communication, and personal leadership skills. • Self-starter with the ability to deliver with minimal supervision. • BSc/BA in Computer Science or a related degree. Bonus Points • Experience with distributed computing. • Experience writing code/scripts in Python. • Experience with Spring Boot. • Nice to have: React, Selenium automation and cloud experience. • Nice to have: document parsing systems, including extraction from PDFs, structured/unstructured data sources, and handling diverse file formats. • Experience with Docker, Kubernetes and Istio. • Experience with Ansible. • Experience with CI/CD pipelines using Spinnaker and/or GitHub Actions. • Linux and IP networking knowledge. • Experience with AWS/Azure cloud services or equivalent. • Experience with Terraform for infrastructure as code and cloud provisioning • Nice to have: Experience with SAML, OAuth and OpenID Connect. • Experience working on a SaaS product. • Experience with Service Oriented Architecture. • On-call experience with production grade systems. • Has mentored others in a professional setting. Generative AI Code Assistants - Use of Generative AI Code Assistants (e.g. GitHub Copilot) and knowledge of latest Generative AI model capabilities