

GuruSchools LLC
AI Data Engineer
⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for an AI Data Engineer with a long-term remote contract, offering a competitive pay rate. Key skills required include proficiency in Python/Java, microservices, AI systems, and cloud integration. A BSc/BA in Computer Science or related field is mandatory.
🌎 - Country
United States
💱 - Currency
$ USD
-
💰 - Day rate
Unknown
-
🗓️ - Date
June 25, 2026
🕒 - Duration
Unknown
-
🏝️ - Location
Remote
-
📄 - Contract
Unknown
-
🔒 - Security
Unknown
-
📍 - Location detailed
Newark, NJ
-
🧠 - Skills detailed
#Microsoft SQL #PostgreSQL #Scala #Docker #Python #Cloud #Data Engineering #Security #SQL Server #Ansible #GIT #Terraform #Distributed Computing #Java #Infrastructure as Code (IaC) #Microsoft SQL Server #Leadership #"ETL (Extract #Transform #Load)" #SQL (Structured Query Language) #Kubernetes #Monitoring #AWS (Amazon Web Services) #GitHub #React #Azure cloud #SaaS (Software as a Service) #IP (Internet Protocol) #Databases #Microservices #MS SQL (Microsoft SQL Server) #AI (Artificial Intelligence) #Datadog #Istio #Computer Science #Azure #SAML (Security Assertion Markup Language) #Automation #Linux #Logging #Spring Boot
Role description
AI Data Engineer
Location: Remote
Duration: Long Term
Key Responsibilities
• Proven experience as a software engineer with strong proficiency in Python and/or Java, writing clean, scalable, production-grade code.
• Solid experience designing and building microservices and RESTful APIs in distributed, cloud-based environments.
• Experience designing, implementing, and extending AI agentic systems, including tool use, planning, and autonomous decision-making workflows.
• Experience building multi-agent systems where multiple agents collaborate, delegate, and coordinate to complete complex tasks.
• Hands-on experience building conversational or chat systems with both short-term (session) and long-term (persistent) context management.
• Experience building Retrieval-Augmented Generation (RAG) systems - including document ingestion, chunking strategies, vector stores, and retrieval pipelines.
• Experience building MCP servers
• Experience integrating agentic systems with external APIs, third-party services, and enterprise data sources.
• Strong understanding of security in agentic systems - authentication, authorization, least-privilege access, prompt injection defense, and audit logging.
• Knowledge of relational databases (e.g. PostgreSQL, Microsoft SQL Server) and vector databases (e.g. Qdrant, Pinecone, pgvector, Weaviate).
• Experience using system and performance monitoring tools (e.g. New Relic, Datadog).
• Proficient in Git and comfortable working in CI/CD-driven development workflows.
• Excellent critical-thinking, communication, and personal leadership skills.
• Self-starter with the ability to deliver with minimal supervision.
• BSc/BA in Computer Science or a related degree.
Bonus Points
• Experience with distributed computing.
• Experience writing code/scripts in Python.
• Experience with Spring Boot.
• Nice to have: React, Selenium automation and cloud experience.
• Nice to have: document parsing systems, including extraction from PDFs, structured/unstructured data sources, and handling diverse file formats.
• Experience with Docker, Kubernetes and Istio.
• Experience with Ansible.
• Experience with CI/CD pipelines using Spinnaker and/or GitHub Actions.
• Linux and IP networking knowledge.
• Experience with AWS/Azure cloud services or equivalent.
• Experience with Terraform for infrastructure as code and cloud provisioning
• Nice to have: Experience with SAML, OAuth and OpenID Connect.
• Experience working on a SaaS product.
• Experience with Service Oriented Architecture.
• On-call experience with production grade systems.
• Has mentored others in a professional setting.
Generative AI Code Assistants - Use of Generative AI Code Assistants (e.g. GitHub Copilot) and knowledge of latest Generative AI model capabilities
AI Data Engineer
Location: Remote
Duration: Long Term
Key Responsibilities
• Proven experience as a software engineer with strong proficiency in Python and/or Java, writing clean, scalable, production-grade code.
• Solid experience designing and building microservices and RESTful APIs in distributed, cloud-based environments.
• Experience designing, implementing, and extending AI agentic systems, including tool use, planning, and autonomous decision-making workflows.
• Experience building multi-agent systems where multiple agents collaborate, delegate, and coordinate to complete complex tasks.
• Hands-on experience building conversational or chat systems with both short-term (session) and long-term (persistent) context management.
• Experience building Retrieval-Augmented Generation (RAG) systems - including document ingestion, chunking strategies, vector stores, and retrieval pipelines.
• Experience building MCP servers
• Experience integrating agentic systems with external APIs, third-party services, and enterprise data sources.
• Strong understanding of security in agentic systems - authentication, authorization, least-privilege access, prompt injection defense, and audit logging.
• Knowledge of relational databases (e.g. PostgreSQL, Microsoft SQL Server) and vector databases (e.g. Qdrant, Pinecone, pgvector, Weaviate).
• Experience using system and performance monitoring tools (e.g. New Relic, Datadog).
• Proficient in Git and comfortable working in CI/CD-driven development workflows.
• Excellent critical-thinking, communication, and personal leadership skills.
• Self-starter with the ability to deliver with minimal supervision.
• BSc/BA in Computer Science or a related degree.
Bonus Points
• Experience with distributed computing.
• Experience writing code/scripts in Python.
• Experience with Spring Boot.
• Nice to have: React, Selenium automation and cloud experience.
• Nice to have: document parsing systems, including extraction from PDFs, structured/unstructured data sources, and handling diverse file formats.
• Experience with Docker, Kubernetes and Istio.
• Experience with Ansible.
• Experience with CI/CD pipelines using Spinnaker and/or GitHub Actions.
• Linux and IP networking knowledge.
• Experience with AWS/Azure cloud services or equivalent.
• Experience with Terraform for infrastructure as code and cloud provisioning
• Nice to have: Experience with SAML, OAuth and OpenID Connect.
• Experience working on a SaaS product.
• Experience with Service Oriented Architecture.
• On-call experience with production grade systems.
• Has mentored others in a professional setting.
Generative AI Code Assistants - Use of Generative AI Code Assistants (e.g. GitHub Copilot) and knowledge of latest Generative AI model capabilities



