Kelly

OMS Platform Reliability Lead

⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for an OMS Platform Reliability Lead, offering a contract length of "unknown," with a pay rate of "unknown," and is remote. Requires 5+ years in OMS Technical Operations, expertise in Fluent Commerce, Java, GraphQL, and strong analytical skills.
🌎 - Country
United States
πŸ’± - Currency
$ USD
-
πŸ’° - Day rate
Unknown
-
πŸ—“οΈ - Date
June 26, 2026
πŸ•’ - Duration
Unknown
-
🏝️ - Location
Unknown
-
πŸ“„ - Contract
Unknown
-
πŸ”’ - Security
Unknown
-
πŸ“ - Location detailed
Berkeley Heights, NJ
-
🧠 - Skills detailed
#Monitoring #Automation #Data Manipulation #Java #JSON (JavaScript Object Notation) #SaaS (Software as a Service) #Computer Science #Splunk #Debugging #GraphQL #Deployment #Datadog #API (Application Programming Interface) #Kafka (Apache Kafka) #Observability #GIT
Role description
OMS RUN Platform Reliability (or Stability) Lead Technical & Professional Requirements: β€’ Education: Bachelor’s degree in Computer Science, Software Engineering, or a related technical field. β€’ Experience: 5+ years in OMS Technical Operations or Platform Engineering, with specific experience in high-volume, event-driven SaaS environments. β€’ Fluent Commerce Expertise preferred: Advanced technical knowledge of Fluent Commerce (specifically Webhooks, Essential Rules, and the Fluent GraphQL API) β€’ Core Technical Stack: β€’ Java: Proficiency in reading, debugging, and identifying performance issues in custom Java extensions. β€’ GraphQL: Expert proficiency in query/mutation design, including the use of aliases, fragments, and variables for complex data manipulation. β€’ Integration: Comprehensive understanding of RESTful architectures, JSON schemas, and event-driven patterns (Pub/Sub, Kafka, or Event Grid). β€’ Observability: Experience with monitoring tools such as Datadog, Splunk, ELK Stack, or New Relic. β€’ GIT: Deep experience with repository management and deployment pipelines. β€’ Process Knowledge: Strong mastery of ITIL with an SRE (Site Reliability Engineering) mindsetβ€”focusing on automation over manual "toil." β€’ Analytical Skills: Ability to parse complex system logs and use data to drive proactive stability improvements. β€’ Communication: Ability to explain a "race condition" or "API timeout" to a business stakeholder in terms of revenue and customer impact.