The Fountain Group

OMS Platform Reliability Lead

⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for an OMS Platform Reliability Lead, a remote position with a contract pay rate of $58-61/hour. Requires a Bachelor's in Computer Science, 5+ years in OMS Technical Operations, and expertise in Fluent Commerce, Java debugging, and RESTful architectures.
🌎 - Country
United States
πŸ’± - Currency
$ USD
-
πŸ’° - Day rate
488
-
πŸ—“οΈ - Date
June 26, 2026
πŸ•’ - Duration
Unknown
-
🏝️ - Location
Remote
-
πŸ“„ - Contract
W2 Contractor
-
πŸ”’ - Security
Unknown
-
πŸ“ - Location detailed
New Jersey, United States
-
🧠 - Skills detailed
#Monitoring #REST (Representational State Transfer) #Splunk #Debugging #Kafka (Apache Kafka) #API (Application Programming Interface) #Alation #Deployment #GIT #Documentation #Automation #Data Manipulation #JSON (JavaScript Object Notation) #Version Control #SaaS (Software as a Service) #Scala #GraphQL #REST API #AI (Artificial Intelligence) #Java #Computer Science #Datadog
Role description
Pay: $58-61/hour W2. Our company offers our consultants a suite of benefits after a qualification period including health, vision, dental, life and disability insurance. REMOTE ROLE no onsite work. W2 Candidates only Summary: β€’ The OMS Platform Reliability Lead is a highly technical role responsible for the health, stability, and automated evolution of the Fluent Commerce Order Management ecosystem. β€’ Unlike a traditional operations role, this position leans heavily into Systems Engineering, requiring the ability to read and debug Java extensions, design complex GraphQL mutations, and build automated remediation tools for the "RUN" team. β€’ Role will manage the technical RUN support team and serve as the bridge between software engineering and IT operations. β€’ Primary focus will be to transition from manual support to "Self-Healing" operations by implementing automation for order replays, data deduplication, and predictive alerting. Key Responsibilities: β€’ Design and implement automated "Order Replay" mechanisms within Fluent Commerce to resolve synchronization failures between event-driven integrations without manual intervention. β€’ Build advanced telemetry dashboards (using tools like Splunk, Datadog, or New Relic) to monitor GraphQL query performance, API latency, and webhook success rates. β€’ Design and tune threshold-based alerting for the RUN team to identify "Stuck Orders" or inventory mismatches before they impact the customer experience. β€’ Script custom utilities using the Fluent Commerce SDK or REST APIs to facilitate bulk updates and system cleanups. β€’ Act as the ultimate technical escalation point for incidents requiring code-level analysis of Java custom extensions or complex GraphQL mutations. β€’ Lead technical Root Cause Analysis (RCA) by performing deep-dives into application logs and event-driven architecture to identify architectural bottlenecks. β€’ Analyze API response times and database interaction patterns to propose platform optimizations to the development team. β€’ Oversee the incident management lifecycle, ensuring documentation includes code-level workarounds and technical "bug-fixes" for future reference. β€’ Serve as the primary technical point of contact for E-commerce and architecture teams to ensure operational requirements are included in the dev roadmap. β€’ Collaborate with Fluent Commerce product engineers to align on platform upgrades and API versioning impacts. β€’ Mentor the RUN support team in technical skills including GraphQL query optimization and Java debugging. β€’ Validate technical configurations and platform extensions during the release cycle to ensure deployment integrity and performance stability. β€’ Manage version control using GIT, ensuring proper branching strategies for operational hotfixes and configuration changes. Requirements β€’ Bachelor’s degree in Computer Science, Software Engineering, or a related technical field. β€’ 5+ years in OMS Technical Operations or Platform Engineering, with specific experience in high-volume, event-driven SaaS environments. β€’ Advanced technical knowledge of Fluent Commerce (specifically Webhooks, Essential Rules, and the Fluent GraphQL API) highly preferred β€’ Proficiency in reading, debugging, and identifying performance issues in custom Java extensions. β€’ Expert proficiency in query/mutation design, including the use of aliases, fragments, and variables for complex data manipulation. β€’ Comprehensive understanding of RESTful architectures, JSON schemas, and event-driven patterns (Pub/Sub, Kafka, or Event Grid). β€’ Experience with monitoring tools such as Datadog, Splunk, ELK Stack, or New Relic. β€’ Deep experience with repository management and deployment pipelines. β€’ Ability to explain a "race condition" or "API timeout" to a business stakeholder in terms of revenue and customer impact. Who We Are: The Fountain Group is a nationwide staffing firm with over 80 Fortune 100-500 clients. Since 2001, TFG has maintained a consistent standard of excellence, and our work is broadly recognized every year through numerous industry performance awards. Our success is a team effort. Browse our website below for additional information on our company. The Fountain Group 3407 W Martin Luther King Jr. Dr. Tampa, FL 33607 β€œWe work in Life Sciences, Clinical, Engineering, IT, and more. Above all, we specialize in people.” By applying for this job, you agree to receive calls, AI-generated calls, text messages, or emails from and its affiliates, and contracted partners. Frequency varies for text messages. Message and data rates may apply. Carriers are not liable for delayed or undelivered messages. You can reply STOP to cancel and HELP for help. You can access our privacy policy at Privacy Policy