Platform Reliability Lead
Compunnel, Inc.
JOB SUMMARY
The OMS Platform Reliability Lead is a highly technical role responsible for the health, stability, and automated evolution of the Fluent Commerce Order Management ecosystem. This position leans heavily into Systems Engineering, requiring the ability to read and debug Java extensions, design complex GraphQL mutations, and build automated remediation tools for the "RUN" team. You will manage the technical RUN support team and serve as the bridge between software engineering and IT operations. Your primary focus is to transition from manual support to "Self‑Healing" operations by implementing automation for order replays, data deduplication, and predictive alerting. Key Responsibilities Technical Automation & Self-Healing Operations Order Remediation Automation: Design and implement automated "Order Replay" mechanisms within Fluent Commerce to resolve synchronization failures between event-driven integrations without manual intervention. Enhanced Observability: Build advanced telemetry dashboards (using tools like Splunk, Datadog, or New Relic) to monitor GraphQL query performance, API latency, and webhook success rates. Smart Alerting: Design and tune threshold‑based alerting for the RUN team to identify "Stuck Orders" or inventory mismatches before they impact the customer experience. Tooling Development: Script custom utilities using the Fluent Commerce SDK or REST APIs to facilitate bulk updates and system cleanups. Technical Incident Management & Platform Monitoring Deep-Dive Troubleshooting: Act as the ultimate technical escalation point for incidents requiring code‑level analysis of Java custom extensions or complex GraphQL mutations. Root Cause Engineering: Lead technical Root Cause Analysis (RCA) by performing deep dives into application logs and event‑driven architecture to identify architectural bottlenecks. Performance Tuning: Analyze API response times and database interaction patterns to propose platform optimizations to the development team. ITSM Compliance: Oversee the incident management lifecycle, ensuring documentation includes code‑level workarounds and technical "bug‑fixes" for future reference. Stakeholder & Vendor Engineering Collaboration Technical Liaison: Serve as the primary technical point of contact for e‑commerce and architecture teams to ensure operational requirements are included in the dev roadmap. Vendor Management: Collaborate with Fluent Commerce product engineers to align on platform upgrades and API versioning impacts. Team Leadership: Mentor the RUN support team in technical skills including GraphQL query optimization and Java debugging. Change Management & Release Integrity Technical Oversight: Validate technical configurations and platform extensions during the release cycle to ensure deployment integrity and performance stability. CI/CD Awareness: Manage version control using GIT, ensuring proper branching strategies for operational hotfixes and configuration changes. Required Qualifications Education: Bachelor’s degree in Computer Science, Software Engineering, or a related technical field. Experience: 5+ years in OMS Technical Operations or Platform Engineering, with specific experience in high‑volume, event‑driven SaaS environments. Fluent Commerce Expertise preferred: Advanced technical knowledge of Fluent Commerce (specifically Webhooks, Essential Rules, and the Fluent GraphQL API). Core Technical Stack: Java: Proficiency in reading, debugging, and identifying performance issues in custom Java extensions. GraphQL: Expert proficiency in query/mutation design, including the use of aliases, fragments, and variables for complex data manipulation. Integration: Comprehensive understanding of RESTful architectures, JSON schemas, and event‑driven patterns (Pub/Sub, Kafka, or Event Grid). Observability: Experience with monitoring tools such as Datadog, Splunk, ELK Stack, or New Relic. GIT: Deep experience with repository management and deployment pipelines. Process Knowledge: Strong mastery of ITIL with an SRE (Site Reliability Engineering) mindset—focusing on automation over manual "toil." Analytical Skills: Ability to parse complex system logs and use data to drive proactive stability improvements. Communication: Ability to explain a "race condition" or "API timeout" to a business stakeholder in terms of revenue and customer impact. #J-18808-Ljbffr Compunnel, Inc.Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Platform Reliability Lead in Berkeley Heights, NJ vacancy
- We're looking for a technical OMS Platform Reliability Lead to own the health and stability of our Fluent Commerce Order Management ecosystem. This is a systems engineering role — you'll lead our RUN support team and drive the shift from reactive support to self-healing...PlatformHourly payContract workFreelanceShift work
$60 per hour
job summary: The OMS Platform Reliability Lead is a highly technical role responsible for the health, stability, and automated evolution of our enterprise cloud Order Management System (OMS) ecosystem. Unlike a traditional operations role, this position leans heavily...PlatformHourly payContract workTemporary workWork experience placement- ...This Old House, and other consumer review platforms. We are powered by extraordinary people. Our innovative products and reliable services are delivered with convenience, excellence... ...Summary: The Field Canvassing Team Lead is responsible for hiring, training, and developing...PlatformH1bWork at officeLocal areaWork from homeShift workAfternoon shift
$143k - $210k
...pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that... ...and scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises... ...scale data center power access and grid reliability. In addition to U.S. responsibilities,...PlatformPermanent employmentTemporary workCasual workWork at officeFlexible hours- ...ensure on-time and accurate movement of goods Maintain accurate inventory records using Warehouse Management Systems (WMS) and ERP platforms Prepare, review, and process shipping documents (BOLs, packing lists, invoices, labels) Communicate with carriers, drivers, and...Platform
- Compunnel, Inc. is seeking an OMS Platform Reliability Lead to oversee the health and stability of the Fluent Commerce Order Management ecosystem. This role will lead the technical RUN support team and focus on transitioning operations to a 'Self-Healing' model using automated...Platform
- ...grade integration solutions, ensuring scalability, security, reliability, and high performance across cloud and hybrid environments. Lead the architectural aspects of system integration using WSO2 platforms (API Manager, Micro Integrator, Identity Server), including API...Platform
$119.5k - $163.88k
...engineers and shaping product strategy while ensuring compliance and data governance. With a focus on AI and cloud solutions, you will lead the development of scalable agent systems. A competitive salary range of $119,500 to $163,875 annually is offered, depending on...Platform$130k - $176k
.... Selective's unique position as both a leading insurance group and an employer of choice... ...Architecture, Analytics, AI/ML, and Platform teams to operationalize data products end... ...data products/data contracts, including reliability targets, change management, and consistency...PlatformWork experience placement- A leading tech solutions provider is seeking a Solution Architect in Berkeley Heights, NJ. The role involves providing technical leadership... ...within digital and e-commerce environments, expertise in cloud platforms, and excellent communication skills. Join a dynamic environment...Platform
$90.1k - $127.2k
Lead CSV Specialist - NA Kenvueは現在、a: 私たちがしていること Kenvueで、日常のケアの並外れた力を実感します。100年以上にわたる伝統を基盤に、科学に根ざした当ホテルは、ニュートロジーナ、アヴェーノ、タイレノール、リステ... ...Qualifications Proven expertise in leveraging analytics tools and CRM platforms to drive customer insights and strategic decision-making....Platform$72.8k - $130k
UnitedHealth Group is seeking an experienced Site Reliability Engineer in Basking Ridge, New Jersey. This role involves leading an SRE team, designing scalable systems with cloud technologies, and implementing monitoring strategies. Candidates should have a Bachelor's degree...Remote job- A leading waste management firm in Florham Park is seeking a Senior Turbine Reliability Specialist to develop and execute outage plans while ensuring proper quality control during turbine maintenance. This role requires effective communication with management and suppliers...
$255.48k - $309.59k
...transformation—cutting-edge science and platforms, extensive clinical and real-world datasets... ...can access cell therapies safely, reliably, and equitably.**Position Summary****The... ...Executive Director, Medical Communications Lead is the Cell Therapy Medical (CT Medical)...PlatformHourly payFull timeTemporary workPart timeFor contractorsSummer workLive inWork at officeLocal areaRemote workWorldwideFlexible hoursShift work- ...Overview Accordant Company, LLC is seeking an Acumatica Implementation Lead to join our growing, fully remote team. We are looking for... ...for process improvement and integration within the Acumatica platform. Manage, train, and mentor one dedicated implementation support...PlatformRemote work
- Job Title: Application Programmer Lead (C#) Job ID: 2024-13036 Job Location: Omaha, NE, Berkeley Heights, NJ, or Frisco, TX (100%... ...Azure Proficiency: Strong familiarity with Azure services and platforms (e.g., App Services, API Management, Key Vault, Monitor, Application...PlatformPermanent employmentLocal areaRelocation
- ...Social Media Coordinator for a 12-month contract role. The coordinator will focus on customer engagement through major social media platforms. Candidates need to demonstrate strong organizational skills and adaptability in a fast-paced environment. A keen interest in...PlatformContract work
- IEEE is seeking a Senior Manager for Member Platforms in Piscataway Township, NJ. This role will serve as a strategic leader for platforms like IEEE Collabratec and IEEE Volunteering, ensuring high value delivery to members and guiding user engagement. With 7-10 years of...Platform
- SwiftX in Carteret, New Jersey is looking for a Platform Customer Sales Manager to drive customer acquisition by developing strategic partnerships with e-commerce platforms and logistics partners. The role demands strong relationship-building skills and a focus on channel...Platform
- ...Galletas Production SLU is looking for an MDS North American Ventures Engagement Lead in East Hanover, New Jersey. This position entails strategic ownership of the company’s NetSuite ERP platform across various operational disciplines. It involves leading ERP support...Platform
- A fast-growing education and training company in New Jersey is seeking a PPC/SEM Manager to lead their paid search strategy and execution across multiple platforms. Ideal candidates will have 4+ years of experience in managing PPC or SEM campaigns, strong analytical skills...Platform
$106.4k - $155k
Colgate-Palmolive is seeking a MarTech Solution Lead based in Piscataway, NJ. This role involves acting as the primary liaison for... ...of experience in IT/Data Analytics, expertise in Customer Data Platforms, and proven leadership in Agile processes. A competitive salary...Platform- Mondelez International is seeking an MDS NAV Engagement Lead in East Hanover, NJ, to take ownership of the NetSuite ERP platform, ensuring optimal governance and strategic enhancements. This role involves working closely with executive stakeholders to align ERP capabilities...Platform
- Accellor is seeking a Transformation Program Manager to lead a large-scale enterprise modernization initiative, focusing on replacing legacy Siebel platforms with a next-generation CRM ecosystem such as Oracle or Salesforce. This role involves defining transformation strategies...Platform
- Mondelēz International is seeking an MDS NAV Engagement Lead responsible for the strategic ownership and optimization of their NetSuite ERP platform. The role involves close partnership with executive stakeholders and overseeing governance across various business functions...Platform
- ...Therapeutics in Morristown, NJ, is seeking a data analyst to support the development and governance of data dashboards and reporting platforms. The ideal candidate will have a Bachelor’s degree and at least 4 years of experience in Biotech or Pharma, with advanced Excel...Platform
- Bluecrux is looking for a SaaS Implementation Consultant to lead strategic implementations of the Binocs platform. This role involves managing end-to-end projects, conducting client workshops, and designing tailored solutions to optimize clients' business environments....Platform
- Our client is seeking an Enterprise Modernization Architect to lead large-scale application transformation initiatives focused on modernizing legacy platforms and migrating critical business systems to modern cloud-based architectures. This role will provide technical leadership...Platform
- ...integration technologies and provide effective solutions for on-premise or cloud based integrations Strong expertise in MuleSoft Anypoint Platform, CloudHub, API Manager, DataWeave, MuleSoft certifications (e.g., Certified Platform Architect) is preferred Strong knowledge and...Platform
- ...inclusion. Manage external vendors for modeling development and validation; interface with Pricing and external advisors. Lead the Economic Platform workstream for assigned projects; report to the Modeling Head within Global HEVA. Qualifications & Experience Master’s...PlatformLocal area
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Platform Reliability Lead. Be the first to apply!

