Senior VoIP Operations & Reliability Engineer (Carrier-Class Voice Platform)
Planet Networks Inc
Why This Role Matters:
Voice is unforgiving. A dropped call, a one-way audio path, or a registration storm is visible to every customer at once. Our team can build the platform, but building it and running it are two different disciplines, and ambition without operational hardening is fragile. We need someone who has lived through real VoIP failures, learned from them, and can stand shoulder to shoulder with the developers to make sure the platform survives contact with production. If you want ownership of a modern, open-source, carrier-class platform from design review through 3am incident to the postmortem that makes it stronger, we want to talk to you. About the Role: Our software team is building a next-generation carrier-class voice platform. They are strong programmers, but they are not experienced operators, and there is a world of difference between code that works and infrastructure that stays up under real carrier load. We need a seasoned operator to close that gap and work hand in hand with the development team.You are the person who has actually run this kind of system in production. You know the failure modes that do not show up in a code review, the things that break at 2am, and what it really takes to keep customers from ever noticing. Your job is to bring that operational reality into the platform from the inside: pairing with the programmers as they build, making sure the design can be operated, and then owning the platform in production with zero customer downtime. There is an architectural side to this. You will sit in design reviews and push the team toward decisions that are operable, resilient, and testable, not just elegant in code. But the core of the role is operational: you are the experienced hand who keeps every system up, who owns every failure scenario end to end, and who instills operational discipline in a team by being on-call and training juniors to handle any incidents.
You should be equally comfortable pairing with a developer to make a service observable and failure-aware, and at 3am driving an incident to resolution. We need that judgment, with years of real VoIP operations behind it.
In the meantime, this is not a future-only role. We already run a live Kamailio and Asterisk production system carrying real customer traffic today, and your first and most immediate mandate is to help harden it: shore up its reliability, close its failure gaps, and keep it solid while the next-generation platform is being built. Day-to-day production stability of the current system comes first. What You Will Do: Harden the current production system (immediate priority)
• Take ownership of the reliability of our live Kamailio and Asterisk production system from day one, while the next-generation platform is still in development.
• Assess the current system end to end and find its weak points: single points of failure, brittle failover, missing redundancy, capacity headroom, and the failure scenarios it does not yet handle gracefully.
• Close those gaps incrementally and safely, without disrupting live customer traffic: add redundancy and failover, tighten configuration, and remove fragility.
• Add the observability the current system is missing so problems are caught before customers feel them, and stand up alerting, dashboards, and SIP capture against the live fleet.
• Stabilize day-to-day operations: triage and resolve recurring issues, document the system as it actually runs, and write the runbooks that do not exist yet. Work hand in hand with the development team
• Pair with the programmers throughout development as the operational voice in the room: review designs, challenge assumptions, and find the failure modes that code reviews miss.
• Make operability a build-time requirement, not an afterthought: push for the logging, metrics, health checks, graceful shutdown, retry behavior, and failure handling that the team needs to add for the platform to survive production.
• Transfer operational knowledge to the team: help developers understand how their code behaves under load and failure, and raise the whole group's instinct for production reality..
• Map the full failure surface of the platform (node failure, data-center loss, upstream carrier outage, registration storms, partial network partitions, resource exhaustion) and make sure every scenario has a defined, tested behavior..
• Design and run a rigorous test program: functional, load, stress, soak, and failover testing, with realistic call models (concurrent calls, BHCA, registration churn).
• Build fault-injection and chaos testing into the pipeline so failure handling is proven, not assumed.
• Validate the high-availability and scalability design under real conditions: active-active and active-passive topologies, geographic redundancy, graceful degradation, automated failover with measured recovery times, and capacity limits. Keep it up (day-to-day reliability engineering)
• Own platform uptime as a daily responsibility, not a quarterly goal. Customers should experience no downtime.
• Build and own the observability stack: SIP capture (HEP/Homer), CDR and quality pipelines, metrics, dashboards, and alerting that catches problems before customers do.
• Define SLOs and SLIs for signaling, media, and registration, and hold the platform to them.
• Run incident response: detect, triage, mitigate, and resolve, then drive blameless postmortems and make sure the same failure cannot recur.
• Write and maintain runbooks, and lead disaster-recovery and failover drills so the team can execute under pressure.
• Participate in (and help design) a sustainable on-call rotation.
• Tune and operate the production fleet: Asterisk, Kamailio, OpenSIPS, and the supporting network layer, under live carrier traffic. What We Need You to Bring: Core expertise (required)
• Years of senior, hands-on experience operating and reliability-engineering production VoIP systems at carrier scale.
• Deep, protocol-level command of SIP: dialogs, transactions, registration, NAT scenarios, SDP negotiation, forking, and the failure modes that surface only under load.
• Expert-level Kamailio and/or OpenSIPS: routing logic, dispatcher and load balancing, registrar and usrloc, dialog and topology modules.
• Expert-level Asterisk: PJSIP stack, dialplan, ARI/AMI, bridging and media handling, and its role as an application and media server behind a SIP proxy.
• Media plane fluency: RTP, SRTP, RTSP, RTCP, codecs (G.711, G.729, Opus), transcoding, jitter, and the link between QoS marking (DSCP) and call quality.
• A demonstrated track record of designing for and operating reliability, scalability, and fault tolerance in carrier-class environments (five-nines thinking, failure-domain isolation, blast-radius control).
• Hands-on reliability engineering practice: SLOs and error budgets, incident command, postmortems, runbooks, and DR testing. Strongly preferred
• Performance and failure testing tooling: sipp for load and call modeling, fault injection and chaos tooling, and SIP troubleshooting with sngrep and Wireshark.
• Observability depth with Homer/HEP, plus metrics and alerting stacks (for example Prometheus, Grafana, or equivalent).
• Strong Linux operations and automation skills (Python, Lua, shell), and comfort with infrastructure-as-code and CI/CD pipelines.
• RADIUS/Diameter integration for AAA, and experience with provisioning and subscriber management.
• Fraud and security operations: detecting and stopping toll fraud, SIP scanning, and registration attacks.
• Experience interconnecting with multiple upstream carriers and managing the routing and failover complexity that brings..
• FreeSWITCH or other media servers as a complement to Asterisk. How you wor
• You assume things will fail, and you design and test so that failure is contained and invisible to customers.
• You measure before you optimize, and you instrument systems so failures are visible early.
• You are calm and decisive in an incident, and rigorous afterward about making sure it never repeats.
• You can challenge a design respectfully and precisely, and you write down the trade-offs so the team can reason about them later.
• You work well alongside developers: you can teach operational thinking without condescension, and you would rather make the team better at running their own code than be the only one who can. Why This Role Matters: Voice is unforgiving. A dropped call, a one-way audio path, or a registration storm is visible to every customer at once. Our team can build the platform, but building it and running it are two different disciplines, and ambition without operational hardening is fragile. We need someone who has lived through real VoIP failures, learned from them, and can stand shoulder to shoulder with the developers to make sure the platform survives contact with production. If you want ownership of a modern, open-source, carrier-class platform from design review through 3am incident to the postmortem that makes it stronger, we want to talk to you. All applicants are considered for all positions without regard to race, religion, color, sex, gender, sexual orientation, pregnancy, age, national origin, ancestry, physical/mental disability, medical condition, military/veteran status, genetic information, marital status, ethnicity, citizenship or immigration status, or any other protected classification, in accordance with applicable federal, state, and local laws. By completing this application, you are seeking to join a team of hardworking professionals dedicated to consistently delivering outstanding service to our customers and contributing to the financial success of the organization, its clients, and its employees. Equal access to programs, services, and employment is available to all qualified persons. Those applicants requiring an accommodation to complete the application and/or interview process should contact a management representative.
Vacancy posted 5 hours ago
Similar jobs that could be interesting for youBased on the Senior VoIP Operations & Reliability Engineer (Carrier-Class Voice Platform) in Newton, NJ vacancy
- ...Role description Role Senior Developer Azure Databricks Experience 8-10 Years Location - Branchville NJ Technical Skills... ...Secondary Nice to have NOT MANDATORY Understanding on ADF platform and framework Experience on working with XML JSON file Experience...PlatformSenior
- ...it is our foundation. The Senior Relay Technician plays a critical... ...in ensuring the safe, reliable operation of medium and highvoltage... ...electrical testing and engineering organization , Asplundh Electrical... ...with modern relay test platforms ~ Solid understanding...PlatformSeniorApprenticeshipLocal areaImmediate start
$110k - $150k
...About CP Engineers With over 40 years of industry leadership... ...is seeking a Senior Manager, Information Technology... ...will oversee IT operations, cybersecurity, enterprise... ...including ERP/CRM platforms and integrations. Cybersecurity... ...vendors to ensure reliable service delivery. AI &...PlatformSeniorWork at officeRemote work$37 - $65.5 per hour
...this role: Wells Fargo is seeking a Senior Branch Premier Banker (SAFE) to play a pivotal... ...in banking. This role offers a clear platform for career growth for experienced... ...programs (Credit, Market, Financial Crimes, Operational, Regulatory Compliance), which includes...PlatformSeniorTemporary workWork experience placementImmediate start$100k
If you've built your life around caring for others — as a nurse, social worker, mother, or in another helping profession — surrogacy offers a powerful way to extend that calling. As a gestational surrogate with Surrogacy Plus, the world's largest international surrogacy...SuggestedStart working today- Thorlabs in Newton, NJ is seeking an Inventory Control professional. This role involves monitoring inventory levels, performing audits, and ensuring compliance with inventory policies. Requires a minimum of 3 years manufacturing experience, strong attention to detail, ...Senior
- ...Thorlabs in Newton, NJ is seeking an Inventory Control Specialist to monitor inventory levels and contribute to accuracy in operations. This role requires a minimum of 3 years of manufacturing experience, basic computer skills, and attention to detail. Key responsibilities...SeniorHourly pay
$120k
Patriot Conceptions helps qualified women become gestational surrogates through a guided, supported process. Surrogates can earn up to $120k+ in total compensation and benefits. Journey-related items such as medical screening, independent legal counsel, qualifying travel...Contract work$105k - $135k
A leading engineering firm is seeking a Senior Electrical Engineer in Sparta Township, NJ. This role includes technical leadership, mentoring junior engineers, and overseeing electrical design for diverse projects. Candidates must have a degree in Electrical Engineering...SeniorRemote jobFlexible hours- A leading engineering firm in New Jersey seeks a Senior Mechanical Engineer to provide technical leadership within their mechanical engineering team. The role involves developing project scopes, preparing budget estimates, and overseeing designs for various projects, including...SeniorRemote jobFlexible hours
$26 - $48 per hour
...Prepare reports Coordinate with airport authorities Disclaimer This is not a direct TSA job offer. This promotes a preparation platform. How Our Program Helps Guided preparation Practice exams Interview support Job search assistance Apply for TSA...Platform- ...SonderMind is seeking skilled clinicians to join its mental health platform. You will have the flexibility to manage your own schedule while delivering compassionate therapy. Our approach utilizes AI tools to reduce burnout and improve patient care. Join us to gain support...PlatformFlexible hours
$100k - $120k
...Insights Manager to lead and evolve our Voice of the Customer (VoC) program, ensuring... ...experience. Partner closely with hotel operations, marketing, brand, and leadership teams... ...technology like Qualtrics and advanced analytics platforms. Grow your career—opportunities for...PlatformTemporary workWork at office$50k - $85k
Help Create Families — Become a Surrogate mother (Earn $50,000–$85,000+) Perfect for Stay-at-Home Moms · Ages 21–39 · Remote Opportunity Are you a caring mom who wants to make a meaningful difference? Join our physician-led surrogacy program and help loving families...Immediate startRemote workFlexible hours$105k - $135k
About CP Engineers With over 40 years of industry leadership, CP Engineers delivers... ...is seeking a highly skilled Senior Electrical Engineer to provide technical... ...Verify that designs meet safety, reliability, maintainability, and operational requirements. Maintain up-to-date...SeniorLocal areaRemote workFlexible hours$23.47 - $38.62 per hour
Role Overview USPS is accepting applications for PSE Sales & Services/Distribution Associate nationwide. This role ensures the smooth distribution of mail while offering professional customer support. USPS provides training to help you succeed in this role. Position...Hourly pay$23.47 - $38.62 per hour
...Role Overview USPS is accepting applications for City Carrier Assistants nationwide. This role ensures smooth postal operations by delivering mail efficiently. No prior experience is required, as USPS offers comprehensive paid training to set you up for success....Hourly pay$23.47 - $38.62 per hour
...Explore a World of Career Opportunities as a Mail Carrier with USPS Role Overview USPS is accepting applications for Rural Carrier Associates nationwide. This role involves delivering and collecting mail while interacting with customers along the route. USPS offers...Hourly pay$70 - $88 per hour
...Overview SonderMind is a leading mental health care platform revolutionizing therapy services by empowering licensed mental health therapists to deliver exceptional care with cutting‑edge, AI‑powered tools. We are seeking compassionate and highly skilled clinicians to...PlatformBi-weekly payHourly payWork at office$150k - $180k
...pets and their care Dedicated leaders who want to provide world-class medicine Collaborators who enjoy teaching, mentoring, and... ...opportunities for doctors and staff through internal learning platforms Qualifications Doctor of Veterinary Medicine (DVM or VMD) degree...PlatformFull timeLocal areaRelocation package- ...Analytics Engineering Supervisor At Thorlabs, we design and manufacture components, instruments... ...and datasets support scalable and reliable analytics solutions. Additionally, this... ...engagement with enterprise data platforms. Collaborate with Global IT and cross...PlatformPermanent employmentWorldwide
- Vanguard Medical Group in Mill Brook, NJ is seeking a Certified Medical Assistant III to support patient care and enhance workflows in a busy healthcare setting. You will be responsible for clinical tasks such as taking vital signs, administering injections, and performing...SeniorWork at office
- ...effectively. Takes initiative to assist in developing others. Operational Effectiveness: Makes sound judgments, and transfers learning... ...program, whether inside or outside. Must be able to hear whistle or alarms. Clear speaking voice is needed to command control of group....SeniorImmediate start
$16.5 - $23.91 per hour
...essential functions. • Ability to erect and stand on ladders and platforms at heights up to 30 feet. • Ability to safely work with paint... ...and aerosol forms. • Ability to paint, clean equipment, and operate motorized equipment as needed. • Ability to work in...PlatformHourly payPart timeShift work$92k - $127k
...effective communication during both routine operations and times of change or crisis. Develop... ...for different audiences using multiple platforms. Strong project management skills, with... ...content systems. Confidence partnering with senior leaders, providing counsel on sensitive...PlatformFull timeWork experience placement$27 per hour
...wheneverpossible during the crisis intervention and with follow-up contacts. Document all contacts in the approved electronic platform to include demographics and response toengagement and crisis intervention. Work with the Crisis Intervention Specialist, and the...PlatformDaily paidImmediate startRemote workNight shiftDay shiftAfternoon shift- Atlantic Union Bank is seeking an Equipment Finance Regional Director responsible for marketing equipment finance lending and leasing products to medium and large commercial clients. The role requires strong sales experience and the ability to maintain quality client relationships...Senior
- ...Junior Cloud DevOps Engineer Location: Tampa, Florida... ...with development and operations teams to streamline... ...direct mentorship from senior engineers. ~ Take... ...issues and ensuring system reliability and scalability. ~... ...least one major cloud platform (AWS, Azure, or GCP)....PlatformFull timeInternshipRemote workWork from home
$38 per hour
...family, and a safe environment to work. That's what you get as a Senior Diesel Mechanic at WM - and more. The pay and benefits you'... ..., weather extremes or similar elements often; This job operates in both inside garage, shop or gas plant conditions as well as...SeniorHourly payTemporary workWork experience placementStart working todayLocal areaImmediate startMonday to Friday$65k - $100k
A reputable insurance agency located in Sparta, NJ, is looking for a Commercial Insurance Lines Account Manager. The ideal candidate will have a valid insurance license and at least 3 years of experience in the commercial sector. Responsibilities include client management...SeniorRelocation package
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior VoIP Operations & Reliability Engineer (Carrier-Class Voice Platform). Be the first to apply!
Related searches



