Staff Software Engineer, ML Infrastructure

$146.6k - $215.1k

SimpliSafe Wireless Home Security

About SimpliSafe

We're a high-tech home security company that's passionate about protecting the life you've built and our mission of keeping Every Home Secure. And we've created a culture here that cares just as deeply about the career you're building. Ours is a no ego culture of collaboration and innovation where those seeking their next challenge can find big opportunities and make a huge impact on the lives of all those who we protect. We don't just want you to work here. We want you to grow and thrive here.

We're embracing a hybrid work model that enables our teams to split their time between office and home. Hybrid for us means we expect our teams to come together in our state-of-the-art office on two core days, typically Tuesday, Wednesday, or Thursday - working together in person and choosing where they work for the remainder of the week. We all benefit from flexibility and get to use the best of both worlds to get our work done.
Why are we hiring?

Well, we're growing and thriving. So, we need smart, talented, and humble people who share our values to join us as we disrupt the home security space and relentlessly pursue our mission of keeping Every Home Secure.
About the Role

We're looking for a Staff Software Engineer to join our Cloud ML team - the team that owns both the cloud-side ML infrastructure and the applied ML research that powers SimpliSafe's intelligent home security products. This is a senior individual contributor role for a distributed systems expert who wants to apply that craft to one of the most demanding problem domains in the company.

You'll partner closely with other Staff and Principal engineers to drive architecture, mentor across the team, and set the technical direction for our ML platform. The work spans two of our most demanding workloads: real-time computer vision inference that processes video from cameras and doorbells across our customer base, and LLM/GenAI infrastructure that will power our future generation of intelligent applications. Both are, fundamentally, distributed systems problems - high-throughput, low-latency, multi-tenant, GPU-aware, and unforgiving of regressions.

This role is for someone who has built and operated large-scale distributed services in production - high-QPS APIs, real-time platforms, low-latency serving systems - and is excited to bring that depth to ML infrastructure. Prior ML experience is a plus, not a prerequisite. If you've shipped systems that serve a lot of traffic, scale gracefully, and stay up at 3am, we want to talk to you.
What You'll Do

Set technical direction for ML infrastructure

Drive architecture decisions for our Kubernetes-based ML platform - anchored on Ray for inference, alongside KServe, Triton, and vLLM - across real-time and batch workloads.
Lead deep technical reviews on system design, capacity planning, and reliability for the highest-stakes ML systems at SimpliSafe.
Identify and remove the systemic bottlenecks in our ML deployment infrastructure - whether that's serving reliability, deployment friction, observability gaps, scaling, or cost.

Build and operate real-time CV inference at scale

Own the design and evolution of cloud-side inference systems that process live video and events from SimpliSafe devices in real time.
Drive throughput, latency, and cost improvements (batching strategies, GPU utilization, autoscaling, multi-model serving) for production CV models.
Build the feedback loops between cloud inference, edge devices, and the data flywheel that improves model quality over time.

Stand up LLM/GenAI serving infrastructure

Help shape how SimpliSafe serves LLMs in production - model serving patterns, KV-cache and batching strategies, evaluation pipelines, guardrails, and cost controls.
Partner with applied ML engineers to take new GenAI-powered product features from prototype to scaled deployment.

Raise the engineering bar across Cloud ML

Mentor engineers across the team through design reviews, code reviews, pairing, and written guidance - a meaningful uplift on everyone you work with.
Establish and evangelize best practices for model lifecycle management (registry, deployment, monitoring, rollback, drift) and on-call.
Write the documentation, runbooks, and architectural decision records that make the platform legible and durable.

Own reliability and operational excellence

Lead incident response and postmortems for critical ML systems; turn lessons learned into platform-level improvements.
Define SLOs, observability standards, and on-call practices for ML services in production.

Qualifications

8+ years of software engineering experience, with a clear track record of building and operating large-scale distributed systems in production.
Deep expertise in high-throughput, low-latency services - ad serving, recommendations, real-time APIs, online platforms, or similar - including the operational reality of running them at scale.
Strong production experience on Kubernetes and AWS (EKS, S3, IAM, networking) and with Kafka, containerized deployments, CI/CD, and infrastructure-as-code.
Demonstrated experience with the building blocks of high-scale systems: load balancing, autoscaling, batching, caching, multi-tenancy, queuing, and capacity planning.
Proficiency in Python is required; experience with a systems language (Go, C++, Rust) for performance-sensitive components is a plus.
Staff-level technical leadership : ability to drive ambiguous, cross-cutting initiatives, align senior stakeholders, and elevate the engineers around you without formal authority.
Strong written and verbal communication - you can make complex technical tradeoffs legible to ML scientists, product, and other infra teams.
ML exposure is preferred - having deployed or operated production ML systems, worked closely with ML teams, or built ML-adjacent infrastructure. Exceptional distributed systems engineers without direct ML experience are encouraged to apply; we'll help you ramp.

Bonus Points

Hands-on experience with Ray , KServe , Triton , vLLM , or other ML serving stacks.
Hands-on experience with LLM serving in production (vLLM, TGI, TensorRT-LLM, SGLang) - KV cache management, continuous batching, speculative decoding, quantization for serving.
Experience building real-time video or streaming pipelines (Kafka, Kinesis, Flink, or similar) at scale.
Experience operating GPU-based inference systems - GPU-aware scheduling, multi-model serving, accelerator utilization optimization.
Familiarity with ML fundamentals - how models are trained, evaluated, versioned, deployed, monitored, and rolled back in production.
Experience with model lifecycle tooling (MLflow, Weights & Biases, model registries, drift detection, shadow deployments).
Open source contributions to distributed systems or ML infrastructure projects.
Experience operating in environments with strong security and compliance requirements .

Why This Role

The Cloud ML team owns the full surface area - infrastructure and applied research - which means your work as a Staff infra engineer directly shapes what's possible for the science. You'll have unusual leverage: the platform you build determines how fast SimpliSafe can ship intelligent features, and the features we ship directly impact whether someone's home is safer tonight than it was yesterday.
What Values You'll Share

Customer Obsessed - Building deep empathy for our customers, putting them at the core of our work, and developing strong, long-term relationships with them.
Aim High - Always challenging ourselves and others to raise the bar.
No Ego - Maintaining a "no job too small" attitude, and an open, inclusive and humble style.
One Team - Taking a highly collaborative approach to achieving success.
Lift As We Climb - Investing in developing others and helping others around us succeed.
Lean & Nimble - Working with agility and efficiency to experiment in an often ambiguous environment.

What We Offer

A mission- and values-driven culture and a safe, inclusive environment where you can build, grow and thrive
A comprehensive total rewards package that supports your wellness and provides security for SimpliSafers and their families (For more information on our total rewards please click here)
Free SimpliSafe system and professional monitoring for your home.
Employee Resource Groups (ERGs) that bring people together, give opportunities to network, mentor and develop, and advocate for change.

The target annual base pay range for this role is $146,600 to $215,100.

This target annual base pay range represents our good-faith estimate of what we expect to pay for this role. We use a market-based compensation approach to set our target annual base pay ranges and make adjustments annually. We carefully tailor individual compensation packages, including base pay, taking into consideration employees' job-related skills, experience, qualifications, work location, and other relevant business factors.

Beyond base pay, we offer a Total Rewards package that may include participation in our annual bonus program, equity, and other forms of compensation, in addition to a full range of medical, retirement, and lifestyle benefits. More details can be found here.

We're committed to fair and equitable pay practices, as well as pay transparency. We regularly review our programs to ensure they remain competitive and aligned with our values.

We wholeheartedly embrace and actively seek applications from all individuals, no matter how they identify. We are committed to cultivating a diverse and inclusive workplace, and we believe our work is enriched when we incorporate a multitude of perspectives, backgrounds, and experiences. We want everyone who works here to thrive and contribute to not only our mission of keeping every home secure, but also to making our workplace safe and supportive for others. If a reasonable accommodation may be needed to fully participate in the job application or interview process, to perform the essential functions of a position, or to receive other benefits and privileges of employment, please contact View email address on click.appcast.io.

Apply

Vacancy posted 15 hours ago

Similar jobs that could be interesting for youBased on the Staff Software Engineer, ML Infrastructure in Boston, MA vacancy

Staff Software Engineer, ML Infrastructure
$146.6k - $215.1k
...Staff Software Engineer, ML Infrastructure We're a high-tech home security company that's passionate about protecting the life you've built and our mission of keeping Every Home Secure. And we've created a culture here that cares just as deeply about the career you...
Suggested
Work at office
Venturefizz Product Management Community
Boston, MA
3 days ago
Senior / Staff Software Engineer, Data Infrastructure
...Staff And Senior Software Engineers Suno is growing fast, and we're hiring Staff and Senior Software Engineers to work on Data Infrastructure at Suno, where you will be responsible for building and scaling... ...with Engineering, Data, and ML teams to ensure data is...
Suggested
Work at office
Local area
Immediate start
SUNO
Boston, MA
15 hours ago
Senior Staff Software Engineer, Data Platform
$253.9k - $298.7k
...The Data Platform team is the engine that makes Coinbase's data... ...financial reporting to the AI and ML systems that will define... ...visionary, hands-on Senior Staff Software Engineer to help define and... ...something that matters at the infrastructure level of one of the most...
Suggested
Local area
Coinbase
Boston, MA
3 days ago
Staff Software Engineer - ML Observability
$234k - $300k
...The ML Observability team builds cutting-edge tools to monitor... ...AI with confidence. As a Staff Engineer, you'll lead the development... ...of both AI systems and software engineering to solve open-ended... .... It brings applications, infrastructure, data, models, and security...
Suggested
Work at office
Datadog
Boston, MA
3 days ago
Staff Software Engineer
$200k - $325k
...Staff Software Engineer Iterative Health is a healthcare technology and services company powering... ...can't access because the operational infrastructure to run clinical trials efficiently doesn... ..., between integration engineering and ML infrastructure, between defining...
Suggested
Iterative Health
Cambridge, MA
15 hours ago
Staff Software Engineer, Lab Software
$192k - $256k
...Staff Software Engineer, Lab Software Cambridge, MA USA Join us in shaping the future of science... ...large-scale workloads. Cloud & Infrastructure: Leverage AWS services, Kubernetes and... ...Cross-Functional Collaboration: Work with ML researchers, engineers, and scientists...
Full time
Work at office
Local area
Flexible hours
Lila Sciences
Cambridge, MA
4 days ago
Senior/Staff Software Engineer - Perception & Sensing
$242k - $333k
...The Perception Sensing team is looking for a Senior or Staff Software Engineer to drive the evaluation and architectural design of our PCP stack... ...Qualifications Experience in performance optimization to fit complex ML stack to low-power low cost edge compute (e.g., Nvidia Thor,...
Odd job
Temporary work
Relocation package
Zoox
Boston, MA
3 days ago
Staff Software Engineer
$170k - $230k
...Staff Software Engineer Tutor Intelligence is building the technology and processes to let robots go where they've never gone before:... ...scale. You will work across backend services, data and ML infrastructure, internal tools, and customer-facing systems, while collaborating...
Tutor Intelligence
Watertown, MA
23 days ago
Staff Software Engineer - Machine Learning
$134k - $235.9k
...responsible for building the ML models and system to simulate... ...Behaviors, Perception, and Safety Engineers. The specific duties may... ..., integration, creating ML infrastructure, metrics, and data pipelines... ...ML team and contribute strong software engineering (SWE) expertise....
Local area
Remote work
Work from home
Relocation
Relocation package
Flexible hours
Shift work
General Motors
Boston, MA
15 hours ago
Staff Software Development Engineer - NodeJs and AI
$106.61k - $284.28k
...Staff Software Development Engineer We're building a world of health around every individual — shaping a... ...leadership to deliver production-grade infrastructure that makes a measurable difference.... ...Experience with managed AI/ML cloud services, modern JavaScript frameworks...
Hourly pay
Full time
Temporary work
Local area
Flexible hours
Oak St. Health
Boston, MA
2 days ago
Senior and Staff Software Engineer Openings
...Staff Full-Stack Software Engineer Financial institutions - banks and credit unions - have begun a seismic shift in how they operate and serve... ...pipelines, vector store migrations, orchestration of ML utility services Optimize applications for reliability...
Remote work
Work from home
Shift work
Roberts Recruiting
Boston, MA
1 day ago
Senior/Staff Software Engineer [Full Stack]
...We're seeking an exceptional Senior/Staff Software Engineer to build and lead our core platform as... ...analyses and deal insights generated by ML pipelines. API & Integration... ...for large, dynamic datasets. Cloud Infrastructure & DevOps: Experience deploying full-stack...
Remote work
Prudentia Sciences
Boston, MA
15 hours ago
Staff Software Development Engineer - Fulfillment
$106.61k - $284.28k
...can be in the digital world. Currently, we are seeking a Staff Software Development Engineer – Fulfillment who as both a Technical Lead and Individual... ...to propagate data from SQL/NoSQL stores to analytics and ML systems with strict latency and throughput requirements...
Hourly pay
Full time
Contract work
Temporary work
Local area
CVS Health
Boston, MA
1 day ago
Staff Software Engineer, Autonomy Evaluation
$172.8k - $251.65k
...analyses to evaluate autonomous driving software performance across the autonomy stack.... ...functional efforts with autonomy, systems engineering, simulation, and data teams to embed evaluation... ...Invent and drive new statistical and ML methods, and ML introspection techniques,...
Local area
Remote work
Work from home
Relocation
Relocation package
Flexible hours
General Motors
Boston, MA
15 hours ago
Staff Software Engineer, Applied AI
$230k - $280k
...respect, and accountability. Staff Software Applied AI EngineerLocation:... ...security . As a Staff AI Engineer , you'll help shape the... ...production-grade AI platforms and infrastructure. Must be able and... ...Experience with cloud-based AI/ML services (AWS Bedrock, GCP...
Apprenticeship
Work at office
Local area
Remote work
Flexible hours
Shift work
1 day per week
HackerOne
Boston, MA
15 hours ago
Staff Software Engineer, ML Tooling and Infrastructure
$155k - $230k
...Staff Software Engineer As a Staff Software Engineer on the Atlas team, you will be a critical engineering... ...to build the tooling, pipelines, and infrastructure that bridge the gap between... ...packaging. ~ Strong familiarity with the ML ecosystem, including PyTorch, ONNX,...
Hourly pay
Immediate start
Boston Dynamics
Waltham, MA
15 hours ago
Senior / Staff C++ Software Engineer - Perception 3D Tracking
$242k - $290k
...Machine Learning Engineer The Perception Object Detection and Tracking team at Zoox deals... ...to move. Your role is to work with the ML model teams to bring cutting-edge models... ...data in the world and an incredible infrastructure for testing and validating your algorithms...
Zoox
Boston, MA
4 days ago
Staff Software Engineer
$170k - $200k
...platform to empower small businesses. Our engineering team isn't just a support function, we'... ...of third-party vendors to fuel our AI/ML initiatives and create a more sophisticated... ...financial ecosystem. We are seeking a Staff Software Engineer to be a pivotal technical...
Work at office
Work from home
Flexible hours
Forward Financing
Boston, MA
2 days ago
Staff Software Engineer - ML Observability
$234k - $300k
The ML Observability team builds cutting‑edge tools to monitor, explain, and improve... ...to ship AI with confidence. As a Staff Engineer, you’ll lead the development of new features... ...deep understanding of both AI systems and software engineering to solve open‑ended problems...
Work at office
I did my part and supported the Regular Toilet
Boston, MA
1 day ago
Staff Software Engineer, Scientific System of Record
$144k - $288k
...Staff Software Engineer, Scientific System of Record Cambridge, MA USA; San Francisco, CA USA... ...laboratory workflows. You'll work closely with ML researchers, platform engineers, and... ...large-scale workloads. Cloud and Infrastructure: Leverage AWS services, Kubernetes,...
Full time
Work at office
Local area
Flexible hours
Lila Sciences
Cambridge, MA
4 days ago
Staff Software Engineer - Vet Care
$166k - $265k
...Opportunity Chewy is seeking a Staff Software Engineer to lead the Practice Hub engineering... ...that ingests signals from predictive ML models and recommendation engines. You... ...optimization, automation, and process infrastructure of the Chewy Vet Care business and operations...
Local area
Flexible hours
Chewy
Boston, MA
3 days ago
Staff ML Software Engineer
$140k - $210k
...innovation and creating the best experience for job seekers. (*Comscore, Total Visits, March 2025) Day to Day As a Software Engineer IV (ML) on the Machine Learning Model Platform team at Indeed, you will be responsible for leading and executing key objectives for...
Temporary work
Work experience placement
Local area
Indeed
Boston, MA
2 days ago
Staff Software Engineer - HealthTech - Series B (Hybrid)
...at the intersection of data, software engineering, and applied machine... ...rigorous environment. As a Staff Software Engineer , you’ll... ...software, data, and applied ML teams to ensure systems are... ...performance tuning ~ Cloud infrastructure experience (AWS), including...
Evolution USA
Boston, MA
4 days ago
Staff Software Engineer (Platform - Identity)
$218.03k - $256.5k
...Attendance is expected and fully supported. We're hiring a Staff Software Engineer to lead the Identity Accounts team — the platform... ...across three focused sub-teams: Foundations (authorization infrastructure), Users Platformization (decomposing the legacy monolith)...
Local area
Coinbase
Boston, MA
4 days ago
Staff Software Engineer, Backend - Platform (Payment Rails)
$218.03k - $256.5k
...is expected and fully supported. We are looking for a Staff Software Engineer to join the Payment Rails team within Coinbase's Platform... ...engineering. ~ Experience building payment systems, financial infrastructure, or high-volume transactional systems. ~ You've...
Local area
Coinbase
Boston, MA
2 days ago
Staff Software Engineer, Backend - Platform (FinHub Tooling)
$218.03k - $256.5k
...efficiency, and safety of these fund movements. Our tooling serves Engineering, Customer Experience, Risk, and Compliance teams — enabling... ...organization operates faster and more safely. As a Staff Software Engineer, you will own the technical strategy and architecture...
Local area
Coinbase
Boston, MA
1 day ago
Staff Software Engineer, Backend - Platform (Overseer)
$218.03k - $256.5k
...correctness a designed property of the infrastructure, not a coordination problem between teams. At staff-level, you’ll define what that... ...framework, real-time detection engine, and the APIs and tooling that... ...requirements) : ~8+ years of software engineering experience. ~ You...
Local area
Coinbase
Boston, MA
1 day ago
Staff Software Engineer
$170k - $195k
...Staff Software Engineer Waltham, MA Xometry powers the industries of today and tomorrow by connecting... ...the effort to integrate production ML pipelines into the application layer,... ...: Expert-level mastery of Python, AWS infrastructure, and modern frontend frameworks (React...
Work at office
3 days per week
Xometry
Waltham, MA
2 days ago
Staff Software Engineer
...Hybrid Full-time, Staff Software Engineer at Activ Surgical About the job, About the Company Activ Surgical is an early-stage medical device startup dedicated to transforming advanced surgical visualization through innovative imaging, computer vision, and AI...
Full time
Immediate start
Flexible hours
Activ
Boston, MA
2 days ago
Senior Staff Software Engineer, Solana Staking Protocol
$253.9k - $298.7k
...reliability. The Role We are looking for a Senior Staff Software Engineer to serve as Coinbase's Solana Staking Protocol CTO — the... ...Execution: Write production code. Design and build critical infrastructure for validator operations, monitoring, automation, and...
Local area
Coinbase
Boston, MA
15 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff Software Engineer, ML Infrastructure. Be the first to apply!