Staff Software Engineer - ML Observability
$234k - $300kI did my part and supported the Regular Toilet
The ML Observability team builds cutting‑edge tools to monitor, explain, and improve AI systems in production, particularly those leveraging Large Language Models (LLMs) and generative AI. We provide robust, scalable observability for AI workloads, including drift detection and model evaluation, and behavior tracing, enabling customers to ship AI with confidence. As a Staff Engineer, you’ll lead the development of new features and foundational capabilities within Datadog’s LLM Observability product. You will shape product direction, drive experimentation, and apply your deep understanding of both AI systems and software engineering to solve open‑ended problems in the fast‑moving AI landscape. Your work will directly impact how our customers monitor, troubleshoot, and optimize LLM‑based applications in production. Join us in building the foundational tools that make AI systems observable, understandable, and reliable in the real world. At Datadog, we place value in our office culture – the relationships and collaboration it builds and the creativity it brings to the table. We operate as a hybrid workplace to ensure our Datadogs can create a work‑life harmony that best fits them. What You’ll Do: Drive design and implementation of LLM observability features. Ideate, prototype, and scale new product features to provide insights and drive improvements for generative AI systems. Work cross‑functionally with other engineering teams, product, UX, and applied science to iterate fast and find product‑market fit. Develop and extend tools for tracing, evaluating, and debugging LLMs. Influence architecture decisions and mentor engineers to build resilient, high‑performance systems. Stay close to customer pain points and use those insights to guide product and engineering priorities. Stay current with industry trends and advancements in machine learning and observability, driving innovation within the team. Who You Are: You have a BS/MS/PhD in a Computer Science, Engineering or related scientific field or equivalent experience. Deep understanding of distributed systems and scalable backend architectures. Hands‑on experience building and shipping LLM‑powered or GenAI applications. Understanding of model internals, inference pipelines, evaluation techniques, and prompt engineering. Ability to thrive in ambiguous, fast‑changing spaces and have a product‑oriented mindset. You’re excited to shape the next generation of AI observability tools from the ground up. Communicate clearly, think rigorously, and take pride in clean, maintainable code. Experience with observability tools/platforms. Datadog values people from all walks of life. We understand not everyone will meet all the above qualifications on day one. That’s okay. If you’re passionate about technology and want to grow your skills, we encourage you to apply. Benefits and Growth: Get to build tools for software engineers, just like yourself. And use the tools we build to accelerate our development. Have a lot of influence on product direction and impact on the business. Work with skilled, knowledgeable, and kind teammates who are happy to teach and learn. Competitive global benefits. Continuous professional development. Benefits and Growth listed above may vary based on the country of your employment and the nature of your employment with Datadog. Datadog offers a competitive salary and equity package, and may include variable compensation. Actual compensation is based on factors such as the candidate's skills, qualifications, and experience. In addition, Datadog offers a wide range of best in class, comprehensive and inclusive employee benefits for this role including healthcare, dental, parental planning, and mental health benefits, a 401(k) plan and match, paid time off, fitness reimbursements, and a discounted employee stock purchase plan. The reasonably estimated yearly salary for this role at Datadog is: $234,000 — $300,000 USD Equal Opportunity at Datadog: Datadog is an affirmative action and equal opportunity employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. Here are our candidate legal notices for your reference. #J-18808-Ljbffr I did my part and supported the Regular Toilet
$234k - $300k
...The ML Observability team builds cutting-edge tools to monitor, explain, and improve AI systems... ...to ship AI with confidence. As a Staff Engineer, you'll lead the development of new features... ...understanding of both AI systems and software engineering to solve open-ended...SuggestedWork at office$106.61k - $284.28k
...Staff Software Development Engineer We're building a world of health around every individual — shaping a... ...deployment pipelines, and production observability infrastructure. Your work will directly... ...Experience with managed AI/ML cloud services, modern JavaScript frameworks...SuggestedHourly payFull timeTemporary workLocal areaFlexible hours- ...We're seeking an exceptional Senior/Staff Software Engineer to build and lead our core platform as... ...analyses and deal insights generated by ML pipelines. API & Integration... ...practices - automated testing, CI/CD, observability, and modular architecture - while staying...SuggestedRemote work
$106.61k - $284.28k
...world. Currently, we are seeking a Staff Software Development Engineer – Fulfillment who as both a Technical... ...trunk based development, and continuous observability Troubleshoot and lead resolution... ...SQL/NoSQL stores to analytics and ML systems with strict latency and...SuggestedHourly payFull timeContract workTemporary workLocal area$230k - $280k
...respect, and accountability. Staff Software Applied AI EngineerLocation:... ...security . As a Staff AI Engineer , you'll help shape the... ...Establish meaningful metrics, observability, evaluation frameworks, and... ...Experience with cloud-based AI/ML services (AWS Bedrock, GCP...SuggestedApprenticeshipWork at officeLocal areaRemote workFlexible hoursShift work1 day per week$146.6k - $215.1k
...Staff Software Engineer, ML Infrastructure We're a high-tech home security company that's passionate about protecting the life you've built... ...whether that's serving reliability, deployment friction, observability gaps, scaling, or cost. Build and operate real-time...Work at office$170k - $200k
...empower small businesses. Our engineering team isn't just a support... ...party vendors to fuel our AI/ML initiatives and create a more... .... We are seeking a Staff Software Engineer to be a pivotal technical... ...experience. Performance & observability: Tunes for real-user performance...Work at officeWork from homeFlexible hours$234k - $300k
...The ML Observability team builds cutting-edge tools to monitor, explain, and improve AI systems... ...to ship AI with confidence. As a Staff Engineer, you’ll lead the development of new features... ...understanding of both AI systems and software engineering to solve open-ended...Work at office- ...Staff And Senior Software Engineers Suno is growing fast, and we're hiring Staff and Senior Software Engineers... ...closely with Engineering, Data, and ML teams to ensure data is reliable,... ...— building systems that are robust, observable, and easy to extend. Your work will...Work at officeLocal areaImmediate start
- ...everything else is built on. We’re hiring Staff and Senior Software Engineers to work on the systems, platforms,... ..., secure, scalable, and easy to observe Own systems end-to-end — from design... ...planes to support rapid product and ML growth Building and operating distributed...Full timeWork at officeLocal area
- ...Staff Full-Stack Software Engineer Financial institutions - banks and credit unions - have begun a seismic shift in how they operate and serve... ...pipelines, vector store migrations, orchestration of ML utility services Optimize applications for reliability...Remote workWork from homeShift work
$253.9k - $298.7k
...and reliability. The Role We are looking for a Senior Staff Software Engineer to serve as Coinbase's Solana Staking Protocol CTO — the... ...infrastructure at scale — bare metal, cloud, networking, observability. Strategic Vision: You can define year-long technical...Local area$200k - $325k
...Staff Software Engineer Iterative Health is a healthcare technology and services company powering the acceleration of clinical research to transform... ...and implementation, between integration engineering and ML infrastructure, between defining technical strategy and...$170k - $230k
...Staff Software Engineer Tutor Intelligence is building the technology and processes to let robots go where they've never gone before: the average... ...scale. You will work across backend services, data and ML infrastructure, internal tools, and customer-facing systems,...$242k - $333k
...The Perception Sensing team is looking for a Senior or Staff Software Engineer to drive the evaluation and architectural design of our PCP stack... ...Qualifications Experience in performance optimization to fit complex ML stack to low-power low cost edge compute (e.g., Nvidia Thor,...Odd jobTemporary workRelocation package$192k - $256k
...Staff Software Engineer, Lab Software Cambridge, MA USA Join us in shaping the future of science! We are seeking Staff Software Engineers... ...systems at scale. Cross-Functional Collaboration: Work with ML researchers, engineers, and scientists to integrate data...Full timeWork at officeLocal areaFlexible hours$172.8k - $251.65k
...analyses to evaluate autonomous driving software performance across the autonomy stack.... ...functional efforts with autonomy, systems engineering, simulation, and data teams to embed evaluation... ...Invent and drive new statistical and ML methods, and ML introspection techniques,...Local areaRemote workWork from homeRelocationRelocation packageFlexible hours$134k - $235.9k
...Agents group is responsible for building the ML models and system to simulate road users... ...Behaviors, Perception, and Safety Engineers. The specific duties may include ML/RL... ...part of an ML team and contribute strong software engineering (SWE) expertise. Support the...Local areaRemote workWork from homeRelocationRelocation packageFlexible hoursShift work$242k - $290k
...Machine Learning Engineer The Perception Object Detection and Tracking team at Zoox deals with perception of all people and objects that have a capability to move. Your role is to work with the ML model teams to bring cutting-edge models into the vehicle stack....$140k - $210k
...innovation and creating the best experience for job seekers. (*Comscore, Total Visits, March 2025) Day to Day As a Software Engineer IV (ML) on the Machine Learning Model Platform team at Indeed, you will be responsible for leading and executing key objectives for...Temporary workWork experience placementLocal area$166k - $265k
...Job Description: Our Opportunity Chewy is seeking a Staff Software Engineer to lead the Practice Hub engineering team, part of the Vet... ...aggregated data pipeline that ingests signals from predictive ML models and recommendation engines. You will work with...Local areaFlexible hours$144k - $288k
...Staff Software Engineer, Scientific System of Record Cambridge, MA USA; San Francisco, CA USA Your Impact at LILA We are seeking a... ...analytics and laboratory workflows. You'll work closely with ML researchers, platform engineers, and scientists to develop systems...Full timeWork at officeLocal areaFlexible hours$254k - $336k
...assembled a diverse team of experts in software, robotics, artificial intelligence,... ...defense capability. ABOUT THE JOB Staff Robotics Engineers lead the delivery of vehicle... ...the security of our candidates. We've observed a rise in sophisticated phishing and...Full timeWork experience placementImmediate startFlexible hours$218.03k - $256.5k
...Coinbase is seeking an experienced backend engineer to join our Advanced Trading team to... ...have at least 8 years of experience in software engineering. You’ve designed, built,... ...customer obsession with comprehensive observability You empower and cultivate your teammates...Local areaWorldwide$168.75k
...Staff AI Embedded Software Engineer - Connected Devices Seattle, Washington, United States Join Axon and be a Force for Good. At Axon, we... ...system design, including reliability, scalability, safety, observability, and lifecycle management. Identify and mitigate...Work experience placementWork at officeRemote work- ...into global manufacturing capacity. We are looking for a Staff Software Engineer to join our core machine learning and data platform engineering... ...the foundational infrastructure leveraged by Xometry’s AI/ML solutions, including the Instant Quoting Engine®, the...Full time
$218.03k - $256.5k
...is expected and fully supported. We are looking for a Staff Software Engineer to join the Payment Rails team within Coinbase's Platform... ...designing for fault isolation, graceful degradation, strong observability, and clear SLOs for Tier-0/Tier-1 services. Drive an AI...Local area- ...Senior Software Engineer About Datalign: Datalign Advisory is a Cambridge-based fintech building... ...Senior Software Engineers with deep AI/ML expertise to join a small, high-impact... ...inference/services and production observability (logs, metrics, traces) ~ Strong communication...Work at officeFlexible hours
- ...Overview of Job Function: As a Senior Software Engineer, you will take deep technical ownership... ...standards. Production Support and Observability Lead triage and resolution of Tier-2... ...Principal Engineer and Tech Leads. AI/ML Integration and Innovation Integrate...Contract workLocal areaShift work
$286.2k - $326.7k
...OverviewSr. Distinguished Software Engineer - AML (Remote - Eligible)As a Sr. Distinguished Engineer... ...system characteristics, such as observability, resiliency, and operational excellenceContinue... ...experience implementing AI/ML strategies for anomaly detection, entity...Full timePart timeLocal areaRemote work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Staff Software Engineer - ML Observability. Be the first to apply!
- software product owner Boston, MA
- id software Boston, MA
- software quality assurance Boston, MA
- software sales Boston, MA
- internship software Boston, MA
- remote software sales Boston, MA
- embedded software Boston, MA
- software asset management analyst Boston, MA
- software engineer - cloud services Boston, MA
- software Boston, MA


