Software Engineer, Evaluation Infrastructure
$127k - $223kWaabi
Job Description
Job Description
Waabi, founded by AI visionary Raquel Urtasun, is the leader in Physical AI. With a world-class team, we're unlocking the next era of autonomous transportation with technology that's powering commercial autonomous trucks and robotaxis. Waabi is backed by and partners with world leaders in AI, automotive, logistics, and deep tech.
With offices in Toronto, San Francisco, Dallas, and Pittsburgh, Waabi is growing quickly and looking for diverse, innovative and collaborative candidates who want to impact the world in a positive way. To learn more visit:
The Evaluation Algorithms team is responsible for building the algorithms & tooling required to comprehensively evaluate our autonomy system’s performance across all development stages. In this role, you will work closely with the systems & safety teams, responsible for defining the requirements & evaluation criteria, and simulation teams to leverage Waabi World, our highly realistic closed-loop simulation engine built with the latest in generative AI technologies to deliver the evaluation capabilities needed to support the safe development of the next generation of autonomous vehicles!
You will…
- Develop the tooling, infrastructure, and pipelines to support complex statistical analyses of driving performance scale.
- Implement metrics and tags to provide a holistic understanding of model performance and enable the discovery of interesting scenarios for training and evaluation.
- Develop and maintain a high availability query service to enable low-latency analysis and curation over large volumes of metric and tag data
- Work with large datasets from various sources including real world driving as well as Waabi World, our high-fidelity simulator.
- Champion engineering excellence, ensuring high-quality, well structured and tested code.
- Assist in project roadmap planning, prioritisation, and delivery.
Qualifications:
- MS/Bachelors degree with 2+ years of industry experience in Computer Science, Machine Learning and/or similar technical field(s) of study.
- Proficient in Python programming and strong software engineering fundamentals with real-world experience writing high quality, well-structured, and well-tested code.
- Open-minded and collaborative team player with the willingness to help others.
- Passionate about self-driving technologies, solving hard problems, and creating innovative solutions.
Bonus Points:
- Experience in data processing pipelines, ETL pipelines, distributed computing.
- Experience in building highly reliable and scalable web services
- Understanding of cloud job orchestration, monitoring, and instrumentation best-practices.
- Experience in evaluating complex ML models or self-driving software stacks.
The US yearly salary range for this role is: $127,000 - $223,000 USD in addition to competitive perks & benefits. Waabi (US) Inc.’s yearly salary ranges are determined based on several factors in accordance with the Company’s compensation practices. The salary base range is reflective of the minimum and maximum target for new hire salaries for the position across all US locations. Note: The Company provides additional compensation for employees in this role, including equity incentive awards and an annual performance bonus.
Perks/Benefits:
- Competitive compensation and equity awards.
- Health and Wellness benefits encompassing Medical, Dental and Vision coverage (for full-time employees only).
- Unlimited Vacation.
- Flexible hours and Work from Home support.
- Daily drinks, snacks and catered meals (when in office).
- Regularly scheduled team building activities and social events both on-site, off-site & virtually.
- As we grow, this list continues to evolve!
Waabi is a technology start-up building technologies to transform the way the world moves. Join our talented team to be a part of the future and to make an impact!
Waabi is an equal opportunity employer. We celebrate diversity and are committed to creating a supportive, inclusive, and accessible workplace for all our employees. We seek applicants of all backgrounds and identities, across race, color, ethnicity, national origin or ancestry, age, citizenship, religion, sex, sexual orientation, gender identity or expression, military or veteran status, marital status, pregnancy or parental status, caregiver status, disability, or any other characteristic protected by law. We make workplace accommodations for qualified individuals with disabilities as required by applicable law. If reasonable accommodation is needed to participate in the job application or interview process please let our recruiting team know.
We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
- ...About the Team We build the data, evaluation, and experimentation infrastructure powering next‑generation agentic... ..., top‑tier startups, and elite engineering orgs . Revenue is already in the... ...~1–3 years as a full‑stack software engineer ~ Background at a high...SuggestedRemote workFlexible hours
$50 - $150 per hour
A leading AI company is seeking a software engineer to review and evaluate model-generated code. This contract role requires several years of software engineering experience, particularly as a full-stack engineer at notable tech firms. You will assess code quality and...SuggestedHourly payContract workFlexible hours$145k - $215k
...Software Engineer In Test - Infrastructure Redwood City, CA (Hybrid); San Francisco, CA (Hybrid) At Snorkel, we believe meaningful AI doesn't start... ...AI Work at the forefront of AI development and evaluation Collaborate with top AI labs and innovative...SuggestedLocal area- ...Role Handshake is building the infrastructure layer that powers the next generation... ...AI agents across our platform. As a Software Engineer on our Agentic Infrastructure team, you... ...across Handshake's platform Design evaluation, observability, & reliability...SuggestedFull timeFreelanceInternshipWork at officeRemote workFlexible hours
$170k - $216k
...U.S. states. The Simulation Infrastructure team creates reliable, scalable, and... ...Simulation-based products that evaluate the Waymo Driver's software stack at a massive scale. We solve... ...broad range of customers Software Engineers, Product, Data Science, System Engineering...SuggestedFull timeRemote work$150k - $215k
...Space Infrastructure Software Engineer Wanna join the adventure? As a Space Infrastructure Software Engineer, you are responsible for scaling... ...The salary range for this role is intentionally wide as we evaluate individuals based on their unique experience and...Temporary workWork at officeRelocation packageFlexible hours$161k - $230k
...Software Engineer: Infrastructure at Thatch Location: Remote (US); San Francisco, California, United States. Compensation: $161,000 – $230,000 USD... ...statistical reporting purposes only and plays no role in the evaluation of your application. Your decision to provide or...Remote work- ...just as language models are changing how engineers write code. Our vision is a design... ...engineer obsessed with building systems and infrastructure that are as simple as possible while... ...product surface, model inference, and evaluation suite. You'll work closely with product...Flexible hours
- ...design teams for Google Workspace. What you’ll do As a Software Engineer, Infrastructure at Sierra, you will be responsible for designing,... ...doesn't precisely match the job description. We strive to evaluate all applicants consistently without regard to race, color...Full timeFlexible hours
$215k - $265k
...scheming mitigations. We're looking for a Software Engineer to build the platform that the rest of... ...Build and maintain Apollo's cloud infrastructure . This means IaC, networking,... ...’s assets. Frontier labs trust us to evaluate their models pre-release. It is essential...Full timeWork experience placementWork at officeImmediate startVisa sponsorshipFlexible hours- ...address real-world challenges. The Infrastructure Engineering team is crucial to the overall... ...perform deep-dive code reviews, and evaluate technical trade-offs to ensure a sustainable... ...Standards: Establish and enforce elite software engineering and DevOps standards,...Shift work
$140k - $225k
...assembling a diverse, world-class team-engineers, designers, researchers, and... ...The Role As the Senior Software Engineer, Tooling and Development Infrastructure, you will play a critical role in... ...user-facing features. Identify, evaluate, and integrate new tools that...Full timeTemporary workLocal areaFlexible hours$204k - $259k
...Senior Software Engineer, Simulation ML Infrastructure Waymo is an autonomous driving technology company with the mission to be the world's most trusted... ...-scale dataset generation, model training, and evaluation. Collaborate cross-functionally to derive performance...Full timeRemote work- ...Software Engineer, Client Infrastructure Engineering · Full-time · San Francisco; New York Our mission is to automate coding. The first step in... ...release velocity—shipping updates to users daily. Evaluating Electron vs. native approaches for critical code paths...Full timeWork at office
$200k - $400k
...a team. About the Team The ML Infrastructure team builds the systems that power every... ...training, the infrastructure for model evaluation and experimentation, and the routing layer... ...'re hiring a Senior ML Infrastructure Engineer to own the platforms powering Decagon's...Full timeWork at officeLocal area- ...Resolve AI Job Post Software maintenance and production troubleshooting... ...become a massive tax of engineering velocity. Resolve AI is... ...application security, cloud infrastructure, internal services, integrations... ...as the company grows. Evaluate and improve how we handle...
$209k - $283k
...Staff Engineer At Candid Health, we're on a mission to revolutionize... ...grittiest and most complex infrastructure challenges at Candid,... ...advancements in technology, evaluate and identify any emerging technologies... ...the next generation of software engineers on advanced...- Software Engineer (Security/Infrastructure) — AfterQuery About the Role You’ll be responsible for protecting a system that handles highly sensitive workflows... ...human-in-the-loop systems used in AI training and evaluation Contribute to core infrastructure and internal...
$175k - $215k
...Waymo Driver. The Simulator Evaluation team faces the ultimate data... ...We are looking for aSoftware Engineer to build the metrics and... ...will report to Senior Staff Software Engineering Manager and serve... ...components of our data processing infrastructure. You will write and maintain...Full timeRemote work$204k - $259k
...Senior Software Engineer, Quantitative Evaluations Waymo is an autonomous driving technology company with the mission to be the world's most trusted... ...with ML Experience with A/B experiment infrastructure Experience building and validating metrics to measure...Full timeRemote work$204k - $259k
...Senior Software Engineer, Statistical Evaluation and Sampling Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the...Full timeRemote work$204k - $259k
...create a training ground for the Waymo Driver. The Simulator Evaluation team faces the ultimate data challenge: How do you... ...that a virtual world is "real"? We are looking for aSenior Software Engineer to build the metrics and systems that grade this hybrid environment...Full timeRemote work$170k - $216k
...Software Engineer, Perception Evaluation and Test Automation Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the...Full timeRemote work$181.1k - $318.4k
...AIML - Sr. Software Development Engineer, Evaluation At Apple, we create world-class innovative products that seamlessly combine cutting-edge hardware... ...ML system is a plus. Experience with cloud-based infrastructure (AWS) and hybrid ecosystems. Experience with ETL...Immediate startRelocation$250k - $330k
...and grow as a team. About the Team The Infrastructure team builds and operates the... ...Role We’re hiring a Senior Infrastructure Engineer to design, build, and operate production... ...architect solutions, set up prototypes, evaluate performance, and scale new features. Tune...Work at office$150k - $215k
...Job Description Wanna join the adventure? As a Space Infrastructure Software Engineer, you are responsible for scaling our ability to operate a... ...The salary range for this role is intentionally wide as we evaluate individuals based on their unique experience and...Temporary workWork at officeRelocation packageFlexible hours- ...Type Hybrid Department Product Infrastructure Who are we? Our mission is... ...is a team of researchers, engineers, designers, and more, who are... ...infrastructure and tools used to train, evaluate and serve Cohere’s... ...North. We’re hiring software engineers at multiple levels...Full timeWork at officeRemote workFlexible hours
- ...Background Specter is creating a software-defined "control plane" for... ...become the perception engine for a company's physical footprint... ...Specter is hiring an ML infrastructure engineer to build and scale... ...Developing continuous training and evaluation systems to improve model...
$100k - $300k
...Position Overview We are looking for a Software Engineer to work at the forefront of developing and optimizing the software infrastructure and tools necessary for training cutting... ..., training orchestration, and model evaluation) and frameworks for large-scale AI models...- ...Software Engineer, Agent Evaluation and Quality Engineering · Full-time · San Francisco; New York Our mission is to automate coding. The... ...build the measurement, evaluation, and feedback-loop infrastructure that makes the Cursor core agent reliably better over...Full timeWork at office
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Software Engineer, Evaluation Infrastructure. Be the first to apply!
- graduate software developer San Francisco, CA
- rust software engineer San Francisco, CA
- senior software design engineer San Francisco, CA
- software engineer student San Francisco, CA
- software engineer amazon San Francisco, CA
- software developer positions San Francisco, CA
- software engineer full time San Francisco, CA
- software qa engineer San Francisco, CA
- new graduate software engineer San Francisco, CA
- junior software developer San Francisco, CA


