Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Software Engineer, Model Performance Systems

BaseTen Labs, Inc.

ABOUT BASETEN

Baseten powers mission‑critical inference for the world’s most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting‑edge models into production. We’re growing quickly and recently raised our $1.5B Series F, led by Altimeter Capital, Conviction Partners, and Spark Capital. Join us and help build the platform engineers turn to to ship AI products.

THE OPPORTUNITY

We are looking for early‑career Software Engineers to join our team. This is a specialized role sitting at the intersection of high‑performance computing (HPC) and Large Language Model (LLM) engineering. You will be responsible for building the automated "speedometer and diagnostic" suite for our next‑generation AI infrastructure. In this role, you won’t just be using models; you will be tearing them apart to see how they run on the metal. You will build tools that measure GPU FLOPS, stress‑test InfiniBand clusters, and define the benchmarks that ensure our systems are production‑ready.

RESPONSIBILITIES

Performance Benchmarking: Run and automate standard LLM quality benchmarks (GSM8K, MMLU) alongside custom performance suites for specific workloads (e.g., long‑context window, KV cache reuse). Infrastructure Validation: Create automated acceptance tests for new GPU clusters across x86 and ARM systems, measuring GPU memory bandwidth, networking throughput, and multi‑node networking performance. Model Dev Experience: Develop and maintain internal GPU‑enabled development environments (similar to GitHub Codespaces). You will ensure the team has seamless, high‑performance "dev machines" optimized for model experimentation. Tool Development: Build and contribute to tools such as InferenceMAX and genai-bench to automate model evaluation and optimization. Deep Hardware Profiling: Use PyTorch Profiler and NVIDIA Nsight Systems to collect performance profiles, identify bottlenecks, and debug the NVIDIA compute/networking stack. Monitoring & Observability: Develop real‑time dashboards and alerts to monitor system health, model startup times, and runtime performance. Continuous Integration: Automate performance testing via CI/CD pipelines to catch regressions in model setups before they hit production. Optimization Automation: Build tools to find the "Pareto frontier"—identifying the absolute best configuration (latency vs. cost vs. quality) for a given model and workload.

WHAT WE'RE LOOKING FOR

This is a fresher‑friendly role. We care more about your trajectory, curiosity, and technical depth than your years of experience. We want to talk to you if you have: A Love for Systems & Hardware: You aren’t just interested in the AI; you want to understand GPU memory subsystems, InfiniBand, and how data moves across a cluster. An Automation Mindset: You believe that if a task has to be done twice, it should be scripted. You have a passion for stress‑testing and fuzzy testing to find the "breaking point" of a system. Mathematical Curiosity: A desire to understand the underlying math of Transformers and how it translates into FLOPs and memory requirements. Interest in Optimization: You are excited to learn about (or already play with) quantization, speculative decoding, disaggregated serving, and kernel‑level optimizations. Technical Toolkit: Familiarity with Python, and an eagerness to master the NVIDIA software stack. C++ familiarity is good to have.

WHY THIS ROLE

Direct Impact: Your tools will be the gatekeeper for what defines "good" performance for our customers. Deep Learning (Literally): You will gain world‑class expertise in GPU orchestration and LLM inference that few engineers in the industry possess. High Ownership: As a small team of freshers led by experts, you will have the autonomy to build tools from scratch and contribute to open‑source projects.

BENEFITS

Competitive compensation, including meaningful equity. 100% coverage of medical, dental, and vision insurance for employee and dependents Flexible PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!) Paid parental leave Fertility and family‑building stipend through Carrot Company‑facilitated 401(k) Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities. Apply now to embark on a rewarding journey in shaping the future of AI! If you are a motivated individual with a passion for machine learning and a desire to be part of a collaborative and forward‑thinking team, we would love to hear from you. At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status. We are an Equal Opportunity Employer and will consider qualified applicants with criminal histories in a manner consistent with applicable law (by example, the requirements of the San Francisco Fair Chance Ordinance, where applicable). #J-18808-Ljbffr BaseTen Labs, Inc.

Vacancy posted 5 days ago
Similar jobs that could be interesting for youBased on the Software Engineer, Model Performance Systems in New York, NY vacancy
  • $405k

     ...interpretable, and steerable AI systems. We want AI to be safe...  ...researchers, engineers, policy experts, and...  ...re looking for a Staff Software Engineer to set technical...  ...that measure model capabilities across diverse...  ...technical initiatives in high-performance, demanding... 
    Performance
    Visa sponsorship

    Anthropic

    New York, NY
    4 days ago
  •  ...inventive research, design, and engineering. Our organization is very...  .... About the Role As a Software Engineer on the Model Routing & Inference team...  ...low-latency distributed systems, especially in inference...  ...reasoning about cost/performance tradeoffs at scale (GPU utilization... 
    Performance

    Anysphere

    New York, NY
    1 day ago
  •  ...AI to bring cutting-edge models into production. We're growing...  ...help build the platform engineers turn to to ship AI...  ...intelligence? We are looking for a Software Engineer focused on ML performance to join our dynamic team....  ...performance of software systems, particularly in the... 
    Performance
    Flexible hours

    The Consensus

    New York, NY
    5 days ago
  • $135k - $200k

     ...Forward Deployed Software Engineer - Edge Autonomous Systems Title of Role: Forward Deployed Software Engineer - Edge Autonomous Systems Location...  ...operational settings, ensuring high reliability and performance. Work closely with hardware teams and data engineers... 
    Performance
    Work at office

    Recruiting from Scratch

    New York, NY
    8 days ago
  • $180k - $220k

     ...industry. Now we're building AI systems to make that quality...  ...Anthropic APIs, or implementing ML models in production. We care more...  ...solutions while maintaining engineering best practices Nice to Have...  ...k (includes base and annual performance bonus) and equity. (Please... 
    Performance
    Immediate start
    Remote work
    Flexible hours

    NewtonX

    New York, NY
    5 days ago
  • $200k - $245k

     ...Senior Software Engineer/Algorithmic Trading Platform Global electronic trading industry leader...  ...testing of trading platforms, systems, and execution algorithms. This person...  ...frameworks covering all functionality, performance and stability Triage critical production... 
    Performance
    Full time
    Immediate start
    Remote work

    Harris Allied

    New York, NY
    2 days ago
  •  ...Stripe, and Vercel use Braintrust to compare models, test prompts, and catch regressions - turning...  ...release. About the role We're looking for a software engineer who loves to build high performance data processing systems. Our customers are scaled up companies who... 
    Performance
    Flexible hours

    Brain Trust Inc

    New York, NY
    4 days ago
  •  ...developers save time by accelerating software builds and tests. Our cloud-...  ...we build tools that empower engineering teams-from startups to...  ...velocity and improve build performance. Learn more about our...  ...Engineer with a focus on build systems, compilers, and languages ,... 
    Performance
    Remote work

    EngFlow

    New York, NY
    1 day ago
  • $176.7k - $190k

    Senior Software Engineer II - Embedded Build Infrastructure A_DAY_IN_THE_LIFE...  ..._functionally,_mentor,_and_model_Tandem_values....  ...tool chains, configuration systems, and artifact generation. Drives...  ...opportunity to improve business performance and campaigning for it when... 
    Performance
    Local area
    Remote work
    Visa sponsorship

    Tandem Diabetes Care Inc.

    New York, NY
    2 days ago
  • $180k - $320k

     ...Career Renew is recruiting for one of its clients a Software Engineer, Distributed Systems (Core) - this is a fully remote role for US/Canada candidates...  ...deliver personalized customer experiences, optimize performance marketing, and move faster by leveraging data and AI... 
    Performance
    Remote work
    Visa sponsorship

    Career Renew

    New York, NY
    3 days ago
  • About The Role As a Software Engineer at Alchemy, you’ll be focused on building one of the most...  ...and high-throughout distributed systems that power the global backbone powering...  ...complex design, scaling, latency, or performance problems in high-throughput, low-latency... 
    Performance
    Work at office
    Home office
    Flexible hours

    Alchemy

    New York, NY
    3 days ago
  • $160k - $240k

    Bloomberg L.P. in New York seeks a Senior Software Engineer to work on the Bloomberg Product Identifier Repository. The role involves modernizing ticker management systems, designing high-performance systems, and working on distributed data infrastructure. Applicants should... 
    Performance

    Bloomberg

    New York, NY
    3 days ago
  • $2,000 per month

    As a Systems Engineer at Octogen, you will take on ambitious problems at the intersection of AI, search, and commerce. You will design and...  ...thoughtful architectural decisions around cost, latency, performance, and scalability Ensure reliability, observability, and performance... 
    Performance
    Immediate start

    Octogen Systems Inc.

    New York, NY
    5 days ago
  •  ...powered by large language models. Our visual, drag-and-...  ...that work runs on a single engine. Some agents finish in a...  .... We're hiring a Senior Software Engineer, Engine & Distributed Systems to own that engine: the...  ..., streaming. Performance and cost optimization of... 
    Performance

    Stack AI, Inc.

    New York, NY
    1 day ago
  •  ...Distributed Systems Software Engineer, Python / Go 3 months ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive...  ...approaches and infrastructure for validating reliability, performance, and resilience of cloud orchestration tools and... 
    Performance
    Full time
    Local area
    Remote work
    Worldwide

    Canonical

    New York, NY
    5 days ago
  •  ...A leading performance management platform is seeking a Senior Software Engineer to enhance their Data Intelligence Platform. This role will involve building data models and infrastructure that support AI and product experiences. Ideal candidates will have over 5 years... 
    Performance
    Remote work

    15Five

    New York, NY
    3 days ago
  •  ...is a high-growth enterprise software company powering real-time,...  ...to deploy scalable, highly performant 3D‑enabled workflows in...  ...infrastructure, enterprise engineering systems, and mission‑critical deployments...  ...understanding of data modeling, database interactions, and... 
    Performance
    Temporary work
    Remote work
    Work visa
    Flexible hours

    Vertex Software, Inc.

    New York, NY
    3 days ago
  •  ...of AI to bring cutting-edge models into production. We're growing...  ...and help build the platform engineers turn to to ship AI products....  ...building the global operating system for distributed, heterogeneous...  ...characterize and validate networking performance on bleeding‑edge clusters (H1... 
    Performance
    Flexible hours

    The Consensus

    New York, NY
    5 days ago
  • CellType Inc. is seeking a Founding Research Engineer to develop and optimize systems for their biological AI models. This pivotal role involves training, evaluation...  ...understanding of reinforcement learning and performance debugging in production systems. The position... 
    Performance
    Remote work

    CellType Inc.

    New York, NY
    2 days ago
  • $160k - $240k

    Bloomberg L.P. is looking for a Senior Software Engineer for their VAULT platform in New York. The role involves building high-performance data pipelines and leading technical direction...  ...a strong understanding of distributed systems. This position offers a competitive... 
    Performance

    Bloomberg

    New York, NY
    1 day ago
  • $123.6k - $200.1k

     ...dedicated team members are engineering the foundation of Cisco’s core...  ...innovations in operating systems, firmware, networking stacks...  ...on experience with hardware-software integration and low-level networking...  ...compatibility, network performance, and security for Cisco’s... 
    Performance
    Full time
    Apprenticeship
    Local area
    Flexible hours

    Cisco

    New York, NY
    3 days ago
  • $160k - $270k

     ...AI — in a single operating system. With AI Agents that resolve...  ...re rebuilding how design and engineering create products from the ground...  ...life. We're looking for a Software Engineer to lead the...  ...APIs and token architecture to performance, accessibility, developer experience... 
    Performance
    Full time
    Work at office
    Flexible hours

    Assembled

    New York, NY
    1 day ago
  • $197.3k - $225.1k

    Lead AI Engineer (Vision Model Customization, VML) Capital One is a leader in...  ...responsible and reliable AI systems that transform banking for...  ...test, deploy, and support AI software components including...  ...techniques to improve the performance, scalability, cost, latency... 
    Performance
    Local area

    Capital One

    New York, NY
    4 days ago
  •  ...A leading AI research accelerator is seeking an experienced software engineer to enhance AI-driven coding solutions. Responsibilities include...  ...with a minimum commitment of 10 hours and potential for extension based on performance. Must be based in the US. #J-18808-Ljbffr... 
    Performance
    Part time
    Remote work
    Flexible hours

    Turing Inc

    New York, NY
    3 days ago
  • HRB is seeking a Lead Systems Programmer Z/OS in Hoboken, New Jersey. This role involves leading systems programming activities, product...  ...should have proven experience in Z/OS product installation, performance tuning, and strong supervisory skills. This position offers a... 
    Performance

    HRB

    Hoboken, NJ
    1 day ago
  •  ...inventive research, design, and engineering. Our organization is very...  ...sessions. Those signals power model improvement, evals, and experimentation...  ...teams can trust. A lot of systems here started simple so we...  ...systems Comfort debugging performance issues across client... 
    Performance
    Contract work

    Anysphere

    New York, NY
    3 days ago
  •  ...We are seeking a Web & System Developer who provides...  ...workflows, and dashboards. Perform full SharePoint...  ...associated databases. Engineer and maintain consistent...  ...functionality upgrades, software integrations, and development...  ...(AI), Large Language Models (LLMs), and Bot creation... 
    Performance
    Contract work
    Temporary work

    Agil3 Technology Solutions, LLC

    Brooklyn, NY
    3 days ago
  • LEAD SYSTEMS PROGRAMMER Z/OS Hybrid work environment (3× week on site required). Great benefits & annual bonus program. Proven skills...  ..., BAL/ASSEMBLER) I/O configuration expertise Z/OS mainframe performance & tuning Debugging skills Strong knowledge of monitoring... 
    Performance

    HRB

    Hoboken, NJ
    1 day ago
  •  ...currently seeking a AI Foundational Model Engineer to join our team in Jersey...  ...enterprise-grade AI systems powered by foundation models...  ...engineering, platform engineering, software engineering, or applied...  ...on individual and/or company performance. If the position offered in... 
    Performance
    Temporary work
    Work at office
    Remote work
    Flexible hours

    NTT DATA Services

    Jersey City, NJ
    2 days ago
  •  ...Andromeda Systems Incorporated (ASI) is hiring passionate software developers to join our team in developing our web‑based...  ...their equipment and fleets performing at their highest levels. Our customers...  ...data science, and reliability engineering using the latest industry... 
    Performance
    For contractors
    For subcontractor
    Immediate start
    Remote work

    Feedinkoo

    New York, NY
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Software Engineer, Model Performance Systems. Be the first to apply!