Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Software Engineer - Model Performance

The Consensus

ABOUT BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $1.5B Series F, led by Altimeter Capital, Conviction Partners, and Spark Capital. Join us and help build the platform engineers turn to to ship AI products. THE ROLE Are you passionate about advancing the application of artificial intelligence? We are looking for a Software Engineer focused on ML performance to join our dynamic team. This role is ideal for someone who thrives in a fast-paced startup environment and is eager to make significant contributions to the exciting field of LLM Inference. If you are a backend engineer who thrives on making things faster and is excited about open-source ML models, we look forward to your application. EXAMPLE INITIATIVES You’ll get to work on these types of projects as part of our Model Performance team: Baseten Embeddings Inference: The fastest embeddings solution available The Baseten Inference Stack Driving model performance optimization RESPONSIBILITIES Implement, refine, and productionize cutting-edge techniques (quantization, speculative decoding, kv cache reuse, chunked prefill and LoRA) for ML model inference and infrastructure. Deep dive into underlying codebases of TensorRT, PyTorch, TensorRT-LLM, vllm, sglang, CUDA, and other libraries to debug ML performance issues. Apply and scale optimization techniques across a wide range of ML models, particularly large language models. Collaborate with a diverse team to design and implement innovative solutions. Own projects from idea to production. REQUIREMENTS Bachelor's, Master's, or Ph.D. degree in Computer Science, Engineering, Mathematics, or related field. Experience with one or more general-purpose programming languages, such as Python or C++. Familiarity with LLM optimization techniques (e.g., quantization, speculative decoding, continuous batching). Strong familiarity with ML libraries, especially PyTorch, TensorRT, or TensorRT-LLM. Demonstrated interest and experience in LLM’s. Deep understanding of GPU architecture. Bonus: Proficiency in enhancing the performance of software systems, particularly in the context of large language models (LLMs). Experience with CUDA or similar technologies. Deep understanding of software engineering principles and a proven track record of developing and deploying AI/ML inference solutions. Experience with Docker and Kubernetes. BENEFITS Competitive compensation, including meaningful equity. 100% coverage of medical, dental, and vision insurance for employee and dependents Flexible PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!) Paid parental leave Fertility and family-building stipend through Carrot Company-facilitated 401(k) Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities. Apply now to embark on a rewarding journey in shaping the future of AI! If you are a motivated individual with a passion for machine learning and a desire to be part of a collaborative and forward-thinking team, we would love to hear from you. At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status. We are an Equal Opportunity Employer and will consider qualified applicants with criminal histories in a manner consistent with applicable law (by example, the requirements of the San Francisco Fair Chance Ordinance, where applicable). #J-18808-Ljbffr The Consensus

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Software Engineer - Model Performance in New York, NY vacancy
  • $405k

     ...group of committed researchers, engineers, policy experts, and business...  ...We're looking for a Staff Software Engineer to set technical direction...  ...eval frameworks that measure model capabilities across diverse...  ...initiatives in high-performance, demanding environments—trading... 
    Performance
    Visa sponsorship

    Anthropic

    New York, NY
    15 hours ago
  •  ...of inventive research, design, and engineering. Our organization is very flat, and...  ...shipping code. About the Role As a Software Engineer on the Model Routing & Inference team at Cursor,...  ...comfortable reasoning about cost/performance tradeoffs at scale (GPU utilization... 
    Performance

    Anysphere

    New York, NY
    2 days ago
  • SME Careers is seeking a remote Kotlin Engineer to review AI-generated responses and create high-quality Kotlin content. Responsibilities...  ...include developing AI prompts, optimizing AI performance, and ensuring model accuracy. The ideal candidate has a Bachelor's degree in... 
    Performance
    Remote job

    SME Careers

    New York, NY
    4 days ago
  • CellType Inc. is seeking a Founding Research Engineer to develop and optimize systems for their biological AI models. This pivotal role involves training, evaluation...  ...understanding of reinforcement learning and performance debugging in production systems. The position... 
    Performance
    Remote work

    CellType Inc.

    New York, NY
    3 days ago
  • $40 per hour

     ...the United States seeks an Application Developer to enhance AI models. The remote position allows you to choose your projects with...  ...effectively. The role includes evaluating AI chatbot logic and model performance, with competitive pay starting at $40+ per hour. Applicants... 
    Performance
    Remote job
    Hourly pay
    Flexible hours

    DataAnnotation

    Brooklyn, NY
    4 days ago
  • About the Job We are looking for an experienced AI Model Engineer with deep expertise in kernel development, model optimization, fine‑tuning...  ...benchmarking (e.g., perplexity testing, fine‑tuned adapter performance). Conduct GPU testing across desktop and mobile devices.... 
    Performance
    Remote job

    Framework Ventures

    New York, NY
    4 days ago
  • $40 per hour

    A leading AI training company is seeking a DevOps Engineer to join their remote team. In this role, you will provide coding challenges...  ...to AI chatbots and evaluate their outputs for correctness and performance. Candidates should be proficient in Python or JavaScript and... 
    Performance
    Remote job
    Hourly pay

    DataAnnotation

    New York, NY
    1 day ago
  • $197.3k - $225.1k

    Lead AI Engineer (Vision Model Customization, VML) Capital One is a leader in applying machine learning...  ..., test, deploy, and support AI software components including foundation model...  ...optimization techniques to improve the performance, scalability, cost, latency, and... 
    Performance
    Local area

    Capital One

    New York, NY
    15 hours ago
  •  ...architectures to the implementation of intelligent models in key business processes. If you’re...  ...data models, ensuring data quality, performance, and scalability. Translate business...  ...and mentoring to less experienced engineers. ParticipSpectrum decision‑making, planning... 
    Performance
    2 days per week
    3 days per week

    Derevo

    New York, NY
    2 days ago
  • A tech company is seeking a Postdoctoral Researcher to evaluate AI chatbots and improve their performance. This role is remote and can be part-time or full-time, allowing you to choose the projects you want to work on. Candidates must have a strong understanding of physics... 
    Performance
    Remote job
    Hourly pay
    Full time
    Part time

    DataAnnotation

    New York, NY
    2 days ago
  • $160k - $190k

    Model Risk - Investment Management Vice President Risk Management New York or Philadelphia The pay range for this position at commencement...  ...execution, as well as models used in risk management and performance reporting. Evaluate model conceptual soundness, ongoing... 
    Performance
    Relocation package

    Nomura Holdings, Inc.

    New York, NY
    15 hours ago
  •  ...Function / major duties and responsibilities of the job Strategic The Model Validator is responsible for validating CLS models, maintaining...  ...based on internal MRM policy and procedures. Conduct and perform quality assurance for model risk reporting. Communicate effectively... 
    Performance

    Sept 2017 Branding

    New York, NY
    3 days ago
  •  ...message the job poster from EDGE Engineering and Science, LLC Recruitment...  ...a Senior Air Dispersion Modeler to lead and manage complex...  ...Qualifications Experience with modeling software customization or scripting (...  ...matched 401(k) plan. Annual performance bonus program. Student loan... 
    Performance
    Full time
    For contractors
    Local area
    Remote work

    EDGE Engineering and Science, LLC

    New York, NY
    4 days ago
  • $40 per hour

    Feedinkoo is looking for a Frontend Software Engineer to join our team in the United States. This role involves training AI models and measuring their progress while solving coding...  ...evaluate AI chatbot outputs and their performance, ensuring high quality in the models... 
    Performance
    Remote job
    Hourly pay
    Contract work
    Flexible hours

    Feedinkoo

    New York, NY
    3 days ago
  • $160k - $190k

     ...seeking a Vice President in New York or Philadelphia to join their Model Validation Group. The role involves conducting independent...  ...models used in investment management. You will evaluate model performance, document findings, and present results to senior management. The... 
    Performance

    Nomura Holdings, Inc.

    New York, NY
    15 hours ago
  •  ...is looking for a Senior Research Scientist for its Foundation Model team in New York. In this hybrid role, you will conduct applied...  ...learning frameworks. Key benefits include a competitive salary range and eligibility for performance bonuses. #J-18808-Ljbffr SupportFinity™
    Performance

    SupportFinity™

    New York, NY
    4 days ago
  •  ...In this fully remote role, you'll assess technical accuracy, design quantitative problems, and provide impactful feedback on model performance. Candidates should have 2+ years in a quantitative field with strong coding and analytical skills, as well as a bachelor's degree... 
    Performance
    Remote job
    Flexible hours

    DataAnnotation

    New York, NY
    4 days ago
  • A leading global financial group in New York is seeking a Credit Risk Model Owner Associate to manage credit risk models for their Americas Division. This role involves model performance monitoring, governance, and collaboration with stakeholders, including the Tokyo Head... 
    Performance
    Work at office

    SMBC

    New York, NY
    2 days ago
  • $50 - $60 per hour

     ...seeking a qualified Law Clerk in New York to assist in training AI models by evaluating their legal reasoning and outputs. The role...  ...have expertise in various legal fields and will evaluate the performance of AI chatbots in addressing complex legal challenges. Payment... 
    Performance
    Remote job
    Hourly pay
    Work from home

    DataAnnotation

    New York, NY
    15 hours ago
  •  ...current or in-progress medical or healthcare-related degree. Responsibilities include evaluating AI logic and ensuring the medical accuracy of outputs, with compensation starting at $50+ per hour. Bonuses are available based on performance. #J-18808-Ljbffr DataAnnotation
    Performance
    Remote job
    Hourly pay

    DataAnnotation

    New York, NY
    1 day ago
  • $40 per hour

    A technology company in the United States is seeking a Chemical Engineer to help train AI models and assess their performance. This remote position offers flexibility in projects and payment starting at $40 per hour, with potential bonuses. Ideal candidates are fluent in... 
    Performance
    Remote job
    Hourly pay

    DataAnnotation

    New York, NY
    2 days ago
  • The Consensus is seeking a skilled engineer to join our Model Performance team, focusing on building and optimizing APIs used in AI applications. You will contribute to enhancing the performance and reliability of our model serving infrastructure, ensuring it meets the... 
    Performance
    Flexible hours

    The Consensus

    New York, NY
    1 day ago
  • A remote tech company is seeking a Chemical Engineer to join their team and train AI models. The role involves measuring AI chatbot progress, evaluating...  ...is hourly, starting at $40 USD, with bonuses for top performance. Applicants must be located in the United States. #J... 
    Performance
    Remote job
    Hourly pay
    Full time
    Part time

    DataAnnotation

    New York, NY
    1 day ago
  • $30 - $40 per hour

    A leading AI training company is seeking a UI Engineer to help train their AI models. This role involves evaluating the performance of AI chatbots and requires proficiency in programming languages such as Python and JavaScript. Candidates should have a strong grasp of algorithms... 
    Performance
    Remote job
    Hourly pay
    Flexible hours

    DataAnnotation

    New York, NY
    2 days ago
  • $20 per hour

     ...quality, clarity, tone, and completeness of responses. Ensure model responses align with expected conversational behavior and system...  ...and asynchronously to meet deadlines while improving AI model performance. Qualifications Must-Have Bachelor's degree Native speaker in Urdu... 
    Performance
    Remote job
    Contract work
    Part time
    Summer work

    Mercor

    New York, NY
    2 days ago
  • $66.6k - $124.2k

    Responsibilities Performs validation of models and assesses model risk to confirm model appropriateness and capability for a designated portfolio. Provides effective challenge during model development and communicates decisions regarding model use to the business to ensure... 
    Performance
    Local area

    Hispanic Alliance for Career Enhancement

    New York, NY
    3 days ago
  • $124k - $280k

    Opportunity As a Strategy& Strategy Consulting - Business Model Reinvention - Senior Manager you will provide strategic guidance and...  ...organizations, analyzing market trends and assessing business performance to develop recommendations that help clients achieve their goals... 
    Performance

    PRICE WATERHOUSE COOPERS

    New York, NY
    15 hours ago
  •  ...Quantitative Analytics in the Market Risk Model Development team, you will design and...  ...Devise statistical tests to evaluate model performance and quantify the impact of alternative...  ...portfolios Bachelor of Science degree in Engineering, Mathematics, Physics, Finance/Economics... 
    Performance

    JPMorgan Chase & Co.

    New York, NY
    1 day ago
  • $75 - $150 per hour

     ...Treliant is looking for Credit Risk Modelers for remote, project-based...  .... Responsibilities Perform thorough model validation of...  ...e. statistics, econometrics, engineering). Advanced degree a plus. 5...  ...Experience using SAS and/or Stata software packages a plus. Experience... 
    Performance
    Work experience placement
    Work at office
    Remote work
    Flexible hours

    Treliant (Acquired by Huron - 2025)

    New York, NY
    4 days ago
  •  .... We’re training and deploying frontier models for developers and enterprises who are building...  .... Cohere is a team of researchers, engineers, designers, and more, who are passionate...  ...the inference stack to improve core performance metrics by diving deep into model execution... 
    Performance
    Full time
    Work at office
    Remote work
    Flexible hours

    Cohere

    New York, NY
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Software Engineer - Model Performance. Be the first to apply!