Software Engineer - Model Performance
The Consensus
ABOUT BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $1.5B Series F, led by Altimeter Capital, Conviction Partners, and Spark Capital. Join us and help build the platform engineers turn to to ship AI products. THE ROLE Are you passionate about advancing the application of artificial intelligence? We are looking for a Software Engineer focused on ML performance to join our dynamic team. This role is ideal for someone who thrives in a fast-paced startup environment and is eager to make significant contributions to the exciting field of LLM Inference. If you are a backend engineer who thrives on making things faster and is excited about open-source ML models, we look forward to your application. EXAMPLE INITIATIVES You’ll get to work on these types of projects as part of our Model Performance team: Baseten Embeddings Inference: The fastest embeddings solution available The Baseten Inference Stack Driving model performance optimization RESPONSIBILITIES Implement, refine, and productionize cutting-edge techniques (quantization, speculative decoding, kv cache reuse, chunked prefill and LoRA) for ML model inference and infrastructure. Deep dive into underlying codebases of TensorRT, PyTorch, TensorRT-LLM, vllm, sglang, CUDA, and other libraries to debug ML performance issues. Apply and scale optimization techniques across a wide range of ML models, particularly large language models. Collaborate with a diverse team to design and implement innovative solutions. Own projects from idea to production. REQUIREMENTS Bachelor's, Master's, or Ph.D. degree in Computer Science, Engineering, Mathematics, or related field. Experience with one or more general-purpose programming languages, such as Python or C++. Familiarity with LLM optimization techniques (e.g., quantization, speculative decoding, continuous batching). Strong familiarity with ML libraries, especially PyTorch, TensorRT, or TensorRT-LLM. Demonstrated interest and experience in LLM’s. Deep understanding of GPU architecture. Bonus: Proficiency in enhancing the performance of software systems, particularly in the context of large language models (LLMs). Experience with CUDA or similar technologies. Deep understanding of software engineering principles and a proven track record of developing and deploying AI/ML inference solutions. Experience with Docker and Kubernetes. BENEFITS Competitive compensation, including meaningful equity. 100% coverage of medical, dental, and vision insurance for employee and dependents Flexible PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!) Paid parental leave Fertility and family-building stipend through Carrot Company-facilitated 401(k) Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities. Apply now to embark on a rewarding journey in shaping the future of AI! If you are a motivated individual with a passion for machine learning and a desire to be part of a collaborative and forward-thinking team, we would love to hear from you. At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status. We are an Equal Opportunity Employer and will consider qualified applicants with criminal histories in a manner consistent with applicable law (by example, the requirements of the San Francisco Fair Chance Ordinance, where applicable). #J-18808-Ljbffr The Consensus
$405k
...group of committed researchers, engineers, policy experts, and business... ...We're looking for a Staff Software Engineer to set technical direction... ...eval frameworks that measure model capabilities across diverse... ...initiatives in high-performance, demanding environments—trading...PerformanceVisa sponsorship- ...of inventive research, design, and engineering. Our organization is very flat, and... ...shipping code. About the Role As a Software Engineer on the Model Routing & Inference team at Cursor,... ...comfortable reasoning about cost/performance tradeoffs at scale (GPU utilization...Performance
- SME Careers is seeking a remote Kotlin Engineer to review AI-generated responses and create high-quality Kotlin content. Responsibilities... ...include developing AI prompts, optimizing AI performance, and ensuring model accuracy. The ideal candidate has a Bachelor's degree in...PerformanceRemote job
- CellType Inc. is seeking a Founding Research Engineer to develop and optimize systems for their biological AI models. This pivotal role involves training, evaluation... ...understanding of reinforcement learning and performance debugging in production systems. The position...PerformanceRemote work
$40 per hour
...the United States seeks an Application Developer to enhance AI models. The remote position allows you to choose your projects with... ...effectively. The role includes evaluating AI chatbot logic and model performance, with competitive pay starting at $40+ per hour. Applicants...PerformanceRemote jobHourly payFlexible hours- About the Job We are looking for an experienced AI Model Engineer with deep expertise in kernel development, model optimization, fine‑tuning... ...benchmarking (e.g., perplexity testing, fine‑tuned adapter performance). Conduct GPU testing across desktop and mobile devices....PerformanceRemote job
$40 per hour
A leading AI training company is seeking a DevOps Engineer to join their remote team. In this role, you will provide coding challenges... ...to AI chatbots and evaluate their outputs for correctness and performance. Candidates should be proficient in Python or JavaScript and...PerformanceRemote jobHourly pay$197.3k - $225.1k
Lead AI Engineer (Vision Model Customization, VML) Capital One is a leader in applying machine learning... ..., test, deploy, and support AI software components including foundation model... ...optimization techniques to improve the performance, scalability, cost, latency, and...PerformanceLocal area- ...architectures to the implementation of intelligent models in key business processes. If you’re... ...data models, ensuring data quality, performance, and scalability. Translate business... ...and mentoring to less experienced engineers. ParticipSpectrum decision‑making, planning...Performance2 days per week3 days per week
- A tech company is seeking a Postdoctoral Researcher to evaluate AI chatbots and improve their performance. This role is remote and can be part-time or full-time, allowing you to choose the projects you want to work on. Candidates must have a strong understanding of physics...PerformanceRemote jobHourly payFull timePart time
$160k - $190k
Model Risk - Investment Management Vice President Risk Management New York or Philadelphia The pay range for this position at commencement... ...execution, as well as models used in risk management and performance reporting. Evaluate model conceptual soundness, ongoing...PerformanceRelocation package- ...Function / major duties and responsibilities of the job Strategic The Model Validator is responsible for validating CLS models, maintaining... ...based on internal MRM policy and procedures. Conduct and perform quality assurance for model risk reporting. Communicate effectively...Performance
- ...message the job poster from EDGE Engineering and Science, LLC Recruitment... ...a Senior Air Dispersion Modeler to lead and manage complex... ...Qualifications Experience with modeling software customization or scripting (... ...matched 401(k) plan. Annual performance bonus program. Student loan...PerformanceFull timeFor contractorsLocal areaRemote work
$40 per hour
Feedinkoo is looking for a Frontend Software Engineer to join our team in the United States. This role involves training AI models and measuring their progress while solving coding... ...evaluate AI chatbot outputs and their performance, ensuring high quality in the models...PerformanceRemote jobHourly payContract workFlexible hours$160k - $190k
...seeking a Vice President in New York or Philadelphia to join their Model Validation Group. The role involves conducting independent... ...models used in investment management. You will evaluate model performance, document findings, and present results to senior management. The...Performance- ...is looking for a Senior Research Scientist for its Foundation Model team in New York. In this hybrid role, you will conduct applied... ...learning frameworks. Key benefits include a competitive salary range and eligibility for performance bonuses. #J-18808-Ljbffr SupportFinity™Performance
- ...In this fully remote role, you'll assess technical accuracy, design quantitative problems, and provide impactful feedback on model performance. Candidates should have 2+ years in a quantitative field with strong coding and analytical skills, as well as a bachelor's degree...PerformanceRemote jobFlexible hours
- A leading global financial group in New York is seeking a Credit Risk Model Owner Associate to manage credit risk models for their Americas Division. This role involves model performance monitoring, governance, and collaboration with stakeholders, including the Tokyo Head...PerformanceWork at office
$50 - $60 per hour
...seeking a qualified Law Clerk in New York to assist in training AI models by evaluating their legal reasoning and outputs. The role... ...have expertise in various legal fields and will evaluate the performance of AI chatbots in addressing complex legal challenges. Payment...PerformanceRemote jobHourly payWork from home- ...current or in-progress medical or healthcare-related degree. Responsibilities include evaluating AI logic and ensuring the medical accuracy of outputs, with compensation starting at $50+ per hour. Bonuses are available based on performance. #J-18808-Ljbffr DataAnnotationPerformanceRemote jobHourly pay
$40 per hour
A technology company in the United States is seeking a Chemical Engineer to help train AI models and assess their performance. This remote position offers flexibility in projects and payment starting at $40 per hour, with potential bonuses. Ideal candidates are fluent in...PerformanceRemote jobHourly pay- The Consensus is seeking a skilled engineer to join our Model Performance team, focusing on building and optimizing APIs used in AI applications. You will contribute to enhancing the performance and reliability of our model serving infrastructure, ensuring it meets the...PerformanceFlexible hours
- A remote tech company is seeking a Chemical Engineer to join their team and train AI models. The role involves measuring AI chatbot progress, evaluating... ...is hourly, starting at $40 USD, with bonuses for top performance. Applicants must be located in the United States. #J...PerformanceRemote jobHourly payFull timePart time
$30 - $40 per hour
A leading AI training company is seeking a UI Engineer to help train their AI models. This role involves evaluating the performance of AI chatbots and requires proficiency in programming languages such as Python and JavaScript. Candidates should have a strong grasp of algorithms...PerformanceRemote jobHourly payFlexible hours$20 per hour
...quality, clarity, tone, and completeness of responses. Ensure model responses align with expected conversational behavior and system... ...and asynchronously to meet deadlines while improving AI model performance. Qualifications Must-Have Bachelor's degree Native speaker in Urdu...PerformanceRemote jobContract workPart timeSummer work$66.6k - $124.2k
Responsibilities Performs validation of models and assesses model risk to confirm model appropriateness and capability for a designated portfolio. Provides effective challenge during model development and communicates decisions regarding model use to the business to ensure...PerformanceLocal area$124k - $280k
Opportunity As a Strategy& Strategy Consulting - Business Model Reinvention - Senior Manager you will provide strategic guidance and... ...organizations, analyzing market trends and assessing business performance to develop recommendations that help clients achieve their goals...Performance- ...Quantitative Analytics in the Market Risk Model Development team, you will design and... ...Devise statistical tests to evaluate model performance and quantify the impact of alternative... ...portfolios Bachelor of Science degree in Engineering, Mathematics, Physics, Finance/Economics...Performance
$75 - $150 per hour
...Treliant is looking for Credit Risk Modelers for remote, project-based... .... Responsibilities Perform thorough model validation of... ...e. statistics, econometrics, engineering). Advanced degree a plus. 5... ...Experience using SAS and/or Stata software packages a plus. Experience...PerformanceWork experience placementWork at officeRemote workFlexible hours- .... We’re training and deploying frontier models for developers and enterprises who are building... .... Cohere is a team of researchers, engineers, designers, and more, who are passionate... ...the inference stack to improve core performance metrics by diving deep into model execution...PerformanceFull timeWork at officeRemote workFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Software Engineer - Model Performance. Be the first to apply!
- software sales engineer New York, NY
- software engineer amazon New York, NY
- oracle software engineer New York, NY
- software engineer student New York, NY
- agile software developer New York, NY
- rust software engineer New York, NY
- software developer positions New York, NY
- senior software design engineer New York, NY
- software developer New York, NY
- ngo software engineer New York, NY
