Applied AI Inference Engineer

Baseten

ABOUT BASETEN

Baseten powers mission‑critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting‑edge models into production. We're growing quickly and recently raised our $300M Series E, backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products.

THE ROLE

As an Applied AI Inference Engineer at Baseten, you will partner directly with customers to architect, build, and deploy high‑scale production AI applications on Baseten’s platform. You’ll own the journey with customers from initial exploration to production deployment, translating ambiguous business goals into reliable, observable services with clear quality, latency, and cost outcomes. This role is a great fit for entrepreneurial engineers who want a front‑row view into how modern companies adopt AI at scale and who enjoy working across product, software development, performance engineering, and customer‑facing implementations. To be clear, this is an engineering role with hands‑on coding and software development that also includes aspects of product management, technical customer success, and pre‑sales solution engineering mixed in.

EXAMPLE INITIATIVES

Take a look at these blog posts written by members of our Forward Deployed Engineering team: Forward Deployed Engineering on the frontier of AI The fastest, most accurate Whisper transcription Deploy production‑ready model servers from Docker images Deploy custom ComfyUI workflows as APIs

RESPONSIBILITIES

Develop and maintain software systems and product features using one or more general‑purpose programming languages in a production‑level environment, with a preference for Python due to its relevance in ML projects. Drive customer impact by designing, implementing, and deploying Baseten solutions end‑to‑end (problem framing → evaluation → production deployment → monitoring). This involves working with customers’ engineering teams at every stage of the customer journey including: sales, implementation, and expansion. Deliver with velocity: turn vague objectives into clear specs and well‑defined PoCs so we can rapidly ship well‑tested services and outcomes for our customers. Optimize and enhance AI/ML projects, contributing to the continuous improvement of our technical stack. This includes developing features and PRDs with other engineering and product orgs. Own products and customer projects end‑to‑end, functioning as both an engineer, project manager, and product manager, with a focus on user empathy, project specification, and end‑to‑end execution. Navigate ambiguity and exercise good judgment on tradeoffs and tools needed to solve problems, avoiding unnecessary complexity. Demonstrate pride, ownership, and accountability for your work, expecting the same from your teammates.

REQUIREMENTS

Bachelor's, Master's, or Ph.D. degree in Computer Science, Engineering, Mathematics, or related field. 1+ years of professional work experience in a fast‑paced, high‑growth environment. Demonstrated experience with one or more general‑purpose programming languages in a production‑level environment, with a strong preference for Python. Familiarity with AI/ML pipelines and the lifecycle of ML model development and deployment. Strong communication skills, particularly on complex technical topics. Experience in building or optimizing AI/ML projects is highly valued.

BENEFITS

Competitive compensation, including meaningful equity. 100% coverage of medical, dental, and vision insurance for employee and dependents. Flexible PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!) Paid parental leave. Fertility and family‑building stipend through Carrot. Company‑facilitated 401(k). Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities. At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status. We are an Equal Opportunity Employer and will consider qualified applicants with criminal histories in a manner consistent with applicable law (by example, the requirements of the San Francisco Fair Chance Ordinance, where applicable). #J-18808-Ljbffr Baseten

Apply

Vacancy posted 2 days ago

Similar jobs that could be interesting for youBased on the Applied AI Inference Engineer in San Francisco, CA vacancy

AI Inference Engineer
$175k - $225k
...Our team is led by veteran operators and engineers, alumni of Sonos, Paypal, Tesla, Apple,... .... The Role We're looking for an AI Inference Engineer who lives at the boundary of high... ...prototypes from Jetpack into Yocto Apply advanced optimization techniques-including...
Suggested
Local area
Remote work
Sauron
San Francisco, CA
3 days ago
Applied AI Engineer
...Who We Need As an Applied AI Engineer on Rillet's AI & ML team, you'll design and ship production AI systems that transform how finance... ...Experience with fine-tuning or training models, not just inference Familiarity with Python, Kotlin, Java, or TypeScript...
Suggested
Work at office
Remote work
Relocation
Flexible hours
Rillet
San Francisco, CA
4 days ago
Senior Applied AI Engineer
...Senior Software Engineer We're hiring a Senior Software Engineer onto our Applied AI team to build and extend the backend systems that power our platform. This is... ...science or research teams Familiarity with causal inference or graph-based systems Why Alembic...
Suggested
Work at office
Alembic Technologies
San Francisco, CA
4 days ago
Applied AI Engineer
...speed, accuracy, and insight. Rillet is an AI-native ERP that can drive a zero-day... ...-growing companies. Who We Need As an Applied AI Engineer on Rillet's AI & ML Team, you will design... ...the infrastructure powering LLM inference, fine-tuning pipelines, and RAG systems...
Suggested
Work at office
Remote work
Relocation
Flexible hours
Rillet
San Francisco, CA
3 days ago
Applied AI Engineer
About the Role As an Applied AI Engineer, you'll work on everything from browser agent reliability to document understanding to inference optimization, building systems that make the work more accurate and faster every week. This is a role for someone who thrives on turning...
Suggested
Relocation
Ersilia
San Francisco, CA
3 days ago
Applied AI Engineer at Roam
This is a job that Jill, our AI Recruiter, is recruiting for on behalf of one of... ...next step is speak to Jack. Job Title: Applied AI Engineer Company Description: Innovative AI simulation... ..., data-efficient training, and inference optimization, enabling new frontiers in...
Jack & Jill/External ATS
San Francisco, CA
2 days ago
Applied AI Engineer
...is building Developmental Intelligence: AI for predicting how organisms change... ...systems. The Role We are seeking an Applied AI Engineer to help build the software, infrastructure... ...Support model serving and inference infrastructure Cloud & Platform Engineering...
Becoming
San Francisco, CA
18 days ago
Applied AI Software Engineer
...and Hims & Hers Health). The Role We’re hiring an Applied AI Software Engineer to lead evaluations for agents in development and the post... ...agents in Canvas using state of the art foundation model inference and fine-tuning APIs along with our server-side SDK. The...
Remote work
Home office
Flexible hours
Canvas Medical
San Francisco, CA
7 days ago
Production AI Inference Engineer — Scale & Impact
A dynamic AI company in San Francisco is looking for an Applied AI Inference Engineer to develop and deploy high-scale production AI applications. You will partner with customers to transform business goals into reliable services while engaging in software development and...
Flexible hours
Baseten
San Francisco, CA
2 days ago
AI Engineer $150K-$170K - San Francisco, CA | Apply Now
$150k - $170k
...About the job AI Engineer $150K-$170K - San Francisco, CA | Apply Now Now Hiring: AI Engineer in San Francisco, CA!Apply now with 24/7 PT.• Responsibilities:- We're looking for an AI Engineer to help build the core agent systems that power autonomous financial audits...
Immediate start
247Protas
San Francisco, CA
4 days ago
Applied AI Engineer, Handshake AI Enterprise
$157k - $175k
...institutions. In 2025, we started Handshake AI and built the fastest-growing AI data... ...institutions Work together with engineers, scientists, operators, and more from Palantir... ...scale. About the Role As an Applied AI Engineer at Handshake AI Enterprise, you...
Full time
Work at office
Remote work
Flexible hours
Handshake
San Francisco, CA
2 days ago
Senior Applied AI Engineer
$250k - $400k
...companies. Our team is 100% remote and we work with teams across the United States to help them hire. Title of Role: Senior Applied AI Engineer Location: San Francisco, CA (On-site, 5-6 days/week) Company Stage of Funding: Series A (~$37M raised, led by...
H1b
Work at office
Remote work
Visa sponsorship
Recruiting from Scratch
San Francisco, CA
4 days ago
Applied AI Engineer
$200k - $275k
...is to create the next generation of Gen AI-driven code reviewers: a symbiotic partnership... ...significantly outperforms individual engineers. We combine language models with human ingenuity... ...and quality. Role Overview As an Applied Gen AI Engineer at CodeRabbit, you'll...
Remote work
CodeRabbit
San Francisco, CA
1 day ago
Applied AI Engineer
...Applied AI Engineer Automat was founded by a diverse group of creative technologists from teams like Google Creative Lab, and Samsung's innovation Think Tank Team. We envision a future where intelligent agents streamline business operations — and one day, run their...
Flexible hours
Automat (Formerly Lasso)
San Francisco, CA
4 days ago
Applied AI Engineer
...About the role Slash is building an AI-native financial platform, and we're... ...stack AI features end to end, from prompt engineering and agent orchestration to React UI and... ...Improve and scale our AI infrastructure: inference routing, model tiering, prompt caching,...
Work at office
Slash Financial
San Francisco, CA
1 day ago
Applied AI Engineer
$180k - $220k
...channel. Marketing services is a $1T/year industry. If AI has automated coding, marketing is next. The Engineering Challenge Fully autonomous agents are still... ...Clay, Replit, and others. Vishruth (Founding Applied AI Engineer) , previously a Data Scientist at...
Work at office
Relocation package
Shift work
daydream Labs, Inc
San Francisco, CA
3 days ago
Applied AI Engineer - Bay Area
$175.82k - $263.82k
...world with simpler experiences. You in? Why would you love this job? We are seeking a highly skilled and motivated Senior Applied AI Engineer to join our small but efficient Applied AI Engineering Team. As a part of this team, you will play a crucial role in helping...
Local area
Worldwide
Redis
San Francisco, CA
2 days ago
Applied AI Engineer
...Applied AI Engineer We're looking for an Applied AI Engineer with 2-5 years of experience building and shipping AI systems used by real people. You're comfortable working with modern LLM-based systems and agentic workflows, and you know how to turn powerful models into...
Paraform
San Francisco, CA
4 days ago
AI Agent Software Engineer $185K-$275K - San Francisco, CA | Apply Now
$185k - $275k
...About the job AI Agent Software Engineer $185K-$275K - San Francisco, CA | Apply Now Now Hiring: AI Agent Software Engineer in San Francisco, CA!Apply now with 24/7 PT.• Responsibilities:- Design and build AI-driven agents for voice and chat- Translate natural language...
Immediate start
247Protas
San Francisco, CA
4 days ago
Applied AI Engineer
...ripe for disruption. Zania is building agentic AI for GRC to solve this problem. We're building... ...and Instacart. Pioneering Technology: Our engineers and GRC experts work at the absolute forefront of applied AI, building agentic systems that will define the...
Work at office
Relocation package
Flexible hours
Zania, Inc
San Francisco, CA
4 days ago
Senior Applied AI Engineer
...home for all of America. Abby Care is building the leading AI-native platform for family-led care. America is facing a... ...through robust, intelligent systems. We're looking for a Senior Applied AI Engineer to build production-grade AI systems that automate and...
Full time
Abby Care
San Francisco, CA
1 day ago
Applied AI Engineer ($200K - $350K + Equity) at Console.com
$200k - $350k
...This is a job that Jill, our AI Recruiter, is recruiting for on behalf of one of our customers. She will pick the best candidates... ...network The next step is to speak to Jack. Job Title: Applied AI Engineer Salary: $200K - $350K + Equity Company Description:...
Jack and Jill AI
San Francisco, CA
1 day ago
Applied AI Engineer
...Applied AI Engineer Everyone's talking about AI. But here's the truth: AI is trapped in a chat box. It can't take real actions in the real world. We are changing that forever. We're not just building another AI company - we're creating the infrastructure that will...
Shift work
Arcade
San Francisco, CA
3 days ago
Distinguished Applied AI Engineer, AI Transformation
$212.63k - $381.15k
...Job Requisition ID # 25WD93825 Position Overview As the Distinguished Applied AI Engineer AI Transformation you will define and guide and accelerate the strategy for the rapid, responsible and outcome driven adoption of AI across Autodesk's go-to-market (GTM)...
For contractors
Autodesk
San Francisco, CA
4 days ago
Applied AI Engineer
$150k - $300k
...The Role As an Applied AI Engineer, you'll bring the frontier of AI research and engineering to Rowspace, transforming how our platform understands and surfaces critical financial insights. In a typical week, you might build and productionize an agent in response to...
Work at office
Rowspace
San Francisco, CA
2 days ago
Senior AI Inference Engineer - GPU, Rust & CUDA
$220k
Perplexity is looking for an engineer to join their team in San Francisco. You will work on building and operating the inference engine, supporting new models, migrating GPU kernels, and developing a Rust-based serving runtime. The ideal candidate has 3+ years of experience...
Perplexity
San Francisco, CA
1 day ago
Applied AI Engineer
$160k - $280k
...Edison Scientific builds and commercializes AI agents for science. Scientific discovery... ...assembling a team of top researchers and engineers across AI and biology to build an AI scientist. Role We're looking for an Applied AI Engineer to join the team responsible...
Work at office
Edison Scientific
San Francisco, CA
4 days ago
Scientific Lead, Generative AI Engineer, Applied Intelligence for Discovery
$181.5k - $283.8k
...Generative AI Engineer At Lilly, we unite caring with discovery to make life better for people around the world. We are a global healthcare... ...change how drug discovery research is conducted. The Applied Intelligence for Discovery (AI4D) team is a newly formed group...
Full time
Flexible hours
Eli Lilly
San Francisco, CA
1 day ago
AI Inference Performance Engineer
Fathom is seeking a Model Performance Engineer in San Francisco to optimize the speed, cost, and reliability of its model inference stack while building fine-tuning infrastructure. The ideal candidate will have extensive experience with LLM frameworks, quantization techniques...
Fathom
San Francisco, CA
4 days ago
Senior AI Inference Performance Engineer (Remote)
A leading cloud infrastructure company is seeking a Senior Engineer 2 to join their AI Inference Optimization team. The role involves leading the technical strategy for performance architecture and addressing complex performance issues ensuring industry-leading service....
Remote job
DigitalOcean
San Francisco, CA
2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Applied AI Inference Engineer. Be the first to apply!