Applied AI Inference Engineer
Baseten
ABOUT BASETEN
Baseten powers mission‑critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting‑edge models into production. We're growing quickly and recently raised our $300M Series E, backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products.THE ROLE
As an Applied AI Inference Engineer at Baseten, you will partner directly with customers to architect, build, and deploy high‑scale production AI applications on Baseten’s platform. You’ll own the journey with customers from initial exploration to production deployment, translating ambiguous business goals into reliable, observable services with clear quality, latency, and cost outcomes. This role is a great fit for entrepreneurial engineers who want a front‑row view into how modern companies adopt AI at scale and who enjoy working across product, software development, performance engineering, and customer‑facing implementations. To be clear, this is an engineering role with hands‑on coding and software development that also includes aspects of product management, technical customer success, and pre‑sales solution engineering mixed in.EXAMPLE INITIATIVES
Take a look at these blog posts written by members of our Forward Deployed Engineering team: Forward Deployed Engineering on the frontier of AI The fastest, most accurate Whisper transcription Deploy production‑ready model servers from Docker images Deploy custom ComfyUI workflows as APIsRESPONSIBILITIES
Develop and maintain software systems and product features using one or more general‑purpose programming languages in a production‑level environment, with a preference for Python due to its relevance in ML projects. Drive customer impact by designing, implementing, and deploying Baseten solutions end‑to‑end (problem framing → evaluation → production deployment → monitoring). This involves working with customers’ engineering teams at every stage of the customer journey including: sales, implementation, and expansion. Deliver with velocity: turn vague objectives into clear specs and well‑defined PoCs so we can rapidly ship well‑tested services and outcomes for our customers. Optimize and enhance AI/ML projects, contributing to the continuous improvement of our technical stack. This includes developing features and PRDs with other engineering and product orgs. Own products and customer projects end‑to‑end, functioning as both an engineer, project manager, and product manager, with a focus on user empathy, project specification, and end‑to‑end execution. Navigate ambiguity and exercise good judgment on tradeoffs and tools needed to solve problems, avoiding unnecessary complexity. Demonstrate pride, ownership, and accountability for your work, expecting the same from your teammates.REQUIREMENTS
Bachelor's, Master's, or Ph.D. degree in Computer Science, Engineering, Mathematics, or related field. 1+ years of professional work experience in a fast‑paced, high‑growth environment. Demonstrated experience with one or more general‑purpose programming languages in a production‑level environment, with a strong preference for Python. Familiarity with AI/ML pipelines and the lifecycle of ML model development and deployment. Strong communication skills, particularly on complex technical topics. Experience in building or optimizing AI/ML projects is highly valued.BENEFITS
Competitive compensation, including meaningful equity. 100% coverage of medical, dental, and vision insurance for employee and dependents. Flexible PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!) Paid parental leave. Fertility and family‑building stipend through Carrot. Company‑facilitated 401(k). Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities. At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status. We are an Equal Opportunity Employer and will consider qualified applicants with criminal histories in a manner consistent with applicable law (by example, the requirements of the San Francisco Fair Chance Ordinance, where applicable). #J-18808-Ljbffr BasetenVacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Applied AI Inference Engineer in San Francisco, CA vacancy
$175k - $225k
...Our team is led by veteran operators and engineers, alumni of Sonos, Paypal, Tesla, Apple,... .... The Role We're looking for an AI Inference Engineer who lives at the boundary of high... ...prototypes from Jetpack into Yocto Apply advanced optimization techniques-including...SuggestedLocal areaRemote work- ...Who We Need As an Applied AI Engineer on Rillet's AI & ML team, you'll design and ship production AI systems that transform how finance... ...Experience with fine-tuning or training models, not just inference Familiarity with Python, Kotlin, Java, or TypeScript...SuggestedWork at officeRemote workRelocationFlexible hours
- ...Senior Software Engineer We're hiring a Senior Software Engineer onto our Applied AI team to build and extend the backend systems that power our platform. This is... ...science or research teams Familiarity with causal inference or graph-based systems Why Alembic...SuggestedWork at office
- ...speed, accuracy, and insight. Rillet is an AI-native ERP that can drive a zero-day... ...-growing companies. Who We Need As an Applied AI Engineer on Rillet's AI & ML Team, you will design... ...the infrastructure powering LLM inference, fine-tuning pipelines, and RAG systems...SuggestedWork at officeRemote workRelocationFlexible hours
- About the Role As an Applied AI Engineer, you'll work on everything from browser agent reliability to document understanding to inference optimization, building systems that make the work more accurate and faster every week. This is a role for someone who thrives on turning...SuggestedRelocation
- This is a job that Jill, our AI Recruiter, is recruiting for on behalf of one of... ...next step is speak to Jack. Job Title: Applied AI Engineer Company Description: Innovative AI simulation... ..., data-efficient training, and inference optimization, enabling new frontiers in...
- ...is building Developmental Intelligence: AI for predicting how organisms change... ...systems. The Role We are seeking an Applied AI Engineer to help build the software, infrastructure... ...Support model serving and inference infrastructure Cloud & Platform Engineering...
- ...and Hims & Hers Health). The Role We’re hiring an Applied AI Software Engineer to lead evaluations for agents in development and the post... ...agents in Canvas using state of the art foundation model inference and fine-tuning APIs along with our server-side SDK. The...Remote workHome officeFlexible hours
- A dynamic AI company in San Francisco is looking for an Applied AI Inference Engineer to develop and deploy high-scale production AI applications. You will partner with customers to transform business goals into reliable services while engaging in software development and...Flexible hours
$150k - $170k
...About the job AI Engineer $150K-$170K - San Francisco, CA | Apply Now Now Hiring: AI Engineer in San Francisco, CA!Apply now with 24/7 PT.• Responsibilities:- We're looking for an AI Engineer to help build the core agent systems that power autonomous financial audits...Immediate start$157k - $175k
...institutions. In 2025, we started Handshake AI and built the fastest-growing AI data... ...institutions Work together with engineers, scientists, operators, and more from Palantir... ...scale. About the Role As an Applied AI Engineer at Handshake AI Enterprise, you...Full timeWork at officeRemote workFlexible hours$250k - $400k
...companies. Our team is 100% remote and we work with teams across the United States to help them hire. Title of Role: Senior Applied AI Engineer Location: San Francisco, CA (On-site, 5-6 days/week) Company Stage of Funding: Series A (~$37M raised, led by...H1bWork at officeRemote workVisa sponsorship$200k - $275k
...is to create the next generation of Gen AI-driven code reviewers: a symbiotic partnership... ...significantly outperforms individual engineers. We combine language models with human ingenuity... ...and quality. Role Overview As an Applied Gen AI Engineer at CodeRabbit, you'll...Remote work- ...Applied AI Engineer Automat was founded by a diverse group of creative technologists from teams like Google Creative Lab, and Samsung's innovation Think Tank Team. We envision a future where intelligent agents streamline business operations — and one day, run their...Flexible hours
- ...About the role Slash is building an AI-native financial platform, and we're... ...stack AI features end to end, from prompt engineering and agent orchestration to React UI and... ...Improve and scale our AI infrastructure: inference routing, model tiering, prompt caching,...Work at office
$180k - $220k
...channel. Marketing services is a $1T/year industry. If AI has automated coding, marketing is next. The Engineering Challenge Fully autonomous agents are still... ...Clay, Replit, and others. Vishruth (Founding Applied AI Engineer) , previously a Data Scientist at...Work at officeRelocation packageShift work$175.82k - $263.82k
...world with simpler experiences. You in? Why would you love this job? We are seeking a highly skilled and motivated Senior Applied AI Engineer to join our small but efficient Applied AI Engineering Team. As a part of this team, you will play a crucial role in helping...Local areaWorldwide- ...Applied AI Engineer We're looking for an Applied AI Engineer with 2-5 years of experience building and shipping AI systems used by real people. You're comfortable working with modern LLM-based systems and agentic workflows, and you know how to turn powerful models into...
$185k - $275k
...About the job AI Agent Software Engineer $185K-$275K - San Francisco, CA | Apply Now Now Hiring: AI Agent Software Engineer in San Francisco, CA!Apply now with 24/7 PT.• Responsibilities:- Design and build AI-driven agents for voice and chat- Translate natural language...Immediate start- ...ripe for disruption. Zania is building agentic AI for GRC to solve this problem. We're building... ...and Instacart. Pioneering Technology: Our engineers and GRC experts work at the absolute forefront of applied AI, building agentic systems that will define the...Work at officeRelocation packageFlexible hours
- ...home for all of America. Abby Care is building the leading AI-native platform for family-led care. America is facing a... ...through robust, intelligent systems. We're looking for a Senior Applied AI Engineer to build production-grade AI systems that automate and...Full time
$200k - $350k
...This is a job that Jill, our AI Recruiter, is recruiting for on behalf of one of our customers. She will pick the best candidates... ...network The next step is to speak to Jack. Job Title: Applied AI Engineer Salary: $200K - $350K + Equity Company Description:...- ...Applied AI Engineer Everyone's talking about AI. But here's the truth: AI is trapped in a chat box. It can't take real actions in the real world. We are changing that forever. We're not just building another AI company - we're creating the infrastructure that will...Shift work
$212.63k - $381.15k
...Job Requisition ID # 25WD93825 Position Overview As the Distinguished Applied AI Engineer AI Transformation you will define and guide and accelerate the strategy for the rapid, responsible and outcome driven adoption of AI across Autodesk's go-to-market (GTM)...For contractors$150k - $300k
...The Role As an Applied AI Engineer, you'll bring the frontier of AI research and engineering to Rowspace, transforming how our platform understands and surfaces critical financial insights. In a typical week, you might build and productionize an agent in response to...Work at office$220k
Perplexity is looking for an engineer to join their team in San Francisco. You will work on building and operating the inference engine, supporting new models, migrating GPU kernels, and developing a Rust-based serving runtime. The ideal candidate has 3+ years of experience...$160k - $280k
...Edison Scientific builds and commercializes AI agents for science. Scientific discovery... ...assembling a team of top researchers and engineers across AI and biology to build an AI scientist. Role We're looking for an Applied AI Engineer to join the team responsible...Work at office$181.5k - $283.8k
...Generative AI Engineer At Lilly, we unite caring with discovery to make life better for people around the world. We are a global healthcare... ...change how drug discovery research is conducted. The Applied Intelligence for Discovery (AI4D) team is a newly formed group...Full timeFlexible hours- Fathom is seeking a Model Performance Engineer in San Francisco to optimize the speed, cost, and reliability of its model inference stack while building fine-tuning infrastructure. The ideal candidate will have extensive experience with LLM frameworks, quantization techniques...
- A leading cloud infrastructure company is seeking a Senior Engineer 2 to join their AI Inference Optimization team. The role involves leading the technical strategy for performance architecture and addressing complex performance issues ensuring industry-leading service....Remote job
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Applied AI Inference Engineer. Be the first to apply!
Related searches
- technical application engineer San Francisco, CA
- application operations engineer San Francisco, CA
- senior application support engineer San Francisco, CA
- application engineer San Francisco, CA
- field applications engineer San Francisco, CA
- hydraulic application engineer San Francisco, CA
- application support engineer San Francisco, CA
- application engineering manager San Francisco, CA
- cnc applications engineer San Francisco, CA
- network applications engineer San Francisco, CA


