Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI Field Engineer - Microsoft Foundry

$280k - $320k
Full-time

Fireworks AI

About Us: At Fireworks, we’re building the future of generative AI infrastructure. Our platform delivers the highest-quality models with the fastest and most scalable inference in the industry. We’ve been independently benchmarked as the leader in LLM inference speed and are driving cutting-edge innovation through projects like our own function calling and multimodal models. Fireworks is a Series C company valued at $4 billion and backed by top investors including Benchmark, Sequoia, Lightspeed, Index, and Evantic. We’re an ambitious, collaborative team of builders, founded by veterans of Meta PyTorch and Google Vertex AI. The Role As an AI Field Engineer for Microsoft Foundry, you will be one of the technical owners of Fireworks' most strategic partnership. You’ll work closely with Microsoft's field teams, Azure-aligned ISVs, and the SIs that run enterprise AI transformation programs to make Fireworks the default inference and fine-tuning layer in every Azure AI architecture your partners touch. The role sits at the intersection of engineering, partner development, and customer delivery. You build reference architectures, run benchmarks, debug production integrations, and co-develop POCs — all while holding your own in executive-level conversations about strategy, roadmap, and business outcomes. You spend most of your time building and enabling. You ship code, run joint POCs with Microsoft field teams, and architect deployments that span Azure Foundry and Fireworks. But you also lead discovery conversations, align partner stakeholders, and translate field signals into product improvements that compress the feedback loop from partner to roadmap. The Segment As a Field Engineer aligned with our Partnerships team you own the technical relationship between Fireworks and the Microsoft ecosystem, Azure field teams, ISVs building on Azure Foundry, and the SIs that deliver AI transformation programs on Azure. The Microsoft partnership is a core go-to-market bet: clients like UIPath, Stack Blitz, Motif run via Fireworks on Foundry.. Your job is to scale that pattern across the partner ecosystem. These engagements involve large, multi-stakeholder organizations, so you will need to navigate both the enterprise buyer (IT, security, compliance) and the builder (ML engineers, platform teams, app developers), while building the trusted-advisor relationships inside Microsoft's field that multiply your reach. What You'll Work On Technical Delivery and Deployment Be the technical lead on co-sell motions with Microsoft — joint reference architectures, Azure Foundry integration patterns, and shared POCs for strategic accounts. Build end-to-end POCs and MVPs alongside partner engineering teams, working inside their codebases, infrastructure, and constraints. Run load tests and establish latency, throughput, and cost baselines against realistic customer traffic profiles, and tune deployments to hit those targets. Deploy and validate new model families on inference frameworks (vLLM, SGLang), determining optimal shapes, quantization configs, and serving patterns across workloads. Model Strategy and Fine-Tuning Guide Microsoft’s customers on model selection, fine-tuning strategy (SFT, DPO, RFT), and evaluation methodology. Build and run fine-tuning pipelines directly with customers, navigating trade-offs between model families, compute cost, and quality targets. Design and implement evaluation frameworks that measure production-quality metrics, not just benchmark scores. Product Feedback and Platform Improvement Own the feedback loop — surface partner-driven product gaps to Fireworks engineering, and translate the roadmap back into partner messaging. Ship external technical content: reference architectures, integration guides, and benchmark posts that make it easy for partners to win deals with us. Track pipeline health; flag risks and opportunities to Field leadership weekly. What We're Looking For Minimum Qualifications 3+ years in a pre-sales, partner engineering, forward-deployed, or technical consulting role. Demonstrated ability to build production software with customers, not just advise on it. You have shipped code running in someone else's production environment. Strong Python skills. Comfortable reading, writing, and debugging production code. Familiarity with Kubernetes and infrastructure engineering. Hands-on fluency with LLM inference: latency/throughput tradeoffs, batching strategies, quantization, structured outputs, function calling. You can explain why 50ms p99 matters to an enterprise CTO. Real experience with fine-tuning — LoRA at minimum, RFT a strong plus. You understand when SFT is enough and when it isn't. Deep familiarity with the Azure AI stack: Azure Foundry, Azure OpenAI Service, Azure ML, AKS, Entra/RBAC for AI workloads. You know where Fireworks fits and where it doesn't. Exceptional communication: able to run a sharp discovery call, present to a VP, and debug a latency issue with an ML engineer in the same afternoon. Preferred Qualifications 5+ years in technical field or engineering roles where you've owned a technical relationship with a hyperscaler or major SI, not just supported one Experience with inference serving frameworks (vLLM, SGLang, TensorRT-LLM) and tuning deployments for real workloads. Prior role at a hyperscaler, AI-native cloud, or inference provider. Experience with agentic frameworks (LangChain, LlamaIndex, or custom tool-use pipelines) — you understand how inference latency and reliability shapes agent behavior at scale. Background in model evaluation — you understand why benchmark gaming is rampant and what rigorous evals actually look like. You've written a technical blog post or reference architecture that people actually read. Track record taking GenAI POCs from prototype to production-scale deployments. On-Target Expectations (Plus Equity)

$280,000 - $320,000 USD

Total compensation also includes meaningful equity in a fast-growing startup, along with a competitive salary and comprehensive benefits package. Base salary is determined by a range of factors including individual qualifications, experience, skills, interview performance, market data, and work location. Fireworks AI is an equal-opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all innovators. Total compensation for this role also includes meaningful equity in a fast-growing startup, along with a competitive salary and comprehensive benefits package. Base salary is determined by a range of factors including individual qualifications, experience, skills, interview performance, market data, and work location. The listed salary range is intended as a guideline and may be adjusted. On Target Earnings (Plus Equity)

$280,000—$320,000 USD

Why Fireworks AI? Solve Hard Problems: Tackle challenges at the forefront of AI infrastructure, from low-latency inference to scalable model serving. Build What’s Next: Work with bleeding-edge technology that impacts how businesses and developers harness AI globally. Ownership & Impact: Join a fast-growing, passionate team where your work directly shapes the future of AI—no bureaucracy, just results. Learn from the Best: Collaborate with world-class engineers and AI researchers who thrive on curiosity and innovation. Fireworks AI is an equal-opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all innovators.

Vacancy posted 17 hours ago
Similar jobs that could be interesting for youBased on the AI Field Engineer - Microsoft Foundry in San Mateo, CA vacancy
  • $71.7k - $88.8k

     ...The Field Service Engineer - Gas Detection will be working within the Industrial Automation business unit, specifically the Service Team...  ...years of field experience ~1-2 years of experience using Microsoft Office Suite, including Excel, Word, SharePoint, and PowerPoint... 
    Microsoft
    Temporary work
    Work experience placement
    Work at office
    Remote work
    Flexible hours

    Honeywell

    San Mateo, CA
    3 days ago
  • $127.4k - $182k

     ...opportunity in UCaaS, Contact Center and AI-powered adjacencies. We invest more than...  ...management role. We are hiring an AI Product Engineer — a hands-on technologist with enough...  ...conversational AI platforms: Google Dialogflow CX, Microsoft Copilot Studio, or equivalent.... 
    Microsoft
    Full time
    Local area
    Work from home
    Flexible hours

    RingCentral

    Belmont, CA
    2 days ago
  • $86.5k - $142.7k

     ...Solution Architect who designs, prototypes and builds modern, AI‑enabled applications and digital products. You will spend the majority...  ...and reviewing code, building proofs‑of‑concept, and guiding engineering teams through complex technical decisions. This position... 
    Suggested
    Summer holiday
    Flexible hours

    EY

    San Mateo, CA
    5 days ago
  •  ...and life sciences, is looking for a "Sr AI Fullstack Engineer" based out of South San Francisco, CA....  ..., Mathematics, or a related field. ~5+ years of full stack development...  ...LLM systems Experience in developing Microsoft Word add-ins using Office.js. Experience... 
    Microsoft
    Long term contract
    Work at office

    Dawar Consulting

    South San Francisco, CA
    3 days ago
  • $131k - $154k

     ...unlock productivity by placing AI, data and intelligence at the...  ...Description As an Applied AI Engineer you are pushing the envelope in...  ...technology partners, such as Microsoft, OpenAI and Databricks. With Celonis...  ..., mathematics or related fields, or equivalent work experience... 
    Microsoft
    Full time
    Work experience placement
    Worldwide
    Shift work

    I did my part and supported the Regular Toilet

    Redwood City, CA
    3 days ago
  •  ...AI Solutions Developer/ Architect Position: Fulltime Location: Foster City, CA Duration...  ...of experience in software development or solution engineering • Hands on experience with Microsoft Azure (APIs, Functions, Logic Apps, App Services) •... 
    Microsoft
    Full time

    Lorven Technologies

    San Mateo, CA
    5 days ago
  •  ...define and enforce the best practices for engineering across the company. Our approach involves...  ...best practices for responsible AI integration in our development pipeline...  ...Computer Science, Engineering, or a related field with 3+ years of industry experience.... 
    Temporary work
    Relocation package

    Zoox

    Foster, CA
    a month ago
  • $160k - $250k

     ...Title: Founding AI Engineer (Research & Systems) Target: PhDs & Research Masters from Stanford, MIT, Berkeley, CMU focused on AI, ML, NLP...  ...completed a PhD/MSc in Computer Science, AI, or a related field. Publication record at top conferences (NeurIPS, ICML, ICLR,... 
    H1b
    Immediate start
    Visa sponsorship

    Aimhire

    San Mateo, CA
    3 days ago
  •  ...Full-Time AI Interfaces Engineer for a fast-growing startup building a real-time 3D positioning platform used by customers in data-intensive environments. In this role, you’ll design and build AI chat interfaces, agentic workflows, and product-facing MCP servers that... 
    Full time

    VortexLink

    Belmont, CA
    4 days ago
  • $192k - $238k

     ...GTM AI Engineer Organizations everywhere struggle under the crushing costs and complexities of "solutions" that promise to simplify their lives. To create a better experience for their customers and employees. To help them grow. Software is a choice that can make or... 
    Flexible hours

    Freshworks

    San Mateo, CA
    7 hours ago
  •  ...A tech-driven AI company in San Mateo is seeking an entry-level Software Engineer. This role focuses on developing core AI technology and graphical user interfaces...  ...grounding in computer science, engineering, or related fields. You will work closely with an experienced team... 

    Monogram

    San Mateo, CA
    4 days ago
  • $110k - $270k

     ...machine learning graph, the Quadric GPNPU executes both NN graph code and conventional C++ DSP and control code. Role The AI Kernel Engineer in Quadric plays the key role to enable a large number of AI kernels/operators to run efficiently on the Quadric platform.... 
    Work at office
    Local area
    Immediate start
    Flexible hours
    2 days per week

    quadric.io

    Burlingame, CA
    5 days ago
  • $123k - $190.9k

     ...Visa is seeking a Cyber Analytics and AI professional to innovate machine learning and AI capabilities. In this hybrid position, you'll collaborate with architects, engineers, and product managers to design secure applications that drive productivity across the organization... 

    Visa

    Foster, CA
    4 days ago
  •  .... Here in the US, we are blending German engineering with American ingenuity. As ADMT, we develop...  ...to know and understand the future of AI and to derive requirements on sensor...  ...unit tests, integration tests, in-vehicle/field tests, etc.) Analyzes and evaluates conducted... 
    Local area
    Worldwide

    Omega Solutions Inc

    Belmont, CA
    4 days ago
  • $152.7k - $249.2k

     ...re now scaling manufacturing and preparing for the launch of our commercial service. Overview We're looking for a Senior AI Engineer to help bring pragmatic, production-grade AI capabilities into the systems that power Joby's manufacturing and enterprise workflows... 
    Temporary work

    Joby Aviation

    San Carlos, CA
    3 days ago
  •  ...A leading software firm in private lending technology is seeking an AI Engineer to develop production-ready AI pipelines for their Loan Origination System. Candidates should have 1-3 years of experience in building AI systems using large language models and strong proficiency... 

    The Mortgage Office (Applied Business Software Inc.)

    San Mateo, CA
    4 days ago
  •  ...system intelligence. As a Model Optimization & Deployment Engineer, you will focus on bringing highly efficient, production-ready large...  ...kernels to minimize latency and maximize memory bandwidth on AI accelerators. Write production-level, low latency, and memory... 
    Temporary work
    Relocation package

    Zoox

    Foster, CA
    4 days ago
  •  ...machine learning graph, the Quadric GPNPU executes both NN graph code and conventional C++ DSP and control code. Role: The AI Inference Engineer in Quadric is the key bridge between the world of AI/LLM models and Quadric unique platforms. The AI Inference Engineer at Quadric... 
    Temporary work
    Work from home

    quadric.io

    Burlingame, CA
    5 days ago
  • $45 - $60 per hour

     ...the performance. Requirements MS student in CS/CE or related fields. Proficiency in C/C++/Python Experience in kernel implementation...  ...hands-on experience working alongside industry experts in AI and semiconductor technology, with access to mentorship and meaningful... 
    Hourly pay
    Temporary work
    Internship
    Work at office
    Relocation

    quadric, Inc

    Burlingame, CA
    22 days ago
  • $97.02k - $163.03k

     ...CorVel in San Mateo, CA seeks a motivated software engineer to develop and enhance AI-driven software solutions. You will work closely with a focused...  ...candidates should possess a Bachelor's degree in a related field and at least 5 years of experience. The role offers a... 

    Energy Jobline ZR

    San Mateo, CA
    4 days ago
  •  ...Replit is seeking a Security Engineer to join their Fraud team in Foster City, CA. You will design and implement detection systems for AI-generated code, analyze attack patterns, and work on unique security problems. This full-time role involves collaborating closely... 
    Full time
    Work at office
    3 days per week

    Jobr

    Foster, CA
    4 days ago
  •  ...A leading technology company in California is seeking an AI Inference Engineer to bridge AI models with unique platforms. Key responsibilities include model optimization, deployment, and performance profiling. Candidates should have a Bachelor’s or Master’s degree, 5+... 
    Remote work
    Work from home

    quadric.io

    Burlingame, CA
    4 days ago
  •  ...Strategic/Functional Define and drive the organization's AI strategy and roadmap aligned with business goals Educate leadership and cross-functional teams on AI capabilities, use cases, and best practices Identify high-impact AI/ML use cases across... 

    Omega Solutions Inc

    San Carlos, CA
    2 days ago
  •  ...About Obvio AI Each year, more than 40,000 people in the U.S. leave home and never make it back due to traffic crashes. At Obvio...  ...promoted and rolled back without pipeline downtime. Set the engineering standard. This is an early hire. You'll write the playbooks—... 
    Local area

    Obvio

    San Carlos, CA
    5 days ago
  • $216k - $283k

     ...employee and customer experiences. Our people-first approach to AI eliminates friction, helping businesses reduce complexity, lower...  ...us. Job Description We are looking for a Full Stack AI Engineer to build intelligent, scalable, and user-centric product experiences... 
    Work at office
    Flexible hours
    3 days per week

    Freshworks

    San Mateo, CA
    26 days ago
  • $147.7k - $215k

     ...and a $93 Billion market opportunity in UCaaS, Contact Center and AI-powered adjacencies. We invest more than $250 million annually...  ...skills come in. We’re currently looking for: an AI Forward-Deployed Engineer This role is responsible for closing the gap between solution... 
    Full time
    Local area
    Remote work
    Flexible hours

    RingCentral

    Belmont, CA
    3 days ago
  • $200k - $350k

     ...Research Engineer | San Francisco | Full-Time Brief Overview Applied AI lab building world models for 3D game environments. Early-stage, well-backed, 3 weeks from beta launch. What You'll Own You'll take large ownership of the core generation and simulation... 
    Full time
    Visa sponsorship
    Relocation package
    Flexible hours

    Harnham

    San Mateo, CA
    5 days ago
  •  ...A technology company specializing in AI solutions is seeking an AI Applications Engineer to bridge development engineering and hands-on users. This role involves integrating products into AI frameworks, developing applications, and providing technical leadership. Applicants... 
    Remote work
    Work from home

    quadric.io

    Burlingame, CA
    4 days ago
  •  ...A technology security firm is seeking a Director of Software Engineering specializing in AI to lead teams focused on delivering innovative AI solutions. This role requires extensive experience in software development and a proven track record in leadership. The ideal... 

    Synack

    San Mateo, CA
    4 days ago
  • $110k - $270k

     ...executes both NN graph code and conventional C++ DSP and control code. Role The AI Applications Engineer is the key bridge between development engineering and hands-on users in the field. The AI Application Engineer will [1] integrate Quadric product and software... 
    Work at office
    Local area
    Immediate start
    Worldwide
    Flexible hours

    quadric.io

    Burlingame, CA
    11 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Field Engineer - Microsoft Foundry. Be the first to apply!