Principal Software Engineer, CoreAI Workload Engines
$142.8k - $274.8kMicrosoft Corporation
Overview
The?CoreAI?Workloads team builds the?foundational inference engines and APIs that power largescale AI inference across Azure - from?cutting-edge?startups to Fortune 500 enterprises and Microsoft Copilots and agents. Our mission is to deliver?secure, reliable, and highly efficient GPU inference that enable multitenant AI systems at?global?scale while maximizing?utilization, performance, and developer productivity.?We own inference serving and performance of OpenAI and other state of the art large language model (LLM) models and work directly with OpenAI serving some of the largest workloads on the planet with trillions of inferences per day. Our?converged AI fabric and engines?deliver inference capabilities for all LLMs in?Microsoft catalog ( , including OpenAI,?Anthropic,?Mistral, Cohere, Llama, and more.
This role sits at the intersection of LLM inference fleets, serving efficiency, rapid experimentation, cloud infrastructure, and systems software-working closely with CoreAI data plane, compute, and partner teams to deliver end-to-end efficiencies and platform capabilities.
In this role, you will have the opportunity to work on multiple levels of the AI software stack, including the fundamental abstractions, programming models, OpenAI and OSS engines runtimes, libraries and application programming interfaces (APIs) to enable large scale inferencing of models.
You will drive production-grade inference serving improvements for OpenAI and open-source models across Azure, including benchmarking, performance measurement, and disciplined experimentation to improve latency, throughput, availability, and cost at scale. You will both (1) make hands-on engine changes and (2) contribute to the experimentation capabilities that make those changes measurable, safe to ship, and repeatable across teams.
Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.?
In alignment with our Microsoft values, we are committed to cultivating an inclusive work environment for all employees to positively impact our culture every day.?
Responsibilities
As?the Principal?engineer on the?team, your responsibilities include: ?
Optimize inference engines for OpenAI and open-source models by implementing and shipping performance/efficiency improvements across runtime, scheduling, and serving paths (latency, throughput, utilization, availability, and cost).
Run experiments end-to-end: formulate hypotheses, implement engine changes (including Python/PyTorch integration points where relevant), analyze results, and ship improvements behind guardrails.
Build and use experimentation capabilities for large-scale AI inference (experiment lifecycle, tracking, metric modeling, comparability standards, automated analysis) so the team can iterate quickly and safely.
Own serving availability and efficiency for Azure OpenAI Service workloads through tiered experimentation, lean segmentation, and multi-modal utilization across heterogeneous fleets-turning findings into shipped engine improvements.
Design and evolve inference serving architectures to improve utilization and latency using techniques such as disaggregated serving, multi-token prediction, KV offload/retrieval, and quantization-validated via staged rollouts and production guardrails.
Extend AI infrastructure abstractions to support elastic, heterogeneous inference engines reliably at scale (e.g., dynamic scaling across model families, modalities, and workload classes while maintaining isolation and SLOs).
Tune and scale inference engines across NVIDIA GPU generations (A100, H100, H200) for state-of-the-art OpenAI models, focusing on serving efficiency, utilization, and reliability (not hardware bring-up).
Partner with networking and storage teams to leverage high-performance interconnects (e.g., RDMA/InfiniBand-class fabrics such as RoCE over IB) for distributed inference, without owning low-level kernel/driver enablement.
Drive end-to-end features from design through production: observability, diagnostics, performance regression detection, and operational excellence for inference serving.
Influence platform architecture and technical direction across teams through design reviews, clear metrics, and technical leadership focused on experimentation velocity and production reliability.
Additional Responsibilities
Work across multiple layers of the AI software stack (abstractions, programming models, engine runtimes, libraries, and APIs) to enable large-scale model inference.
Benchmark OpenAI and other LLMs for performance across Azure OpenAI Service workload tiers and segments, and translate results into production improvements.
Debug, profile, and optimize production inference performance across the stack (abstractions, runtime, scheduling, and serving pipelines) to improve latency, throughput, and utilization.
Monitor performance regressions and drive continuous improvements to reduce time-to-deploy and hardware footprint.
Collaborate across engineering teams to deliver scalable, production-ready serving efficiency and availability improvements, using experimentation results to guide prioritization and rollout.
Build durable engine interfaces that enable fast experimentation and safe shipping of new strategies for class of service (QoS), replica load balancing, KV management (including offload/retrieval), quantization, and sampling (e.g., multi-token prediction and constrained sampling).
Out of Scope (This role does not focus on)
Novel hardware bring-up or first-party silicon enablement (e.g., Microsoft chips) or expanded support for non-NVIDIA platforms (e.g., AMD).
Low-level kernel, driver, or CUDA optimization as a primary responsibility.
Model pre-training, fine-tuning, or model architecture customization.
Qualifications
- Bachelor's Degree in Computer Science or related technical field and 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, Python, or equivalent experience.
Other Requirements:
Proven ability to design and operate large-scale, production inference services with high reliability and performance requirements, and to ship performance improvements safely via disciplined experimentation.
Strong skills in performance analysis: benchmarking, profiling, diagnosing regressions, and turning results into concrete engine/runtime changes.
Strong problem-solving skills and the ability to debug complex,?cross?layer?systems issues.?
Demonstrated technical leadership, including mentoring engineers, driving cross-team architectural alignment, and leveraging AI tools and AI-assisted workflows to accelerate engineering velocity and quality.
Hands-on experience with Kubernetes (building and operating services on k8s), including debugging production issues and designing platform abstractions (e.g., custom resources/controllers) and scheduling-aware deployments (e.g., node affinity, taints/tolerations, resource requests/limits).
Strong collaboration and communication skills, with the ability to work across organizational boundaries.?
Preferred Qualifications:
Experience optimizing LLM inference in practice (e.g., PyTorch inference, serving runtimes, model execution, or inference orchestration) in production environments.
Familiarity?with?high?performance?networking?and?low?latency?communication stacks.?
Familiarity with GPU-accelerated inference stacks (e.g., CUDA at the application/runtime level, device plugins, or runtime integration).
Experience building or using experimentation systems (A/B, canarying, tiered rollout), including metric definition and comparability for performance and reliability.
Familiarity with distributed inference stacks (e.g., NCCL-style collectives, model/tensor parallelism) and performance tradeoffs in large-scale serving.
Impact & Growth:
Work on?mission?critical infrastructure?that directly powers?largescale?AI systems.?
Influence the future of?cloud GPU platforms?used by internal and external customers.?
Collaborate with experts across?OS, hardware, networking, and AI platform teams.?
Opportunity to grow as a?technical leader, shaping long?term platform strategy.?
Software Engineering IC5 - The typical base pay range for this role across the U.S. is USD $142,800 - $274,800 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $188,000 - $304,200 per year.
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:
Software Engineering IC6 - The typical base pay range for this role across the U.S. is USD $165,600 - $296,400 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $220,800 - $331,200 per year.
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations. (
$142.8k - $274.8k
...Overview The CoreAI GPU Infrastructure team builds the foundational... ...infrastructure, systems software, virtualization, and... ...Responsibilities As the Principal engineer on the team, your responsibilities... ...for training and inference workloads, spanning bare metal, virtual...SuggestedOngoing contractLocal area$139.9k - $274.8k
...Overview Join our team within? CoreAI , where we are building the?AI data-plane?that powers all LLM inferencing workloads across Microsoft and Azure customers-from... ..., Cohere, Llama, and more. As a? Principal Software Engineer , you will shape the future of one...SuggestedOngoing contractLocal area$119.8k - $234.7k
...Overview CoreAI sits at the center of Microsoft's mission to redefine how software is built and experienced, providing the foundational... ...build services that empower engineers and scientists across the... ...of production workloads. · Experience designing and...SuggestedOngoing contractLocal area$119.8k - $234.7k
...the Role ~ We'rebuildingAIfirst engineering systemsthat power growth at Microsoft -... ...trustworthy AI. AboutCoreAI ~ CoreAI- Platform and ToolsisMicrosoft's AIfirst... ...What We're Looking For ~ Software engineering fundamentals with experience...SuggestedOngoing contractLocal area$163k - $296.4k
...confidence. ~ As aPrincipal Growth Engineer inCoreAI,you'lldrive the technical... ...technical judgment. AboutCoreAI ~ CoreAI- Platform and ToolsisMicrosoft's AIfirst... ...handson and detailoriented ~ Software engineering fundamentals with experience...SuggestedOngoing contractLocal area$139.9k - $274.8k
...Overview Software quality is being redefined by AI. As part of the Microsoft Playwright team , you'll build the foundation... ...workflow and serve millions worldwide. As a Principal Software Engineer - CoreAI on the Playwright engineering team, you will design and...Ongoing contractLocal areaWorldwide$142.8k - $274.8k
...agents and generative AI systems. Within CoreAI, the Foundry Agents organization is... ...evaluate and optimize agents. As a senior engineer within Foundry Agents, you will help... ...Drive technical direction across the full software development lifecycle, influencing design...Ongoing contractLocal area$165.6k - $296.4k
...Overview CoreAI is at the forefront of Microsoft's mission to redefine how software is built and experienced. We are responsible for building the foundational... ..., and multi-agent solutions. As a Principal Software Engineering Manager on this team, you will lead and grow...Ongoing contractLocal area$100.6k - $199k
...Overview CoreAI is at the forefront of Microsoft's mission to redefine how software is built and experienced. We are responsible... ...powers all of Microsoft's AI workloads, such as M365 CoPilot, Github... ...many more. As a Software Engineer on the training infrastructure...Ongoing contractLocal area$139.9k - $274.8k
...AI is at the forefront of Microsoft’s mission to redefine how software is built and experienced. We are responsible for building the... ...Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited...Ongoing contractLocal area$163k - $296.4k
...Windows, Edge, web, and mobile. We're seeking a hands-on Principal Software Engineer to define and lead the strategy for authentication, authorization... ...consumer compliance expectations for Copilot-class workloads. Leadership: Track record mentoring senior engineers...Ongoing contractWork at officeLocal areaWorldwide$142.8k - $274.8k
...the world. The Customer Experience Engineering team operates within the Office of the... ...We are looking for hands-on cloud Software Engineers and Senior Software Engineers... ...engineering, helping improve how production workloads are designed, monitored, and operated...Ongoing contractWork at officeLocal areaRemote workWorldwideHome office$142.8k - $274.8k
...multimodal experiences across Microsoft's AI ecosystem. As a Principal Software Engineer on the Image Search Experience team, you will define and... ...performance, reliability, and efficiency under real-world workloads. Define success metrics and experimentation strategies,...Ongoing contractWork at officeLocal areaWorldwide$142.8k - $274.8k
...65 Copilot inference is a high-impact engineering team advancing applied AI and large-scale... ...scalability, and efficiency. As a Principal Software Engineering Manager you will lead a... ...system-level improvements in areas such as workload execution, scheduling, batching, or...Ongoing contractWork at officeLocal area3 days per week$165k - $242k
...became a publicly traded company (Nasdaq: CRWV) in March 2025. Learn more at What You'll Do: As a Senior Software Engineer II (IC4) on the AI Workload Orchestration team, you will help build and operate CoreWeave's Kubernetes-native platform for admitting,...Permanent employmentTemporary workCasual workWork at officeRemote workFlexible hours$102.1k - $202.2k
...AI is at the forefront of Microsoft's mission to redefine how software is built and experienced. We are responsible for building the foundational... ...stack. In this role, you will work as a software engineer building and operating the Azure Managed Grafana, and Azure Monitor...Ongoing contractLocal area$304k
...foundational layer that powers Snowflake's AI, Analytics and Data Engineering capabilities. We lead innovations across open table formats... ...for Analytics, Data Engineering, and high-performance AI workloads. ~ AI & Agentic Foundation: Build the foundational metadata...Flexible hours$142.8k - $274.8k
...services across Microsoft to everyone through GitHub, Entra, Azure, and beyond. We are looking to hire an experienced Principal Software Engineer who knows how to build and nurture a high performing team. The role requires experience building seamlessly integrated components...Ongoing contractLocal area$119.8k - $234.7k
...advancing Microsoft’s Cloud Solutions, AI strategy, full stack engineering, Security, Dataverse & D365? Are you interested in a... ...and at the forefront of AI-Led engineering. As a Senior/Principal Software Engineer , you will lead the end-to-end software development...Ongoing contractLocal area$155k - $215k
...Job Description Position Title: Principal Software Engineer Position Description: Protingent Staffing has an exciting direct-hire opportunity for a Principal Software Engineer. Job Responsibilities: • Elevate the technical judgment of engineers around...Permanent employmentRemote work- ...Principal Engineer, Endpoint AI Learning Framework CrowdStrike's Sensor Security Platform team builds foundational security capabilities for Crowstrike's Falcon sensor, which runs on over 50 million endpoints worldwide. We are significantly expanding our AI and machine...Work at officeWorldwide2 days per week
$71.23 - $121.29 per hour
...Description Principal Software Engineer IS - Hybrid The Principal Software Engineer takes end-to-end ownership for development and quality of solutions and services that delight caregivers and add strategic value to Providence St. Joseph Health. They evaluate requirements...Minimum wageFull timeLocal areaShift work$139.9k - $274.8k
...technology? The AI Frameworks team at Microsoft develops software that pushes the cutting edge of performance and experience... ...Service, Copilot+ PC, and many others. As the Principal Software Engineer on our team, you would have the opportunity to work on:...Ongoing contractWork at officeLocal area- ...quality, security, and performance standards. Mentor and develop engineers, fostering a culture of collaboration, accountability, and... ...experience shipping large-scale, distributed cloud-based software solutions. ~10+ years of experience working in agile, fast-paced...Full timeFlexible hours
$188k - $275k
...Staff Software Engineer- AI Workload Orchestration Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with...Permanent employmentTemporary workCasual workWork at officeRemote workFlexible hours$163k - $296.4k
...billions of lives around the world. The Identity and Access Management team within the Identity division is looking for a Principal Software Engineer - Architect to solve large scale problems and deliver the next wave of innovation in Entra ID. Our work enables...Ongoing contractLocal areaShift work$119.8k - $274.8k
Senior + Principal Software Engineers - Front End Applied AI (MTP) Design and build AI-first security systems leveraging LLMs and multimodal models. Focus on creating high-quality front-end experiences for Microsoft Threat Protection, enabling defenders to reason over...Full time$139.9k - $274.8k
...positive human impact at scale. We work with scientific, engineering, academic, and business partners to apply machine learning... ...Microsoft Research Americas Engineering team is hiring a Principal Research Software Engineer to provide technical leadership and direct...Ongoing contractLocal area$139.9k - $274.8k
...Overview Are you a Software Engineer already at the forefront of agentic AI development - someone who uses agents to build software today and wants to build the platform that brings those capabilities to every developer at Microsoft? Do you have strong opinions about...Ongoing contractLocal area- ...and infrastructure. Worldscape is looking for a seasoned Principal Engineer to join our expanding team. We are at the forefront of AI decision... ...to deliver end‑to‑end solutions. Drive best practices in software design, testing, CI/CD, observability, and operational...Full timeFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Principal Software Engineer, CoreAI Workload Engines. Be the first to apply!
- principal software engineer Redmond, WA
- principal Redmond, WA
- senior principal cloud computing engineer Redmond, WA
- principal cloud computing engineer Redmond, WA
- senior principal scientist Redmond, WA
- id software Redmond, WA
- software quality assurance Redmond, WA
- software sales Redmond, WA
- internship software Redmond, WA
- remote software sales Redmond, WA


