AI Inference Infrastructure Engineer
$350kThinking Machines Lab Inc.
A leading AI research organization seeks an Infrastructure Research Engineer in San Francisco to optimize and scale systems powering large AI models. This role emphasizes enhancing inference speed, reliability, and cost-effectiveness. Ideal candidates possess a Bachelor's in CS/Engineering, experience with deep learning frameworks, and collaborative skills in diverse teams. Competitive compensation between $350,000 and $475,000 USD is offered along with generous benefits including unlimited PTO and visa sponsorship. #J-18808-Ljbffr
- ...A cutting-edge AI infrastructure company in San Francisco seeks an experienced network engineer to optimize high-performance networking protocols for AI models. The ideal... ...will integrate RDMA and InfiniBand into the inference stack, ensuring efficient communication and...Suggested
$300k - $430k
...is the leading conversational AI platform empowering every... ...team. About the Team The ML Infrastructure team builds the systems that... ...the routing layer that manages inference across multiple providers. We... ...hiring a Staff ML Infrastructure Engineer to own the platforms powering...SuggestedWork at office$179k - $248k
...Machine Learning Infrastructure Engineer Join to apply for the Machine Learning Infrastructure... ...deeper understanding in healthcare. Our AI‑powered platform was purpose‑built for... ...scalable Kubernetes clusters for AI model inference and training Develop, optimize, and...SuggestedHourly payFull timeFlexible hours- A dynamic AI company is seeking an Infrastructure Software Engineer in San Francisco to build and maintain components of an ML inference platform. The successful candidate will develop infrastructure components using Python and Go, manage Kubernetes deployments, and enhance...Suggested
- ...A healthcare technology firm in San Francisco is seeking an ML Infrastructure Engineer, Model Inference to build and optimize AI-driven solutions. You will design scalable Kubernetes clusters, enhance ML model serving infrastructure, and collaborate with cross-functional...Suggested
- ...Francisco to enable enterprises to implement AI workloads effectively. The role involves... ...deployment architectures, solving AI inference challenges, and collaborating closely... ...candidates will have 3+ years in cloud infrastructure or DevOps, strong skills in Kubernetes,...Flexible hours
- ...An innovative AI company is seeking a Software Engineer to develop infrastructure that supports AI training and inference workflows. This role requires strong object-oriented programming skills and a solid foundation in data structures and algorithms. The ideal candidate...
- ...An innovative studio is seeking an AI Infrastructure Engineer to enhance their ML infrastructure for groundbreaking anime games. This role involves designing and implementing cutting-edge inference architectures to support various platforms. As part of a small, agile...Worldwide
- ...Qualifications CUDA + GPU inference optimization vLLM, SGLang, or TensorRT-LLM experience KV caching, paged attention, batching, token streaming... ...plus) No degree required Company Luminal (YC S25) builds an AI compiler and serving stack that makes models 10x faster and...
$320k
...Staff + Sr. Software Engineer, Cloud Inference San Francisco, CA About Anthropic Anthropic... ...reliable, interpretable, and steerable AI systems. We want AI to be safe and... ...build, and own backend services and infrastructure that serve Claude across multiple CSPs...Work at officeVisa sponsorshipFlexible hours$160k - $250k
...Senior Backend Engineer, Inference Platform San Francisco About the Role Together AI is building the Inference Platform that brings the most advanced generative... ...journey in building the next generation AI infrastructure. Compensation We offer competitive...Full timeLocal area$160k - $250k
...A pioneering AI company in San Francisco is seeking a Senior Backend Engineer for their Inference Platform. The role involves optimizing latency, developing auto-scaling systems, and collaborating with ML researchers to scale architectures. Ideal candidates will have...$200k - $400k
A leading AI technology company located in San Francisco is seeking an infrastructure engineer to build distributed systems for their AI inference engine. The role involves designing systems that ensure minimal latency and maximum reliability. Candidates should have a...Visa sponsorship- A leading AI research organization in San Francisco is seeking a Technical Program Manager for Inference to bridge their systems with the broader organization. This role involves driving strategic initiatives across inference performance, coordinating launches, and ensuring...
- ...Software Engineer Intern (AI Infrastructure / Training / Inference) About the Role We are hiring Software Engineers focused on AI Infrastructure to build the systems that enable frontier multimodal AI to operate reliably at production scale. This role exists because modern...InternshipImmediate start
- ...About Us Most AI is frozen in place - it doesn't adapt... ...into useful intelligence - the inference services that serve LLMs at scale... ...both. Researchers and ML engineers will hand you workloads that... ...Experience operating Kubernetes-based infrastructure, including custom operators...Flexible hours
- ...Location Type Hybrid Department Inference Model Serving Who are we? Our... ...enterprises who are building AI systems to power magical... ...Cohere is a team of researchers, engineers, designers, and more, who are... ...running production infrastructure at a large scale Experience designing...Full timeWork experience placementWork at officeRemote workFlexible hours
- ...A pioneering AI technology firm in San Francisco is seeking a founding member to optimize and serve models on Luminal Cloud. The role involves deploying models with advanced optimization techniques, conducting performance reviews, and enhancing scheduling processes. Ideal...
$165k - $200k
...even played with ChatGPT or AI products early on), and prefer... ...You'll Do As a member of our infrastructure team, you'll be at the heart... ...—acting as an infrastructure engineer one moment, and a developer,... ...availability machine learning inference service. Collaborating with customer...Second jobRemote workWork from homeRelocation packageFlexible hours$120k - $200k
...Senior Infrastructure Engineer At Bland.com, our goal is to empower enterprises to make AI-phone agents at scale. Based out of San Francisco, we're a quickly growing team... ...handle real-time voice processing, scale ML inference, and integrate with enterprise telephony...Work at officeNight shift- ...Senior HPC & GPU Infrastructure Engineer Sciforium is an AI infrastructure company developing next-generation multimodal AI models and a proprietary... ...Exposure to vLLM, model serving optimizations, or inference systems. Hands-on experience with configuration...Flexible hours
- ...Tamarind Bio We enable any scientist to access AI-powered drug discovery. Thousands of scientists... ...released daily. About the Role We're looking for two Infrastructure Engineers to lead the scaling of our machine learning inference system. You'll be responsible for architecting...Relocation
- ...Discovery Chai Discovery builds frontier AI models to design molecules and... ...About the role We are hiring an engineer obsessed with building systems and infrastructure that are as simple as possible... ...our product surface, model inference, and evaluation suite. You’ll work...Full timeWork at officeFlexible hours
- ...Infrastructure Engineer ENGINEERING | San Francisco, New York City | On-site | Full-time The Role We... ...the backbone of our enterprise-grade AI data platform. You’ll design systems that... ...to optimize infrastructure for LLM inference and training workloads and building agent...Full timeWork at office
$200k - $260k
...Rebuild Matterhaul's infrastructure and core systems from zero — AWS,... ...pipeline choices that the rest of engineering will build on for years.... ...Matterhaul is building the AI-native operating system for... ...: vector stores, GPU‑backed inference, embedding pipelines, prompt...Full timeWork at officeLocal area- ...Hyperbolic Labs is on a mission to democratize AI by breaking down the barriers to... ...an innovative GPU marketplace and AI inference service that promise affordability and... ...the Role We're seeking a Senior Infrastructure Engineer to help build and scale Hyperbolic's...Remote work
$130k - $240k
...Maxana is seeking an experienced Infrastructure Engineer for a confidential client — a fast-growing AI company. In this role you will build and maintain the platform layer supporting large-scale ML training, inference, and deployment. This is a high-impact role at the...Flexible hours$150k - $200k
...Senior Infrastructure Engineer Location: On-site, San Francisco, CA (3 days/week in office) Salary: $150k – $200k + equity Industry: AI, Cloud Infrastructure What You’ll Drive Join a fast... ...impact platform reliability, ML inference performance, and the future of enterprise...Work at office3 days per week$180k - $200k
...Infrastructure Engineer (Storage) Lightning AI is the company behind PyTorch Lightning. Founded in 2019, we build an end-to-end platform for developing... ...need for experimentation, training, and production inference, with security, observability, and control built in....Remote workWork from homeFlexible hours- ...Infrastructure Engineer (Observability) Lightning AI is the company behind PyTorch Lightning. Founded in 2019, we build an end-to-end platform for developing... ...need for experimentation, training, and production inference, with security, observability, and control built in...Work from homeFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI Inference Infrastructure Engineer. Be the first to apply!
- senior ai engineer San Francisco, CA
- ai ml engineer San Francisco, CA
- ai engineer remote San Francisco, CA
- ai engineer San Francisco, CA
- ai prompt engineer San Francisco, CA
- ai developer San Francisco, CA
- ai research engineer San Francisco, CA
- machine learning ai engineer San Francisco, CA
- security infrastructure engineer San Francisco, CA
- principal infrastructure engineer San Francisco, CA

