Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Software Engineer (AI Infrastructure / Training / Inference)

SpreeAI

Software Engineer (AI Infrastructure / Training / Inference) About the Role We are hiring Software Engineers focused on AI Infrastructure to build the systems that enable frontier multimodal AI to operate reliably at production scale. This role exists because modern generative and vision models require infrastructure beyond traditional backend engineering — including GPU orchestration, large-scale inference systems, performance optimization, and developer platforms that allow applied scientists to move fast without sacrificing reliability or cost efficiency. You will work on: Scalable model serving and inference pipelines. Distributed GPU infrastructure. Performance and cost optimization. Reliability, observability, and production readiness. You will operate at the boundary between systems engineering and machine learning — building the “paved roads” that allow advanced AI systems to scale safely and efficiently. What you'll do Design and build scalable infrastructure supporting training and inference workflows. Develop high-performance APIs and backend services for AI model serving. Optimize GPU utilization, latency, and throughput for multimodal workloads. Improve observability, monitoring, and reliability of AI systems. Partner closely with Applied Science teams to productionize research systems. Drive improvements in deployment workflows, automation, and platform usability. Qualifications Degree in Computer Science, Engineering, or comparable combination of education and practical experience. Strong object-oriented programming skills (Python, C++, Java, Go, or similar). Strong data structures and algorithms foundations. Experience building production backend or distributed systems. Understanding of cloud infrastructure concepts and containerized systems. Preferred Qualifications Experience with Kubernetes, Docker, or container orchestration. Familiarity with GPU-based ML workloads or distributed training/inference systems. Experience with model serving frameworks (vLLM, Triton, Ray Serve, or similar). Experience with observability tools and performance debugging. Familiarity with PyTorch or ML workflows. Interest in optimizing systems for efficiency, scalability, and developer velocity. SPREEAI is a fast-growing, innovative AI company at the forefront of fashion and e-commerce, revolutionizing how consumers engage with fashion through lifelike photorealistic try-on technology and hyper-personalized shopping experiences. Our mission is to redefine the retail landscape with cutting-edge AI solutions that blend high fashion and technology. We thrive in a dynamic, fast-paced environment where creativity meets technology to drive real impact. If you are passionate about innovation and shaping the future of fashion, SPREEAI offers a platform to make your mark. #J-18808-Ljbffr SpreeAI

Vacancy posted 7 days ago
Similar jobs that could be interesting for youBased on the Software Engineer (AI Infrastructure / Training / Inference) in San Francisco, CA vacancy
  •  ...Type Hybrid Department Inference Model Serving Who are...  ...serve humanity. We’re training and deploying frontier...  ...enterprises who are building AI systems to power...  ...team of researchers, engineers, designers, and more,...  ...experience running production infrastructure at a large scale... 
    Training
    Full time
    Work experience placement
    Work at office
    Remote work
    Flexible hours

    Jaide Health

    San Francisco, CA
    5 days ago
  •  ...safety. We use computer vision and AI to enable existing security...  .... We're hiring a strong software engineer to own the ML Infrastructure that powers how Voxel trains and ships vision models. You'll...  ...export trained models to optimized inference formats (TensorRT, ONNX),... 
    Training
    Work at office
    Flexible hours

    Voxel Labs

    San Francisco, CA
    1 day ago
  •  ...group working across engineering, product,...  ...the boundaries of AI capabilities while...  ...for an experienced Software Engineer to help...  ...machine learning infrastructure that powers OpenAI...  ...teams to build, train, deploy, serve, monitor...  ...real-time inference and serving infrastructure... 
    Training

    AI Chopping Block, Inc.

    San Francisco, CA
    4 days ago
  •  ...Specter is creating a software-defined "control...  ...the perception engine for a company's physical...  ...world of physical AI and robotics. We...  ...is hiring an ML infrastructure engineer to build...  ...time perception and inference across our edge-...  ...This role owns the training, deployment, and... 
    Training

    Specter Services LLC

    San Francisco, CA
    5 days ago
  •  ...'re looking for a Staff Software Engineer - Computer Vision Deployment...  ...to build and scale the infrastructure that powers our AI-driven warehouse...  ...computer vision models — from training pipelines through...  ...serving for low-latency inference at scale. You\'ll work closely... 
    Training
    Work at office
    3 days per week

    Claryo

    San Francisco, CA
    3 days ago
  •  ...democratize access to cutting‑edge AI infrastructure previously reserved for...  ...layer seamlessly routes training and inference jobs across the world,...  ...an Infrastructure Product Engineer, you will play a pivotal role...  ...environments. Advanced software engineering skills;... 
    Training
    Full time
    Remote work

    Andromeda

    San Francisco, CA
    2 days ago
  •  ...powers mission-critical inference for the world’s most dynamic AI companies, like...  ...AI research, flexible infrastructure, and seamless developer...  ...help build the platform engineers turn to to ship AI...  ...THE ROLE As a Senior Software Engineer - Model Training at Baseten, you’ll be... 
    Training
    Flexible hours

    BaseTen

    San Francisco, CA
    5 days ago
  •  ...designing and running OpenAI’s LLM training and inference infrastructure that powers frontier models at massive...  ...the Role We are looking for an engineer to design and implement the dataset...  ...time data About OpenAI OpenAI is an AI research and deployment company dedicated... 
    Training

    Slope

    San Francisco, CA
    2 days ago
  • An innovative AI company is seeking a Software Engineer to develop infrastructure that supports AI training and inference workflows. This role requires strong object-oriented programming skills and a solid foundation in data structures and algorithms. The ideal candidate... 
    Training

    SpreeAI

    San Francisco, CA
    2 days ago
  •  ...is to architect AI that learns from...  ...new primitive for training efficient, large-...  ...innovation and systems engineering paired with a...  ...re looking for a Software Engineer to help...  ...data and ML data infrastructure at Cartesia. This...  ...model training, and inference — it is not a... 
    Training
    Work at office
    Visa sponsorship
    Flexible hours

    Cartesia, Inc.

    San Francisco, CA
    1 day ago
  • $212k - $318.4k

    Software Engineer, ML platform and Infrastructure San Francisco Bay Area, California, United States...  ..., and Generative AI initiatives. You will architect...  ...direction of ML/Data/Inference platform capabilities, leading...  ...processing and model training/fine‑tuning workflows.... 
    Training
    Relocation

    Apple Inc.

    San Francisco, CA
    4 days ago
  • $255k - $405k

     ...About the Team The Agent Infrastructure team at OpenAI is...  ...building systems that enable training and deployment of highly useful AI agents, both internally...  ...issues, and develop software just as human SWEs do. Our...  ...the Role As a Software Engineer on the Agent... 
    Training
    Work at office
    Worldwide
    Relocation package

    OpenAI

    San Francisco, CA
    5 days ago
  • $194k - $239k

     ...Senior Software Engineer, Infrastructure Hover helps people design, improve, and protect the properties they love. With proprietary AI built on over a decade of real property data, Hover answers...  ...help cover the cost of management training, conferences, workshops, or... 
    Training
    Full time
    For contractors
    Work at office
    Local area
    Flexible hours

    HOVER Inc.

    San Francisco, CA
    2 days ago
  • $136.3k - $187.45k

     ...Francisco, CA (Hybrid) Team: IT Infrastructure and Operations About the...  ...incredible growth. As a Senior Software Engineer (Infrastructure) , you will...  .... Tooling, Scripting & AI: Build internal CLI tools, AI...  ...relevant certifications and training, and specific work location.... 
    Training
    Worldwide

    I did my part and supported the Regular Toilet

    San Francisco, CA
    2 days ago
  •  ...Meet Eloquent AI At Eloquent AI, we’re building...  ...talent in AI, engineering, and product as we...  ...Your Role As a Senior Software Engineer, AIOps & Infrastructure at Eloquent AI, you...  ...and AI teams to train, fine-tune, and deploy...  ...model serving, and inference optimizations.... 
    Training

    Eloquent AI

    San Francisco, CA
    1 day ago
  • $255k

     ...most cutting edge model training. We take data center...  ...systems and build any software needed for running large...  ...We are looking for engineers to operate the next generation...  ...with hands-on infrastructure work on our largest datacenters...  ...OpenAI OpenAI is an AI research and... 
    Training

    Slope

    San Francisco, CA
    3 days ago
  • $230k - $405k

    About the Team Compute Infrastructure builds the platform that turns...  ...compute into a reliable engine for frontier AI. We design, provision, schedule...  ...centers, orchestration software, agent infrastructure,...  ...capacity online, optimize training workloads from profiler traces... 
    Training

    Centaur Labs

    San Francisco, CA
    5 days ago
  •  ...Every breakthrough Physical AI system — humanoid robots, autonomous...  ...video generation models — is trained on petabytes of video, lidar,...  ...to close it. Our open‑source engine, Daft, is the distributed...  ...labs and public AI infrastructure companies today. We have raised... 
    Training
    Work at office
    Flexible hours
    Night shift

    Eventual Inc.

    San Francisco, CA
    2 days ago
  •  ...the career network for the AI economy. 20 million knowledge...  ...upskilling, from freelance AI training gigs to first internships to...  ...Handshake is building the infrastructure layer that powers the next generation...  ...our platform. As a Senior Software Engineer on our Agentic... 
    Training
    Full time
    Freelance
    Internship
    Work at office
    Remote work
    Flexible hours

    AI Chopping Block, Inc.

    San Francisco, CA
    4 days ago
  • $164.2k - $205.2k

     ...At Databricks, the Compute Infrastructure organization builds and operates...  ...that runs all Data, AI and stateful workloads across...  ...cost efficiency. As a Senior Software Engineer on the Compute Infra team, you...  ...relevant certifications and training, and the specific work location... 
    Training
    Local area

    Menlo Ventures

    San Francisco, CA
    3 days ago
  • $150k - $250k

     ...Robotics is building an AI‑native robotics...  ...founding member of our engineering team, you will have a...  ...just “models.” It’s the software layer that turns: Orders...  ...the backend systems and infrastructure that power the factory...  ...relevant education or training. Foundry Robotics... 
    Training
    Full time
    Contract work

    Foundry Robotics Inc.

    San Francisco, CA
    1 day ago
  • $184k - $259.44k

    Software Engineer, Frontier AI Infrastructure San Francisco, CA; St. Louis, MO; New York, NY; Washington, DC Ready to Apply? Join the team shaping the future...  ..., interview performance, and relevant education or training. Scale employees in eligible roles are also granted... 
    Training
    Full time
    Work at office
    3 days per week

    Scale AI, Inc.

    San Francisco, CA
    3 days ago
  • $215k - $265k

     ...risks from scheming frontier AI systems. We work with and...  ...mitigations. We’re looking for a Software Engineer to build the platform that...  ...and maintain Apollo's cloud infrastructure . This means IaC, networking...  ...with GPU workloads, ML training infrastructure, or research... 
    Training
    Full time
    Work experience placement
    Work at office
    Immediate start
    Visa sponsorship
    Flexible hours

    COL Limited

    San Francisco, CA
    3 days ago
  • $166k - $225k

    Senior Software Engineer - Infrastructure and Tools P-78 While candidates in the listed locations are encouraged...  ...running the world's best data and AI infrastructure platform so our...  ...experience, relevant certifications and training, and specific work location. Local Pay... 
    Training
    Local area
    Worldwide
    Flexible hours

    Databricks Inc.

    San Francisco, CA
    5 days ago
  •  ...is evolving into an AI-first company, where...  ...The Machine Learning Infrastructure team sits at the center...  ...developers to experiment, train, deploy, and monitor...  ...frameworks and inference tooling. We are in...  ...experiences. As a Senior Software Engineer on the Machine... 
    Training
    Shift work

    Plaid

    San Francisco, CA
    more than 2 months ago
  • Software Engineer (Security/Infrastructure) — AfterQuery About the Role You’ll be responsible for protecting a system that handles highly sensitive workflows across AI training, expert networks, and enterprise integrations with leading model labs. This role is deeply... 
    Training

    AfterQuery

    San Francisco, CA
    5 days ago
  • About the Team We’re hiring software engineers to make OpenAI’s networking teams more productive...  ...systems that support OpenAI’s training and inference infrastructure at frontier scale. About the Role...  ...About OpenAI OpenAI is an AI research and deployment company dedicated... 
    Training

    OpenAI

    San Francisco, CA
    5 days ago
  • $292.5k - $405k

    Software Engineer, Infrastructure Security | OpenAI Careers Software Engineer, Infrastructure Security Security...  ...services that power our frontier AI models. Our charter spans everything...  ...compute clusters, enabling rapid model training and deployment without compromising... 
    Training
    Remote work

    OpenAI

    San Francisco, CA
    14 hours ago
  • $200k - $250k

     ...ship category-defining software that enables Ryder...  ...impact for the engine of the American economy...  ...Software Engineer - Infrastructure Team: Machine Learning...  ..., distributed training, real-time inference, and more. You’ll be...  ...offline usage. AI-Driven Optimization:... 
    Training
    Full time
    Work at office
    Immediate start
    Remote work
    Monday to Friday

    Baton (A Ryder Technology Lab)

    San Francisco, CA
    more than 2 months ago
  • $170k - $230k

     ...mission to harness transformative AI technologies to make world-...  ...to everyone. As a Senior Software Engineer at Kira, you will own core backend systems and infrastructure that power critical product workflows...  ..., demotion, compensation, training, working conditions, transfer... 
    Training
    Full time
    Live in
    Work at office
    Local area
    Flexible hours

    Kira

    San Francisco, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Software Engineer (AI Infrastructure / Training / Inference). Be the first to apply!