Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Software Engineer, Inference - Performance Optimization

$295k

OpenAI

Software Engineer, Inference - Performance Optimization Inference – San Francisco About the Team Our team analyzes inference stack performance across the application, model, and fleet layers to identify bottlenecks and drive faster, cheaper inference. We combine systems profiling, benchmarking, and analysis to understand where time and cost are spent, then turn that understanding into performance optimizations and models that project performance and capacity needs for future launches. About the Role In this role, you will model inference performance across application, model, and fleet layers with higher fidelity. You will build cost-to-serve estimates from microbenchmarks and create tools that help cross‑functional teams reason about latency, capacity, utilization, and cost tradeoffs. Responsibilities Build and refine performance models that translate microbenchmark results into cost-to-serve estimates. Analyze inference workloads end to end across applications, models, and fleet infrastructure. Enhance tooling to identify bottlenecks across layers for latency and throughput. Partner with other teams to turn performance insights into concrete improvements and project how future changes affect inference. Qualifications Enjoy reasoning from first principles about distributed systems, model inference, and hardware efficiency. Are comfortable working across abstraction layers, from application behavior to kernels, accelerators, networking, and fleet scheduling. Have deep expertise with performance profiling, benchmarking, analysis, and optimization. Enjoy collaborating with engineering and research teams to improve real production systems. Compensation $295K – $555K + Offers Equity OpenAI is an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic. #J-18808-Ljbffr OpenAI

Vacancy posted 12 hours ago
Similar jobs that could be interesting for youBased on the Software Engineer, Inference - Performance Optimization in Los Angeles, CA vacancy
  • $230k - $385k

    Software Engineer, Productivity - Inference Runtime Runtime - San Francisco About the Team We’re hiring a Developer...  ...compromising reliability or performance. This role sits at the...  ...support model launches, inference optimizations, cloud provider integrations, and... 
    Performance

    OpenAI

    Los Angeles, CA
    4 days ago
  •  ...AI Engineer – Routing & Network Optimization At Gallatin, we are rebuilding logistics infrastructure for the...  ...with monitoring, validation, and performance tuning. Operations Research &...  ...experimentation (A/B testing), or causal inference. Experience in data... 
    Performance
    Local area

    Gallatin AI, Inc.

    El Segundo, CA
    5 days ago
  • Inference Technical Lead, On-Device Transformers Consumer...  ...and model system performance, identifying...  ...Build and lead a team of engineers responsible for implementing...  .... Have designed or optimized high-performance...  ...performance-critical software such as CUDA kernels,... 
    Performance
    Work at office
    Relocation package

    OpenAI

    Los Angeles, CA
    1 day ago
  • $120k - $140k

     ...Software Engineer: State Estimation & Prediction Los Angeles, US About Lodestar...  ...patterns of targets Implement intent inference models to identify actions and dynamically...  ...-time systems, multi-threading, and performance optimization Strong understanding of... 
    Performance
    Permanent employment
    Full time
    Flexible hours

    Lodestar

    Los Angeles, CA
    2 days ago
  • $293k

    Software Engineer, Workload Enablement Scaling - San Francisco and Seattle About the Team...  ...architecture, fleet-level monitoring, and performance optimization. About the Role We’re hiring an SW...  ...stress benchmarks, porting existing inference and training workloads to new,... 
    Performance

    OpenAI

    Los Angeles, CA
    3 days ago
  • Senior Software Engineer, Machine Learning About us Moonware builds products...  ...to coordinate, optimize, and automate aircraft ground...  ...tasks, communications, and performance. By enhancing operational visibility...  ..., data science, multimodal inference, and ML infrastructure. You... 
    Performance
    Worldwide

    Moonware

    Los Angeles, CA
    4 days ago
  • Senior Software Engineer, Personalization Services Role Summary: The Senior...  ...high standards for performance, reliability, and scalability...  ...Performance, Reliability & Scale Optimize personalization services...  ...with ML model serving and inference pipelines. Experience supporting... 
    Performance
    Temporary work
    Flexible hours

    Internet Brands

    El Segundo, CA
    3 days ago
  • $124k - $186k

     ...Applied Intelligence Data Engineering team is seeking a Senior Software Engineer - Cloud...  ...systems. You will build high-performance streaming applications and...  ...Responsibilities Optimize Data Streaming Applications...  ...feature engineering, inference, and analytics workloads... 
    Performance
    Contract work

    Paramount Unified School District

    Burbank, CA
    4 days ago
  •  ...or denied. As an Autonomy Software Engineer, you will design, build, and...  ...navigation, planning, and embedded inference. Take systems from...  ...stability, and mission-level performance. Work closely with hardware...  ...onto real platforms. Optimize software and models for real... 
    Performance
    Work experience placement
    Local area
    Night shift

    Mach Industries

    Los Angeles, CA
    3 days ago
  •  ...leading AI research organization in San Francisco seeks an Inference Technical Lead to evaluate silicon platforms and work on...  ..., understanding transformer models, and leading teams on performance-critical software. Competitive compensation package, including equity, is... 
    Performance

    OpenAI

    Los Angeles, CA
    1 day ago
  •  ...Riot engineers bring deep knowledge of specific technical areas but...  ...of engineers, overseeing performance management, growth opportunities...  ...Manager within the Optimize team, you will report to the...  ...Manage a team of 4-10 cloud and software engineers; coaching them, overseeing... 
    Performance
    Local area
    Flexible hours

    Riot Games

    Los Angeles, CA
    1 day ago
  •  ...Full Stack Software Engineer We are seeking a versatile Full Stack Software Engineer to help us design and build the systems that...  ...Geospatial mapping with MapBox/MapLibre. WebRTC and SRT streaming. Distributed systems and performance optimization.... 
    Performance

    Splash Industries

    El Segundo, CA
    5 days ago
  • $180k - $240k

     ...Frontend Software Engineer San Francisco, Palo Alto, Los Angeles, Toronto About HeyGen At HeyGen, our mission is to make visual...  ...quality frontend features for the HeyGen platform. Ensure optimal performance, scalability, and responsiveness of the user interface.... 
    Performance
    Work experience placement

    HeyGen

    Los Angeles, CA
    2 days ago
  •  ...Senior Front-End Software Engineer Contract Location - Los Angeles, CA / New...  ...Preferred: Someone who has experience in app performance, measuring it, looking into things...  ...Software Engineer to help build and optimize a streaming application used across... 
    Performance
    Contract work
    Work at office
    Remote work

    VDart

    Los Angeles, CA
    4 days ago
  • $150k - $180k

     ...Senior Frontend Software Engineer Title of Role: Senior Frontend Software Engineer...  ...REST APIs for seamless data flow. Optimize applications for maximum speed and scalability...  ...and debug applications, ensuring high performance and responsiveness. Ideal Candidate... 
    Performance
    Work at office

    Recruiting from Scratch

    Beverly Hills, CA
    5 days ago
  • $145k - $250k

     ...translate designer UX flows into high quality components; optimize performance (SSR vs client side, code splitting, lazy loading, etc.)....  ...urgent problems. Who You Are: Have 3+ years of full stack engineering experience (or equivalent), especially in startup or fast-... 
    Performance
    Work at office
    Remote work
    Flexible hours

    Pure

    Los Angeles, CA
    3 days ago
  • $122.43k - $194.39k

     ...comprehensively uses machine learning to optimally engineer, additively manufacture, and flexibly...  ...created using DAPS are superior in performance, lower in cost, rapidly customizable to...  ...Purpose We're seeking a Full‑Stack Software Engineer with solid experience in application... 
    Performance
    Temporary work

    Divergent

    Los Angeles, CA
    22 days ago
  • $180k - $215k

     ...Fullstack Software Engineer San Francisco, Palo Alto, Los Angeles, Toronto About HeyGen...  ...test, and deploy robust, scalable, and optimized features for the HeyGen platform across...  ...& Optimization: Monitor platform performance, identify bottlenecks, and implement solutions... 
    Performance
    Work experience placement

    HeyGen

    Los Angeles, CA
    2 days ago
  • $141.9k - $190.3k

     ...Sr Product Software Engineer Disney Entertainment and ESPN Product & Technology is a global organization...  ...strategies. Production Support: Monitor and optimize production systems, ensuring platform stability, performance, and uptime. Mentorship: Provide technical... 
    Performance
    Work experience placement
    Worldwide

    Disney

    Glendale, CA
    1 day ago
  • $140k - $180k

     ...From telescopes to software architecture, Observable Space provides...  ...'s former VP of software engineering to create a developer platform...  ...scalability. Design and optimize relational database schemas,...  ...Ensure high reliability and performance in production systems. Work... 
    Performance
    Work at office
    Local area
    Remote work
    Flexible hours
    3 days per week

    Observable Space

    Los Angeles, CA
    3 days ago
  • $124k - $180k

     ...Senior Full-Stack Software Engineer 45449 Burbank, CA, US, 91505 Technology Burbank Full-Time...  ...data, and design partners to deliver performant, intuitive, and impactful user experiences...  ...Architect Scalable UIs: Create and optimize reusable components, design systems,... 
    Performance
    Full time

    Paramount Global Services

    Burbank, CA
    4 days ago
  • $180k - $200k

     ...Software Engineer, Android Title of Role: Software Engineer, Android Location: Los...  ...streamline logistics and enhance overall performance in the industry. This company is on a...  ...and debug applications, ensuring optimal performance and reliability. Participate... 
    Performance
    Work at office

    Recruiting from Scratch

    Beverly Hills, CA
    10 days ago
  • $180k - $200k

     ...Software Engineer, iOS Title of Role: Software Engineer, iOS Location: Los Angeles, onsite Company Stage of Funding: Seed —...  .... Maintain and improve existing applications, ensuring optimal performance and user experience. Implement best practices for code... 
    Performance
    Work at office

    Recruiting from Scratch

    Beverly Hills, CA
    15 days ago
  • $175k - $200k

     ...months. THE ROLE As a Senior Software Engineer on the CMS Team you will create the authoring...  ..., and review their app's business performance across key metrics. This includes a...  ...sign-up through app design, launch, optimization, and maintenance over time. You'll... 
    Performance
    Full time
    Contract work
    Remote work
    Worldwide
    Home office
    Flexible hours

    Tapcart

    Santa Monica, CA
    3 days ago
  • $388k

     ...POSITION SUMMARY: We are seeking Software Engineers to join our pioneering team at INK. INK...  ...and iterate to improve usability and performance. Establish and maintain studio standards...  ...need Working knowledge of scalable inference and training technologies . But more... 
    Performance
    Remote work

    Netflix

    Los Angeles, CA
    16 days ago
  • $175k - $220k

     ...Senior Software Engineer Los Angeles, California, United States Genius Sports is enabling...  ...driven insights about team and player performance, assists our semi-automated systems...  ...industry experience ~ Experience with optimizing and benchmarking low-latency, real-time... 
    Performance
    Work at office
    Worldwide

    Genius Sports

    Los Angeles, CA
    3 days ago
  •  ...Senior Software Engineer – Agentic AI Analytics (Rust / Data Systems) Location: Glendale...  ...concepts. If you like owning hard performance problems, designing columnar/vectorized...  ..., profiling, and hardware-aware optimization on large-scale cloud-native data systems... 
    Performance
    Work at office

    Noor Staffing Group

    Glendale, CA
    5 days ago
  • $150k - $212k

     ...Senior Software Engineer Parallel Systems is pioneering autonomous battery-electric rail vehicles designed...  ...Software Engineer to design, build, and optimize critical system applications. You will work on high-performance backend systems that empower our rail vehicles... 
    Performance
    Local area
    Shift work

    Parallel Systems Corp

    Los Angeles, CA
    4 days ago
  •  ...Job Description Job Title: Senior Software Engineer (Full-Stack) Location: Los Angeles...  ...frontend and backend environments, ensuring performance and security are prioritized....  ...application stack, focusing on database query optimization and API latency reduction.... 
    Performance
    Remote work

    8Fleet Inc.

    Los Angeles, CA
    27 days ago
  • $115k - $140k

     ...seeking a highly skilled and experienced Software Engineer to join our dynamic team. As a...  ...development, and project integration. Optimize workflow and deliver rich content across...  ...existing systems in Unreal Engine to ensure performance and functionality. Support sound... 
    Performance
    Full time
    Work experience placement

    Absurd Ventures

    Santa Monica, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Software Engineer, Inference - Performance Optimization. Be the first to apply!