Software Engineer, Inference - Performance Optimization
$295kOpenAI
Software Engineer, Inference - Performance Optimization Inference – San Francisco About the Team Our team analyzes inference stack performance across the application, model, and fleet layers to identify bottlenecks and drive faster, cheaper inference. We combine systems profiling, benchmarking, and analysis to understand where time and cost are spent, then turn that understanding into performance optimizations and models that project performance and capacity needs for future launches. About the Role In this role, you will model inference performance across application, model, and fleet layers with higher fidelity. You will build cost-to-serve estimates from microbenchmarks and create tools that help cross‑functional teams reason about latency, capacity, utilization, and cost tradeoffs. Responsibilities Build and refine performance models that translate microbenchmark results into cost-to-serve estimates. Analyze inference workloads end to end across applications, models, and fleet infrastructure. Enhance tooling to identify bottlenecks across layers for latency and throughput. Partner with other teams to turn performance insights into concrete improvements and project how future changes affect inference. Qualifications Enjoy reasoning from first principles about distributed systems, model inference, and hardware efficiency. Are comfortable working across abstraction layers, from application behavior to kernels, accelerators, networking, and fleet scheduling. Have deep expertise with performance profiling, benchmarking, analysis, and optimization. Enjoy collaborating with engineering and research teams to improve real production systems. Compensation $295K – $555K + Offers Equity OpenAI is an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic. #J-18808-Ljbffr OpenAI
$230k - $385k
Software Engineer, Productivity - Inference Runtime Runtime - San Francisco About the Team We’re hiring a Developer... ...compromising reliability or performance. This role sits at the... ...support model launches, inference optimizations, cloud provider integrations, and...Performance- ...AI Engineer – Routing & Network Optimization At Gallatin, we are rebuilding logistics infrastructure for the... ...with monitoring, validation, and performance tuning. Operations Research &... ...experimentation (A/B testing), or causal inference. Experience in data...PerformanceLocal area
- Inference Technical Lead, On-Device Transformers Consumer... ...and model system performance, identifying... ...Build and lead a team of engineers responsible for implementing... .... Have designed or optimized high-performance... ...performance-critical software such as CUDA kernels,...PerformanceWork at officeRelocation package
$120k - $140k
...Software Engineer: State Estimation & Prediction Los Angeles, US About Lodestar... ...patterns of targets Implement intent inference models to identify actions and dynamically... ...-time systems, multi-threading, and performance optimization Strong understanding of...PerformancePermanent employmentFull timeFlexible hours$293k
Software Engineer, Workload Enablement Scaling - San Francisco and Seattle About the Team... ...architecture, fleet-level monitoring, and performance optimization. About the Role We’re hiring an SW... ...stress benchmarks, porting existing inference and training workloads to new,...Performance- Senior Software Engineer, Machine Learning About us Moonware builds products... ...to coordinate, optimize, and automate aircraft ground... ...tasks, communications, and performance. By enhancing operational visibility... ..., data science, multimodal inference, and ML infrastructure. You...PerformanceWorldwide
- Senior Software Engineer, Personalization Services Role Summary: The Senior... ...high standards for performance, reliability, and scalability... ...Performance, Reliability & Scale Optimize personalization services... ...with ML model serving and inference pipelines. Experience supporting...PerformanceTemporary workFlexible hours
$124k - $186k
...Applied Intelligence Data Engineering team is seeking a Senior Software Engineer - Cloud... ...systems. You will build high-performance streaming applications and... ...Responsibilities Optimize Data Streaming Applications... ...feature engineering, inference, and analytics workloads...PerformanceContract work- ...or denied. As an Autonomy Software Engineer, you will design, build, and... ...navigation, planning, and embedded inference. Take systems from... ...stability, and mission-level performance. Work closely with hardware... ...onto real platforms. Optimize software and models for real...PerformanceWork experience placementLocal areaNight shift
- ...leading AI research organization in San Francisco seeks an Inference Technical Lead to evaluate silicon platforms and work on... ..., understanding transformer models, and leading teams on performance-critical software. Competitive compensation package, including equity, is...Performance
- ...Riot engineers bring deep knowledge of specific technical areas but... ...of engineers, overseeing performance management, growth opportunities... ...Manager within the Optimize team, you will report to the... ...Manage a team of 4-10 cloud and software engineers; coaching them, overseeing...PerformanceLocal areaFlexible hours
- ...Full Stack Software Engineer We are seeking a versatile Full Stack Software Engineer to help us design and build the systems that... ...Geospatial mapping with MapBox/MapLibre. WebRTC and SRT streaming. Distributed systems and performance optimization....Performance
$180k - $240k
...Frontend Software Engineer San Francisco, Palo Alto, Los Angeles, Toronto About HeyGen At HeyGen, our mission is to make visual... ...quality frontend features for the HeyGen platform. Ensure optimal performance, scalability, and responsiveness of the user interface....PerformanceWork experience placement- ...Senior Front-End Software Engineer Contract Location - Los Angeles, CA / New... ...Preferred: Someone who has experience in app performance, measuring it, looking into things... ...Software Engineer to help build and optimize a streaming application used across...PerformanceContract workWork at officeRemote work
$150k - $180k
...Senior Frontend Software Engineer Title of Role: Senior Frontend Software Engineer... ...REST APIs for seamless data flow. Optimize applications for maximum speed and scalability... ...and debug applications, ensuring high performance and responsiveness. Ideal Candidate...PerformanceWork at office$145k - $250k
...translate designer UX flows into high quality components; optimize performance (SSR vs client side, code splitting, lazy loading, etc.).... ...urgent problems. Who You Are: Have 3+ years of full stack engineering experience (or equivalent), especially in startup or fast-...PerformanceWork at officeRemote workFlexible hours$122.43k - $194.39k
...comprehensively uses machine learning to optimally engineer, additively manufacture, and flexibly... ...created using DAPS are superior in performance, lower in cost, rapidly customizable to... ...Purpose We're seeking a Full‑Stack Software Engineer with solid experience in application...PerformanceTemporary work$180k - $215k
...Fullstack Software Engineer San Francisco, Palo Alto, Los Angeles, Toronto About HeyGen... ...test, and deploy robust, scalable, and optimized features for the HeyGen platform across... ...& Optimization: Monitor platform performance, identify bottlenecks, and implement solutions...PerformanceWork experience placement$141.9k - $190.3k
...Sr Product Software Engineer Disney Entertainment and ESPN Product & Technology is a global organization... ...strategies. Production Support: Monitor and optimize production systems, ensuring platform stability, performance, and uptime. Mentorship: Provide technical...PerformanceWork experience placementWorldwide$140k - $180k
...From telescopes to software architecture, Observable Space provides... ...'s former VP of software engineering to create a developer platform... ...scalability. Design and optimize relational database schemas,... ...Ensure high reliability and performance in production systems. Work...PerformanceWork at officeLocal areaRemote workFlexible hours3 days per week$124k - $180k
...Senior Full-Stack Software Engineer 45449 Burbank, CA, US, 91505 Technology Burbank Full-Time... ...data, and design partners to deliver performant, intuitive, and impactful user experiences... ...Architect Scalable UIs: Create and optimize reusable components, design systems,...PerformanceFull time$180k - $200k
...Software Engineer, Android Title of Role: Software Engineer, Android Location: Los... ...streamline logistics and enhance overall performance in the industry. This company is on a... ...and debug applications, ensuring optimal performance and reliability. Participate...PerformanceWork at office$180k - $200k
...Software Engineer, iOS Title of Role: Software Engineer, iOS Location: Los Angeles, onsite Company Stage of Funding: Seed —... .... Maintain and improve existing applications, ensuring optimal performance and user experience. Implement best practices for code...PerformanceWork at office$175k - $200k
...months. THE ROLE As a Senior Software Engineer on the CMS Team you will create the authoring... ..., and review their app's business performance across key metrics. This includes a... ...sign-up through app design, launch, optimization, and maintenance over time. You'll...PerformanceFull timeContract workRemote workWorldwideHome officeFlexible hours$388k
...POSITION SUMMARY: We are seeking Software Engineers to join our pioneering team at INK. INK... ...and iterate to improve usability and performance. Establish and maintain studio standards... ...need Working knowledge of scalable inference and training technologies . But more...PerformanceRemote work$175k - $220k
...Senior Software Engineer Los Angeles, California, United States Genius Sports is enabling... ...driven insights about team and player performance, assists our semi-automated systems... ...industry experience ~ Experience with optimizing and benchmarking low-latency, real-time...PerformanceWork at officeWorldwide- ...Senior Software Engineer – Agentic AI Analytics (Rust / Data Systems) Location: Glendale... ...concepts. If you like owning hard performance problems, designing columnar/vectorized... ..., profiling, and hardware-aware optimization on large-scale cloud-native data systems...PerformanceWork at office
$150k - $212k
...Senior Software Engineer Parallel Systems is pioneering autonomous battery-electric rail vehicles designed... ...Software Engineer to design, build, and optimize critical system applications. You will work on high-performance backend systems that empower our rail vehicles...PerformanceLocal areaShift work- ...Job Description Job Title: Senior Software Engineer (Full-Stack) Location: Los Angeles... ...frontend and backend environments, ensuring performance and security are prioritized.... ...application stack, focusing on database query optimization and API latency reduction....PerformanceRemote work
$115k - $140k
...seeking a highly skilled and experienced Software Engineer to join our dynamic team. As a... ...development, and project integration. Optimize workflow and deliver rich content across... ...existing systems in Unreal Engine to ensure performance and functionality. Support sound...PerformanceFull timeWork experience placement
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Software Engineer, Inference - Performance Optimization. Be the first to apply!
- software developer internship no experience Los Angeles, CA
- federal - software developer Los Angeles, CA
- software engineer contract Los Angeles, CA
- software engineer healthcare Los Angeles, CA
- network software engineer Los Angeles, CA
- ngo software engineer Los Angeles, CA
- software development engineer aws Los Angeles, CA
- software developer internship Los Angeles, CA
- software developer intern Los Angeles, CA
- software developer fintech Los Angeles, CA


