Staff Software Engineer - GenAI Performance and Kernel

$190.9k - $232.8k

Cacheflow

About This Role As a staff software engineer for GenAI Performance and Kernel, you will own the design, implementation, optimization, and correctness of the high-performance GPU kernels powering our GenAI inference stack. You will lead development of highly-tuned, low-level compute paths, manage trade-offs between hardware efficiency and generality, and mentor others in kernel-level performance engineering. You will work closely with ML researchers, systems engineers, and product teams to push the state-of-the-art in inference performance at scale. What You Will Do Lead the design, implementation, benchmarking, and maintenance of core compute kernels (e.g. attention, MLP, softmax, layernorm, memory management) optimized for various hardware backends (GPU, accelerators) Drive the performance roadmap for kernel-level improvements: vectorization, tensorization, tiling, fusion, mixed precision, sparsity, quantization, memory reuse, scheduling, auto-tuning, etc. Integrate kernel optimizations with higher-level ML systems Build and maintain profiling, instrumentation, and verification tooling to detect correctness, performance regressions, numerical issues, and hardware utilization gaps Lead performance investigations and root‑cause analysis on inference bottlenecks, e.g. memory bandwidth, cache contention, kernel launch overhead, tensor fragmentation Establish coding patterns, abstractions, and frameworks to modularize kernels for reuse, cross‑backend portability, and maintainability Influence system architecture decisions to make kernel improvements more effective (e.g. memory layout, dataflow scheduling, kernel fusion boundaries) Mentor and guide other engineers working on lower‑level performance, provide code reviews, help set best practices Collaborate with infrastructure, tooling, and ML teams to roll out kernel-level optimizations into production, and monitor their impact What We Look For BS/MS/PhD in Computer Science, or a related field Deep hands‑on experience writing and tuning compute kernels (CUDA, Triton, OpenCL, LLVM IR, assembly or similar sort) for ML workloads Strong knowledge of GPU/accelerator architecture: warp structure, memory hierarchy (global, shared, register, L1/L2 caches), tensor cores, scheduling, SM occupancy, etc. Experience with advanced optimization techniques: tiling, blocking, software pipelining, vectorization, fusion, loop transformations, auto‑tuning Familiarity with ML‑specific kernel libraries (cuBLAS, cuDNN, CUTLASS, oneDNN, etc.) or open kernels Strong debugging and profiling skills (Nsight, NVProf, perf, vtune, custom instrumentation) Experience reasoning about numerical stability, mixed precision, quantization, and error propagation Experience in integrating optimized kernels into real‑world ML inference systems; exposure to distributed inference pipelines, memory management, and runtime systems Experience building high‑performance products leveraging GPU acceleration Excellent communication and leadership skills — able to drive design discussions, mentor colleagues, and make trade‑offs visible A track record of shipping performance‑critical, high‑quality production software Bonus: published in systems/ML performance venues (e.g. MLSys, ASPLOS, ISCA, PPoPP), experience with custom accelerators or FPGA, experience with sparsity or model compression techniques Pay Range Transparency Databricks is committed to fair and equitable compensation practices. The pay range(s) for this role is listed below and represents the expected salary range for non‑commissionable roles or on‑target earnings for commissionable roles. Actual compensation packages are based on several factors that are unique to each candidate, including but not limited to job‑related skills, depth of experience, relevant certifications and training, and specific work location. Based on the factors above, Databricks anticipates utilizing the full width of the range. The total compensation package for this position may also include eligibility for annual performance bonus, equity, and the benefits listed above. For more information regarding which range your location is in visit our page here. Local Pay Range

$190,900 — $232,800 USD

About Databricks Databricks is the data and AI company. More than 10,000 organizations worldwide — including Comcast, Condé Nast, Grammarly, and over 50% of the Fortune 500 — rely on the Databricks Data Intelligence Platform to unify and democratize data, analytics and AI. Databricks is headquartered in San Francisco, with offices around the globe and was founded by the original creators of Lakehouse, Apache Spark™, Delta Lake and MLflow. To learn more, follow Databricks on Twitter, LinkedIn and Facebook. Benefits At Databricks, we strive to provide comprehensive benefits and perks that meet the needs of all of our employees. For specific details on the benefits offered in your region, please visit Our Commitment to Diversity and Inclusion At Databricks, we are committed to fostering a diverse and inclusive culture where everyone can excel. We take great care to ensure that our hiring practices are inclusive and meet equal employment opportunity standards. Individuals looking for employment at Databricks are considered without regard to age, color, disability, ethnicity, family or marital status, gender identity or expression, language, national origin, physical and mental ability, political affiliation, race, religion, sexual orientation, socio‑economic status, veteran status, and other protected characteristics. Compliance If access to export‑controlled technology or source code is required for performance of job duties, it is within Employer's discretion whether to apply for a U.S. government license for such positions, and Employer may decline to proceed with an applicant on this basis alone. #J-18808-Ljbffr Cacheflow

Apply

Vacancy posted 4 days ago

Similar jobs that could be interesting for youBased on the Staff Software Engineer - GenAI Performance and Kernel in San Francisco, CA vacancy

Staff GenAI Kernel & Performance Engineer
A leading data and AI company in San Francisco seeks a Staff Software Engineer to lead kernel-level performance engineering for GenAI workloads. The role involves designing and optimizing high-performance GPU kernels, mentoring engineers, and driving performance roadmaps...
Performance
Databricks
San Francisco, CA
3 days ago
Staff Software Engineer, Enterprise GenAI
$252k - $315k
...and more. We are looking for a strong engineer to join our team and help us build and... ...candidate will have a strong understanding of software engineering principles and practices, as... ..., experience, qualifications, interview performance, and relevant education or training....
Performance
Full time
Scale AI
San Francisco, CA
5 days ago
Staff Software Engineer, GenAI Platform
$208k - $250k
...value. THE WORK: Are you an ambitious engineer looking to make an outstanding... ...Ripple Labs Inc., we are seeking a GenAI Platform Staff Software Engineer to join our team in San Francisco... ...build-vs-use decisions based on performance, scalability, control, and long-term...
Performance
Full time
Work at office
Ripple
San Francisco, CA
1 day ago
Senior GenAI Research Engineer - Optimization and Kernels
$166k - $225k
...Job Description As a research engineer on the Scaling team, you will... ...Databricks, you will: Drive performance improvements through advanced... ...optimization techniques including kernel fusion, mixed precision,... ...distributed workloads Strong software engineering skills in Python...
Performance
Worldwide
Cacheflow
San Francisco, CA
1 day ago
Staff Software Engineer, AI
$194k - $267k
...too, let's talk. The Global AI Engineering Team At Okta, you'll be... ...opportunity to make an impact. The Staff Software Engineer Opportunity At... ...intelligent automation and GenAI real for our workforce - all while ensuring performance, security, and reliability at...
Performance
Local area
Worldwide
Flexible hours
Okta, Inc.
San Francisco, CA
2 days ago
Staff Robotics Software Engineer
$200k - $225k
...applying robotics and distributed software to create a new class of... ...looking for an experienced Staff Software Engineer to develop software... ...systems critical to our robot's performance. Responsibilities Deliver... ...Linux operating systems and kernel fundamentals A Final Note If...
Performance
Work at office
Mytra
Brisbane, CA
4 days ago
Staff Embedded Software Engineer
...Role Overview As a Senior/Staff Embedded Linux Engineer at BrightAI, you will help... ...bootloader configuration, kernel updates, and device tree changes... ...maintain low-level system software in C/C++, working closely... ...system reliability, performance, boot time, and debuggability...
Performance
BrightAI
San Francisco, CA
4 days ago
Software Engineer, AI/ML GenAI
$175k - $220k
...Software Engineer, AI/ML GenAI Title of Role: Software Engineer, AI/ML GenAI Location: San Francisco, on-site or remote Company... ...embeddings, demonstrating expertise in AI system reliability and performance. ~ Ability to design complex backend systems with a...
Performance
Work at office
Remote work
Recruiting from Scratch
San Francisco, CA
4 days ago
Software Engineer, Kernel Performance & AI Tooling
$266k
...'s supercomputing platform. About the Role We are looking for a systems-minded engineer to help advance our kernel development, performance engineering, and hardware-software co-design capabilities, with a particular focus on AI-assisted workflows and tooling. This...
Performance
OpenAI
San Francisco, CA
3 days ago
AI Performance & Kernel Engineer for Frontier-Scale ML
...A leading AI technology firm located in San Francisco is seeking a Research Engineer specializing in AI Performance & Kernel Optimization. The role involves enhancing the performance of large-scale AI systems, optimizing kernels, and collaborating with various teams....
Performance
Zyphra
San Francisco, CA
4 days ago
Staff Software Engineer, ML Performance & Systems
$180k - $250k
Staff Software Engineer, ML Performance & Systems Help fal maintain its frontier position on model performance for generative media models. Design and... ...deeper into the stack to fix bottlenecks (custom GEMM kernels with CUTLASS for common shapes). Proficient in Triton or...
Performance
Currently hiring
Relocation
Visa sponsorship
fal
San Francisco, CA
17 days ago
Member of Technical Staff - Full-stack Software Engineer
$150k - $300k
Member of Technical Staff - Full‑stack Software Engineer Compound AI About the Company Twenty Labs is an applied... ...in language models, shipped GenAI products to billions in production,... ...and talent‑dense team. No formal performance reviews. If you're here, you're a high...
Performance
Full time
Work at office
Local area
SupportFinity™
San Francisco, CA
2 days ago
Senior Software Engineer, GenAI
$216k - $270k
...private evaluations. About Data Engine Our Generative AI Data... ...across several teams within the GenAI Engineering organization,... ...REQUIREMENTS: * 5+ years of software engineering experience,... ...scale * Drive reliability and performance across critical infrastructure...
Performance
Full time
Scale AI
San Francisco, CA
5 days ago
Staff Software Engineer - Product
$225k - $320k
...type: Full Time · Department: Engineering (R&D) · Work type: On-Site About... ...collaboration. Join a lean, staff-level team (ex-Affirm, Uber, DoorDash... ...stack. Optimize application performance and scalability. Own both product manager and software engineer responsibilities....
Performance
Full time
Work at office
Immediate start
Relocation
Neara
San Francisco, CA
3 days ago
Staff Software Engineer
...such as distributed systems, reliability/performance tradeoffs. Experience in API design,... ...0x mindset – you’ll be among the first engineering hires! You are a fun human! Good time management... ...of experience (or equivalent) as a software engineer. We’re leveling this role to...
Performance
Full time
Neon Health
San Francisco, CA
4 days ago
Staff Software Engineer
$325k
...Staff Software Engineer, Frontend Engineering at Ivo Role Our software delights users, and our Frontend Engineers make that happen. You’ll... ...real‑time, collaborative AI‑enabled interface Writing high‑performance UI code Implementing frontend components that integrate directly...
Performance
Contract work
Work at office
Remote work
Icehouseventures
San Francisco, CA
3 days ago
Senior Embedded Linux Kernel & Driver Engineer
.... is looking for a Senior Firmware Engineer to join the Device Software team in San Francisco, California.... ...role requires deep expertise in Linux kernel and device driver development,... ...maintain device drivers, optimize performance, and work closely with hardware teams...
Performance
Hayden AI Technologies, Inc.
San Francisco, CA
3 days ago
Senior Staff Software Engineer, API
$405k
...growing group of committed researchers, engineers, policy experts, and business leaders... ...is seeking an exceptional Senior Staff Software Engineer to join the Claude Developer... ...owns the foundational reliability and performance of the Claude API; API Capabilities ships...
Performance
Work at office
Remote work
Visa sponsorship
Flexible hours
Menlo Ventures
San Francisco, CA
1 day ago
Senior Staff Software Engineer
...growing group of committed researchers, engineers, policy experts, and business leaders... ...Products team is hiring a Senior Staff Software Engineer to serve as the technical lead... ...person obsesses over how the app feels — performance, polish, the small details that make software...
Performance
Visa sponsorship
United States Digital Space LLC
San Francisco, CA
3 days ago
Staff Software Engineer, Frontend
...ensuring consistency, quality, and scalability Partner with engineering leadership, product, and design to shape frontend technical strategy... ...high-impact initiatives such as architectural evolution, performance improvements, or major UX foundations Establish and uphold...
Performance
Hourly pay
Daily paid
Minimum wage
Work at office
Work from home
Home office
Flexible hours
Shift work
WorkWhile
San Francisco, CA
4 days ago
Staff Software Engineer, Fullstack
$200k - $250k
...About Us At 3Y Health, we are building AI-driven software to empower healthcare providers and solve the overwhelming... ...and 8VC. About the Role We are seeking a Fullstack Engineer to help us craft intuitive, high-performance user interfaces that our providers love. The ideal...
Performance
Work experience placement
Private practice
Work at office
3Y Health
San Francisco, CA
3 days ago
Staff Software Engineer
...like Founders Fund, Google, and Coinbase. Role We are hiring a Staff Software Engineer to lead 0 to 1 and 1 to N systems that power programmable markets. You will own critical architecture, ship high-performance services, and raise the engineering bar across the stack....
Performance
Framework Ventures
San Francisco, CA
3 days ago
Staff Software Engineer
...is real-world enterprise AI. Our software is the distribution layer for powerful... ...work really hard to win. The Role | Staff Software Engineer As a Staff Software Engineer at... ...designing database schemas, solving performance issues, and improving code quality....
Performance
Endeavor AI, Inc
San Francisco, CA
3 days ago
Staff Software Engineer - Machine Learning
...data that are efficient w.r.t cost and performance Integrate ML solutions with our production... ...as integrating them into production software systems Hands-on experience with... ...multivariate optimization) Strong software engineering fundamentals Nice to haves...
Performance
Flexible hours
Hivemapper
San Francisco, CA
21 hours ago
Staff Software Engineer
$210k - $235k
...Engineering @ Ironclad Ironclad is the leading AI contracting platform that transforms... ...days for team or company events. As a Staff+ Software Engineer at Ironclad, you'll work cross... ...including individual proficiency, anticipated performance, and the location of the selected...
Performance
Work at office
Remote work
Ironclad Inc
San Francisco, CA
2 days ago
Staff Software Engineer, Consumer Experience
...growing business with billions in revenue About the Role As a Staff Software Engineer on the Consumer Experience team, you'll build the products... ...support millions of users, ensuring high availability and performance Influence technical direction by collaborating with cross‑...
Performance
Full time
Freelance
Internship
Work at office
Remote work
Flexible hours
Cacheflow
San Francisco, CA
4 days ago
Staff Software Engineer
$220k - $270k
...Staff Software Engineer page is loaded## Staff Software Engineerremote type: On-sitelocations: Chicago, IL: San Francisco, CAtime type: Full... ...content, social media, account-based marketing (ABM), and performance analytics. Working directly alongside marketing subject matter...
Performance
Temporary work
Remote work
Jones Lang LaSalle Incorporated
San Francisco, CA
4 days ago
Staff Software Engineer, Reliability
$160.2k - $290.7k
...platform team develops the first layers of software on the GM Autonomous Vehicles from... ...vehicle platforms. Role As a Staff Software Engineers, you are the expert professionals... ...andoptimized for customer experience and performance. Raise the bar on...
Performance
Work experience placement
Local area
Remote work
Work from home
Relocation package
Flexible hours
General Motors
San Francisco, CA
3 days ago
Staff Software Engineer
$198k - $299k
...starts with a query. Our in-house OLAP engine, Nova , processes trillions of events... ...dramatically. We're looking for a Staff Software Engineer who wants to go deep on both... ...while driving meaningful improvements to performance, cost-efficiency, and reliability at scale...
Performance
Work at office
Immediate start
Worldwide
Home office
Flexible hours
Amplitude
San Francisco, CA
5 days ago
Staff Software Engineer, Frontend
$200k - $250k
...Frontend Engineer At 3Y Health, we are building AI-driven software to empower healthcare providers and solve the overwhelming administrative complexity that... ...Frontend Engineer to help us craft intuitive, high-performance user interfaces that our providers love. The...
Performance
Work experience placement
Private practice
Work at office
3Y
San Francisco, CA
1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff Software Engineer - GenAI Performance and Kernel. Be the first to apply!