Software Engineer, LLM Compilation

$2,000 per month

OpenReq

About Etched Etched is building AI chips that are hard-coded for individual model architectures. Our first product (Sohu) only supports transformers, but has an order of magnitude more throughput and lower latency than a B200. With Etched ASICs, you can build products that would be impossible with GPUs, like real-time video generation models and extremely deep chain-of-thought reasoning. Software, LLM Compilation Software sells chips. Etched ASICs are no exception. While our first chip, Sohu, is only able to run transformer models, we still need production-grade software to map existing LLMs onto our chip. You will help make this happen. You will write optimized kernels for the operations that make up a transformer, like attention, model parallelism, and normalization, and package them into components that developers can use (e.g. in the way that vLLM has its fused MergedColumnParallelLinear component). You will work with the hardware team to debug issues that hurt performance. You will work with the software team to build integrations with existing libraries like vLLM and HuggingFace Transformers, so that our software can be drop-in compatible. You will not build a Pytorch compiler stack - instead, we will build a few highly-optimized fused kernels that can be used to implement transformer models. Representative projects: Write an optimized kernel to compute a new attention variant on our hardware Implement HuggingFace’s CohereForCausalLM class using Etched’s transformer building blocks Implement a synchronization mechanism to coordinate between the host CPU and Etched accelerator Implement FP8 quantization for FP16 models using the same mechanism as TransformerEngine You may be a good fit if you: Have 3+ years of software engineering experience Have experience working with machine learning operators Are comfortable doing low-level embedded programming Pick up slack, even if it goes outside your job description Are results-oriented, and bias towards shipping products Want to learn more about machine learning research Strong candidates may also have experience with: Transformer optimizations, such as FlashAttention Ongoing research in machine learning How we’re different: Etched believes in the Bitter Lesson. We think most of the progress in the AI field has come from using more FLOPs to train and run models, and the best way to get more FLOPs is to build model-specific hardware. Larger and larger training runs encourage companies to consolidate around fewer model architectures, which creates a market for single-model ASICs. We are a fully in-person team in Cupertino, and greatly value engineering skills. We do not have boundaries between engineering and research, and we expect all of our technical staff to contribute to both as needed. Benefits: Full medical, dental, and vision packages, with 100% of premium covered, 90% for dependents Housing subsidy of $2,000/month for those living within walking distance of the office Daily lunch and dinner in our office Relocation support for those moving to Cupertino #J-18808-Ljbffr OpenReq

Apply

Vacancy posted 2 days ago

Similar jobs that could be interesting for youBased on the Software Engineer, LLM Compilation in Cupertino, CA vacancy

Software Engineer, LLM Compilation
$2,000 per month
...generation models and extremely deep chain-of-thought reasoning. Software, LLM Compilation Software sells chips. Etched ASICs are no exception. While... ...You may be a good fit if you: Have 3+ years of software engineering experience Have experience working with machine learning...
Suggested
Work at office
Relocation package
OpenReq
Cupertino, CA
5 days ago
Senior Software Engineer - TensorRT Edge-LLM
...Join NVIDIA’s TensorRT Edge‑LLM team and help shape the next... ...automotive and robotics. We build the software stack that enables Large... .... Design and implement compiler and runtime optimizations tailored... ...Science, Electrical/Computer Engineering, or a closely related field....
Suggested
NVIDIA Gruppe
Santa Clara, CA
4 days ago
Senior Software Development Engineer - LLM Inference Framework
...As a senior member of the LLM inference framework team, you... ...the intersection of inference engines, distributed systems, and GPU... ...and collaborating with kernel, compiler, and networking teams to close... ...and kernel development Software Engineering ~ Expertise in Python...
Suggested
Advanced Micro Devices , Inc.
Santa Clara, CA
1 day ago
Software Engineer, GDC LLM Serving and GPU Performance
$207k - $300k
Software Engineer, GDC LLM Serving and GPU Performance Google Sunnyvale, CA, USA Qualifications Bachelor’s degree or equivalent practical experience. 8 years of experience in software development. 5 years of experience testing and launching software products, and 3...
Suggested
Full time
Google Inc.
Sunnyvale, CA
4 days ago
Senior Deep Learning Software Engineer, LLM Performance
$184k - $287.5k
Senior Deep Learning Software Engineer, LLM Performance page is loaded## Senior Deep Learning Software Engineer, LLM Performancelocations: US... ...the crowd:*** Prior experience with a LLM framework or a DL compiler in inference, deployment, algorithms, or implementation*...
Suggested
NVIDIA Corporation
Santa Clara, CA
1 day ago
Software Engineer, LLM Infrastructure
$2,000 per month
...that would be impossible with GPUs, like real-time video generation models and extremely deep chain-of-thought reasoning. Software Engineer, LLM Infrastructure Transformer ASICs, like those built by Etched, dramatically improve time-to-first-token latency. For a large...
Work at office
Relocation package
OpenReq
Cupertino, CA
5 days ago
Sr Full-stack Software Engineer, AIML Data Operations
$181.1k - $272.1k
Sr Full-stack Software Engineer, AIML Data Operations Cupertino, California, United States Software... ...deliver the highest-quality data for LLM training and evaluation, the foundation... .... Demonstrated expertise in a modern, compiled, object-oriented language such as Swift...
Relocation
Apple Inc.
Cupertino, CA
2 days ago
C++ Developer with compiler experience
...C++ Developer with compiler experience Location: Sunnyvale/Cupertino WFO all 5 days Job description: - Preferred C++ software focused candidates - Candidates in gaming industry with good experience in C++ - Experience in Rendering techniques, Visual...
Info Way Solutions
Sunnyvale, CA
4 days ago
ML Compiler Software Engineer, Tensor Processing Unit
$147k - $211k
ML Compiler Software Engineer, Tensor Processing Unit Google - Sunnyvale, CA, USA Qualifications Bachelor’s degree or equivalent practical experience. 2 years of experience with coding in C++, or 1 year of experience with an advanced degree. 1 year of experience with...
Full time
Google Inc.
Sunnyvale, CA
5 days ago
Software Engineer - Compiler
$120k - $275k
...Software Engineer - Compiler Mountain View, CA MatX is on a mission to be the compute platform for AGI. We are developing vertically integrated full-stack solutions from silicon to systems including hardware and software to train and run the largest ML workloads...
Full time
Work experience placement
Work at office
Local area
Remote work
Monday to Friday
Flexible hours
3 days per week
MatX
Mountain View, CA
4 days ago
US 2026 Software - Compiler Engineer Intern
...US 2026 Software - Compiler Engineer Intern We are seeking a Compiler Engineer intern to support the Compiler Software team in developing the compiler and runtime for in‑memory machine learning accelerators at TetraMem. Responsibilities: Explore and support techniques...
Internship
TetraMem Inc
San Jose, CA
5 days ago
Software Development Engineer, Platforms
$147.4k - $272.1k
Software Development Engineer, Platforms Cupertino, California, United States Software and Services Imagine... ...rotation schedule. Integrate GenAI/LLM capabilities to enhance developer... ...software development processes such as compilation, unit testing, code analysis, release...
Relocation
Apple Inc.
Cupertino, CA
5 days ago
Senior Software Engineer, DL Compilers
$184k - $287.5k
...infrastructure, from training large-scale models to running inference in production. That position depends on software as much as hardware, and compiler engineering is a big part of what makes it work. What you’ll be doing: Designing and implementing compiler passes, IRs,...
Work experience placement
NVIDIA Gruppe
Santa Clara, CA
5 days ago
Software Engineer, Edge TPU Compilers, Silicon
$147k - $211k
Software Engineer, Edge TPU Compilers, Silicon Location: Mountain View, CA, USA; Kirkland, WA, USA. Mid Experience driving progress, solving problems, and mentoring more junior team members; deeper expertise and applied knowledge within relevant area. Benefits In accordance...
Full time
Temporary work
Google Inc.
Mountain View, CA
3 days ago
Software Engineer: Agentic Evaluation
$181.1k - $318.4k
Senior Software Engineer: Agentic Evaluation Cupertino, California, United States — Machine Learning... ...programming skills in one or more compiled languages (Swift, C++ or Objective‑C).... ...conditions Experience evaluating ML, LLM or agent-based systems, including familiarity...
Relocation package
Shift work
Apple Inc.
Cupertino, CA
1 day ago
Applications Engineering, Sr Staff Engineer- RTL-to-GDS/Fusion Compiler
...creation of high-performance silicon chips and software content. Join us to transform the future... .... You Are: You are an accomplished engineer with a passion for physical design and a... ...‑leading EDA tools such as Fusion Compiler, PrimeTime, PrimeClosure, Formality, DSO...
Synopsys
Sunnyvale, CA
2 days ago
Software Development Engineer-AI/LLM Network-Global Frontier Tech Recruitment Start (PhD)
$212.8k
...technologies from network architecture, software defined networking (SDN), network virtualization... ...network technologies to support AI/LLM applications. - Design and development... ...Development, Computer Science, Computer Engineering, or a related technical discipline....
Temporary work
Local area
ByteDance
San Jose, CA
4 days ago
C++ Compiler & Graphics Engineer
...Title: C++ Compiler & Graphics Engineer Location: Cupertino, CA (On-site) Duration: 6 months (possibility of extension) Implementation Partner... ...C++ Developer with compiler experience Preferred C++ software-focused candidates Candidates from the gaming...
Work at office
Wise Skulls
Cupertino, CA
2 days ago
Secure AI Backend Engineer (LLM & Microservices)
...A leading cybersecurity firm is seeking a candidate to enhance LLM security by architecting monitoring and filtering systems. This role requires expertise in deploying AI systems, managing prompts, and safeguarding against emerging threats. The ideal candidate will have...
Fortinet
Sunnyvale, CA
4 days ago
AI Engineer (LLM, Agents & Data Engineering)
...Title: AI Engineer (LLM, Agents & Data Engineering) Location: Austin, TX or Sunnyvale, CA (Hybrid) Duration: 6 months (possibility... ...Required Qualifications: ~5+ years of software engineering experience with Python or similar programming languages...
Wise Skulls
Sunnyvale, CA
1 day ago
Senior Full Stack Software Engineer
$140k - $210k
...make a difference at Fiserv. Job Title Senior Full Stack Software Engineer What does a successful Fullstack Engineer do at Clover?... ...have: Familiarity with Google Cloud Pub/Sub, K8, Lambda run, LLM prompting techniques, Kotlin and Python Previous experience...
Work at office
Worldwide
Monday to Friday
Fiserv
Sunnyvale, CA
5 days ago
Principal Software Engineer - Large-Scale LLM Memory and Storage Systems
$272k - $425.5k
Principal Software Engineer – Large-Scale LLM Memory and Storage Systems page is loaded## Principal Software Engineer – Large-Scale LLM Memory and Storage Systemslocations: US, CA, Santa Clara: US, WA, Remote: US, MA, Remotetime type: Full timeposted on: Posted Todayjob...
Local area
Remote work
NVIDIA Corporation
Santa Clara, CA
1 day ago
Staff Software Development Engineer (LLM)
$196.5k - $219.3k
...implement functions to monitor and filter LLM requests/responses in real time,... ...requirements into platform features. Mentor junior engineers on secure backend development and best... ...Ensure the timely delivery of high-quality software features while adhering to project...
Full time
Worldwide
Fortinet
Sunnyvale, CA
1 day ago
Software Engineer - LLM Applications and AI Agents
$156k - $387.6k
...transform data platforms. We are seeking a highly skilled and innovative Software Engineer to join our cutting-edge Agentic Engine team. As part of a division of the data platform team which focuses on LLM adoption, you will have the opportunity to work with state-of-the-...
Temporary work
Local area
Tik Tok
San Jose, CA
1 day ago
Python Developer- LLM / AI Applications
...maintains a close and long-term relationship with our direct client. In support of their needs, we are looking for a Python Developer - LLM / AI Applications. Job Description: Job Title: Python Developer - LLM / AI Applications Job Type: Contract...
Contract work
Maxonic
Sunnyvale, CA
5 days ago
Software Engineer (Customer Success), Developer Engagement
$147.4k - $272.1k
Software Engineer (Customer Success), Developer Engagement Cupertino, California, United States... ...stakeholders Proficient in modern scripting or compiled languages (Python, Swift, C/C++, etc.)... ...incorporating AI/ML capabilities or LLM‑based workflows into developer‑facing...
Relocation
Apple Inc.
Cupertino, CA
2 days ago
Software Engineer - Insights Platform
$125k - $222k
...deploy these complex systems safely. Our software is used by top automotive OEMs, trucking... ...agentic analytics. Our platform gives autonomy engineers complete visibility and control over the... ...agents * Bonus: experience with LLM tool use, developing agentic workflows, or...
Full time
For contractors
For subcontractor
Casual work
Work at office
Remote work
Day shift
Applied Intuition
Sunnyvale, CA
2 days ago
Senior ML Software Engineer - Integration & Quality
...computation. About the Role We are looking for a Software Engineer to join the ML Integration and Quality... ...closely with engineers across runtime, compiler, kernel, and hardware teams to debug... ...ML model deployment. Familiarity with LLM or multimodal model workloads....
Work at office
Remote work
Dormont Manufacturing Company
Sunnyvale, CA
5 days ago
LLM Infra Backend Engineer: Streaming & Guardrails
...CloudAct Inc. is looking for a backend engineer to enhance the FastAPI proxy. You will handle streaming, retries, and ensure the system... ...especially in high-concurrency environments. Additional advantages include exposure to LLM APIs or streaming proxies. #J-18808-Ljbffr...
CloudAct Inc.
Sunnyvale, CA
5 days ago
Senior AI Inference Compiler Engineer
$152k - $241.5k
...computing company”. We are looking for an AI & Deep Learning Compiler Engineer. NVIDIA is hiring software engineers for its Deep Learning & AI Compiler (DLC)... ...frameworks, such as PyTorch, XLA etc. Understanding of LLM inference optimizations and techniques. GPU kernel...
NVIDIA Gruppe
Santa Clara, CA
4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Software Engineer, LLM Compilation. Be the first to apply!