Software Engineer, LLM Compilation
$2,000 per monthOpenReq
About Etched Etched is building AI chips that are hard-coded for individual model architectures. Our first product (Sohu) only supports transformers, but has an order of magnitude more throughput and lower latency than a B200. With Etched ASICs, you can build products that would be impossible with GPUs, like real-time video generation models and extremely deep chain-of-thought reasoning. Software, LLM Compilation Software sells chips. Etched ASICs are no exception. While our first chip, Sohu, is only able to run transformer models, we still need production-grade software to map existing LLMs onto our chip. You will help make this happen. You will write optimized kernels for the operations that make up a transformer, like attention, model parallelism, and normalization, and package them into components that developers can use (e.g. in the way that vLLM has its fused MergedColumnParallelLinear component). You will work with the hardware team to debug issues that hurt performance. You will work with the software team to build integrations with existing libraries like vLLM and HuggingFace Transformers, so that our software can be drop-in compatible. You will not build a Pytorch compiler stack - instead, we will build a few highly-optimized fused kernels that can be used to implement transformer models. Representative projects: Write an optimized kernel to compute a new attention variant on our hardware Implement HuggingFace’s CohereForCausalLM class using Etched’s transformer building blocks Implement a synchronization mechanism to coordinate between the host CPU and Etched accelerator Implement FP8 quantization for FP16 models using the same mechanism as TransformerEngine You may be a good fit if you: Have 3+ years of software engineering experience Have experience working with machine learning operators Are comfortable doing low-level embedded programming Pick up slack, even if it goes outside your job description Are results-oriented, and bias towards shipping products Want to learn more about machine learning research Strong candidates may also have experience with: Transformer optimizations, such as FlashAttention Ongoing research in machine learning How we’re different: Etched believes in the Bitter Lesson. We think most of the progress in the AI field has come from using more FLOPs to train and run models, and the best way to get more FLOPs is to build model-specific hardware. Larger and larger training runs encourage companies to consolidate around fewer model architectures, which creates a market for single-model ASICs. We are a fully in-person team in Cupertino, and greatly value engineering skills. We do not have boundaries between engineering and research, and we expect all of our technical staff to contribute to both as needed. Benefits: Full medical, dental, and vision packages, with 100% of premium covered, 90% for dependents Housing subsidy of $2,000/month for those living within walking distance of the office Daily lunch and dinner in our office Relocation support for those moving to Cupertino #J-18808-Ljbffr
$152k - $241.5k
...Join NVIDIA's TensorRT Edge-LLM team and help shape the next... ...automotive and robotics. We build the software stack that enables Large... .... Design and implement compiler and runtime optimizations tailored... ...Science, Electrical/Computer Engineering, or a closely related field....SuggestedRemote work- ...As a senior member of the LLM inference framework team, you... ...the intersection of inference engines, distributed systems, and GPU... ...and collaborating with kernel, compiler, and networking teams to close... ...and kernel development Software Engineering ~ Expertise in Python...Suggested
- We are now looking for a Senior Deep Learning Software Engineer, LLM Performance! NVIDIA is seeking an experienced Deep Learning Engineer passionate... ...Prior experience with an LLM framework or a deep learning compiler in inference, deployment, algorithms, or implementation....Suggested
$207k - $300k
Software Engineer, GDC LLM Serving and GPU Performance Google Sunnyvale, CA, USA Qualifications Bachelor’s degree or equivalent practical experience. 8 years of experience in software development. 5 years of experience testing and launching software products, and 3...SuggestedFull time$2,000 per month
...that would be impossible with GPUs, like real-time video generation models and extremely deep chain-of-thought reasoning. Software Engineer, LLM Infrastructure Transformer ASICs, like those built by Etched, dramatically improve time-to-first-token latency. For a large...SuggestedWork at officeRelocation package$181.1k - $272.1k
...Sr Full-Stack Software Engineer, AIML Data Operations Imagine what you could do here. We're... ...help deliver the highest-quality data for LLM training and evaluation, the foundation... ...~ Demonstrated expertise in a modern, compiled, object-oriented language such as Swift...Relocation$147.4k - $272.1k
...Apple Inc. is seeking a Compiler Security Engineer in Cupertino, California, to develop secure language features for C/C++ and enhance security in the Clang compiler. Your role is crucial in eliminating vulnerabilities, impacting billions of devices globally. You will...$147.4k - $272.1k
...Compiler Security Engineer - C/C++, Languages & Runtimes Cupertino, California, United States Software and Services The Security Tools team at Apple is looking for software engineers to develop secure language features for C/C++ and enhance security features in the Clang...Work at officeRelocation$147.4k - $272.1k
...Software Development Engineer, Platforms Cupertino, California, United States Software and Services Imagine... ...rotation schedule. Integrate GenAI/LLM capabilities to enhance developer... ...software development processes such as compilation, unit testing, code analysis, release...Relocation$147k - $211k
ML Compiler Software Engineer, Tensor Processing Unit Google - Sunnyvale, CA, USA Qualifications Bachelor’s degree or equivalent practical experience. 2 years of experience with coding in C++, or 1 year of experience with an advanced degree. 1 year of experience with...Full time- ...C++ Developer with compiler experience Location: Sunnyvale/Cupertino WFO all 5 days Job description: - Preferred C++ software focused candidates - Candidates in gaming industry with good experience in C++ - Experience in Rendering techniques, Visual...
$45 - $52 per hour
...Immediate need for a talented C++ Developer with Compiler . This is a 12+months contract opportunity with long-term potential and is... ...C++ Developer with compiler experience Preferred C++ software focused candidates Experience in Rendering techniques, Visual...Contract workLocal areaImmediate start$120k - $275k
...Software Engineer - Compiler Mountain View, CA MatX is on a mission to be the compute platform for AGI. We are developing vertically integrated full-stack solutions from silicon to systems including hardware and software to train and run the largest ML workloads...Full timeWork experience placementWork at officeLocal areaRemote workMonday to FridayFlexible hours3 days per week- ...US 2026 Software - Compiler Engineer Intern We are seeking a Compiler Engineer intern to support the Compiler Software team in developing the compiler and runtime for in‑memory machine learning accelerators at TetraMem. Responsibilities: Explore and support techniques...Internship
$50 - $55 per hour
...Request ID: 89185-1 Title: Full Stack LLM Engineer Location : Sunnyvale, CA (3x/ week onsite) Duration: 6+ Months Salary... ...high-velocity product engineer who combines 8+ years of robust software engineering experience with a modern, AI-accelerated...$184k - $287.5k
...infrastructure, from training large-scale models to running inference in production. That position depends on software as much as hardware, and compiler engineering is a big part of what makes it work. What you’ll be doing: Designing and implementing compiler passes, IRs,...Work experience placement- ...creation of high-performance silicon chips and software content. Join us to transform the future... ...You Are: You are an accomplished engineer with a passion for physical design and a... ...-leading EDA tools such as Fusion Compiler, PrimeTime, PrimeClosure, Formality, DSO...
$147k - $211k
Software Engineer, Edge TPU Compilers, Silicon Location: Mountain View, CA, USA; Kirkland, WA, USA. Mid Experience driving progress, solving problems, and mentoring more junior team members; deeper expertise and applied knowledge within relevant area. Benefits In accordance...Full timeTemporary work- ...Hiring: C++ Compiler & Graphics Engineer | Cupertino, CA | On-site We're looking for an experienced C++ Compiler & Graphics Enginee r with... ...Troubleshoot performance and functionality issues across software and graphics compone nts• Participate in code reviews,...
$212.8k
...technologies from network architecture, software defined networking (SDN), network virtualization... ...network technologies to support AI/LLM applications. - Design and development... ...Development, Computer Science, Computer Engineering, or a related technical discipline....Temporary workLocal area$147.4k - $272.1k
...Software Engineer (Customer Success), Developer Engagement Cupertino, California, United States... ...stakeholders Proficient in modern scripting or compiled languages (Python, Swift, C/C++, etc.)... ...incorporating AI/ML capabilities or LLM‑based workflows into developer‑facing...Relocation- ...Title: C++ Compiler & Graphics Engineer Location: Cupertino, CA (On-site) Duration: 6 months (possibility of extension) Implementation Partner... ...C++ Developer with compiler experience Preferred C++ software-focused candidates Candidates from the gaming...
- ...Job Description: ~ We're looking for a Software Engineer to join a team working on next-generation AI and high-performance computing technologies... ...: Design, develop, and optimize software components for compiler and/or GPU technology stacks. Contribute to compiler...
- ...A leading cybersecurity firm is seeking a candidate to enhance LLM security by architecting monitoring and filtering systems. This role requires expertise in deploying AI systems, managing prompts, and safeguarding against emerging threats. The ideal candidate will have...
$100k - $150k
...be responsible for the internal working of the AI inference stack and compiler running neural networks in millions of Tesla vehicles and Optimus. You will collaborate closely with the AI Engineers and Hardware Engineers to understand the full inference stack and design...Full timeTemporary workPart timeSummer workInternshipImmediate startFlexible hours- ...visualization frameworks. You will work across the software stack, from designing scalable APIs and... ...design workflows to enable hardware engineering teams to design the world’s most... ...orchestration Experience integrating LLM APIs (e.g., OpenAI, Claude, Gemini) into...
$174k - $252k
Senior Software Engineer, ML Compilers, Edge TPU Location: Mountain View, CA, USA; Kirkland, WA, USA. Mid-level role focused on driving progress, solving problems, and mentoring more junior team members; requires deeper expertise and applied knowledge within the relevant...Full timeTemporary work$156k - $387.6k
...Software Engineer - LLM Applications and AI Agents Location: San Jose Employment Type: Regular Job Code: A00574A Responsibilities: Join us in pushing the boundaries of AI technology and creating the next generation of intelligent systems that will transform...Temporary workLocal area$196.5k - $219.3k
...implement functions to monitor and filter LLM requests/responses in real time,... ...requirements into platform features. Mentor junior engineers on secure backend development and best... ...Ensure the timely delivery of high-quality software features while adhering to project...Full timeWorldwide$160k
...What to Expect As a Software Engineer within our AI teams, you will contribute to one of the most advanced and widely deployed AI Platforms... ...Engineers to understand requirements and co-design the compiler/runtime to extract the maximum performance out of our hardware...Hourly payFull timeTemporary workImmediate startFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Software Engineer, LLM Compilation. Be the first to apply!
- software engineer internship remote Cupertino, CA
- machine learning software engineer Cupertino, CA
- senior robotics software engineer Cupertino, CA
- software development engineer aws Cupertino, CA
- startup software engineer Cupertino, CA
- rust software engineer Cupertino, CA
- part time software developer remote Cupertino, CA
- senior software engineer remote Cupertino, CA
- software engineer Cupertino, CA
- real time software engineer Cupertino, CA


