Software Engineer, LLM Compilation
$2,000 per monthOpenReq
About Etched Etched is building AI chips that are hard-coded for individual model architectures. Our first product (Sohu) only supports transformers, but has an order of magnitude more throughput and lower latency than a B200. With Etched ASICs, you can build products that would be impossible with GPUs, like real-time video generation models and extremely deep chain-of-thought reasoning. Software, LLM Compilation Software sells chips. Etched ASICs are no exception. While our first chip, Sohu, is only able to run transformer models, we still need production-grade software to map existing LLMs onto our chip. You will help make this happen. You will write optimized kernels for the operations that make up a transformer, like attention, model parallelism, and normalization, and package them into components that developers can use (e.g. in the way that vLLM has its fused MergedColumnParallelLinear component). You will work with the hardware team to debug issues that hurt performance. You will work with the software team to build integrations with existing libraries like vLLM and HuggingFace Transformers, so that our software can be drop-in compatible. You will not build a Pytorch compiler stack - instead, we will build a few highly-optimized fused kernels that can be used to implement transformer models. Representative projects: Write an optimized kernel to compute a new attention variant on our hardware Implement HuggingFace’s CohereForCausalLM class using Etched’s transformer building blocks Implement a synchronization mechanism to coordinate between the host CPU and Etched accelerator Implement FP8 quantization for FP16 models using the same mechanism as TransformerEngine You may be a good fit if you: Have 3+ years of software engineering experience Have experience working with machine learning operators Are comfortable doing low-level embedded programming Pick up slack, even if it goes outside your job description Are results-oriented, and bias towards shipping products Want to learn more about machine learning research Strong candidates may also have experience with: Transformer optimizations, such as FlashAttention Ongoing research in machine learning How we’re different: Etched believes in the Bitter Lesson. We think most of the progress in the AI field has come from using more FLOPs to train and run models, and the best way to get more FLOPs is to build model-specific hardware. Larger and larger training runs encourage companies to consolidate around fewer model architectures, which creates a market for single-model ASICs. We are a fully in-person team in Cupertino, and greatly value engineering skills. We do not have boundaries between engineering and research, and we expect all of our technical staff to contribute to both as needed. Benefits: Full medical, dental, and vision packages, with 100% of premium covered, 90% for dependents Housing subsidy of $2,000/month for those living within walking distance of the office Daily lunch and dinner in our office Relocation support for those moving to Cupertino #J-18808-Ljbffr OpenReq
$2,000 per month
...generation models and extremely deep chain-of-thought reasoning. Software, LLM Compilation Software sells chips. Etched ASICs are no exception. While... ...You may be a good fit if you: Have 3+ years of software engineering experience Have experience working with machine learning...SuggestedWork at officeRelocation package- ...Join NVIDIA’s TensorRT Edge‑LLM team and help shape the next... ...automotive and robotics. We build the software stack that enables Large... .... Design and implement compiler and runtime optimizations tailored... ...Science, Electrical/Computer Engineering, or a closely related field....Suggested
- ...As a senior member of the LLM inference framework team, you... ...the intersection of inference engines, distributed systems, and GPU... ...and collaborating with kernel, compiler, and networking teams to close... ...and kernel development Software Engineering ~ Expertise in Python...Suggested
$207k - $300k
Software Engineer, GDC LLM Serving and GPU Performance Google Sunnyvale, CA, USA Qualifications Bachelor’s degree or equivalent practical experience. 8 years of experience in software development. 5 years of experience testing and launching software products, and 3...SuggestedFull time$184k - $287.5k
Senior Deep Learning Software Engineer, LLM Performance page is loaded## Senior Deep Learning Software Engineer, LLM Performancelocations: US... ...the crowd:*** Prior experience with a LLM framework or a DL compiler in inference, deployment, algorithms, or implementation*...Suggested$2,000 per month
...that would be impossible with GPUs, like real-time video generation models and extremely deep chain-of-thought reasoning. Software Engineer, LLM Infrastructure Transformer ASICs, like those built by Etched, dramatically improve time-to-first-token latency. For a large...Work at officeRelocation package$181.1k - $272.1k
Sr Full-stack Software Engineer, AIML Data Operations Cupertino, California, United States Software... ...deliver the highest-quality data for LLM training and evaluation, the foundation... .... Demonstrated expertise in a modern, compiled, object-oriented language such as Swift...Relocation- ...C++ Developer with compiler experience Location: Sunnyvale/Cupertino WFO all 5 days Job description: - Preferred C++ software focused candidates - Candidates in gaming industry with good experience in C++ - Experience in Rendering techniques, Visual...
$147k - $211k
ML Compiler Software Engineer, Tensor Processing Unit Google - Sunnyvale, CA, USA Qualifications Bachelor’s degree or equivalent practical experience. 2 years of experience with coding in C++, or 1 year of experience with an advanced degree. 1 year of experience with...Full time$120k - $275k
...Software Engineer - Compiler Mountain View, CA MatX is on a mission to be the compute platform for AGI. We are developing vertically integrated full-stack solutions from silicon to systems including hardware and software to train and run the largest ML workloads...Full timeWork experience placementWork at officeLocal areaRemote workMonday to FridayFlexible hours3 days per week- ...US 2026 Software - Compiler Engineer Intern We are seeking a Compiler Engineer intern to support the Compiler Software team in developing the compiler and runtime for in‑memory machine learning accelerators at TetraMem. Responsibilities: Explore and support techniques...Internship
$147.4k - $272.1k
Software Development Engineer, Platforms Cupertino, California, United States Software and Services Imagine... ...rotation schedule. Integrate GenAI/LLM capabilities to enhance developer... ...software development processes such as compilation, unit testing, code analysis, release...Relocation$184k - $287.5k
...infrastructure, from training large-scale models to running inference in production. That position depends on software as much as hardware, and compiler engineering is a big part of what makes it work. What you’ll be doing: Designing and implementing compiler passes, IRs,...Work experience placement$147k - $211k
Software Engineer, Edge TPU Compilers, Silicon Location: Mountain View, CA, USA; Kirkland, WA, USA. Mid Experience driving progress, solving problems, and mentoring more junior team members; deeper expertise and applied knowledge within relevant area. Benefits In accordance...Full timeTemporary work$181.1k - $318.4k
Senior Software Engineer: Agentic Evaluation Cupertino, California, United States — Machine Learning... ...programming skills in one or more compiled languages (Swift, C++ or Objective‑C).... ...conditions Experience evaluating ML, LLM or agent-based systems, including familiarity...Relocation packageShift work- ...creation of high-performance silicon chips and software content. Join us to transform the future... .... You Are: You are an accomplished engineer with a passion for physical design and a... ...‑leading EDA tools such as Fusion Compiler, PrimeTime, PrimeClosure, Formality, DSO...
$212.8k
...technologies from network architecture, software defined networking (SDN), network virtualization... ...network technologies to support AI/LLM applications. - Design and development... ...Development, Computer Science, Computer Engineering, or a related technical discipline....Temporary workLocal area- ...Title: C++ Compiler & Graphics Engineer Location: Cupertino, CA (On-site) Duration: 6 months (possibility of extension) Implementation Partner... ...C++ Developer with compiler experience Preferred C++ software-focused candidates Candidates from the gaming...Work at office
- ...A leading cybersecurity firm is seeking a candidate to enhance LLM security by architecting monitoring and filtering systems. This role requires expertise in deploying AI systems, managing prompts, and safeguarding against emerging threats. The ideal candidate will have...
- ...Title: AI Engineer (LLM, Agents & Data Engineering) Location: Austin, TX or Sunnyvale, CA (Hybrid) Duration: 6 months (possibility... ...Required Qualifications: ~5+ years of software engineering experience with Python or similar programming languages...
$140k - $210k
...make a difference at Fiserv. Job Title Senior Full Stack Software Engineer What does a successful Fullstack Engineer do at Clover?... ...have: Familiarity with Google Cloud Pub/Sub, K8, Lambda run, LLM prompting techniques, Kotlin and Python Previous experience...Work at officeWorldwideMonday to Friday$272k - $425.5k
Principal Software Engineer – Large-Scale LLM Memory and Storage Systems page is loaded## Principal Software Engineer – Large-Scale LLM Memory and Storage Systemslocations: US, CA, Santa Clara: US, WA, Remote: US, MA, Remotetime type: Full timeposted on: Posted Todayjob...Local areaRemote work$196.5k - $219.3k
...implement functions to monitor and filter LLM requests/responses in real time,... ...requirements into platform features. Mentor junior engineers on secure backend development and best... ...Ensure the timely delivery of high-quality software features while adhering to project...Full timeWorldwide$156k - $387.6k
...transform data platforms. We are seeking a highly skilled and innovative Software Engineer to join our cutting-edge Agentic Engine team. As part of a division of the data platform team which focuses on LLM adoption, you will have the opportunity to work with state-of-the-...Temporary workLocal area- ...maintains a close and long-term relationship with our direct client. In support of their needs, we are looking for a Python Developer - LLM / AI Applications. Job Description: Job Title: Python Developer - LLM / AI Applications Job Type: Contract...Contract work
$147.4k - $272.1k
Software Engineer (Customer Success), Developer Engagement Cupertino, California, United States... ...stakeholders Proficient in modern scripting or compiled languages (Python, Swift, C/C++, etc.)... ...incorporating AI/ML capabilities or LLM‑based workflows into developer‑facing...Relocation$125k - $222k
...deploy these complex systems safely. Our software is used by top automotive OEMs, trucking... ...agentic analytics. Our platform gives autonomy engineers complete visibility and control over the... ...agents * Bonus: experience with LLM tool use, developing agentic workflows, or...Full timeFor contractorsFor subcontractorCasual workWork at officeRemote workDay shift- ...computation. About the Role We are looking for a Software Engineer to join the ML Integration and Quality... ...closely with engineers across runtime, compiler, kernel, and hardware teams to debug... ...ML model deployment. Familiarity with LLM or multimodal model workloads....Work at officeRemote work
- ...CloudAct Inc. is looking for a backend engineer to enhance the FastAPI proxy. You will handle streaming, retries, and ensure the system... ...especially in high-concurrency environments. Additional advantages include exposure to LLM APIs or streaming proxies. #J-18808-Ljbffr...
$152k - $241.5k
...computing company”. We are looking for an AI & Deep Learning Compiler Engineer. NVIDIA is hiring software engineers for its Deep Learning & AI Compiler (DLC)... ...frameworks, such as PyTorch, XLA etc. Understanding of LLM inference optimizations and techniques. GPU kernel...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Software Engineer, LLM Compilation. Be the first to apply!
- federal - software developer Cupertino, CA
- software engineer contract Cupertino, CA
- software engineer healthcare Cupertino, CA
- network software engineer Cupertino, CA
- ngo software engineer Cupertino, CA
- software development engineer aws Cupertino, CA
- software developer fintech Cupertino, CA
- software data engineer Cupertino, CA
- senior software engineer remote Cupertino, CA
- intel software engineer Cupertino, CA


