Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Software Engineer, LLM Compilation

$2,000 per month

OpenReq

About Etched Etched is building AI chips that are hard-coded for individual model architectures. Our first product (Sohu) only supports transformers, but has an order of magnitude more throughput and lower latency than a B200. With Etched ASICs, you can build products that would be impossible with GPUs, like real-time video generation models and extremely deep chain-of-thought reasoning. Software, LLM Compilation Software sells chips. Etched ASICs are no exception. While our first chip, Sohu, is only able to run transformer models, we still need production-grade software to map existing LLMs onto our chip. You will help make this happen. You will write optimized kernels for the operations that make up a transformer, like attention, model parallelism, and normalization, and package them into components that developers can use (e.g. in the way that vLLM has its fused MergedColumnParallelLinear component). You will work with the hardware team to debug issues that hurt performance. You will work with the software team to build integrations with existing libraries like vLLM and HuggingFace Transformers, so that our software can be drop-in compatible. You will not build a Pytorch compiler stack - instead, we will build a few highly-optimized fused kernels that can be used to implement transformer models. Representative projects: Write an optimized kernel to compute a new attention variant on our hardware Implement HuggingFace’s CohereForCausalLM class using Etched’s transformer building blocks Implement a synchronization mechanism to coordinate between the host CPU and Etched accelerator Implement FP8 quantization for FP16 models using the same mechanism as TransformerEngine You may be a good fit if you: Have 3+ years of software engineering experience Have experience working with machine learning operators Are comfortable doing low-level embedded programming Pick up slack, even if it goes outside your job description Are results-oriented, and bias towards shipping products Want to learn more about machine learning research Strong candidates may also have experience with: Transformer optimizations, such as FlashAttention Ongoing research in machine learning How we’re different: Etched believes in the Bitter Lesson. We think most of the progress in the AI field has come from using more FLOPs to train and run models, and the best way to get more FLOPs is to build model-specific hardware. Larger and larger training runs encourage companies to consolidate around fewer model architectures, which creates a market for single-model ASICs. We are a fully in-person team in Cupertino, and greatly value engineering skills. We do not have boundaries between engineering and research, and we expect all of our technical staff to contribute to both as needed. Benefits: Full medical, dental, and vision packages, with 100% of premium covered, 90% for dependents Housing subsidy of $2,000/month for those living within walking distance of the office Daily lunch and dinner in our office Relocation support for those moving to Cupertino #J-18808-Ljbffr OpenReq

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Software Engineer, LLM Compilation in Cupertino, CA vacancy
  • $2,000 per month

     ...generation models and extremely deep chain-of-thought reasoning. Software, LLM Compilation Software sells chips. Etched ASICs are no exception. While...  ...You may be a good fit if you: Have 3+ years of software engineering experience Have experience working with machine learning... 
    Suggested
    Work at office
    Relocation package

    OpenReq

    Cupertino, CA
    5 days ago
  •  ...Join NVIDIA’s TensorRT Edge‑LLM team and help shape the next...  ...automotive and robotics. We build the software stack that enables Large...  .... Design and implement compiler and runtime optimizations tailored...  ...Science, Electrical/Computer Engineering, or a closely related field.... 
    Suggested

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  •  ...As a senior member of the LLM inference framework team, you...  ...the intersection of inference engines, distributed systems, and GPU...  ...and collaborating with kernel, compiler, and networking teams to close...  ...and kernel development Software Engineering ~ Expertise in Python... 
    Suggested

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    1 day ago
  • $207k - $300k

    Software Engineer, GDC LLM Serving and GPU Performance Google Sunnyvale, CA, USA Qualifications Bachelor’s degree or equivalent practical experience. 8 years of experience in software development. 5 years of experience testing and launching software products, and 3... 
    Suggested
    Full time

    Google Inc.

    Sunnyvale, CA
    4 days ago
  • $184k - $287.5k

    Senior Deep Learning Software Engineer, LLM Performance page is loaded## Senior Deep Learning Software Engineer, LLM Performancelocations: US...  ...the crowd:*** Prior experience with a LLM framework or a DL compiler in inference, deployment, algorithms, or implementation*... 
    Suggested

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • $2,000 per month

     ...that would be impossible with GPUs, like real-time video generation models and extremely deep chain-of-thought reasoning. Software Engineer, LLM Infrastructure Transformer ASICs, like those built by Etched, dramatically improve time-to-first-token latency. For a large... 
    Work at office
    Relocation package

    OpenReq

    Cupertino, CA
    5 days ago
  • $181.1k - $272.1k

    Sr Full-stack Software Engineer, AIML Data Operations Cupertino, California, United States Software...  ...deliver the highest-quality data for LLM training and evaluation, the foundation...  .... Demonstrated expertise in a modern, compiled, object-oriented language such as Swift... 
    Relocation

    Apple Inc.

    Cupertino, CA
    2 days ago
  •  ...C++ Developer with compiler experience Location: Sunnyvale/Cupertino WFO all 5 days Job description: - Preferred C++ software focused candidates - Candidates in gaming industry with good experience in C++ - Experience in Rendering techniques, Visual... 

    Info Way Solutions

    Sunnyvale, CA
    4 days ago
  • $147k - $211k

    ML Compiler Software Engineer, Tensor Processing Unit Google - Sunnyvale, CA, USA Qualifications Bachelor’s degree or equivalent practical experience. 2 years of experience with coding in C++, or 1 year of experience with an advanced degree. 1 year of experience with... 
    Full time

    Google Inc.

    Sunnyvale, CA
    5 days ago
  • $120k - $275k

     ...Software Engineer - Compiler Mountain View, CA MatX is on a mission to be the compute platform for AGI. We are developing vertically integrated full-stack solutions from silicon to systems including hardware and software to train and run the largest ML workloads... 
    Full time
    Work experience placement
    Work at office
    Local area
    Remote work
    Monday to Friday
    Flexible hours
    3 days per week

    MatX

    Mountain View, CA
    4 days ago
  •  ...US 2026 Software - Compiler Engineer Intern We are seeking a Compiler Engineer intern to support the Compiler Software team in developing the compiler and runtime for in‑memory machine learning accelerators at TetraMem. Responsibilities: Explore and support techniques... 
    Internship

    TetraMem Inc

    San Jose, CA
    5 days ago
  • $147.4k - $272.1k

    Software Development Engineer, Platforms Cupertino, California, United States Software and Services Imagine...  ...rotation schedule. Integrate GenAI/LLM capabilities to enhance developer...  ...software development processes such as compilation, unit testing, code analysis, release... 
    Relocation

    Apple Inc.

    Cupertino, CA
    5 days ago
  • $184k - $287.5k

     ...infrastructure, from training large-scale models to running inference in production. That position depends on software as much as hardware, and compiler engineering is a big part of what makes it work. What you’ll be doing: Designing and implementing compiler passes, IRs,... 
    Work experience placement

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • $147k - $211k

    Software Engineer, Edge TPU Compilers, Silicon Location: Mountain View, CA, USA; Kirkland, WA, USA. Mid Experience driving progress, solving problems, and mentoring more junior team members; deeper expertise and applied knowledge within relevant area. Benefits In accordance... 
    Full time
    Temporary work

    Google Inc.

    Mountain View, CA
    3 days ago
  • $181.1k - $318.4k

    Senior Software Engineer: Agentic Evaluation Cupertino, California, United States — Machine Learning...  ...programming skills in one or more compiled languages (Swift, C++ or Objective‑C)....  ...conditions Experience evaluating ML, LLM or agent-based systems, including familiarity... 
    Relocation package
    Shift work

    Apple Inc.

    Cupertino, CA
    1 day ago
  •  ...creation of high-performance silicon chips and software content. Join us to transform the future...  .... You Are: You are an accomplished engineer with a passion for physical design and a...  ...‑leading EDA tools such as Fusion Compiler, PrimeTime, PrimeClosure, Formality, DSO... 

    Synopsys

    Sunnyvale, CA
    2 days ago
  • $212.8k

     ...technologies from network architecture, software defined networking (SDN), network virtualization...  ...network technologies to support AI/LLM applications. - Design and development...  ...Development, Computer Science, Computer Engineering, or a related technical discipline.... 
    Temporary work
    Local area

    ByteDance

    San Jose, CA
    4 days ago
  •  ...Title: C++ Compiler & Graphics Engineer Location: Cupertino, CA (On-site) Duration: 6 months (possibility of extension) Implementation Partner...  ...C++ Developer with compiler experience Preferred C++ software-focused candidates Candidates from the gaming... 
    Work at office

    Wise Skulls

    Cupertino, CA
    2 days ago
  •  ...A leading cybersecurity firm is seeking a candidate to enhance LLM security by architecting monitoring and filtering systems. This role requires expertise in deploying AI systems, managing prompts, and safeguarding against emerging threats. The ideal candidate will have... 

    Fortinet

    Sunnyvale, CA
    4 days ago
  •  ...Title: AI Engineer (LLM, Agents & Data Engineering) Location: Austin, TX or Sunnyvale, CA (Hybrid) Duration: 6 months (possibility...  ...Required Qualifications: ~5+ years of software engineering experience with Python or similar programming languages... 

    Wise Skulls

    Sunnyvale, CA
    1 day ago
  • $140k - $210k

     ...make a difference at Fiserv. Job Title Senior Full Stack Software Engineer What does a successful Fullstack Engineer do at Clover?...  ...have: Familiarity with Google Cloud Pub/Sub, K8, Lambda run, LLM prompting techniques, Kotlin and Python Previous experience... 
    Work at office
    Worldwide
    Monday to Friday

    Fiserv

    Sunnyvale, CA
    5 days ago
  • $272k - $425.5k

    Principal Software Engineer – Large-Scale LLM Memory and Storage Systems page is loaded## Principal Software Engineer – Large-Scale LLM Memory and Storage Systemslocations: US, CA, Santa Clara: US, WA, Remote: US, MA, Remotetime type: Full timeposted on: Posted Todayjob... 
    Local area
    Remote work

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • $196.5k - $219.3k

     ...implement functions to monitor and filter LLM requests/responses in real time,...  ...requirements into platform features. Mentor junior engineers on secure backend development and best...  ...Ensure the timely delivery of high-quality software features while adhering to project... 
    Full time
    Worldwide

    Fortinet

    Sunnyvale, CA
    1 day ago
  • $156k - $387.6k

     ...transform data platforms. We are seeking a highly skilled and innovative Software Engineer to join our cutting-edge Agentic Engine team. As part of a division of the data platform team which focuses on LLM adoption, you will have the opportunity to work with state-of-the-... 
    Temporary work
    Local area

    Tik Tok

    San Jose, CA
    1 day ago
  •  ...maintains a close and long-term relationship with our direct client. In support of their needs, we are looking for a Python Developer - LLM / AI Applications. Job Description: Job Title: Python Developer - LLM / AI Applications Job Type: Contract... 
    Contract work

    Maxonic

    Sunnyvale, CA
    5 days ago
  • $147.4k - $272.1k

    Software Engineer (Customer Success), Developer Engagement Cupertino, California, United States...  ...stakeholders Proficient in modern scripting or compiled languages (Python, Swift, C/C++, etc.)...  ...incorporating AI/ML capabilities or LLM‑based workflows into developer‑facing... 
    Relocation

    Apple Inc.

    Cupertino, CA
    2 days ago
  • $125k - $222k

     ...deploy these complex systems safely. Our software is used by top automotive OEMs, trucking...  ...agentic analytics. Our platform gives autonomy engineers complete visibility and control over the...  ...agents * Bonus: experience with LLM tool use, developing agentic workflows, or... 
    Full time
    For contractors
    For subcontractor
    Casual work
    Work at office
    Remote work
    Day shift

    Applied Intuition

    Sunnyvale, CA
    2 days ago
  •  ...computation. About the Role We are looking for a Software Engineer to join the ML Integration and Quality...  ...closely with engineers across runtime, compiler, kernel, and hardware teams to debug...  ...ML model deployment. Familiarity with LLM or multimodal model workloads.... 
    Work at office
    Remote work

    Dormont Manufacturing Company

    Sunnyvale, CA
    5 days ago
  •  ...CloudAct Inc. is looking for a backend engineer to enhance the FastAPI proxy. You will handle streaming, retries, and ensure the system...  ...especially in high-concurrency environments. Additional advantages include exposure to LLM APIs or streaming proxies. #J-18808-Ljbffr... 

    CloudAct Inc.

    Sunnyvale, CA
    5 days ago
  • $152k - $241.5k

     ...computing company”. We are looking for an AI & Deep Learning Compiler Engineer. NVIDIA is hiring software engineers for its Deep Learning & AI Compiler (DLC)...  ...frameworks, such as PyTorch, XLA etc. Understanding of LLM inference optimizations and techniques. GPU kernel... 

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Software Engineer, LLM Compilation. Be the first to apply!