Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Software Engineer, ML Performance

$2,000 per month

OpenReq

About Etched
Etched is building AI chips that are hard-coded for individual model architectures. Our first product (Sohu) only supports transformers, but has an order of magnitude more throughput and lower latency than a B200. With Etched ASICs, you can build products that would be impossible with GPUs, like real-time video generation models and extremely deep chain-of-thought reasoning.

Software Engineer, ML Performance

Running millions of tokens per second for large models (e.g Llama-3-70B) means running into new performance bottlenecks. Even with hardware optimization for the operations that usually bottleneck us (attention, kernel parallelism), we encounter novel bottlenecks and must design our own solutions to solve them.

You will work closely with our hardware and software teams to identify and mitigate performance bottlenecks, enabling our chips to achieve unprecedented throughput and efficiency. Your work will involve a blend of low-level programming, performance profiling, and hands-on debugging, all aimed at maximizing the performance of our custom-built AI hardware.

You will also play a key role in developing tools and methodologies to help our customers understand the full potential of our hardware.

Representative projects:
  • Writing new kernels to improve throughput for LLM embedding
  • Improving on PagedAttention to prevent fragmentation of the KV cache in memory
  • Debugging hardware issues on a simulated or emulated chip
  • Profile transformers running on our hardware, and fix bottlenecks
  • Develop ways for customers to work with our chip and understand how their workloads will run on it.
You may be a good fit if you:
  • Have 5+ years of low-level programming experience
  • Have a strong understanding of data flow and execution paths within embedded systems
  • Pick up slack, even if it goes outside your job description
  • Are results-oriented, and bias towards shipping products
  • Understand SoC and computer system architecture, especially for CPU, interconnect, and memory subsystems
  • Want to learn more about machine learning research
We encourage you to apply even if you do not believe you meet every single qualification.

Strong candidates may also have experience with:
  • GPU kernel profiling and low-level programming
  • Transformer optimizations, such as FlashAttention
  • Ongoing research in machine learning
  • Palladium emulation
How we're different:

Etched believes in the Bitter Lesson. We think most of the progress in the AI field has come from using more FLOPs to train and run models, and the best way to get more FLOPs is to build model-specific hardware. Larger and larger training runs encourage companies to consolidate around fewer model architectures, which creates a market for single-model ASICs.

We are a fully in-person team in Cupertino, and greatly value engineering skills. We do not have boundaries between engineering and research, and we expect all of our technical staff to contribute to both as needed.

Benefits:
  • Full medical, dental, and vision packages, with 100% of premium covered, 90% for dependents
  • Housing subsidy of $2,000/month for those living within walking distance of the office
  • Daily lunch and dinner in our office
  • Relocation support for those moving to Cupertino
Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Software Engineer, ML Performance in Cupertino, CA vacancy
  •  ...researchers, data scientists, and engineers, tackling the most...  ...MBZUAI as a global hub for high-performance computing in deep learning, driving...  ...with Researchers and ML Engineers to produce AI applications...  ...’s responsible for the full software development life cycle, from... 
    Performance
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    a month ago
  • $120k - $170k

     ...Full Stack Software Engineer Sunnyvale, CA The future of defense will be decided by those who field intelligent...  ...services and APIs Develop intuitive, high-performance frontend applications Integrate AI/ML capabilities into real user workflows Improve... 
    Performance
    Full time
    Relocation package

    Scout AI

    Sunnyvale, CA
    21 hours ago
  •  ...users to effortlessly run large-scale ML applications, without the hassle of managing...  ...About The Role As a New Graduate Software Engineer, you will collaborate with world-class...  ...software systems that directly impact performance, scalability, reliability, and... 
    Performance
    Internship

    Cerebras

    Sunnyvale, CA
    21 hours ago
  • $189.7k - $232.93k

     ...Analyze user needs and software requirements, develop solutions...  ...specifications and determine performance standards; Develop scalable...  ...infrastructure, robotics, and graphics engineers, as well as startup veterans,...  ...simulation products, such as ML Sim Agent integration into... 
    Performance
    For contractors
    For subcontractor

    Applied Intuition

    Sunnyvale, CA
    21 hours ago
  •  ...Software Engineer We are looking for a software engineer with expertise in perception for autonomous...  ...Who Has: ~ Experience using ML for uncertainty estimation, confidence...  ...attainment, skill level requirements, interview performance, and the level and scope of the... 
    Performance
    Odd job
    For contractors
    For subcontractor

    Applied Intuition

    Sunnyvale, CA
    1 day ago
  •  ...Software Engineer We are looking for a software engineer excited about delivering cutting-edge...  ...and KPI visualization, coverage analysis, ML-based failure finding). We are looking...  ...Build features to enable customers to perform software-driven validation including a unified... 
    Performance

    Applied Intuition

    Sunnyvale, CA
    1 day ago
  •  ...Software Engineer We are looking for a Software Engineer with deep experience in optimizing...  ...budgets while maintaining algorithmic performance, analyzing runtime behavior, and ensuring...  ...conditions Collaborate closely with ML runtime optimization engineers to ensure... 
    Performance
    For contractors
    For subcontractor

    Applied Intuition

    Sunnyvale, CA
    21 hours ago
  • $160k - $200k

     ...Senior Software Developer Join Fortinet as a Senior Software Developer...  ...maintain Fortinet's GenAI/ML software systems. Direct...  ...in large-scale and high-performance software design, architecture...  ...Knowledge of professional software engineering practices, including version... 
    Performance
    Full time

    Edelman

    Sunnyvale, CA
    4 days ago
  • $153k - $222k

     ...research effort by building ML tools, infrastructure, managing...  ...at Applied, we encourage all engineers to take ownership over...  ...next generation self-driving software Help scale end-to-end training...  ...level requirements, interview performance, and the level and scope of... 
    Performance
    Full time
    For contractors
    For subcontractor
    Casual work
    Work at office
    Remote work
    Day shift

    Applied Intuition

    Sunnyvale, CA
    1 day ago
  • $152k - $204k

     ...Senior Software Engineer, Inference Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential...  ...CoreWeave combines superior infrastructure performance with deep technical expertise to...  ...and performance. ~ Optimize end-to-end ML system performance by developing and tuning... 
    Performance
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Flexible hours
    Shift work

    CoreWeave

    Sunnyvale, CA
    21 hours ago
  • $109k - $145k

     ...Software Engineer, Observability CoreWeave is The Essential Cloud for AI™. Built for pioneers...  ...CoreWeave combines superior infrastructure performance with deep technical expertise to...  ...Kafka, Kafka Connect) Exposure to AI/ML infrastructure, including GPU-based systems... 
    Performance
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    21 hours ago
  • $139k - $204k

     ...Senior Software Engineer, Cluster Orchestration CoreWeave is The Essential Cloud for AI™. Built...  ...combines superior infrastructure performance with deep technical expertise to accelerate...  ...workloads, GPU-based applications, or ML pipelines. Knowledge of scheduling concepts... 
    Performance
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    21 hours ago
  • $125k - $245k

     ...Software Engineer Applied Intuition, Inc. is powering the future of physical AI. Founded in...  .... The modules you develop must be high performance and state-of-the-art due to critical timing...  ...onroad behavior software and leverage ML components to achieve highway and city... 
    Performance
    Odd job
    Full time
    For contractors
    For subcontractor
    Casual work
    Work at office
    Remote work
    Day shift

    Applied Intuition

    Sunnyvale, CA
    3 days ago
  • $170.6k - $261.3k

     ...Job Description As a Senior Software Engineer on the SimCore team, you will build and deploy applied AI/ML solutions that directly support simulation workflows, internal...  ...models, and excel at building robust, high-performance inference pipelines. This role is not... 
    Performance
    Local area
    Work from home
    Flexible hours

    General Motors

    Sunnyvale, CA
    3 days ago
  •  ...-level results into clear feedback for engineering and leadership, and help accelerate validated...  ...to introspect autonomous driving software performance atinterfaces across the autonomy stack;...  ...Propose and develop new statistical and ML methods to quantify performance... 
    Performance
    Local area
    Remote work
    Work from home
    Relocation
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    1 day ago
  • $147.4k - $272.1k

    Full Stack Software Engineer - Camera & Photos Tools & AI Team Cupertino, California, United States...  ...analyses that characterize camera performance, and surfaces the results to the engineers...  .... Evaluate, integrate, and maintain AI/ML models in production: monitoring for... 
    Performance
    Relocation
    Shift work

    Apple Inc.

    Cupertino, CA
    21 hours ago
  •  ...Description We are seeking an experienced engineer to work on distributed AI/ML systems. The role focuses on...  ...Linux, kernel internals, and high‑performance code is essential. Experience with...  ...Qualifications 3+ years of professional software development experience (non‑... 
    Performance
    Internship

    Itlearn360

    Cupertino, CA
    21 hours ago
  • $167k

     ...expanding rapidly. We're looking for engineers who are passionate about building high-...  .... Develop reliable, scalable, and high-performance software solutions. Write clean, maintainable, and...  ...experience is preferred, however deep ML knowledge is not required For San Francisco... 
    Performance

    Uber

    Sunnyvale, CA
    3 days ago
  • $189.7k - $232.93k

     ...role Analyze user needs and software requirements, develop solutions...  ...specifications and determine performance standards; Develop scalable...  ..., robotics, and graphics engineers, as well as startup veterans,...  ...simulation products, such as ML Sim Agent integration into tooling... 
    Performance

    Applied Intuition

    Sunnyvale, CA
    4 days ago
  • $151k - $240k

    About the role As a Motion Planning Engineer on the Fallback Stack team, you will design and...  ...will: Design and implement classical or ML motion planners for fallback and minimal-...  ...tools, and dashboards to understand planner performance at scale Collaborate closely with... 
    Performance
    Odd job
    Full time
    Remote work

    Applied Intuition

    Sunnyvale, CA
    3 days ago
  • $187.74k - $225.29k

    Employer: Uber Technologies, Inc. Job Title: Software Engineer Job Location: Sunnyvale, California Job...  ...learning and deep learning; Common ML frameworks including TensorFlow or...  ...statistical methods for evaluating model performance. Uber's mission is to reimagine the way... 
    Performance
    Full time
    Work at office
    Remote work

    Uber

    Sunnyvale, CA
    3 days ago
  • $141k - $202k

     ...Implement GenAI solutions, utilize ML infrastructure, and...  ...preparation, optimization, and performance enhancements. Requirements: Bachelor...  .... 2 years of experience with software development in one or more...  ...the job: Google's software engineers develop the next-generation technologies... 
    Performance
    Full time

    Google Inc.

    Sunnyvale, CA
    4 days ago
  • $207k - $300k

    Software Engineer, GDC LLM Serving and GPU Performance Google Sunnyvale, CA, USA Qualifications Bachelor’s degree or equivalent practical experience. 8 years...  ...reinforcement learning (e.g., sequential decision making), ML infrastructure, or specialization in another ML field... 
    Performance
    Full time

    Google Inc.

    Sunnyvale, CA
    2 days ago
  • $166k - $244k

    Senior Software Engineer, Infra, Vertex Gemini API+ Serving - Sunnyvale, CA, USA. About the job...  ...solutions that meet the highest standards of performance, security, and reliability....  ...architecting production‑quality Machine Learning (ML) infrastructure. Experience in AI/ML... 
    Performance
    Full time

    Carlsbad Tech

    Sunnyvale, CA
    1 day ago
  •  .../ Walmart Job Title: Senior Software Engineer (Python) Location: Sunnyvale, CA...  ...on observability, reliability, and performance. 2. System Design & Scalable Engineering...  ...patterns. ~ Exposure to AI/ML frameworks such as PyTorch, TensorFlow,... 
    Performance

    Anveta

    Sunnyvale, CA
    4 days ago
  • $147k - $211k

     ...Python or C++. 1 year of experience with ML infrastructure (e.g., model deployment,...  ...algorithms. About the job Google's software engineers develop the next-generation technologies...  ...to data preparation, optimization, and performance enhancements. Google is proud to be an... 
    Performance
    Full time

    Google Inc.

    Sunnyvale, CA
    21 hours ago
  • $207k - $300k

     ...experience. 8 years of experience in software development. 5 years of...  ...making), Machine learning (ML) infrastructure, or...  ...qualifications Master’s degree or PhD in Engineering, Computer Science, or a...  ...Analyze petabytes of telemetry and performance data to uncover insights that... 
    Performance
    Full time
    Worldwide

    Google Inc.

    Sunnyvale, CA
    4 days ago
  • $150k - $250k

     ...founders, research scientists, and engineering leads High-impact ownership...  ...for improving model performance Startup environment where your...  ...you, we’d love to connect. Software Engineer (Machine Learning) Location...  ...Collaborate with AI/ML teams to build and deploy applications... 
    Performance
    Full time
    Work at office
    Local area
    Immediate start
    Relocation package
    Flexible hours

    Collinear AI, Inc.

    Sunnyvale, CA
    2 days ago
  • Position Summary Senior Software Engineer - TV SDK at Walmart, located in Sunnyvale, CA. You will...  ...phased rollouts. Deep‑dive debug and performance profiling - memory, GPU bandwidth, GC pauses...  ...). Experience with GenerativeAI / ML inference on device (ONNXRuntime, TensorRT... 
    Performance
    Full time
    Temporary work
    Part time
    Remote work

    Hong Kong Study Skills Research Institute

    Sunnyvale, CA
    4 days ago
  • $153k - $222k

     ...Machine Learning Engineer Applied Intuition, Inc. is powering the...  ...learning pipelines and ML engineers that want to work beyond...  ...degree in Computer Science, Software Engineering, or equivalent...  ...level requirements, interview performance, and the level and scope of the... 
    Performance
    Full time
    For contractors
    For subcontractor
    Casual work
    Work at office
    Remote work
    Day shift

    Applied Intuition

    Sunnyvale, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Software Engineer, ML Performance. Be the first to apply!