Software Engineer, ML Performance
$2,000 per monthOpenReq
About Etched
Etched is building AI chips that are hard-coded for individual model architectures. Our first product (Sohu) only supports transformers, but has an order of magnitude more throughput and lower latency than a B200. With Etched ASICs, you can build products that would be impossible with GPUs, like real-time video generation models and extremely deep chain-of-thought reasoning. Software Engineer, ML Performance Running millions of tokens per second for large models (e.g Llama-3-70B) means running into new performance bottlenecks. Even with hardware optimization for the operations that usually bottleneck us (attention, kernel parallelism), we encounter novel bottlenecks and must design our own solutions to solve them. You will work closely with our hardware and software teams to identify and mitigate performance bottlenecks, enabling our chips to achieve unprecedented throughput and efficiency. Your work will involve a blend of low-level programming, performance profiling, and hands-on debugging, all aimed at maximizing the performance of our custom-built AI hardware. You will also play a key role in developing tools and methodologies to help our customers understand the full potential of our hardware. Representative projects:
Etched is building AI chips that are hard-coded for individual model architectures. Our first product (Sohu) only supports transformers, but has an order of magnitude more throughput and lower latency than a B200. With Etched ASICs, you can build products that would be impossible with GPUs, like real-time video generation models and extremely deep chain-of-thought reasoning. Software Engineer, ML Performance Running millions of tokens per second for large models (e.g Llama-3-70B) means running into new performance bottlenecks. Even with hardware optimization for the operations that usually bottleneck us (attention, kernel parallelism), we encounter novel bottlenecks and must design our own solutions to solve them. You will work closely with our hardware and software teams to identify and mitigate performance bottlenecks, enabling our chips to achieve unprecedented throughput and efficiency. Your work will involve a blend of low-level programming, performance profiling, and hands-on debugging, all aimed at maximizing the performance of our custom-built AI hardware. You will also play a key role in developing tools and methodologies to help our customers understand the full potential of our hardware. Representative projects:
- Writing new kernels to improve throughput for LLM embedding
- Improving on PagedAttention to prevent fragmentation of the KV cache in memory
- Debugging hardware issues on a simulated or emulated chip
- Profile transformers running on our hardware, and fix bottlenecks
- Develop ways for customers to work with our chip and understand how their workloads will run on it.
- Have 5+ years of low-level programming experience
- Have a strong understanding of data flow and execution paths within embedded systems
- Pick up slack, even if it goes outside your job description
- Are results-oriented, and bias towards shipping products
- Understand SoC and computer system architecture, especially for CPU, interconnect, and memory subsystems
- Want to learn more about machine learning research
- GPU kernel profiling and low-level programming
- Transformer optimizations, such as FlashAttention
- Ongoing research in machine learning
- Palladium emulation
- Full medical, dental, and vision packages, with 100% of premium covered, 90% for dependents
- Housing subsidy of $2,000/month for those living within walking distance of the office
- Daily lunch and dinner in our office
- Relocation support for those moving to Cupertino
Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Software Engineer, ML Performance in Cupertino, CA vacancy
- ...researchers, data scientists, and engineers, tackling the most... ...MBZUAI as a global hub for high-performance computing in deep learning, driving... ...with Researchers and ML Engineers to produce AI applications... ...’s responsible for the full software development life cycle, from...PerformanceVisa sponsorship
$120k - $170k
...Full Stack Software Engineer Sunnyvale, CA The future of defense will be decided by those who field intelligent... ...services and APIs Develop intuitive, high-performance frontend applications Integrate AI/ML capabilities into real user workflows Improve...PerformanceFull timeRelocation package- ...users to effortlessly run large-scale ML applications, without the hassle of managing... ...About The Role As a New Graduate Software Engineer, you will collaborate with world-class... ...software systems that directly impact performance, scalability, reliability, and...PerformanceInternship
$189.7k - $232.93k
...Analyze user needs and software requirements, develop solutions... ...specifications and determine performance standards; Develop scalable... ...infrastructure, robotics, and graphics engineers, as well as startup veterans,... ...simulation products, such as ML Sim Agent integration into...PerformanceFor contractorsFor subcontractor- ...Software Engineer We are looking for a software engineer with expertise in perception for autonomous... ...Who Has: ~ Experience using ML for uncertainty estimation, confidence... ...attainment, skill level requirements, interview performance, and the level and scope of the...PerformanceOdd jobFor contractorsFor subcontractor
- ...Software Engineer We are looking for a software engineer excited about delivering cutting-edge... ...and KPI visualization, coverage analysis, ML-based failure finding). We are looking... ...Build features to enable customers to perform software-driven validation including a unified...Performance
- ...Software Engineer We are looking for a Software Engineer with deep experience in optimizing... ...budgets while maintaining algorithmic performance, analyzing runtime behavior, and ensuring... ...conditions Collaborate closely with ML runtime optimization engineers to ensure...PerformanceFor contractorsFor subcontractor
$160k - $200k
...Senior Software Developer Join Fortinet as a Senior Software Developer... ...maintain Fortinet's GenAI/ML software systems. Direct... ...in large-scale and high-performance software design, architecture... ...Knowledge of professional software engineering practices, including version...PerformanceFull time$153k - $222k
...research effort by building ML tools, infrastructure, managing... ...at Applied, we encourage all engineers to take ownership over... ...next generation self-driving software Help scale end-to-end training... ...level requirements, interview performance, and the level and scope of...PerformanceFull timeFor contractorsFor subcontractorCasual workWork at officeRemote workDay shift$152k - $204k
...Senior Software Engineer, Inference Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential... ...CoreWeave combines superior infrastructure performance with deep technical expertise to... ...and performance. ~ Optimize end-to-end ML system performance by developing and tuning...PerformancePermanent employmentTemporary workCasual workWork at officeFlexible hoursShift work$109k - $145k
...Software Engineer, Observability CoreWeave is The Essential Cloud for AI™. Built for pioneers... ...CoreWeave combines superior infrastructure performance with deep technical expertise to... ...Kafka, Kafka Connect) Exposure to AI/ML infrastructure, including GPU-based systems...PerformancePermanent employmentTemporary workCasual workWork at officeFlexible hours$139k - $204k
...Senior Software Engineer, Cluster Orchestration CoreWeave is The Essential Cloud for AI™. Built... ...combines superior infrastructure performance with deep technical expertise to accelerate... ...workloads, GPU-based applications, or ML pipelines. Knowledge of scheduling concepts...PerformancePermanent employmentTemporary workCasual workWork at officeFlexible hours$125k - $245k
...Software Engineer Applied Intuition, Inc. is powering the future of physical AI. Founded in... .... The modules you develop must be high performance and state-of-the-art due to critical timing... ...onroad behavior software and leverage ML components to achieve highway and city...PerformanceOdd jobFull timeFor contractorsFor subcontractorCasual workWork at officeRemote workDay shift$170.6k - $261.3k
...Job Description As a Senior Software Engineer on the SimCore team, you will build and deploy applied AI/ML solutions that directly support simulation workflows, internal... ...models, and excel at building robust, high-performance inference pipelines. This role is not...PerformanceLocal areaWork from homeFlexible hours- ...-level results into clear feedback for engineering and leadership, and help accelerate validated... ...to introspect autonomous driving software performance atinterfaces across the autonomy stack;... ...Propose and develop new statistical and ML methods to quantify performance...PerformanceLocal areaRemote workWork from homeRelocationRelocation packageFlexible hours
$147.4k - $272.1k
Full Stack Software Engineer - Camera & Photos Tools & AI Team Cupertino, California, United States... ...analyses that characterize camera performance, and surfaces the results to the engineers... .... Evaluate, integrate, and maintain AI/ML models in production: monitoring for...PerformanceRelocationShift work- ...Description We are seeking an experienced engineer to work on distributed AI/ML systems. The role focuses on... ...Linux, kernel internals, and high‑performance code is essential. Experience with... ...Qualifications 3+ years of professional software development experience (non‑...PerformanceInternship
$167k
...expanding rapidly. We're looking for engineers who are passionate about building high-... .... Develop reliable, scalable, and high-performance software solutions. Write clean, maintainable, and... ...experience is preferred, however deep ML knowledge is not required For San Francisco...Performance$189.7k - $232.93k
...role Analyze user needs and software requirements, develop solutions... ...specifications and determine performance standards; Develop scalable... ..., robotics, and graphics engineers, as well as startup veterans,... ...simulation products, such as ML Sim Agent integration into tooling...Performance$151k - $240k
About the role As a Motion Planning Engineer on the Fallback Stack team, you will design and... ...will: Design and implement classical or ML motion planners for fallback and minimal-... ...tools, and dashboards to understand planner performance at scale Collaborate closely with...PerformanceOdd jobFull timeRemote work$187.74k - $225.29k
Employer: Uber Technologies, Inc. Job Title: Software Engineer Job Location: Sunnyvale, California Job... ...learning and deep learning; Common ML frameworks including TensorFlow or... ...statistical methods for evaluating model performance. Uber's mission is to reimagine the way...PerformanceFull timeWork at officeRemote work$141k - $202k
...Implement GenAI solutions, utilize ML infrastructure, and... ...preparation, optimization, and performance enhancements. Requirements: Bachelor... .... 2 years of experience with software development in one or more... ...the job: Google's software engineers develop the next-generation technologies...PerformanceFull time$207k - $300k
Software Engineer, GDC LLM Serving and GPU Performance Google Sunnyvale, CA, USA Qualifications Bachelor’s degree or equivalent practical experience. 8 years... ...reinforcement learning (e.g., sequential decision making), ML infrastructure, or specialization in another ML field...PerformanceFull time$166k - $244k
Senior Software Engineer, Infra, Vertex Gemini API+ Serving - Sunnyvale, CA, USA. About the job... ...solutions that meet the highest standards of performance, security, and reliability.... ...architecting production‑quality Machine Learning (ML) infrastructure. Experience in AI/ML...PerformanceFull time- .../ Walmart Job Title: Senior Software Engineer (Python) Location: Sunnyvale, CA... ...on observability, reliability, and performance. 2. System Design & Scalable Engineering... ...patterns. ~ Exposure to AI/ML frameworks such as PyTorch, TensorFlow,...Performance
$147k - $211k
...Python or C++. 1 year of experience with ML infrastructure (e.g., model deployment,... ...algorithms. About the job Google's software engineers develop the next-generation technologies... ...to data preparation, optimization, and performance enhancements. Google is proud to be an...PerformanceFull time$207k - $300k
...experience. 8 years of experience in software development. 5 years of... ...making), Machine learning (ML) infrastructure, or... ...qualifications Master’s degree or PhD in Engineering, Computer Science, or a... ...Analyze petabytes of telemetry and performance data to uncover insights that...PerformanceFull timeWorldwide$150k - $250k
...founders, research scientists, and engineering leads High-impact ownership... ...for improving model performance Startup environment where your... ...you, we’d love to connect. Software Engineer (Machine Learning) Location... ...Collaborate with AI/ML teams to build and deploy applications...PerformanceFull timeWork at officeLocal areaImmediate startRelocation packageFlexible hours- Position Summary Senior Software Engineer - TV SDK at Walmart, located in Sunnyvale, CA. You will... ...phased rollouts. Deep‑dive debug and performance profiling - memory, GPU bandwidth, GC pauses... ...). Experience with GenerativeAI / ML inference on device (ONNXRuntime, TensorRT...PerformanceFull timeTemporary workPart timeRemote work
$153k - $222k
...Machine Learning Engineer Applied Intuition, Inc. is powering the... ...learning pipelines and ML engineers that want to work beyond... ...degree in Computer Science, Software Engineering, or equivalent... ...level requirements, interview performance, and the level and scope of the...PerformanceFull timeFor contractorsFor subcontractorCasual workWork at officeRemote workDay shift
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Software Engineer, ML Performance. Be the first to apply!
Related searches
- software engineer contract Cupertino, CA
- software engineer healthcare Cupertino, CA
- network software engineer Cupertino, CA
- ngo software engineer Cupertino, CA
- software development engineer aws Cupertino, CA
- software developer fintech Cupertino, CA
- software data engineer Cupertino, CA
- senior software engineer remote Cupertino, CA
- intel software engineer Cupertino, CA
- software engineer Cupertino, CA


