Acceleration Kernel Developer Intern
Tenstorrent University
Acceleration Kernel Developer Intern
Santa Clara, California, United States
Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations in software models, compilers, platforms, networking, and semiconductors. Our diverse team of technologists have developed a high performance RISC-V CPU from scratch, and share a passion for AI and a deep desire to build the best AI platform possible. We value collaboration, curiosity, and a commitment to solving hard problems. We are growing our team and looking for contributors of all seniorities.
As an Acceleration Kernel Developer at Tenstorrent, you will play a crucial role in optimizing low-level workloads, kernel development, and enhancing our software's performance for machine learning applications. You will work closely with a team of highly skilled engineers to ensure that our software operates at peak efficiency, delivering high-quality results to our clients and users.
This role is hybrid, based out of Toronto, Ontario.
Responsibilities:
- Kernel Development: Participate in the design, development, and maintenance of kernel-level software components for our applications. Develop and optimize kernels and kernel libraries for efficient machine learning and HPC applications.
- Implementation of tensor compute and tensor data movement optimizations kernels
- Heavy focus on optimizations.
- Low-Level Optimization: Analyze and optimize low-level code to improve the performance and efficiency of our software, with a strong emphasis on tensor optimization.
- Machine Learning Integration: Collaborate with machine learning engineers and data scientists to integrate optimized kernels and low-level routines into machine learning frameworks and pipelines.
- Performance Profiling: Identify performance bottlenecks, conduct performance profiling, and develop strategies to address and resolve them.
- Testing and Debugging: Write comprehensive unit tests, conduct thorough debugging, and ensure the stability and reliability of kernel-level code.
- Infrastructure: Develop infrastructure to analyze functional test and performance test data.
- Documentation: Create clear and concise documentation for code, APIs, and best practices to facilitate collaboration within the team.
- Research and Innovation: Stay up-to-date with the latest developments in kernel development, tensor optimization, and machine learning to propose innovative solutions and improvements.
Experience & Qualifications:
- Currently enrolled in a bachelors degree, or a masters or doctorate program in Computer Science, Software Engineering, Computer Engineering, Engineering Science or a related field.
- A strong understanding of algorithm implementation and low level algorithm optimization.
- Experience with parallel acceleration development, such as parallel algorithms on CPU, or GPU acceleration.
- High degree of proficiency in C/C++ and Python programming languages.
- Familiarity with machine learning frameworks and concepts.
- Strong problem-solving skills and the ability to analyze and debug complex issues.
- Experience with debug, performance profiling and optimization tools.
- Excellent communication and teamwork skills.
- Self-motivated, detail-oriented, and able to work independently as well as in a team.
- Familiarity with infrastructure built with cloud services is a plus (e.g. AWS).
Tenstorrent offers a highly competitive compensation package and benefits, and we are an equal opportunity employer.
$40 - $70 per hour
Location Toronto Employment Type Intern Location Type Hybrid Department R&D - SW Kernels & Workloads Compensation CA$40 - CA$70 per... ...team, you will play a key role in developing high performance kernels essential for accelerating Machine Learning models. Your responsibilities...InternshipHourly pay$207k - $300k
...Inc. is seeking a Staff Software Engineer, Accelerator Platform Software, based in Sunnyvale, CA. This role involves developing next-generation technologies and empowering... ...and a strong background in C, C++, and Linux kernel systems. Responsibilities include onboarding...Suggested- ...deeply technical, hands-on software engineer to join our on-field Kernel Reliability team. You'll help tackle a critical challenge:... ...advanced compute clusters and the underlying inference, training, and internal production services. In this role, you'll work close to the...Internship
$184k - $287.5k
...for outstanding AI systems engineers to develop groundbreaking technologies in the inference... ...build innovative AI systems software to accelerate for AI inference. As a member of the... ...develop libraries, code generators, and GPU kernel technologies for NVIDIA's hardware...Suggested- ...The Role: Software Engineer, Staff - SIMD Kernels As a member of the SIMD Kernels team,... ...stack for our AI compute engine. You will develop, enhance, and maintain software kernels... ...hardware such as FPGAs, DSPs, GPUs, AI accelerators using libraries such as CUDA, etc. Experience...SuggestedWork experience placementRemote work
- ...EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers,... ...and beyond. Together, we advance your career. As an AMD ML/AI intern/co-op, you’ll be placed at the epicenter of the AI ecosystem,...InternshipFull timeSummer workSummer internshipWorldwide
- ...Staff Software Engineer - Kernels At d-Matrix, we are focused on unleashing the potential of generative AI to power the transformation... ...for specialized hardware such as FPGAs, DSPs, GPUs, and AI accelerators using libraries such as CUDA, etc. Experience in...Work experience placement3 days per week
$195k - $292k
...graphs to unlock the full potential of Ampere's deep learning accelerator. You'll work across the full SW/HW stack - from inference serving... ...framework integration down to compiler, runtime, and compute kernels. What you'll achieve: Optimize deep...Local area- ...multiple layers of a fully integrated AI-accelerated system, including advanced hardware... ...optimization, and system bring-up activities. Develop tools and infrastructure to improve... ...individual beliefs. Read our blog: Intern at Cerebras Apply today and become...Internship
- ...transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a... .... We are looking for a Senior Linux Kernel Software Engineer to join the Linux... ...part of the Linux kernel group, working on developing the device driver for our NICs....
$19 - $65 per hour
...including TRATON GROUP's Scania, MAN, and International brands, Hyundai Motor Company, Iveco... ...Bosch, and DSV are working with Plus to accelerate the deployment of next-generation... ...infrastructure Responsibilities: Develop fallback mechanisms for redundant actuation...InternshipHourly pay- ...multiple layers of a fully integrated AI-accelerated system, including advanced hardware... ...frameworks. Responsibilities You will develop and maintain the infrastructure required... ...beliefs. Read our blog: Intern at Cerebras Apply today and become part...Internship
- ...democratize robotics by lowering the barrier to entry, delivering a plug-and-play platform for developers, researchers, and enterprises, and cultivating an open ecosystem that accelerates the evolution of physical AI. If you want to help shape the next layer of human...Internship
$19 - $65 per hour
...including TRATON GROUP's Scania, MAN, and International brands, Hyundai Motor Company, Iveco... ...Bosch, and DSV are working with Plus to accelerate the deployment of next-generation... ...GitLab Pull Requests. Automate Alerting: Develop mechanisms to automatically detect, flag...InternshipHourly pay$19 - $65 per hour
...including TRATON GROUP’s Scania, MAN, and International brands, Hyundai Motor Company, Iveco... ...Bosch, and DSV are working with Plus to accelerate the deployment of next-generation... ...Runtime infrastructure Responsibilities Develop fallback mechanisms for redundant actuation...InternshipHourly pay$19 - $65 per hour
...s Most Innovative Companies. Partners including TRATON GROUP’s Scania, MAN, and International brands, Hyundai Motor Company, Iveco Group, Bosch, and DSV are working with Plus to accelerate the deployment of next-generation autonomous trucks. If you’re ready to make a huge...Internship- ...EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers,... ...of inference engines, distributed systems, and GPU runtime and kernel backends. THE PERSON You are a systems‑minded ML engineer who...
$94.16k - $141k
...leveraging purpose-built Marvell accelerators, we accelerate performance... ...bridge between customers and internal teams to troubleshoot and... ...software, and IC design teams to develop scalable, high-performance AI... ...Linux operating system and kernel internals, with a focus on networking...Internship$320k
...foundation models into real‑time, GPU‑accelerated video intelligence systems... ...tuning on NVIDIA GPUs (kernels, memory, and latency/efficiency... ...Doing: Lead, encourage, and develop world‑class engineering and... ...drafting and finalizing SOWs with internal customers and partners....$19 - $65 per hour
...including TRATON GROUP’s Scania, MAN, and International brands, Hyundai Motor Company, Iveco... ...Bosch, and DSV are working with Plus to accelerate the deployment of next‑generation... ...dynamically refine and improve search accuracy. Develop Smart Sampling Strategies: Overhaul the...InternshipHourly pay$19 - $65 per hour
PlusAI is seeking a Machine Learning Infrastructure Engineer Intern to advance their AI-based virtual driver software. The role involves... ...in BEV model training and implementing high-performance custom kernels using CUDA or C++. Interns will explore using LLMs for code...InternshipHourly pay$124k - $195.5k
...join our team! We’re looking for an AI Developer Technology Engineer to push the limits of... ...developing groundbreaking techniques to accelerate high-performance workloads for FSI-focused... ...across the stack: from algorithms to kernels to system-level behavior. Publishing and...Internship$207k - $300k
...products. 5 years of experience building and developing infrastructure, distributed systems or... ..., including package management and kernel updates, and experience in programming with... ...data while in use. Google Cloud accelerates every organization’s ability to digitally...Full time$147k - $211k
Software Engineer III, Linux Kernel Networking corporate_fare Google place Sunnyvale, CA... ...C/C++. 2 years of experience with developing large-scale infrastructure, distributed... ...to push technology forward. Google Cloud accelerates every organization’s ability to digitally...Full time$147k - $237.5k
...communicate goals and desired outcomes to internal project teams Interview, mentor and... ...products. Technical experts that help accelerate adoption of the very best engineering... ...customers' needs by doing the research to develop insight into their needs and desires...InternshipFull timeSummer workCasual workWork at office$7,500 per month
...will not waste your time. Responsibilities ~ Define, develop and maintain AI-powered software tools, services, and... ...email protected] with the position title "Software Engineer, II (Intern)" in the email subject. Compensation ~ Base salary is...InternshipWork at office$207k - $300k
Staff Software Engineer, Accelerator Platform Software corporate_fare... ...Experience with C, C++, linux kernel, virtualization, and computer... ...Google Cloud's software engineers develop the next-generation... ...efficiency. Collaborate with internal teams to design and implement...Full timeWorldwide- A leading cybersecurity firm in Sunnyvale is offering an internship focused on developing GenAI solutions to aid threat intelligence workflows. This role entails research, tool development, and collaboration with experienced engineers and stakeholders. Candidates should...InternshipRemote job
$147.4k - $272.1k
Systems & Kernel Software Engineer, SEAR Cupertino, California, United States Software and Services The SPEAR Systems & Kernel team... ...mitigations. Deep expertise in C programming. Experience using LLMs to accelerate security engineering. Experience with compiler security...Relocation- NVIDIA Gruppe seeks a Senior Developer Technology Engineer in Santa Clara to drive GPU-acceleration innovations for high-performance databases. This role involves researching memory management techniques and optimizing data-intensive applications for modern architectures...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Acceleration Kernel Developer Intern. Be the first to apply!
- volunteer developer Santa Clara, CA
- access developer Santa Clara, CA
- programmer Santa Clara, CA
- remote entry level developer Santa Clara, CA
- engine programmer Santa Clara, CA
- junior programmer Santa Clara, CA
- entry level mulesoft developer Santa Clara, CA
- developer Santa Clara, CA
- senior mainframe developer Santa Clara, CA
- algorithm developer Santa Clara, CA

