Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Inference Intern

Etched

About Etched Etched is building the world’s first AI inference system purpose-built for transformers – delivering over 10x higher performance and dramatically lower cost and latency than a B200. With Etched ASICs, you can build products that would be impossible with GPUs, like real‑time video generation models and extremely deep & parallel chain‑of‑thought reasoning agents. Backed by hundreds of millions from top‑tier investors and staffed by leading engineers, Etched is redefining the infrastructure layer for the fastest growing industry in history. Job Summary We are seeking a talented Architecture intern to join our team and contribute to the design of next‑generation AI accelerators. This role focuses on developing and optimizing compute architectures that deliver exceptional performance and efficiency for transformer workloads. You will work on cutting‑edge architectural problems and performance modeling over the course of your internship. Key Responsibilities Support porting state‑of‑the‑art models to our architecture. Help build programming abstractions and testing capabilities to rapidly iterate on model porting. Assist in building, enhancing, and scaling Sohu’s runtime, including multi‑node inference, intra‑node execution, state management, and robust error handling. Contribute to optimizing routing and communication layers using Sohu’s collectives. Utilize performance profiling and debugging tools to identify bottlenecks and correctness issues. Develop and leverage a deep understanding of Sohu to co‑design both hardware instructions and model architecture operations to maximize model performance. Implement high‑performance software components for the Model Toolkit. You May Be a Good Fit If You Have Progress toward a Bachelor’s, Master’s, or PhD degree in computer science, computer engineering, applied mathematics, or a related field. Proficiency in Python and C++. Understanding of performance‑sensitive or complex distributed software systems, e.g. Linux internals, accelerator architectures (GPUs, TPUs), compilers, or high‑speed interconnects (NVLink, InfiniBand). Ported applications to non‑standard accelerator hardware or hardware platforms. Deep knowledge of transformer model architectures and/or inference serving stacks (vLLM, SGLang, etc.). Strong Candidates May Have Some Experience With Proficiency in Rust. Low‑latency, high‑performance applications using both kernel‑level and user‑space networking stacks. Deep understanding of distributed systems concepts, algorithms, and challenges, including consensus protocols, consistency models, and communication patterns. Solid grasp of Transformer architectures, particularly Mixture‑of‑Experts (MoE). Built applications with extensive SIMD (single instruction, multiple data) optimizations for performance‑critical paths. Familiarity with PyTorch or JAX. Math competitions (AIME, AMC, etc.). Program Details 12‑week paid internship (June – August 2026). Generous housing support for those relocating. Daily lunch and dinner in our office. Based at our office in San Jose, CA. Direct mentorship from industry leaders and world‑class engineers. Opportunity to work on one of the most important problems of our time. We encourage you to apply even if you do not believe you meet every qualification. For any questions, contact View email address on click.appcast.io. How We’re Different Etched believes in the Bitter Lesson. We think most of the progress in the AI field has come from using more FLOPs to train and run models, and the best way to get more FLOPs is to build model‑specific hardware. Larger and larger training runs encourage companies to consolidate around fewer model architectures, which creates a market for single‑model ASICs. We are a fully in‑person team in West San Jose, and greatly value engineering skills. We do not have boundaries between engineering and research, and we expect all of our technical staff to contribute to both as needed. #J-18808-Ljbffr Etched

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Inference Intern in San Jose, CA vacancy
  •  ...About Etched Etched is building the world's first AI inference system purpose-built for transformers - delivering over 10x higher...  ...team. We are looking for Fall '26, Spring '27, and Summer '27 interns. This role requires business acumen, analytical skills, strong... 
    Internship
    Summer internship
    Work at office
    Relocation
    Shift work

    ETCHED LLC

    San Jose, CA
    4 days ago
  •  ...About Etched Etched is building the world's first AI inference system purpose-built for transformers - delivering over 10x higher...  ...looking for Summer '26, Fall '26, Spring '27, and Summer '27 interns. You may be a good fit if you have Progress towards... 
    Internship
    Summer work
    Summer internship
    Work at office
    Relocation

    ETCHED LLC

    San Jose, CA
    1 day ago
  •  ...About Etched Etched is building the world's first AI inference system purpose-built for transformers - delivering over 10x higher...  ...looking for Summer '26, Fall '26, Spring '27, and Summer '27 interns. You may be a good fit if you have Progress towards... 
    Internship
    Summer work
    Summer internship
    Work at office
    Relocation

    ETCHED LLC

    San Jose, CA
    1 day ago
  •  ...About Etched Etched is building the world's first AI inference system purpose-built for transformers - delivering over 10x higher...  ...industry in history. Job Summary We're hiring for a GTM intern - someone who will help us build the operational backbone of our... 
    Internship
    Summer internship
    Work at office
    Relocation

    ETCHED LLC

    San Jose, CA
    1 day ago
  • $184k - $287.5k

     ...We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency. You’ll architect and implement high-performance inference stacks, optimize GPU kernels and compilers, drive industry... 
    Suggested

    NVIDIA

    Santa Clara, CA
    4 days ago
  •  ...About Etched Etched is building the world's first AI inference system purpose-built for transformers - delivering over 10x higher...  ...looking for Summer '26, Fall '26, Spring '27, and Summer '27 interns. You may be a good fit if you have Progress towards... 
    Internship
    Summer work
    Summer internship
    Work at office
    Relocation

    ETCHED LLC

    San Jose, CA
    1 day ago
  • $193.3k - $261.5k

     ...software stack powering AWS Inferentia and Trainium machine learning accelerators, designed to deliver high-performance, low-cost inference at scale. The Neuron Serving team develops infrastructure to serve modern machine learning models-including large language models... 
    Internship
    Local area
    Flexible hours

    Amazon

    Cupertino, CA
    5 days ago
  •  ...optimally with different teams across AMD. KEY RESPONSIBILITIES: Develop techniques for optimizing scale-up and scale-out inference. Develop methods and tooling to utilize dynamic resources in service of inference Support proliferation of rocm ecosystem.... 

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    1 day ago
  •  ...About Etched Etched is building the world's first AI inference system purpose-built for transformers - delivering over 10x higher...  ...strategies, and supply chain management, we are looking for a Finance intern to tackle strategic financial challenges and execute on our day... 
    Internship
    Summer internship
    Work at office
    Relocation

    ETCHED LLC

    San Jose, CA
    1 day ago
  • $120k - $180k

    Application Engineer - Low Power Edge Inference (DIB Focus) About this Role We are seeking an Application Engineer to support deployment...  ...Customer Enablement Serve as a technical interface between internal engineering teams and DIB customers (prime contractors, government... 
    Internship
    For contractors

    TetraMem Inc

    San Jose, CA
    3 days ago
  • $164k - $313.3k

     ...data and infrastructure teams. * Experience optimizing model inference and deployment for high-throughput product environments, ensuring...  ...has strong publications experience and previous industry level intern experience. About Adobe Adobe empowers everyone to... 
    Internship
    Temporary work
    Local area
    Worldwide

    Adobe

    San Jose, CA
    4 days ago
  • $165.2k - $223.6k

     ...popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and...  ...in design discussions, code review, and communicate with internal and external stakeholders. You will work cross-functionally to... 
    Internship
    Work experience placement
    Local area
    Flexible hours

    Amazon

    Cupertino, CA
    4 days ago
  • About Etched Etched is building the world’s first AI inference system purpose-built for transformers - delivering over 10x higher performance...  ...in history. Job Summary Our summer electrical engineering interns will work on both hands‑on product design and the development of... 
    Internship
    Summer work
    Summer internship
    Work at office
    Relocation

    Etched.ai, Inc.

    San Jose, CA
    3 days ago
  •  ...provide architectural guidance for AI/ML infrastructure and performance optimization. • Optimize and accelerate LLM training and inference on AMD GPUs, improving kernel, communication, and end-to-end system efficiency. • Develop and enhance infrastructure supporting... 

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    5 days ago
  •  ...Office Administrator Intern Come join VisitorsCoverage, one of Silicon Valley's most successful InsurTech companies, certified as a Great Place to Work! We're looking for an eager and motivated Office Administrator Intern to join our team and gain hands-on experience... 
    Internship
    Full time
    Part time
    Work experience placement
    Summer internship
    Work at office
    Local area
    Visa sponsorship
    Monday to Thursday

    VisitorsCoverage

    Santa Clara, CA
    5 days ago
  •  ...Job Description Job Description Talent Intern Location: San Jose, CA Team: Talent About Etched Etched is building the world’s first AI inference system purpose-built for transformers - delivering over 10x higher performance and dramatically lower cost... 
    Internship
    Summer work
    Summer internship
    Work at office
    Relocation

    Etched

    San Jose, CA
    3 days ago
  •  ...chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds... 

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    1 day ago
  • $92k - $135k

     ...CRWV) in March 2025. Learn more at What You'll Do: Join the Inference team to ship production features that improve latency,...  ...and market location. We strive for both market alignment and internal equity when determining compensation. In addition to base salary... 
    Internship
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    13 days ago
  • A leading technology company seeks a summer intern for Process/Integration/Device Engineering in San Jose, CA. Ideal candidates are currently pursuing a Master's or Ph.D. in Engineering and possess strong analytical and problem-solving skills. The role involves supporting... 
    Internship
    Summer work
    Summer internship

    Carlsbad Tech

    San Jose, CA
    2 days ago
  • Etched.ai, Inc. in San Jose is looking for a Physical Design intern for the upcoming summer internship program. You will be responsible for realizing front-end designs in silicon while improving iteration speed to final signoff. The role requires progress toward a degree... 
    Internship
    Summer internship

    Etched.ai, Inc.

    San Jose, CA
    2 days ago
  • $20 - $25 per hour

    Cadent Technology, Inc. is seeking a motivated QA Engineer Intern for the Summer term in San Jose, California, starting June 8, 2026, through August 14, 2026. Interns will engage in team activities, develop automation scripts, and work on individual projects customized... 
    Internship
    Hourly pay
    Summer work
    Summer internship

    Cadent Technology, Inc.

    San Jose, CA
    5 days ago
  • Eugenus Inc in San Jose is seeking a Process Engineer Summer Intern to support the Process Engineering team in semiconductor manufacturing processes. The intern will gain hands-on experience with process technologies, cleanroom operations, and capital equipment development... 
    Internship
    Summer internship

    Eugenus Inc

    San Jose, CA
    4 days ago
  • Cadent is seeking a QA Engineer Intern for the Summer term in San Jose, California. This internship offers the opportunity to work closely with a team and tackle an individual project tailored to your skills. Responsibilities include creating automation scripts and providing... 
    Internship
    Summer work
    Summer internship

    Cadent

    San Jose, CA
    5 days ago
  • JTC Group is seeking a Summer Intern for the Specialty Administration Team in San Jose, California. This role offers hands-on exposure to investment administration, focusing on EB-5 fund administration and 1031 exchange facilitation. Candidates should be currently enrolled... 
    Internship
    Summer internship

    JTC Group

    San Jose, CA
    3 days ago
  • $35 per hour

     ...portraits and behavior analysis. - Design A/B tests and other causal inference methods to evaluate the model's performance and promote the...  ...Pay Transparency]Compensation Description (Hourly) - Campus Intern The hourly rate range for this position in the selected city... 
    Internship
    Hourly pay
    Summer work
    Summer internship
    Local area

    Tik Tok

    San Jose, CA
    3 days ago
  • $90k - $193.75k

     ...HPE Labs - Quantum Computing Research Associate (Intern) This role has been designed as ‘’Onsite’ with an expectation that you will primarily work from an HPE office. Who We Are: Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way... 
    Internship
    Work experience placement
    Summer internship
    Work at office

    HPE

    Milpitas, CA
    4 days ago
  • JTC Group in San Jose is looking for a Summer Intern for the Specialty Administration Team, providing hands-on experience in investment administration. The role focuses on EB-5 fund administration and 1031 exchange facilitation. Ideal candidates are enrolled in relevant... 
    Internship
    Summer internship
    Work at office

    JTC Group

    San Jose, CA
    4 days ago
  • Job Summary The Summer Legal Intern provides support to the Advantest America Legal Department by performing legal research, drafting, analysis, and assisting with compliance and contract-related tasks. This role offers hands-on exposure to in-house legal practice within... 
    Internship
    Contract work
    Summer internship

    Advantest America

    San Jose, CA
    5 days ago
  • Etched.ai, Inc. in San Jose is seeking summer electrical engineering interns for hands-on product design and development of validation infrastructure. Interns will work on designing boards that support advanced silicon systems, gaining direct mentorship from industry leaders... 
    Internship
    Summer internship

    Etched.ai, Inc.

    San Jose, CA
    2 days ago
  • A leading social media platform is seeking interns for Summer 2026 to work with their product team on shop ads. The internship offers hands-on experience collaborating with cross-functional teams and developing key products. Ideal candidates are pursuing a degree in Computer... 
    Internship
    Summer work
    Summer internship

    TikTok

    San Jose, CA
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Inference Intern. Be the first to apply!