Research Intern, Inference (Fall 2026)
$58 - $63 per hourTogether AI
Research Intern, Inference (Fall 2026)
San Francisco
About The Role
The Inference Research team is dedicated to building the next generation of efficient, scalable, and reliable serving systems for large foundation models, directly contributing to the mission of advancing open and transparent AI. Our work operates at the critical intersection of cutting-edge model architectures, high-performance systems engineering, and deep hardware optimization. We focus on co-designing software, algorithms, and models to significantly lower the cost and latency of modern AI systems.
As a research intern, you will dive into the complexities of distributed inference, compiler-aware optimization, and novel inference-time computation strategies (such as speculative decoding and phase-aware execution). You will be tasked with co-designing and implementing cross-layer optimizations across models, systems, and hardware, with a focus on areas like KV cache design and large-scale serving architectures.
Projects aim to unlock unprecedented performance and scale for foundation models, enabling faster serving, larger model deployment (e.g., Mixture-of-Experts), and robust, reproducible evaluation under realistic serving workloads.
Responsibilities
- Design and conduct rigorous experiments to validate hypotheses
- Communicate the plans, progress, and results of projects to the broader team
- Document findings in scientific publications and blog posts
Requirements
- Currently pursuing a final year of Bachelor's, Master's, or Ph.D. degree in Computer Science, Electrical Engineering, or a related field
- Strong knowledge of Machine Learning and Deep Learning fundamentals
- Experience with deep learning frameworks (PyTorch, JAX, etc.)
- Strong programming skills in Python
- Familiarity with Transformer architectures and recent developments in foundation models
Preferred Qualifications
- Prior research experience in foundation models, efficient machine learning, or ML systems.
- Publications at leading conferences in machine learning or systems (i.e., MLSys, ICLR).
- Experience with CUDA programming (for kernel development)
- Understanding of model optimization techniques and hardware acceleration approaches
- Contributions to open-source machine learning projects
Internship Program Details
Our fall internship program spans over 12 to 16 weeks where you'll have the opportunity to work with industry-leading engineers building a cloud from the ground up and possibly contribute to influential open source projects. Our internship dates are September 14th to December 18th.
About Together AI
Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancements such as FlashAttention, Mamba, FlexGen, Petals, Mixture of Agents, and RedPajama.
Compensation
We offer competitive compensation, housing stipends, and other competitive benefits. The estimated US hourly rate for this role is $58-63/hr. Our hourly rates are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge.
Equal Opportunity
Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.
Please see our privacy policy at
$9.7k - $19k
...for AI Safety (CAIS) is a leading research and field-building organization... .... As a research engineer intern here, you will work very closely... ...application is for the full-time fall internship position. Applications are due by May 29, 2026. You might be a good fit if you...InternshipFull timeLocal area- ...About Phonic Phonic is a product and research lab focused on powering the most realistic, human-like voice AI conversations.... ...intelligence. Our team includes top-tier AI researchers, international olympiad medalists, and former founders. Our customers include...InternshipWork at office
$62 per hour
...The Advanced Technology Group (ATG) is the research division of the company. ATG’s mission is... ...experiences. As an Acoustic Mapping Intern, you will be exposed to the technologies... ...submitting your application by June 26, 2026. Eligibility Working towards...InternshipHourly payFull timeLocal areaMonday to FridayFlexible hours$62 per hour
...design the future. The Advanced Technology Group (ATG) is the research division of the company. ATG’s mission is to look ahead, deliver... ...considered, we recommend submitting your application by June 26, 2026. Eligibility Currently enrolled in PhD program....InternshipHourly payFull timeLocal areaMonday to FridayFlexible hours$58 - $63 per hour
About The Role As a Systems Research Engineer Intern specialized in GPU Programming, you will play a crucial role in developing and optimizing GPU... ...and analytical skills Internship Program Details Our fall internship program spans over 12 to 16 weeks where you’ll have...InternshipHourly pay$62 per hour
...Advanced Technology Group (ATG) is the research division of the company. ATG’s mission is... ...Technology Group We are seeking exceptional interns to join our cutting-edge research at the... ...submitting your application by June 26, 2026. Eligibility Working towards a...InternshipHourly payFull timeLocal areaMonday to FridayFlexible hours$1,850 per week
Research Analyst Intern (Economics & Finance) - Summer 2026 San Francisco, CA. Please note that while we accept applications for our internship position starting in the fall, we will not begin actively contacting candidates for interviews until November 2025. Our Summer...InternshipSummer workCasual workSummer internshipWork at officeImmediate startRemote work$29 per hour
...operates its satellites out of its 153,000 sq. ft. headquarters in Northern California, USA. Assembly, Integration and Test Intern Engineer (Fall 2026) Internships at Astranis typically last for twelve weeks, and are hourly roles designed for students who are...InternshipHourly payPermanent employmentFull time$1,925 per week
...headquarters in Northern California, USA. FPGA Engineer Associate (Fall 2026) Associate positions at Astranis typically last for twelve... ...graduated from a four-year university, please apply to be an Intern. Role RTL Development for FPGA targeted applications...InternshipPermanent employmentFull time$1,925 per week
...California, USA. Electrical Integration Associate Engineer (Fall 2026) Associate Engineer positions typically last for twelve weeks... ...you are still a college student, please apply to join us as an Intern. Role Work on an interdisciplinary team to...InternshipPermanent employmentFull time$29 per hour
...Astranis designs, builds, and operates its satellites out of its 153,000 sq. ft. headquarters in Northern California, USA. Antenna Intern (Fall 2026) Internships at Astranis typically last for twelve weeks, and are hourly roles designed for students who are currently...InternshipHourly payPermanent employment$29 per hour
...Astranis designs, builds, and operates its satellites out of its 153,000 sq. ft. headquarters in Northern California, USA. FPGA Intern (Fall 2026) Internships at Astranis typically last for twelve weeks, and are hourly roles designed for students who are currently...InternshipHourly payPermanent employmentFull time$29 per hour
...and operates its satellites out of its 153,000 sq. ft. headquarters in Northern California, USA. Communications/DSP Engineer — Intern (Fall 2026) Internships at Astranis typically last for twelve weeks, and are hourly roles designed for students who are currently...InternshipHourly payPermanent employmentFull time$29 per hour
...more set to launch soon, the company services a backlog of more than $1 billion of commercial contracts. Mission Engineering Intern (Fall 2026) Internships at Astranis typically last twelve weeks, are hourly roles designed for students currently enrolled at a four‑year...InternshipHourly payPermanent employment$1,925 per week
...headquarters in Northern California, USA. Mission Engineering Associate (Fall 2026) Associate Engineer positions typically last for twelve weeks,... ...you are still a college student, please apply to join us as an Intern. Successful candidates will have a proven track record of...InternshipPermanent employmentFull time$24 per hour
...People Team Intern - HR Operations & AI Innovation (Fall 2026) In-Office At Cloudflare, we are on a mission to help build a better Internet. Today the company runs one of the world's largest networks that powers millions of websites and other Internet properties...InternshipFull timeSummer internshipWork at officeLocal area3 days per week$29 per hour
...more set to launch soon, the company is servicing a backlog of more than $1 billion of commercial contracts. Power Electronics Intern (Fall 2026) Internships at Astranis typically last for twelve weeks, and are hourly roles designed for students who are currently...InternshipHourly payPermanent employmentFull time$29 per hour
...builds, and operates its satellites out of its 153,000 sq. ft. headquarters in Northern California, USA. Production Quality Intern (Fall 2026) Four billion people do not have access to the internet. Astranis is going to change that. We are building the next...InternshipHourly payPermanent employment$29 per hour
Propulsion Manufacturing Intern (Fall 2026) Astranis builds advanced satellites for high orbits, expanding humanity’s reach into the solar system. Astranis satellites provide dedicated, secure networks to highly sophisticated customers across the globe — large enterprises...InternshipHourly payPermanent employment$29 per hour
...Software Engineer- Backend Intern (Fall 2026) Astranis builds advanced satellites for high orbits, expanding humanity's reach into the solar system. Today, Astranis satellites provide dedicated, secure networks to highly-sophisticated customers across the globe—large...InternshipHourly payPermanent employment$29 per hour
...out of its 153,000 sq. ft. headquarters in Northern California, USA. Embedded Software Developer, Network/Payload Software Intern (Fall 2026) Internships at Astranis typically last for twelve weeks, and are hourly roles designed for students who are currently enrolled...InternshipHourly payPermanent employmentFull time$29 per hour
...builds, and operates its satellites out of its 153,000 sq. ft. headquarters in Northern California, USA. Propulsion Manufacturing Intern (Fall 2026) Internships at Astranis typically last for twelve weeks, and are hourly roles designed for students who are currently...InternshipHourly payPermanent employmentFull time$29 per hour
...builds, and operates its satellites out of its 153,000 sq. ft. headquarters in Northern California, USA. CAD Engineer/Librarian Intern (Fall 2026) As an Intern, you will have an amazing opportunity to work on hard problems — we pride ourselves on giving everyone at...InternshipHourly payPermanent employmentFull time$29 per hour
...builds, and operates its satellites out of its 153,000 sq. ft. headquarters in Northern California, USA. Role Mechanical Engineer Intern (Fall 2026) Internships at Astranis typically last for twelve weeks, and are hourly roles designed for students who are currently...InternshipHourly payPermanent employment$29 per hour
...builds, and operates its satellites out of its 153,000 sq. ft. headquarters in Northern California, USA. Harness Design Engineer Intern (Fall 2026) Internships at Astranis typically last for twelve weeks, and are hourly roles designed for students who are currently...InternshipHourly payPermanent employmentFull time$1,925 per week
...headquarters in Northern California, USA. Flight Software - Associate (Fall 2026) Associate programs at Astranis typically last for twelve... ...you are still a college student, please apply to join us as an Intern. Role Work with the engineering team to design, write and...InternshipPermanent employmentFull time$29 per hour
Pow.bio is seeking a Mechanical Engineer Intern for Fall 2026 in San Francisco. This internship involves collaboration with the engineering team to design and test satellite hardware, conduct mechanical and thermal analysis, and enhance build processes. Candidates should...InternshipHourly pay$29 per hour
Astranis Space Technologies is seeking a Mechanical Engineer Intern for Fall 2026. This role offers a unique opportunity to engage in hands-on design and development of satellite hardware. The intern will work closely with the engineering team to design, evaluate, and test...InternshipHourly pay$1,925 per week
...headquarters in Northern California, USA. Avionics Associate Engineer (Fall 2026) Associate Engineer positions typically last for twelve weeks,... ...you are still a college student, please apply to join us as an Intern. Role Work with the avionics engineering team to build and...InternshipPermanent employmentFull time$29 per hour
Astranis Space Technologies Corp. is looking for an Antenna Intern for Fall 2026, ideally suited for students enrolled in a four-year university. This hourly internship involves leveraging advanced design tools to create high-performance communications antenna systems and...InternshipHourly pay
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Research Intern, Inference (Fall 2026). Be the first to apply!
- anthropology research San Francisco, CA
- research dietitian San Francisco, CA
- history research San Francisco, CA
- education policy research San Francisco, CA
- research pharmacist San Francisco, CA
- research professional San Francisco, CA
- student research intern San Francisco, CA
- research intern San Francisco, CA
- physics research San Francisco, CA
- pharmaceutical research San Francisco, CA

