ML Researcher: Agents & Efficient Inference
OpenReq
A leading AI tech company in Cupertino is seeking a research engineer to drive innovative ML projects. The ideal candidate will lead research directions that enhance compute efficiency and understanding through new search techniques. Strong skills in Python, PyTorch, and a deep knowledge of model architectures are essential. The role offers a unique opportunity to shape the future of AI technology through your contributions. #J-18808-Ljbffr OpenReq
- ...large contexts much more efficiently than GPUs. Sohu enables entirely new research directions and products... ...real-time video, agents, speculative decoding,... ...substantial impact in ML and/or CV (quality > quantity... ...transformer training and inference Familiarity with...Suggested
$147k - $211k
...practical experience. Experience in ML/AI, backed by a publication... ...writing model training and inference pipelines. Preferred... ...Google maintains a portfolio of research projects driven by fundamental... ...Our team develops resource-efficient architectures, model training...SuggestedFull time- ...Applied Machine Learning Research Scientist Sunnyvale CA or Toronto... ...-leading training and inference speeds and empowers machine learning... ...effortlessly run large-scale ML applications, without the... ...researchers to translate ML ideas into efficient, scalable implementation....SuggestedInternship
$218.7k - $249.6k
...questions in real time, our applications of AI & ML are bringing humanity and simplicity to... .... Our work touches every aspect of the research life cycle, from partnering with Academia... ...scale both in terms of training data and inference volumes. Experience in delivering...SuggestedLocal areaFlexible hours$160.36k - $240.54k
...Nuro is looking for a Software Engineer in GenAI Infrastructure & Agent Systems based in Mountain View, CA. You will design and build platforms that enhance developer productivity and accelerate ML research. Candidates must have 4+ years of experience and proficiency in...Suggested- A leading technology company is seeking a Machine Learning Researcher to join their team in Cupertino, California. The role involves developing cutting-edge algorithms and systems in deep learning, with a focus on real-world applications. Candidates should possess deep...
$184k - $299k
Senior Research Scientist, Efficient Deep Learning NVIDIA is searching for an outstanding Senior Researcher working on efficient deep learning to... ..., NAS), efficient architecture design, adaptive/dynamic inference, resource‑efficient training and fine‑tuning, and so forth...$179.5k - $260k
...work closely with product, AI/ML, and design teams to deliver... ...Architect and implement scalable AI agent and backend systems for high‑... ...models, optimise inference pipelines, and collect feedback... ...with experienced engineers, researchers, and designers. Continuous learning...Full timeFlexible hoursNight shift- ...customer intelligence, operational efficiency, and business growth. By... ...Models (SLMs) to power AI Agents across the entire CX journey—... ...interactions Architect low-latency inference pipelines integrating LLMs,... ...closely with Applied AI / ML teams to productionize models...Remote jobFlexible hours
- ...AI Agent Data Pipeline Intern Santa Clara, CA XPENG is a leading smart technology... ...with Machine Learning Engineers to improve efficiency, quality, and reliability of the... ...ambiguous data sources and collaborating with ML and platform engineers to clarify requirements...Internship
$207k - $300k
Staff AI Research Scientist, Applied AI, Google Cloud corporate_fare Google place Sunnyvale... ...learning algorithms and tools, or Applied ML (e.g., LLM's, Generative AI, NLP,... ...global company and, in order to facilitate efficient collaboration and communication globally,...Full time$38 - $46 per hour
...are looking for an outstanding AI Robotics Research Intern to join the team at NIO. This role... ...modeling, or representation learning for agent-environment interactions. Practical experience... ...). Proficiency in Python and modern ML frameworks (e.g., PyTorch, JAX, TensorFlow...Full timeInternship$152k - $241.5k
...Manager focused on Foundational AI Research to engage leading academic... ...systems, training methods, inference systems, model serving, and... ...boundaries of model performance, efficiency, and scale. The ideal... ...NVIDIA participation in major AI, ML, and systems research venues...$207k - $300k
...learning (e.g., sequential decision making), ML infrastructure, or specialization in... ...technology forward. As a Staff Applied AI Agent Developer, you will be at the forefront... ...global company and, in order to facilitate efficient collaboration and communication globally,...Full timeFlexible hoursShift work- ...The mission of the Waymo Research team is to develop machine learning... ...modeling, Bayesian inference, hierarchical learning, and robust... ...'ll: Work on open-ended ML research problems for realistic... ...programming in Python with robust and efficient code It's preferred if...InternshipSummer internshipLocal area
$150k - $290k
...individual under 8 U.S.C. 1324b(a)(3). Role: Machine Learning Researcher Location: 2550 N First Street Suite 250, San Jose, California... ...0 + benefits Role Description We are seeking a talented ML Researcher / Research Engineer to advance our computer vision...Permanent employment$236k - $339.2k
...MISSION We are seeking a Staff Research Scientist to serve as a... ...intersection of autonomous agents and large language models (LLMs... ...optimized for performance and efficiency Arctic-Text2SQL : Post-... ...system optimization Arctic Inference : High-performance LLM...Flexible hoursShift work$167.3k - $250.9k
...Clara is seeking a highly motivated PhD to join their Agentic AI research team. You will collaborate with AI researchers to shape the... ...applications. Your responsibilities include researching Multi-AI Agents systems, improving agent performance, and delivering reproducible...- ...push notifications. Conduct applied research to improve recommender systems using traditional ML techniques, deep learning and reinforcement... ...pipelines for training, evaluation, and inference, ensuring models operate efficiently in production. Work closely with...Work at officeRemote work2 days per week
- ...Senior Principal AI Agent / ML Software Engineer The Senior Principal... ...workflows, scalable inference infrastructure, and enterprise... ...latency, high throughput, GPU efficiency, reliability, cost,... ..., or emerging generative AI research. Experience building evaluation...Flexible hours
$168k - $258.75k
...Product Manager, Local AI and Agents for Enterprise page is... ...will define how developers, researchers, and enterprise teams build,... ...their private data on-prem. Inference stacks like vLLM, SGLang, TensorRT... ..., with meaningful time on AI/ML, developer tools, or infrastructure...Local areaShift work$300k
...of Foundation Models We are a dedicated research lab for building, understanding, using, and... .../NumPy/JAX or others). Convert them into efficient CUDA/Triton kernels with custom gradients... ...software engineering skills on large ML codebases. Ownership of mixed- or low-precision...Visa sponsorship$99.6k - $234.6k
...Principal AI Agent / ML Software Engineer The Principal AI Agent / ML Software Engineer... ..., autonomous workflows, scalable inference infrastructure, and enterprise AI applications... ...for low latency, high throughput, GPU efficiency, reliability, cost, operability, and...Temporary workFlexible hours$238k - $302k
...ensure the behavior evaluation at Waymo is efficient, scientifically rigorous, and supporting... ...metrics, sampling strategy, statistical inference, etc. Become an expert in Waymo's evaluation... ...causal inference, etc); familiarity with ML systems/models Demonstrated knowledge...Remote work$298k - $368k
...Collaborate closely with Perception engineers, ML researchers, the Labeling organization, and... ...gaps, biases, and opportunities for more efficient data utilization. Establish and... ...design, sampling methodologies, and causal inference. ~ A track record of mentoring other...Remote work$163.5k - $212.4k
...About the Position We are seeking exceptional AI Robotics Researchers to join our next-generation intelligent systems team-... ...multi-sensor fusion . Proficiency in Python and modern ML frameworks (e.g., PyTorch, JAX, TensorFlow), and robotics middleware...Full timeTemporary workFlexible hours$182.9k - $352.35k
...Scientists who excel at turning research breakthroughs into innovative... ...models. Familiarity with inference optimization, performance... ...world-class researchers and ML engineers to bring research ideas... ...training, and memory-efficient strategies. Experience with...Temporary workLocal areaWorldwide- ...Models We are a dedicated research lab for building, understanding... ...Summary As a member of the Agents team, you will tackle research... ..., reinforcement learning, LLM inference, distributed machine learning... ...published research and/or open-source ML software Visa Sponsorship...Visa sponsorship
$152k - $218.5k
At Toyota Research Institute (TRI), we’re on a mission to improve the quality of human life. We’re developing new tools and capabilities... ...Human-Centered AI Division (HCAI). We are an integrated team of ML researchers, behavior scientists, and human-computer interaction...Local areaShift work$136k - $219k
RippleMatch Inc. is looking for a passionate Security Researcher to enhance our Cloud-Delivered Security Services team. You will play a vital... ...include researching network countermeasures, applying ML methods for traffic analysis, and collaborating with experienced...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to ML Researcher: Agents & Efficient Inference. Be the first to apply!



