Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

ML Researcher: Agents & Efficient Inference

OpenReq

A leading AI tech company in Cupertino is seeking a research engineer to drive innovative ML projects. The ideal candidate will lead research directions that enhance compute efficiency and understanding through new search techniques. Strong skills in Python, PyTorch, and a deep knowledge of model architectures are essential. The role offers a unique opportunity to shape the future of AI technology through your contributions. #J-18808-Ljbffr OpenReq

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the ML Researcher: Agents & Efficient Inference in Cupertino, CA vacancy
  •  ...large contexts much more efficiently than GPUs. Sohu enables entirely new research directions and products...  ...real-time video, agents, speculative decoding,...  ...substantial impact in ML and/or CV (quality > quantity...  ...transformer training and inference Familiarity with... 
    Suggested

    OpenReq

    Cupertino, CA
    1 day ago
  • $147k - $211k

     ...practical experience. Experience in ML/AI, backed by a publication...  ...writing model training and inference pipelines. Preferred...  ...Google maintains a portfolio of research projects driven by fundamental...  ...Our team develops resource-efficient architectures, model training... 
    Suggested
    Full time

    Google Inc.

    Mountain View, CA
    4 days ago
  •  ...Applied Machine Learning Research Scientist Sunnyvale CA or Toronto...  ...-leading training and inference speeds and empowers machine learning...  ...effortlessly run large-scale ML applications, without the...  ...researchers to translate ML ideas into efficient, scalable implementation.... 
    Suggested
    Internship

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    1 day ago
  • $218.7k - $249.6k

     ...questions in real time, our applications of AI & ML are bringing humanity and simplicity to...  .... Our work touches every aspect of the research life cycle, from partnering with Academia...  ...scale both in terms of training data and inference volumes. Experience in delivering... 
    Suggested
    Local area
    Flexible hours

    Capital One National Association

    San Jose, CA
    4 days ago
  • $160.36k - $240.54k

     ...Nuro is looking for a Software Engineer in GenAI Infrastructure & Agent Systems based in Mountain View, CA. You will design and build platforms that enhance developer productivity and accelerate ML research. Candidates must have 4+ years of experience and proficiency in... 
    Suggested

    I did my part and supported the Regular Toilet

    Mountain View, CA
    4 days ago
  • A leading technology company is seeking a Machine Learning Researcher to join their team in Cupertino, California. The role involves developing cutting-edge algorithms and systems in deep learning, with a focus on real-world applications. Candidates should possess deep... 

    Apple Inc.

    Cupertino, CA
    5 days ago
  • $184k - $299k

    Senior Research Scientist, Efficient Deep Learning NVIDIA is searching for an outstanding Senior Researcher working on efficient deep learning to...  ..., NAS), efficient architecture design, adaptive/dynamic inference, resource‑efficient training and fine‑tuning, and so forth... 

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  • $179.5k - $260k

     ...work closely with product, AI/ML, and design teams to deliver...  ...Architect and implement scalable AI agent and backend systems for high‑...  ...models, optimise inference pipelines, and collect feedback...  ...with experienced engineers, researchers, and designers. Continuous learning... 
    Full time
    Flexible hours
    Night shift

    Fortinet, Inc.

    Sunnyvale, CA
    1 day ago
  •  ...customer intelligence, operational efficiency, and business growth. By...  ...Models (SLMs) to power AI Agents across the entire CX journey—...  ...interactions Architect low-latency inference pipelines integrating LLMs,...  ...closely with Applied AI / ML teams to productionize models... 
    Remote job
    Flexible hours

    StartX

    Mountain View, CA
    2 days ago
  •  ...AI Agent Data Pipeline Intern Santa Clara, CA XPENG is a leading smart technology...  ...with Machine Learning Engineers to improve efficiency, quality, and reliability of the...  ...ambiguous data sources and collaborating with ML and platform engineers to clarify requirements... 
    Internship

    XPENG

    Santa Clara, CA
    3 days ago
  • $207k - $300k

    Staff AI Research Scientist, Applied AI, Google Cloud corporate_fare Google place Sunnyvale...  ...learning algorithms and tools, or Applied ML (e.g., LLM's, Generative AI, NLP,...  ...global company and, in order to facilitate efficient collaboration and communication globally,... 
    Full time

    Google Inc.

    Sunnyvale, CA
    4 days ago
  • $38 - $46 per hour

     ...are looking for an outstanding AI Robotics Research Intern to join the team at NIO. This role...  ...modeling, or representation learning for agent-environment interactions. Practical experience...  ...). Proficiency in Python and modern ML frameworks (e.g., PyTorch, JAX, TensorFlow... 
    Full time
    Internship

    NIO

    San Jose, CA
    5 days ago
  • $152k - $241.5k

     ...Manager focused on Foundational AI Research to engage leading academic...  ...systems, training methods, inference systems, model serving, and...  ...boundaries of model performance, efficiency, and scale. The ideal...  ...NVIDIA participation in major AI, ML, and systems research venues... 

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $207k - $300k

     ...learning (e.g., sequential decision making), ML infrastructure, or specialization in...  ...technology forward. As a Staff Applied AI Agent Developer, you will be at the forefront...  ...global company and, in order to facilitate efficient collaboration and communication globally,... 
    Full time
    Flexible hours
    Shift work

    Google Inc.

    Sunnyvale, CA
    5 days ago
  •  ...The mission of the Waymo Research team is to develop machine learning...  ...modeling, Bayesian inference, hierarchical learning, and robust...  ...'ll: Work on open-ended ML research problems for realistic...  ...programming in Python with robust and efficient code It's preferred if... 
    Internship
    Summer internship
    Local area

    Waymo

    Mountain View, CA
    4 days ago
  • $150k - $290k

     ...individual under 8 U.S.C. 1324b(a)(3). Role: Machine Learning Researcher Location: 2550 N First Street Suite 250, San Jose, California...  ...0 + benefits Role Description We are seeking a talented ML Researcher / Research Engineer to advance our computer vision... 
    Permanent employment

    Rivet Industries

    San Jose, CA
    1 day ago
  • $236k - $339.2k

     ...MISSION We are seeking a Staff Research Scientist to serve as a...  ...intersection of autonomous agents and large language models (LLMs...  ...optimized for performance and efficiency Arctic-Text2SQL : Post-...  ...system optimization Arctic Inference : High-performance LLM... 
    Flexible hours
    Shift work

    Snowflake Computing

    Menlo Park, CA
    5 days ago
  • $167.3k - $250.9k

     ...Clara is seeking a highly motivated PhD to join their Agentic AI research team. You will collaborate with AI researchers to shape the...  ...applications. Your responsibilities include researching Multi-AI Agents systems, improving agent performance, and delivering reproducible... 

    Fujitsu

    Santa Clara, CA
    3 days ago
  •  ...push notifications. Conduct applied research to improve recommender systems using traditional ML techniques, deep learning and reinforcement...  ...pipelines for training, evaluation, and inference, ensuring models operate efficiently in production. Work closely with... 
    Work at office
    Remote work
    2 days per week

    Wayfair

    Mountain View, CA
    3 days ago
  •  ...Senior Principal AI Agent / ML Software Engineer The Senior Principal...  ...workflows, scalable inference infrastructure, and enterprise...  ...latency, high throughput, GPU efficiency, reliability, cost,...  ..., or emerging generative AI research. Experience building evaluation... 
    Flexible hours

    Oracle

    Santa Clara, CA
    4 days ago
  • $168k - $258.75k

     ...Product Manager, Local AI and Agents for Enterprise page is...  ...will define how developers, researchers, and enterprise teams build,...  ...their private data on-prem. Inference stacks like vLLM, SGLang, TensorRT...  ..., with meaningful time on AI/ML, developer tools, or infrastructure... 
    Local area
    Shift work

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • $300k

     ...of Foundation Models We are a dedicated research lab for building, understanding, using, and...  .../NumPy/JAX or others). Convert them into efficient CUDA/Triton kernels with custom gradients...  ...software engineering skills on large ML codebases. Ownership of mixed- or low-precision... 
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    3 days ago
  • $99.6k - $234.6k

     ...Principal AI Agent / ML Software Engineer The Principal AI Agent / ML Software Engineer...  ..., autonomous workflows, scalable inference infrastructure, and enterprise AI applications...  ...for low latency, high throughput, GPU efficiency, reliability, cost, operability, and... 
    Temporary work
    Flexible hours

    Oracle

    Santa Clara, CA
    5 days ago
  • $238k - $302k

     ...ensure the behavior evaluation at Waymo is efficient, scientifically rigorous, and supporting...  ...metrics, sampling strategy, statistical inference, etc. Become an expert in Waymo's evaluation...  ...causal inference, etc); familiarity with ML systems/models Demonstrated knowledge... 
    Remote work

    Waymo

    Mountain View, CA
    4 days ago
  • $298k - $368k

     ...Collaborate closely with Perception engineers, ML researchers, the Labeling organization, and...  ...gaps, biases, and opportunities for more efficient data utilization. Establish and...  ...design, sampling methodologies, and causal inference. ~ A track record of mentoring other... 
    Remote work

    Waymo

    Mountain View, CA
    15 days ago
  • $163.5k - $212.4k

     ...About the Position We are seeking exceptional AI Robotics Researchers to join our next-generation intelligent systems team-...  ...multi-sensor fusion . Proficiency in Python and modern ML frameworks (e.g., PyTorch, JAX, TensorFlow), and robotics middleware... 
    Full time
    Temporary work
    Flexible hours

    NIO

    San Jose, CA
    2 days ago
  • $182.9k - $352.35k

     ...Scientists who excel at turning research breakthroughs into innovative...  ...models. Familiarity with inference optimization, performance...  ...world-class researchers and ML engineers to bring research ideas...  ...training, and memory-efficient strategies. Experience with... 
    Temporary work
    Local area
    Worldwide

    Adobe

    San Jose, CA
    1 day ago
  •  ...Models We are a dedicated research lab for building, understanding...  ...Summary As a member of the Agents team, you will tackle research...  ..., reinforcement learning, LLM inference, distributed machine learning...  ...published research and/or open-source ML software Visa Sponsorship... 
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    a month ago
  • $152k - $218.5k

    At Toyota Research Institute (TRI), we’re on a mission to improve the quality of human life. We’re developing new tools and capabilities...  ...Human-Centered AI Division (HCAI). We are an integrated team of ML researchers, behavior scientists, and human-computer interaction... 
    Local area
    Shift work

    Toyota Research Institute

    Los Altos, CA
    1 day ago
  • $136k - $219k

    RippleMatch Inc. is looking for a passionate Security Researcher to enhance our Cloud-Delivered Security Services team. You will play a vital...  ...include researching network countermeasures, applying ML methods for traffic analysis, and collaborating with experienced... 

    RippleMatch Inc.

    Santa Clara, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to ML Researcher: Agents & Efficient Inference. Be the first to apply!