Hunyuan Multimodal Algorithm Researcher (Omni-Modal)
$149k - $279.8kTencent
Omni Multimodal Large Models Research Scientist
Technology Engineering Group (TEG) is responsible for supporting the company and its business groups on technology and operational platforms, as well as the construction and operation of R&D management and data centers. TEG provides users with a full range of customer services. As the operator of the largest networking, devices, and data center in Asia, TEG also leads the Tencent Technology Committee in strengthening infrastructure R&D through internal and distributed open source collaboration, constructing new platforms and supporting business innovation.
What the Role Entails
1. Conduct research and development of Omni multimodal large models, including the design and construction of training data, foundational model algorithm design, optimization related to pre-training/SFT/RL, model capability evaluation, and exploration of downstream application scenarios.
2. Scientifically analyze challenges in R&D, identify bottlenecks in model performance, and devise solutions based on first principles to accelerate model development and iteration, ensuring competitiveness and leading-edge performance.
3. Explore diverse paradigms for achieving Omni-modal understanding and generation capabilities, research next-generation model architectures, and push the boundaries of multimodal models.
Who We Look For
1. Bachelor's degree (full-time preferred) or higher in Computer Science, Artificial Intelligence, Mathematics, or related fields; graduate degrees are prioritized.
2. Hands-on experience in large-scale multimodal data processing and high-quality data generation is highly preferred.
3. Solid foundation in deep learning algorithms and practical experience in large model development; familiarity with Diffusion Models and Autoregressive Models is advantageous. Publication in top-tier conferences or experience in cross-modal (e.g., audio-visual) research is preferred.
4. Proficiency in underlying implementation details of deep learning networks and operators, model tuning for training/inference, CPU/GPU acceleration, and distributed training/inference optimization; practical experience is a plus.
5. Participation in ACM or NOI competitions is highly valued.
6. Strong learning agility, communication skills, teamwork, and curiosity.
Location State(s): US-California-Palo Alto
The expected base pay range for this position in the location(s) listed above is $149,000.00 to $279,800.00 per year. Actual pay may vary depending on job-related knowledge, skills, and experience. Employees hired for this position may be eligible for a sign on payment, relocation package, and restricted stock units, which will be evaluated on a case-by-case basis. Subject to the terms and conditions of the plans in effect, hired applicants are also eligible for medical, dental, vision, life and disability benefits, and participation in the Company's 401(k) plan. The Employee is also eligible for up to 15 to 25 days of vacation per year (depending on the employee's tenure), up to 13 days of holidays throughout the calendar year, and up to 10 days of paid sick leave per year. Your benefits may be adjusted to reflect your location, employment status, duration of employment with the company, and position level. Benefits may also be pro-rated for those who start working during the calendar year.
Equal Employment Opportunity at Tencent
As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.
- A leading technology company is seeking a Machine Learning Researcher to join their team in Cupertino, California. The role involves developing cutting-edge algorithms and systems in deep learning, with a focus on real-world applications. Candidates should possess deep...Suggested
$180k
...important. All engineers and researchers are expected to have strong... .... About the Role The multimodal team at xAI creates magical... ...of content across various modalities, including image, video, and... ...implementing effective and efficient algorithms for achieving state-of-the-...SuggestedLocal areaRelocation$176k - $253k
...Job Description Job Description At Toyota Research Institute (TRI), we’re on a mission to improve the quality of human life. We’... ...experiments to evaluate a wide range of architectural, data, and algorithmic choices, and help shape how we apply modern machine learning...SuggestedWork experience placementInternshipLocal areaShift work$192k - $356.5k
NVIDIA is seeking a Senior Research Scientist focused on Multimodal Foundation Models in Santa Clara, California. You will design and implement AI algorithms for humanoid robots, significantly impacting the development of general-purpose embodied agents. The role requires...Suggested$192k - $304.75k
We’re now looking for a Senior Research Scientist, Multi-Modal Language Models! NVIDIA is seeking a Senior Research Scientist passionate about multi... ...Solid background in computer science fundamentals: algorithms, data structures, parallel/distributed computing, and systems...Suggested- A leading technology company is seeking a Senior Research Scientist to focus on Multimodal Foundation Models and Robotics. This position involves designing AI algorithms for humanoid robots, developing training methods for foundation models, and working with a collaborative...
$152k - $218.5k
At Toyota Research Institute (TRI), we’re on a mission to improve the quality of human life. We’re developing new tools and capabilities... ...as pythonocc-core for primitive fitting. Experience with multimodal generative models for boundary representations. Track record...Local areaShift work- A leading technology company based in Santa Clara is seeking a researcher with expertise in Machine Learning and AI. This role focuses on developing foundational models for image and video generation. Candidates should possess a PhD, along with experience in computer vision...
$238.9k - $292.5k
...Advanced Technology Group (ATG) is the research division of the company. ATG’s mission... ...electrical engineering, such as AI/ML, algorithms, digital signal processing, audio engineering... .... Applications in vision, audio, or multimodal domains (e.g., source separation,...Full timeLocal areaWorldwideFlexible hours$113k - $150k
Job Summary Student Researcher, PhD, Winter/Summer 2025 job at Google. Mountain View, CA. Applications will be reviewed on a rolling... ...Interactions, Computer Vision, Machine Learning, Deep Learning, Algorithmic Foundations of Optimization, Quantum Information Science,...Full timeSummer workInternshipRemote workFlexible hours$151.8k - $211.3k
We are inviting motivated researchers and engineers to join our team in Bay Area. The team works on developing cutting edge technologies... ...example areas include (but not limited to) video compression algorithms, future video coding standards, learned image and video...Work experience placementRelocation package- ...About the Role As an AI Researcher for Computer Vision & Autonomous Robots at TCS, you’... ...implement computer vision and machine learning algorithms that power autonomous robots, humanoids... .... From visual perception and SLAM to multimodal sensor fusion and reinforcement...Full time
$193.93k - $352.29k
...role, you will collaborate closely with researchers and engineers on the Learned Behavior... ...matching, energy-based models, and SoTA algorithms—in order to develop novel solutions that... ...reasoning, decision making and multi-modality generation. Optimize generative models...Immediate startFlexible hours- A leading technology research lab in California is seeking motivated researchers and engineers to join their team working on advanced... ...coding skills in C/C++. The role includes researching innovative algorithms for video compression, contributing to academic advancements,...
$200k - $300k
...Type Hybrid Department AI Perplexity is seeking top-tier AI Research Scientists and Engineers to advance our AI products and capabilities... ...to enhance SOTA models Turn research ideas into algorithms and run experiments to launch new models Infrastructure & Implementation...Full time$50 per hour
...About Position: Job Title: UX Researcher Location: Mountain View , CA - Hybrid Pay rate: $50/Hr. Duration: 12 Months with... ...with repairability in mind, augmented reality hardware, new multi-modal interaction models, and bringing personal and GenAI features to...Contract work$100k - $300k
...Job Description Job Description OPPO Research Center is seeking a passionate and... ...the design, training, and deployment of multimodal intelligence models that seamlessly integrate... ...collaborative team, you will co-design algorithms and hardware, translate research into...Full time$180k
...important. All engineers and researchers are expected to have... ...About the Role The omni team at xAI creates... ...across various modalities, including image, video... ...researcher/engineer on multimodal, you will develop and... ...Hardware / Software / Algorithm co-design. Ideal Experiences...Temporary workRelocation$140k - $195k
...Center About the Team : Our AI Research team, reporting directly to the Chief AI... ...generation of intelligent systems-integrating multimodal foundation models, generative AI,... ...deep learning pipelines that handle multi-modal sensor data (vision, force/torque,...Work at officeLocal areaWork from homeRelocation packageFlexible hours$142.8k - $274.8k
...contentquality stack, combining LLMs, multimodal models, and largescale recommender systems... ..., safety, and diversity across modalities; reduce misinformation/toxicity error rates... ....g., statistics, predictive analytics, research) OR Master's Degree in Statistics...Ongoing contractWork at officeLocal area$110.8k - $253.4k
...through our expertise, tools, and products, accelerating the advancement of gaming technologies worldwide. Responsibilities: Research and analyze the latest advancements in gaming AI technologies, including but not limited to AI-driven 2D/3D content generation,...WorldwideRelocation package- The Institute of Foundation Models in Sunnyvale, California is seeking a Research Scientist for their Vision Language Model team. This role involves advancing state-of-the-art multimodal foundation models and developing large-scale systems combining visual understanding...
$180k - $260k
...worst. There is compelling research showing that smarterdataselection... ...data curation across modalities. This role is based in Redwood... ...You’ll design and implement algorithms to generate and improve instruction... ...large vision, language, and multimodal models Post-training...Work at officeRelocation package$192k - $356.5k
NVIDIA Gruppe is looking for a Senior Research Scientist specializing in Multi-Modal Language Models to enhance our open-source multi-modal models. You will collaborate with researchers and engineers, developing and improving model capabilities, ensuring they perform effectively...- Somi AI is seeking a Machine Learning Engineer with expertise in Computer Vision to contribute to the development of advanced systems using the Waymo Driver technology. Candidates should possess a Bachelor’s degree and 3+ years of relevant experience, particularly in Python...
- ...around the car. We work jointly with downstream teams on the optimization and integration into the Waymo Driver. We conduct our own research to address real-world problems and collaborate with research teams at Alphabet. We have access to millions of miles of driving...Full timeTemporary workRemote work
- Simular Inc. is looking for a Research Scientist to pioneer new research directions in AI and execute end-to-end experiments. You will collaborate with engineers and contribute to the AI research community through publications. The ideal candidate will have a PhD in Computer...
$165k - $195k
...Description Job Description Company Description The Bosch Research and Technology Center North America with offices in Sunnyvale,... ...other Bosch products. Architect, design and validate multi-modal deep learning and Timeseries Foundation Models (TSFM) to work...Full timeWork experience placementLocal areaWorldwide$190k - $250k
...autonomous trucks. We are looking for a research scientist to lead the design and... ...approaches Build methods for joint multimodal generation that maintain cross-sensor consistency... ...driving data across multiple sensor modalities What you'll bring: PhD in Computer...Temporary workWork at officeVisa sponsorshipFlexible hours- ...contexts much more efficiently than GPUs. Sohu enables entirely new research directions and products. When our chips come out, these use... ...-time video, agents, speculative decoding, or new tree search algorithms, we must create the market for our hardware. In this role, you...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Hunyuan Multimodal Algorithm Researcher (Omni-Modal). Be the first to apply!
- court researcher Palo Alto, CA
- security researcher Palo Alto, CA
- design researcher Palo Alto, CA
- machine learning researcher Palo Alto, CA
- data collection researcher Palo Alto, CA
- music researcher Palo Alto, CA
- human factors researcher Palo Alto, CA
- researcher Palo Alto, CA
- image processing researcher
- photonics researcher



