Hunyuan Multimodal Algorithm Researcher (Omni-Modal)
$149k - $279.8kTencent
Omni Multimodal Large Models Research Scientist
Technology Engineering Group (TEG) is responsible for supporting the company and its business groups on technology and operational platforms, as well as the construction and operation of R&D management and data centers. TEG provides users with a full range of customer services. As the operator of the largest networking, devices, and data center in Asia, TEG also leads the Tencent Technology Committee in strengthening infrastructure R&D through internal and distributed open source collaboration, constructing new platforms and supporting business innovation.
What the Role Entails
1. Conduct research and development of Omni multimodal large models, including the design and construction of training data, foundational model algorithm design, optimization related to pre-training/SFT/RL, model capability evaluation, and exploration of downstream application scenarios.
2. Scientifically analyze challenges in R&D, identify bottlenecks in model performance, and devise solutions based on first principles to accelerate model development and iteration, ensuring competitiveness and leading-edge performance.
3. Explore diverse paradigms for achieving Omni-modal understanding and generation capabilities, research next-generation model architectures, and push the boundaries of multimodal models.
Who We Look For
1. Bachelor's degree (full-time preferred) or higher in Computer Science, Artificial Intelligence, Mathematics, or related fields; graduate degrees are prioritized.
2. Hands-on experience in large-scale multimodal data processing and high-quality data generation is highly preferred.
3. Solid foundation in deep learning algorithms and practical experience in large model development; familiarity with Diffusion Models and Autoregressive Models is advantageous. Publication in top-tier conferences or experience in cross-modal (e.g., audio-visual) research is preferred.
4. Proficiency in underlying implementation details of deep learning networks and operators, model tuning for training/inference, CPU/GPU acceleration, and distributed training/inference optimization; practical experience is a plus.
5. Participation in ACM or NOI competitions is highly valued.
6. Strong learning agility, communication skills, teamwork, and curiosity.
Location State(s): US-California-Palo Alto
The expected base pay range for this position in the location(s) listed above is $149,000.00 to $279,800.00 per year. Actual pay may vary depending on job-related knowledge, skills, and experience. Employees hired for this position may be eligible for a sign on payment, relocation package, and restricted stock units, which will be evaluated on a case-by-case basis. Subject to the terms and conditions of the plans in effect, hired applicants are also eligible for medical, dental, vision, life and disability benefits, and participation in the Company's 401(k) plan. The Employee is also eligible for up to 15 to 25 days of vacation per year (depending on the employee's tenure), up to 13 days of holidays throughout the calendar year, and up to 10 days of paid sick leave per year. Your benefits may be adjusted to reflect your location, employment status, duration of employment with the company, and position level. Benefits may also be pro-rated for those who start working during the calendar year.
Equal Employment Opportunity at Tencent
As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.
$180k
...important. All engineers and researchers are expected to have strong... .... About the Role The multimodal team at xAI creates magical... ...of content across various modalities, including image, video, and... ...implementing effective and efficient algorithms for achieving state-of-the-...SuggestedLocal areaRelocation$181.1k - $318.4k
...AIML - Machine Learning Researcher - Multimodal Agent The AIML Multimodal Foundation Model Team is pioneering next-generation intelligent agent technologies that combine multimodal reasoning, tool-use, and visual understanding. Our innovative features redefine how hundreds...SuggestedRelocation- A leading technology company is seeking a Machine Learning Researcher to join their team in Cupertino, California. The role involves developing cutting-edge algorithms and systems in deep learning, with a focus on real-world applications. Candidates should possess deep...Suggested
$182.5k - $343.2k
...immersive content creation. We are looking for exceptional researchers and engineers who are passionate about advancing multimodal intelligence and world models at scale. Responsibility: 1. Research and develop core algorithms for Generative Game Engine technologies, including...SuggestedFull timeRelocation package- ...advance AI/ML systems for mobility services Conducting applied research in Agentic AI, including agent design, orchestration... ...adaptability of agent-based systems Prototyping frameworks and algorithms in Python using tools such as AutoGen, LangChain, or custom orchestration...Suggested
$180k
...important. All engineers and researchers are expected to have... ...About the Role The omni team at xAI creates... ...across various modalities, including image, video... ...researcher/engineer on multimodal, you will develop and... ...Hardware / Software / Algorithm co-design. Ideal Experiences...Temporary workRelocation$200k - $287.5k
...At Toyota Research Institute (TRI), we're on a mission to improve the quality of human life. We're developing new tools and capabilities... ...architectures, and integrating visual-language-action modalities. Beyond refining models for closed-loop driving on public roads...Local areaShift work$147.4k - $272.1k
...AIML - Machine Learning Researcher, Foundation Models We are a group of engineers and... ...text, images, speech, videos, and other modalities and apply these models to Apple products... ...who are passionate about developing algorithms, techniques, and systems that push the...Relocation$181.1k - $318.4k
...Senior Applied ML Researcher - Video Apps We are seeking a Senior Applied ML Researcher... ...computer vision, audio signal processing, and multimodal learning, enabling intelligent systems... ...-visual representation learning, cross-modal alignment, and fusion. Develop solutions...Relocation- ...About the Role As an AI Researcher for Computer Vision & Autonomous Robots at TCS, you’... ...implement computer vision and machine learning algorithms that power autonomous robots, humanoids... .... From visual perception and SLAM to multimodal sensor fusion and reinforcement...Full time
$118k - $157k
Student Researcher, PhD, Winter/Summer 2026 About the job The Student Researcher Program fosters academic collaborations by hiring students... ..., Computer Vision, Machine Learning, Deep Learning, Algorithmic Foundations of Optimization, Quantum Information Science, Data...Full timeTemporary workSummer workInternship$151.8k - $211.3k
We are inviting motivated researchers and engineers to join our team in Bay Area. The team works on developing cutting edge technologies... ...example areas include (but not limited to) video compression algorithms, future video coding standards, learned image and video...Work experience placementRelocation package$193.93k - $352.29k
...Senior/Staff Machine Learning Research Scientist: Generative Modeling for Planning Mountain... ...matching, energy-based models, and SoTA algorithms—in order to develop novel solutions that... ...reasoning, decision making and multi-modality generation. Optimize generative...$50 per hour
...About Position: Job Title: UX Researcher Location: Mountain View , CA - Hybrid Pay rate: $50/Hr. Duration: 12 Months with... ...with repairability in mind, augmented reality hardware, new multi-modal interaction models, and bringing personal and GenAI features to...Contract work- ..., co-author of the Transformer (“the T” in ChatGPT) and a key researcher behind OpenAI’s reasoning models. Pathway is headquartered in... ...design, and communication. Have a good understanding of graph algorithms. Have some familiarity with model monitoring, git ,...Permanent employmentFull timeContract workImmediate startRemote workFlexible hours
- A leading technology firm in California is seeking a passionate Research Scientist to advance next-generation AI hardware platforms. The role involves developing multimodal intelligence models, benchmarking innovative LLM architectures, and collaborating across teams to...
- A leading technology research lab in California is seeking motivated researchers and engineers to join their team working on advanced... ...coding skills in C/C++. The role includes researching innovative algorithms for video compression, contributing to academic advancements,...
$180k - $250k
...excited for what’s next! We have an opening in our lab for a senior computer vision researcher, with 3D Reconstruction and Deep Learning expertise, to develop and improve the underlying algorithms powering our consumer products. We are looking for highly-motivated, creative,...Full timeRemote work$100k - $300k
Overview OPPO Research Center is seeking a passionate and innovative Research Scientist... ...the design, training, and deployment of multimodal intelligence models that seamlessly... ...collaborative team, you will co-design algorithms and hardware, translate research into real...Full time$207k - $300k
Research Scientist, Gemini Retrieval and Agera, DeepMind Mountain View, CA, USA Required... ...competitions, or developing novel algorithms. Ability to quickly prototype and iterate... ...generation retrieval architectures and natively multimodal Gemini representations to improve...Full time$158k - $304k
...the role We are looking for a passionate Research Scientist to join the Research Team at... ...and engineering teams to test and deploy algorithms to our autonomy and ADAS products We’re... ...autonomy stack Reinforcement learning Multi‑modal foundation models Multi‑view perception...Full timeFor contractorsFor subcontractorCasual workWork at officeImmediate startRemote workDay shift$204k - $259k
...around the car. We work jointly with downstream teams on the optimization and integration into the Waymo Driver. We conduct our own research to address real-world problems and collaborate with research teams at Alphabet. We have access to millions of miles of driving...Full timeTemporary workRemote work$176k - $264k
...At Toyota Research Institute (TRI), we’re on a mission to improve the quality of human life... ...capable of leveraging large-scale multi-modal (RGB, depth, flow, semantics, bounding... ...Responsibilities Develop, integrate, and deploy algorithms for Multi-Modal and 4D reasoning...Local areaShift work- ...At Toyota Research Institute (TRI), we're on a mission to improve the quality of human life. We're developing new tools and capabilities to amplify the human experience. To lead this transformative shift in mobility, we've built a world-class team advancing the state...Work experience placementInternshipLocal areaShift work
$140k - $195k
...Center About the Team : Our AI Research team, reporting directly to the Chief AI... ...generation of intelligent systems-integrating multimodal foundation models, generative AI,... ...deep learning pipelines that handle multi-modal sensor data (vision, force/torque,...Work at officeLocal areaWork from homeRelocation packageFlexible hours- ...much more efficiently than GPUs. Sohu enables entirely new research directions and products. When our chips come out, these use... ...time video, agents, speculative decoding, or new tree search algorithms, we must create the market for our hardware. In this role,...
- ...competitive salary with an employee stock option plan and the opportunity to be part of a leading AI startup focused on impactful research. The ideal candidates should have a strong background in machine learning and deep learning with the capability to conduct hands-on...Remote work
- ...ML Researcher Tilde Research is a moonshot AI lab advancing mechanistic interpretability, new architectures, and pretraining science. We build foundational understanding of models to advance the frontier of intelligence. About the role: As a ML Researcher, you...Full timeInternship
$300k - $350k
...Lightmatter is (re)inventing the future of computing with light! Lightmatter is seeking a Principal of Machine Learning Infrastructure Researcher to join our Machine Learning team. This team is focused on inventing novel hardware systems using Lightmatter technology and...Full timeTemporary workFlexible hours- ...Role Number: 200601297-3760 Summary We are hiring a researcher with a strong technical background in Image/Video generation and editing, as well as Multimodal Foundation Models. You will play a critical role in the research and development of multimodal foundation...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Hunyuan Multimodal Algorithm Researcher (Omni-Modal). Be the first to apply!
- court researcher Palo Alto, CA
- data collection researcher Palo Alto, CA
- security researcher Palo Alto, CA
- researcher Palo Alto, CA
- machine learning researcher Palo Alto, CA
- field researcher Palo Alto, CA
- design researcher Palo Alto, CA
- music researcher Palo Alto, CA
- product researcher Palo Alto, CA
- human factors researcher Palo Alto, CA


