Hunyuan Multimodal Algorithm Researcher (Omni-Modal)
$134.9k - $253.4kTencent
Business Unit Technology Engineering Group (TEG) is responsible for supporting the company and its business groups on technology and operational platforms, as well as the construction and operation of R&D management and data centers; TEG provides users with a full range of customer services. As the operator of the largest networking, devices, and data center in Asia, TEG also leads the Tencent Technology Committee in strengthening infrastructure R&D through internal and distributed open source collaboration, constructing new platforms and supporting business innovation. What The Role Entails Conduct research and development of Omni multimodal large models, including the design and construction of training data, foundational model algorithm design, optimization related to pre‑training/SFT/RL, model capability evaluation, and exploration of downstream application scenarios. Scientifically analyze challenges in R&D, identify bottlenecks in model performance, and devise solutions based on first principles to accelerate model development and iteration, ensuring competitiveness and leading‑edge performance. Explore diverse paradigms for achieving Omni‑modal understanding and generation capabilities, research next‑generation model architectures, and push the boundaries of multimodal models. Who We Look For Bachelor’s degree (full‑time preferred) or higher in Computer Science, Artificial Intelligence, Mathematics, or related fields; graduate degrees are prioritized. Hands‑on experience in large‑scale multimodal data processing and high‑quality data generation is highly preferred. Solid foundation in deep learning algorithms and practical experience in large model development; familiarity with Diffusion Models and Autoregressive Models is advantageous. Publication in top‑tier conferences or experience in cross‑modal (e.g., audio‑visual) research is preferred. Proficiency in underlying implementation details of deep learning networks and operators, model tuning for training/inference, CPU/GPU acceleration, and distributed training/inference optimization; practical experience is a plus. Participation in ACM or NOI competitions is highly valued. Strong learning agility, communication skills, teamwork, and curiosity. Location: US-California-Palo Alto The expected base pay range for this position in the location(s) listed above is $134,900.00 to $253,400.00 per year. Actual pay may vary depending on job‑related knowledge, skills, and experience. Employees hired for this position may be eligible for a sign‑on payment, relocation package, and restricted stock units, which will be evaluated on a case‑by‑case basis. Subject to the terms and conditions of the plans in effect, hired applicants are also eligible for medical, dental, vision, life and disability benefits, and participation in the Company’s 401(k) plan. The Employee is also eligible for up to 15 to 25 days of vacation per year (depending on the employee’s tenure), up to 13 days of holidays throughout the calendar year, and up to 10 days of paid sick leave per year. Your benefits may be adjusted to reflect your location, employment status, duration of employment with the company, and position level. Benefits may also be pro‑rated for those who start working during the calendar year. Equal Employment Opportunity at Tencent As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals. #J-18808-Ljbffr
$163.8k - $307.6k
...What the Role Entails Conduct research and development of Omni multimodal large models, including the design and... ...training data, foundational model algorithm design, optimization related to pre... ...diverse paradigms for achieving Omni‑modal understanding and generation capabilities...SuggestedFull timeRelocation package- Multimodal LLM Researcher (MLLM) About the Role At Pika, we are pioneering next-generation creative... ...infrastructure Design and prototype novel algorithms and architectures for high-fidelity,... ...model inference and synthesis across modalities Work on diffusion model distillation...SuggestedRemote workFlexible hours
- ...in Palo Alto, California. This role involves advancing multimodal large models through research and development. The ideal candidate will have a... ...hands-on experience in multimodal data and deep learning algorithms. Competitive salary and comprehensive benefits are offered...SuggestedRelocation package
- ...Neura Market is searching for a Multimodal LLM Researcher to innovate in real-time generative AI technologies. The position involves leading research efforts in multimodal generation and collaborating closely with engineering teams to transform insights into groundbreaking...SuggestedFlexible hours
- ...Pika is seeking a Multimodal LLM Researcher to innovate in real-time multimodal generation across text, image, video, and audio. You will be a core member of the research team, collaborating closely with engineering and product teams to develop technologies that empower...SuggestedFlexible hours
$40 - $48 per hour
...Carlsbad Tech is seeking a Research Assistant V to support innovative AI projects based in Menlo Park, CA. The role focuses on cutting-edge research in data curation algorithms, requiring 8+ years of relevant experience and a Bachelor's degree in a STEM field. The position...Remote work$180k
...important. All engineers and researchers are expected to have strong... .... About the Role The multimodal team at xAI creates magical... ...of content across various modalities, including image, video, and... ...implementing effective and efficient algorithms for achieving state-of-the-...Local areaRelocation- ...Senior Applied AI Researcher Articul8 AI is seeking a Senior Applied AI Researcher to solve... ...training, reinforcement learning, multimodal understanding, and knowledge representation... ...in parallel Go broad: span modalities, methods, and domains simultaneously —...
- Real-time Video Researcher About the Role At Pika, we are pioneering next-generation creative... ...Design and prototype novel algorithms and architectures for real-time, high-fidelity... ...developments in real-time video, generative AI, multimodal systems, and agentic orchestration...Remote workFlexible hours
- A leading technology company is seeking a Machine Learning Researcher to join their team in Cupertino, California. The role involves developing cutting-edge algorithms and systems in deep learning, with a focus on real-world applications. Candidates should possess deep...
$180k
...important. All engineers and researchers are expected to have... ...About the Role The omni team at xAI creates... ...across various modalities, including image, video... ...researcher/engineer on multimodal, you will develop and... ...Hardware / Software / Algorithm co-design. Ideal Experiences...Temporary workRelocation- A leading technology company in Santa Clara seeks a Machine Learning Researcher for its AIML Multimodal Foundation Model Team. You will develop advanced multimodal foundation models and agent capabilities for Apple's products. Ideal candidates possess a PhD or MS in a...
$192k - $304.75k
...We’re now looking for a Senior Research Scientist, Multi-Modal Language Models! NVIDIA is seeking a Senior Research Scientist passionate about... ...PyTorch Solid background in computer science fundamentals: algorithms, data structures, parallel/distributed computing, and...$152k - $218.5k
...At Toyota Research Institute (TRI), we’re on a mission to improve the quality of human life. We’re developing new tools and capabilities... ...such as pythonocc-core for primitive fitting. Experience with multimodal generative models for boundary representations. Track record...Local areaShift work$192k - $304.75k
...We are now looking for a Senior Research Scientist focused on Multimodal Foundation Models and Robotics! NVIDIA is searching for an outstanding research... ...What you will be doing: Design and implement novel AI algorithms and models for general-purpose humanoid robots and...$270.6k - $330k
...our progress in developing unified, multi-modal generative foundation models for... ...focus on the synthesis of unstructured multimodal signals with the structured relational data... ...foundation models, at scale, to benefit the research. Pre-train and fine-tune large-scale machine...Contract workWork experience placementLocal area- ...A leading technology firm in California is seeking a passionate Research Scientist to advance next-generation AI hardware platforms. The role involves developing multimodal intelligence models, benchmarking innovative LLM architectures, and collaborating across teams...
$176k - $253k
At Toyota Research Institute (TRI), we’re on a mission to improve the quality of human life. We’re developing new tools and capabilities... ...to evaluate a wide range of architectural, data, and algorithmic choices, and help shape how we apply modern machine learning to...Work experience placementInternshipLocal areaShift work$192k - $356.5k
NVIDIA is seeking a Senior Research Scientist focused on Multimodal Foundation Models in Santa Clara, California. You will design and implement AI algorithms for humanoid robots, significantly impacting the development of general-purpose embodied agents. The role requires...- A leading technology company is seeking a Senior Research Scientist to focus on Multimodal Foundation Models and Robotics. This position involves designing AI algorithms for humanoid robots, developing training methods for foundation models, and working with a collaborative...
- ...advance AI/ML systems for mobility services Conducting applied research in Agentic AI, including agent design, orchestration... ...adaptability of agent-based systems Prototyping frameworks and algorithms in Python using tools such as AutoGen, LangChain, or custom orchestration...
- A leading technology company based in Santa Clara is seeking a researcher with expertise in Machine Learning and AI. This role focuses on developing foundational models for image and video generation. Candidates should possess a PhD, along with experience in computer vision...
$151.8k - $211.3k
We are inviting motivated researchers and engineers to join our team in Bay Area. The team works on developing cutting edge technologies... ...example areas include (but not limited to) video compression algorithms, future video coding standards, learned image and video...Work experience placementRelocation package$147.4k - $272.1k
AIML - Machine Learning Researcher, Foundation Models Cupertino, California, United States... ..., images, speech, videos, and other modalities and apply these models to Apple products... ...researchers who are passionate about developing algorithms, techniques, and systems that push the...Relocation$238.9k - $292.5k
...Advanced Technology Group (ATG) is the research division of the company. ATG’s mission... ...electrical engineering, such as AI/ML, algorithms, digital signal processing, audio engineering... .... Applications in vision, audio, or multimodal domains (e.g., source separation,...Full timeLocal areaWorldwideFlexible hours$118k - $157k
Student Researcher, PhD, Winter/Summer 2026 About the job The Student Researcher Program fosters academic collaborations by hiring students... ..., Computer Vision, Machine Learning, Deep Learning, Algorithmic Foundations of Optimization, Quantum Information Science, Data...Full timeTemporary workSummer workInternship- ...ModelMesh(TM), reliable model evaluation through LLM-IQ(TM), and multimodal understanding to serve regulated industries including energy,... ...chain. Trusted by Fortune 500 enterprises, we bring together research, engineering, product, and domain expertise to deliver AI that...Shift work
$113k - $150k
Job Summary Student Researcher, PhD, Winter/Summer 2025 job at Google. Mountain View, CA. Applications will be reviewed on a rolling... ...Interactions, Computer Vision, Machine Learning, Deep Learning, Algorithmic Foundations of Optimization, Quantum Information Science,...Full timeSummer workInternshipRemote workFlexible hours$180k - $250k
...excited for what’s next! We have an opening in our lab for a senior computer vision researcher, with 3D Reconstruction and Deep Learning expertise, to develop and improve the underlying algorithms powering our consumer products. We are looking for highly-motivated, creative,...Full timeRemote work- ...Engineers with a strong track record in machine learning models research . This is an extremely ambitious foundational project.... ...design, and communication. Have a good understanding of graph algorithms. Have some familiarity with model monitoring, git ,...Permanent employmentFull timeContract workImmediate startRemote workFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Hunyuan Multimodal Algorithm Researcher (Omni-Modal). Be the first to apply!
- data collection researcher Palo Alto, CA
- design researcher Palo Alto, CA
- product researcher Palo Alto, CA
- machine learning researcher Palo Alto, CA
- music researcher Palo Alto, CA
- court researcher Palo Alto, CA
- security researcher Palo Alto, CA
- researcher Palo Alto, CA
- human factors researcher Palo Alto, CA
- field researcher Palo Alto, CA

