AI Research Engineer (Multi-Modal & Vision) [Remote]

Full-time

jobgether

Remote job

This position is listed on behalf of a partner company, who manages all applications and next steps. Our partner is looking for an AI Research Engineer (Multi-Modal & Vision) based in United States.

This is an exciting opportunity for a research-focused AI engineer to contribute to the development of advanced multimodal systems that combine vision and language capabilities. The role covers the full AI model lifecycle, from dataset creation and training pipeline development to model evaluation, optimization, and deployment. Working within a highly skilled and collaborative team, you will help build scalable AI solutions designed for real-world production environments. The position offers significant ownership, direct impact on cutting-edge research initiatives, and the opportunity to apply state-of-the-art techniques to solve complex challenges. Ideal candidates are passionate about advancing multimodal AI while maintaining a strong engineering mindset focused on measurable outcomes and practical deployment.

Accountabilities

Conduct end-to-end research and development of vision-language models, including training, evaluation, optimization, and deployment activities.
Design and implement advanced post-training methodologies such as supervised fine-tuning, knowledge distillation, and reinforcement learning from human feedback.
Build, curate, filter, and maintain high-quality multimodal datasets tailored to domain-specific applications.
Improve model efficiency and scalability through optimization, compression, and adaptation techniques suitable for resource-constrained environments.
Develop benchmarking systems and evaluation frameworks to assess model quality, robustness, and real-world performance.
Build and maintain distributed training workflows across GPU infrastructure while identifying and resolving performance bottlenecks.
Contribute to open-source AI ecosystems by leveraging and enhancing models, datasets, and development tools.
Monitor emerging research in multimodal learning and vision-language systems, translating relevant advancements into practical improvements.
Collaborate on research publications and contribute to scientific advancements through conference or journal submissions when appropriate.

Requirements

Bachelor's degree in Computer Science, Machine Learning, Artificial Intelligence, or a related field; Master's or PhD preferred.
Strong hands-on experience working with multimodal AI systems, particularly vision-language models.
Proven expertise in supervised fine-tuning, knowledge distillation, reinforcement learning from feedback, and other post-training optimization techniques.
Experience with parameter-efficient fine-tuning approaches and distributed training frameworks.
Demonstrated success improving model performance on industry-standard benchmarks or production use cases.
Strong understanding of model optimization techniques for deployment in resource-constrained environments.
Experience building scalable machine learning pipelines and training workflows on GPU infrastructure.
Proven contributions to open-source multimodal AI projects through platforms such as GitHub or Hugging Face.
Research background supported by publications in leading AI conferences or journals is highly desirable.
Strong analytical thinking, problem-solving skills, and the ability to balance research innovation with production-oriented engineering practices.
Excellent communication skills and the ability to collaborate effectively within distributed, cross-functional teams.

Benefits

Competitive salary package aligned with experience and expertise.
Opportunity to work on cutting-edge multimodal AI and vision-language research projects.
Fully remote work environment with global collaboration opportunities.
Exposure to large-scale AI infrastructure and advanced machine learning technologies.
High degree of autonomy, ownership, and impact on product and research outcomes.
Collaboration with experienced researchers, engineers, and AI specialists.
Professional growth opportunities through research, innovation, and publication support.
Flexible working arrangements that support work-life balance.
Dynamic, fast-paced environment focused on innovation and continuous learning.

How Jobgether works:

We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team.

We appreciate your interest and wish you the best!

Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time.

#LI-CL1

We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses and identifying potential inconsistencies or verification signals in application materials based on available information. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

Apply

Vacancy posted 2 days ago

Similar jobs that could be interesting for youBased on the AI Research Engineer (Multi-Modal & Vision) [Remote] in United States vacancy

Remote AI Research Engineer - RL & Multi-Modal
A leading fintech company is looking for an AI model team member to innovate in reinforcement learning approaches. You will optimize decision-making, work with advanced models, and push the limits of AI performance. Candidates should have a PhD in NLP or Machine Learning...
Suggested
Remote job
Remotely
New York, NY
1 day ago
AI Engineer & Researcher - Pre-training (Multi-modal)
...xAI’s mission is to create AI systems that can accurately... ...highly motivated, and focused on engineering excellence. This... ...important. All engineers and researchers are expected to have strong... ...researchers and engineers in multi-modal pre-training. Tech Stack...
Suggested
Local area
Relocation
xAI
San Francisco, CA
more than 2 months ago
Remote Multi-Modal AI Systems Engineer
...seeking an experienced Artificial Intelligence Engineer to develop machine learning systems integrating various sensor modalities for maritime intelligence. The ideal... ...learning, preferably with 7+ years of experience in multi-modal applications. This position is remote-friendly...
Suggested
Remote job
Flexible hours
Quartermaster
Washington DC
4 days ago
AI Research Engineer
...Driving innovation in multi-modal reinforcement learning, the full-time AI Research Engineer will optimize decision-making across integrated data modalities, develop... ...required; a PhD in Machine Learning, NLP, or Computer Vision is preferred Proven experience running large-...
Suggested
Full time
Remote work
Virtual Vocations Inc
United States
5 days ago
AI Research Engineering Intern (Translational Genomics & Multi-Omic Data Platforms)
...Title: AI Research Engineering Intern – Translational Genomics & Multi-Omic Data Platforms Reports to : Director of Translational Research Department: Translational Research Location: Remote MMRF OVERVIEW: The Multiple Myeloma Research Foundation (MMRF...
Suggested
Internship
Local area
Remote work
Multiple Myeloma Research Foundation - MMRF
Norwalk, CT
3 days ago
AI Research Engineer
...Health is the leading provider of multi-modal RCM agents empowering... ...healthcare providers and payers. Our vision is to equip healthcare providers with an AI workforce that reduces... ...revenue. About the role As an AI Engineer on our research team, you’ll work on our hardest...
Work at office
Work from home
Flexible hours
Amperos Health
New York, NY
1 day ago
Research Engineer- 3D Vision and Generation, Self-Driving
$126k - $423k
...powering the future of physical AI. Founded in 2017 and now... ...are looking for a passionate Research Engineer to join the Research Group at... ...will: Conduct research on 3D vision related topics including 3D foundation model, multi‑modal pretraining, Gaussian splatting...
Full time
For contractors
For subcontractor
Casual work
Work at office
Immediate start
Remote work
Day shift
Decisive Point
Sunnyvale, CA
4 days ago
Multimodal AI Research Engineer — Data & Vision Innovator
Orbifold AI in Palo Alto is seeking a Research Engineer - Multimodal AI to develop advanced AI models and optimize data platforms. The ideal candidate will have a strong background in computer vision and a passion for multimodal advancements. This position also offers...
Bonfirevc
Palo Alto, CA
1 day ago
Senior AI/ML Research Engineer (Computer Vision)
...Position We are building advanced augmented dexterity capabilities for next-generation robotic platforms. As a Senior AI/ML Research Engineer (Computer Vision), you will develop the perception models that let our Embodied-AI system understand the surgical scene. Working...
Intuitive
Sunnyvale, CA
3 days ago
Image & Computer Vision AI Engineer
$140k - $170k
...risk operations. We deliver advanced AI and data analytics solutions... ...ROLE SUMMARY: As an Engineer on the Image & Computer Vision AI team, you will play a hands-on role... ...in intelligence workflows. Multi-Modal AI & Image Search You will...
Work at office
Flexible hours
Babel Street
Reston, VA
5 days ago
AI Computer Vision Engineer Jobs
Software Engineer, Computer Vision and Deep Learning Developing new computer vision algorithms... ...dramatically improve existing methods, researching and maintaining state-of-the-art... .... Explore and integrate novel multi-modal foundational AI models, including large Vision‑Language...
Shift work
AI Chopping Block, Inc.
Palo Alto, CA
5 days ago
AI & Computer Vision Scientist - Real-Time Multi-Modal
...learning expert to join our team focused on developing advanced AI solutions. You will be responsible for optimizing neural networks... ...cases. Ideal candidates will have strong experience in computer vision technologies, a master's or Ph.D., and the ambition to grow within...
Full time
shefsolutionsllc
San Francisco, CA
2 days ago
AI Research Engineer
...future? About the job As a member of the AI model team, you will drive innovation in... ...limited hardware environments and complex multi-modal architectures that integrate data such as... ...architectures. You will adopt a hands‑on, research‑driven approach to developing, testing,...
Immediate start
Remote work
Remotely
New York, NY
1 day ago
AI Research Engineer
$200k - $350k
...Research Engineer | San Francisco | Full-Time Brief Overview Applied AI lab building world models for 3D game environments. Early... .... Your focus: distributed multi-agent orchestration for coordinated... ...large models (diffusion, vision-language, RL agents). Hands-...
Full time
Visa sponsorship
Relocation package
Flexible hours
Harnham
Santa Clara, CA
3 days ago
AI Research Engineer (Applied AI)
Bright Vision Technologies is a forward-thinking software development company dedicated... ...to grow, we’re looking for a skilled AI Research Engineer (Applied AI) to join our dynamic team and... ...third-party) Engagement: Long-term, multi-year, aligned to the Bright Vision SOW...
Full time
H1b
Local area
Immediate start
Remote work
Visa sponsorship
Work visa
Bright Vision Technologies
Missouri City, TX
1 day ago
Founding AI Research Engineer - Robot Learning
...Origin Origin is building physical AI for the built world. Our... ...Backed by Accel. Our system runs a Multi Agent Action Expert... ...deployment on Jetson AGX Orin. Every research project will have a deployment... ...two of: imitation learning, RL, vision-language models, robot...
Origin
San Francisco, CA
3 days ago
Python & AI Agent Engineer (RAG, Multi-Modal)
A technology company focused on AI is seeking an individual to design and develop AI-driven agents that enhance user experiences... ...ideal candidate will possess a strong understanding of software engineering practices and hands-on experience with RAG technologies like LangChain...
Ethereum Technologies LLC
Palo Alto, CA
5 days ago
Sr. Applied AI Software Engineer- Vision Products Group & Siri
$181.1k - $318.4k
Sr. Applied AI Software Engineer- Vision Products Group & Siri San Francisco Bay Area, California, United States Software and Services Apple builds... ...race conditions, deadlocks, synchronization issues in multi-threaded environments etc. Systems thinking, including ability...
Relocation
Apple Inc.
San Francisco, CA
2 days ago
Senior AI Research Quantization Engineer
$140.8k - $211.2k
General Summary Qualcomm AI Research is looking for world-class algorithm engineers in general domain machine learning, especially... ...devices. You will be part of a multi-disciplinary talented team... ...efficient generative AI, LLM, LVM, Multi-modal, VLA Efficient inference...
Work experience placement
Worldwide
Qualcomm
San Diego, CA
5 days ago
AI Research Engineer - Agentic AI
$165k - $180k
The Bosch Research and Technology Center North America... ...research organization, our AI research in Silicon... ...Autonomous Systems, AI Systems Engineering, and Industry AI. We... ...in Silicon Valley, our Vision and Language AI Group... ...including single- and multi-agent setups for...
Temporary work
Work experience placement
Worldwide
Ultimate.ai
Sunnyvale, CA
2 days ago
Research - engineering
$190k - $320k
Research Engineer - Computer Vision & Machine Learning Want to build vision systems that let machines understand... ..., including gaze tracking, SLAM, multi‑camera geometry, and systems that explicitly... ...‑aware vision systems that connect AI to the physical world in meaningful...
Trades Workforce Solutions
San Francisco, CA
1 day ago
AI Engineer Computer Vision LLMs ML
$12 per hour
...Founding CTO (Unicorn, $1B+). 6 AI patents. Enterprise AI... ...infrastructure. Founding AI Engineer (Applied ML / Vision + LLM) Engineer the AI... ...cutting‑edge LLM and vision research into tools that run on dusty... ...OpenAI, Anthropic, Gemini—multi‑model approach Orchestration...
Full time
For contractors
Remote work
Flexible hours
Intellus Build
New York, NY
1 day ago
Lead Voice AI Research Engineer (Production)
$200k - $400k
...Francisco, is seeking a Voice Research Engineer to lead the development of... ...algorithms. The role involves owning multi-quarter initiatives, focusing... ...role in enhancing Decagon's AI capabilities. The position... ...including medical, dental, vision, and unique vacation policies...
Decagon
San Francisco, CA
1 day ago
Staff AI Research Engineer
$240k - $280k
About Sybill: At Sybill.ai, we're building the... ...Opportunity: As a Staff AI Researcher, you'll architect and... ...systems Strong software engineering foundation with... ...autonomously plan and orchestrate multi-tool workflows... ...comprehensive Health/Dental/Vision insurance Our commitment...
Sybill AI
Mountain View, CA
4 days ago
AI/ML Engineer - Computer Vision
$110k
...AI/ML Computer Vision Engineer Prime Solutions Group (PSG), Inc. is seeking an experienced AI/ML Computer... ...datasets, and turning cutting-edge research into production-ready solutions.... ...Experience with object tracking, multi-object tracking, or video analytics....
Remote work
Flexible hours
Prime Solutions Group, Inc.
Phoenix, AZ
1 day ago
Senior AI Research Engineer
$160k - $180k
...What You’ll Do: As a strong AI and ML fundamentalist who is... .... Collaborate with other researcher engineers to prototype and validate... ...architectures, attention mechanisms, multi-modal foundation models, diffusion... ...Experience developing VLA (Vision Language Action) models for...
Full time
Local area
Archer
California
19 days ago
Senior AI/ML Research Engineer — Edge Vision & Autonomy
Lockheed Martin is hiring an AI/ML Research Engineer Sr in Orlando, Florida. The role involves designing, implementing, and validating advanced AI/ML algorithms, with a focus on deep learning object-detection and classification. Candidates should have a bachelor's degree...
Full time
Lockheed Martin
Orlando, FL
5 days ago
ML/AI Research Engineer — Agentic AI Lab (Founding Team)
ML/AI Research Engineer — Agentic AI Lab (Founding Team) Location: San Francisco Bay Area Type: Full... ...(RAG), knowledge graphs, and multi‑tenant governance. We’re looking for an... ...Kubernetes, TGI, Sagemaker, LambdaLabs, Modal Languages : Python (core), optionally...
Full time
Fabrion
San Francisco, CA
4 days ago
AI Behavior Simulation Research Engineer
$200k - $400k
...that. We have built the first AI simulation of society, populated... ...based on real humans. Our research pioneered the field of AI-based... ...and optimizing inference for multi-agent environments. Compensation... ...medical, dental, and vision coverage. Time Off: Flexible...
Flexible hours
Simile
San Francisco, CA
17 hours ago
AI Engineer, Geometric Vision, Tesla AI
$176k - $420k
...What To Expect Join the Tesla AI team to help shape one of the most advanced and... ...experience Expertise in key areas of computer vision and robotics, such as feedforward 3D... ...autoregressive models, video generation, multi-modal generation Practical experience with PyTorch...
Hourly pay
Full time
Temporary work
Worldwide
Flexible hours
Tesla
California
10 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Research Engineer (Multi-Modal & Vision) [Remote]. Be the first to apply!