AI Research Engineer (Multi-Modal & Vision) [Remote]
jobgether
- Remote job
This position is listed on behalf of a partner company, who manages all applications and next steps. Our partner is looking for an AI Research Engineer (Multi-Modal & Vision) based in United States.
This is an exciting opportunity for a research-focused AI engineer to contribute to the development of advanced multimodal systems that combine vision and language capabilities. The role covers the full AI model lifecycle, from dataset creation and training pipeline development to model evaluation, optimization, and deployment. Working within a highly skilled and collaborative team, you will help build scalable AI solutions designed for real-world production environments. The position offers significant ownership, direct impact on cutting-edge research initiatives, and the opportunity to apply state-of-the-art techniques to solve complex challenges. Ideal candidates are passionate about advancing multimodal AI while maintaining a strong engineering mindset focused on measurable outcomes and practical deployment.
Accountabilities
- Conduct end-to-end research and development of vision-language models, including training, evaluation, optimization, and deployment activities.
- Design and implement advanced post-training methodologies such as supervised fine-tuning, knowledge distillation, and reinforcement learning from human feedback.
- Build, curate, filter, and maintain high-quality multimodal datasets tailored to domain-specific applications.
- Improve model efficiency and scalability through optimization, compression, and adaptation techniques suitable for resource-constrained environments.
- Develop benchmarking systems and evaluation frameworks to assess model quality, robustness, and real-world performance.
- Build and maintain distributed training workflows across GPU infrastructure while identifying and resolving performance bottlenecks.
- Contribute to open-source AI ecosystems by leveraging and enhancing models, datasets, and development tools.
- Monitor emerging research in multimodal learning and vision-language systems, translating relevant advancements into practical improvements.
- Collaborate on research publications and contribute to scientific advancements through conference or journal submissions when appropriate.
Requirements
- Bachelor's degree in Computer Science, Machine Learning, Artificial Intelligence, or a related field; Master's or PhD preferred.
- Strong hands-on experience working with multimodal AI systems, particularly vision-language models.
- Proven expertise in supervised fine-tuning, knowledge distillation, reinforcement learning from feedback, and other post-training optimization techniques.
- Experience with parameter-efficient fine-tuning approaches and distributed training frameworks.
- Demonstrated success improving model performance on industry-standard benchmarks or production use cases.
- Strong understanding of model optimization techniques for deployment in resource-constrained environments.
- Experience building scalable machine learning pipelines and training workflows on GPU infrastructure.
- Proven contributions to open-source multimodal AI projects through platforms such as GitHub or Hugging Face.
- Research background supported by publications in leading AI conferences or journals is highly desirable.
- Strong analytical thinking, problem-solving skills, and the ability to balance research innovation with production-oriented engineering practices.
- Excellent communication skills and the ability to collaborate effectively within distributed, cross-functional teams.
Benefits
- Competitive salary package aligned with experience and expertise.
- Opportunity to work on cutting-edge multimodal AI and vision-language research projects.
- Fully remote work environment with global collaboration opportunities.
- Exposure to large-scale AI infrastructure and advanced machine learning technologies.
- High degree of autonomy, ownership, and impact on product and research outcomes.
- Collaboration with experienced researchers, engineers, and AI specialists.
- Professional growth opportunities through research, innovation, and publication support.
- Flexible working arrangements that support work-life balance.
- Dynamic, fast-paced environment focused on innovation and continuous learning.
How Jobgether works:
We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team.
We appreciate your interest and wish you the best!
Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time.
#LI-CL1
We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses and identifying potential inconsistencies or verification signals in application materials based on available information. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
- A leading fintech company is looking for an AI model team member to innovate in reinforcement learning approaches. You will optimize decision-making, work with advanced models, and push the limits of AI performance. Candidates should have a PhD in NLP or Machine Learning...SuggestedRemote job
- ...xAI’s mission is to create AI systems that can accurately... ...highly motivated, and focused on engineering excellence. This... ...important. All engineers and researchers are expected to have strong... ...researchers and engineers in multi-modal pre-training. Tech Stack...SuggestedLocal areaRelocation
- ...seeking an experienced Artificial Intelligence Engineer to develop machine learning systems integrating various sensor modalities for maritime intelligence. The ideal... ...learning, preferably with 7+ years of experience in multi-modal applications. This position is remote-friendly...SuggestedRemote jobFlexible hours
- ...Driving innovation in multi-modal reinforcement learning, the full-time AI Research Engineer will optimize decision-making across integrated data modalities, develop... ...required; a PhD in Machine Learning, NLP, or Computer Vision is preferred Proven experience running large-...SuggestedFull timeRemote work
- ...Title: AI Research Engineering Intern – Translational Genomics & Multi-Omic Data Platforms Reports to : Director of Translational Research Department: Translational Research Location: Remote MMRF OVERVIEW: The Multiple Myeloma Research Foundation (MMRF...SuggestedInternshipLocal areaRemote work
- ...Health is the leading provider of multi-modal RCM agents empowering... ...healthcare providers and payers. Our vision is to equip healthcare providers with an AI workforce that reduces... ...revenue. About the role As an AI Engineer on our research team, you’ll work on our hardest...Work at officeWork from homeFlexible hours
$126k - $423k
...powering the future of physical AI. Founded in 2017 and now... ...are looking for a passionate Research Engineer to join the Research Group at... ...will: Conduct research on 3D vision related topics including 3D foundation model, multi‑modal pretraining, Gaussian splatting...Full timeFor contractorsFor subcontractorCasual workWork at officeImmediate startRemote workDay shift- Orbifold AI in Palo Alto is seeking a Research Engineer - Multimodal AI to develop advanced AI models and optimize data platforms. The ideal candidate will have a strong background in computer vision and a passion for multimodal advancements. This position also offers...
- ...Position We are building advanced augmented dexterity capabilities for next-generation robotic platforms. As a Senior AI/ML Research Engineer (Computer Vision), you will develop the perception models that let our Embodied-AI system understand the surgical scene. Working...
$140k - $170k
...risk operations. We deliver advanced AI and data analytics solutions... ...ROLE SUMMARY: As an Engineer on the Image & Computer Vision AI team, you will play a hands-on role... ...in intelligence workflows. Multi-Modal AI & Image Search You will...Work at officeFlexible hours- Software Engineer, Computer Vision and Deep Learning Developing new computer vision algorithms... ...dramatically improve existing methods, researching and maintaining state-of-the-art... .... Explore and integrate novel multi-modal foundational AI models, including large Vision‑Language...Shift work
- ...learning expert to join our team focused on developing advanced AI solutions. You will be responsible for optimizing neural networks... ...cases. Ideal candidates will have strong experience in computer vision technologies, a master's or Ph.D., and the ambition to grow within...Full time
- ...future? About the job As a member of the AI model team, you will drive innovation in... ...limited hardware environments and complex multi-modal architectures that integrate data such as... ...architectures. You will adopt a hands‑on, research‑driven approach to developing, testing,...Immediate startRemote work
$200k - $350k
...Research Engineer | San Francisco | Full-Time Brief Overview Applied AI lab building world models for 3D game environments. Early... .... Your focus: distributed multi-agent orchestration for coordinated... ...large models (diffusion, vision-language, RL agents). Hands-...Full timeVisa sponsorshipRelocation packageFlexible hours- Bright Vision Technologies is a forward-thinking software development company dedicated... ...to grow, we’re looking for a skilled AI Research Engineer (Applied AI) to join our dynamic team and... ...third-party) Engagement: Long-term, multi-year, aligned to the Bright Vision SOW...Full timeH1bLocal areaImmediate startRemote workVisa sponsorshipWork visa
- ...Origin Origin is building physical AI for the built world. Our... ...Backed by Accel. Our system runs a Multi Agent Action Expert... ...deployment on Jetson AGX Orin. Every research project will have a deployment... ...two of: imitation learning, RL, vision-language models, robot...
- A technology company focused on AI is seeking an individual to design and develop AI-driven agents that enhance user experiences... ...ideal candidate will possess a strong understanding of software engineering practices and hands-on experience with RAG technologies like LangChain...
$181.1k - $318.4k
Sr. Applied AI Software Engineer- Vision Products Group & Siri San Francisco Bay Area, California, United States Software and Services Apple builds... ...race conditions, deadlocks, synchronization issues in multi-threaded environments etc. Systems thinking, including ability...Relocation$140.8k - $211.2k
General Summary Qualcomm AI Research is looking for world-class algorithm engineers in general domain machine learning, especially... ...devices. You will be part of a multi-disciplinary talented team... ...efficient generative AI, LLM, LVM, Multi-modal, VLA Efficient inference...Work experience placementWorldwide$165k - $180k
The Bosch Research and Technology Center North America... ...research organization, our AI research in Silicon... ...Autonomous Systems, AI Systems Engineering, and Industry AI. We... ...in Silicon Valley, our Vision and Language AI Group... ...including single- and multi-agent setups for...Temporary workWork experience placementWorldwide$190k - $320k
Research Engineer - Computer Vision & Machine Learning Want to build vision systems that let machines understand... ..., including gaze tracking, SLAM, multi‑camera geometry, and systems that explicitly... ...‑aware vision systems that connect AI to the physical world in meaningful...$12 per hour
...Founding CTO (Unicorn, $1B+). 6 AI patents. Enterprise AI... ...infrastructure. Founding AI Engineer (Applied ML / Vision + LLM) Engineer the AI... ...cutting‑edge LLM and vision research into tools that run on dusty... ...OpenAI, Anthropic, Gemini—multi‑model approach Orchestration...Full timeFor contractorsRemote workFlexible hours$200k - $400k
...Francisco, is seeking a Voice Research Engineer to lead the development of... ...algorithms. The role involves owning multi-quarter initiatives, focusing... ...role in enhancing Decagon's AI capabilities. The position... ...including medical, dental, vision, and unique vacation policies...$240k - $280k
About Sybill: At Sybill.ai, we're building the... ...Opportunity: As a Staff AI Researcher, you'll architect and... ...systems Strong software engineering foundation with... ...autonomously plan and orchestrate multi-tool workflows... ...comprehensive Health/Dental/Vision insurance Our commitment...$110k
...AI/ML Computer Vision Engineer Prime Solutions Group (PSG), Inc. is seeking an experienced AI/ML Computer... ...datasets, and turning cutting-edge research into production-ready solutions.... ...Experience with object tracking, multi-object tracking, or video analytics....Remote workFlexible hours$160k - $180k
...What You’ll Do: As a strong AI and ML fundamentalist who is... .... Collaborate with other researcher engineers to prototype and validate... ...architectures, attention mechanisms, multi-modal foundation models, diffusion... ...Experience developing VLA (Vision Language Action) models for...Full timeLocal area- Lockheed Martin is hiring an AI/ML Research Engineer Sr in Orlando, Florida. The role involves designing, implementing, and validating advanced AI/ML algorithms, with a focus on deep learning object-detection and classification. Candidates should have a bachelor's degree...Full time
- ML/AI Research Engineer — Agentic AI Lab (Founding Team) Location: San Francisco Bay Area Type: Full... ...(RAG), knowledge graphs, and multi‑tenant governance. We’re looking for an... ...Kubernetes, TGI, Sagemaker, LambdaLabs, Modal Languages : Python (core), optionally...Full time
$200k - $400k
...that. We have built the first AI simulation of society, populated... ...based on real humans. Our research pioneered the field of AI-based... ...and optimizing inference for multi-agent environments. Compensation... ...medical, dental, and vision coverage. Time Off: Flexible...Flexible hours$176k - $420k
...What To Expect Join the Tesla AI team to help shape one of the most advanced and... ...experience Expertise in key areas of computer vision and robotics, such as feedforward 3D... ...autoregressive models, video generation, multi-modal generation Practical experience with PyTorch...Hourly payFull timeTemporary workWorldwideFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI Research Engineer (Multi-Modal & Vision) [Remote]. Be the first to apply!
- ai research engineer United States
- ai developer United States
- ai prompt engineer United States
- ai engineer United States
- senior ai engineer United States
- ai ml engineer United States
- ai engineer remote United States
- machine learning ai engineer United States
- research assistant engineering United States
- transportation research engineer United States


