Multimodal AI Engineer - Image/Video & Audio
$180kxAI
xAI in Seattle is seeking a Member of Technical Staff for the Imagine Model team. This role focuses on developing advanced AI experiences beyond text, emphasizing image and video modalities. Responsibilities include enhancing multimodal capabilities, improving data quality, and designing evaluation frameworks. Candidates should have a strong background in data-driven experimentation and machine learning systems. The compensation package ranges from $180,000 to $440,000, including equity and comprehensive benefits. #J-18808-Ljbffr xAI
- ...We are seeking a talented Multimodal AI Systems Architect to develop and optimize AI systems that seamlessly integrate vision and audio models. This role focuses on enhancing our voice-... ...systems capable of retrieving insights from videos and PDFs. Qualifications:...VideoAudio
- ...Intelligent Creation Team is the AI, special effects, and audio-video creation technology team,... ..., client and server engineering, and provides cutting‑... ...limited to: Experience in multimodal understanding, such as... ...vision and language, such as image/video captioning,...VideoAudio
- ...grow, we’re looking for a skilled AI Data Infrastructure Engineer to join our dynamic team and contribute... ...modalities including text, image, audio, video, and structured signals. Implement... ...Preferred Qualifications Experience with multimodal datasets at large scale....VideoAudioFull timeH1bLocal areaImmediate startRemote workVisa sponsorship
$171.6k - $302.2k
...Sr. Applied AI Software Engineer- Vision Products Group & Siri Apple builds products that are loved by people around the world—products... ...with algorithm teams Processing sensor data, (e.g. image/video, audio, motion), such as using computer vision / signal processing...VideoAudioRelocation$171.6k - $302.2k
Sr. Applied AI Software Engineer- Vision Products Group & Siri Seattle, Washington, United States Software and Services Apple builds products... ...with algorithm teams Processing sensor data, (e.g. image/video, audio, motion), such as using computer vision / signal...VideoAudioRelocation$171.6k - $230.1k
...Lead Machine Learning Engineer Technology is at the heart of Disney's... ...decisions daily across Disney's video-on-demand and live TV properties... ...ViT) for NLP and/or vision ~ Multimodal embedding techniques across text, image, audio, or structured data ~ Large language...VideoAudio$171.6k - $230.1k
...Lead Machine Learning Engineer Technology is at the heart... ...in generative AI applications, including generative... ...models, and other agentic multimodal technologies. Areas of work... ...may include generative video, generative image, generative audio, chatbots, LLM applications...VideoAudio$201.3k - $367.4k
...SIML - LLM & Generative AI The System... ...generative AI models for image and video generation. You will work... ...including researchers, engineers, and product leaders,... ...transfer learning, and multimodal alignment. Collaboration... ...(e.g., image, video, audio, or motion modalities)...VideoAudioRelocation- ...Model, Generative AI TikTok is the leading... ...short-form mobile video. Our mission is to... ...effects and audio-video creation technology... ...client and server engineering, and provides... ...generative AI (e.g., image, video, or 3D... ...but not limited to multimodal generation, image...VideoAudio
$180k
...s mission is to create AI systems that can accurately... ..., and focused on engineering excellence. This organization... ...ABOUT THE ROLE: As a multimodal engineer on the Imagine... ...and generation across image and video modalities, while also incorporating audio where it enhances...VideoAudioTemporary work- ...Centific**Centific is a frontier AI data foundry that curates... ...4,000 AI practitioners and engineers. We harness the power of an... ...evaluation frameworks for LLM and multimodal systems, covering benchmark... ...multimodal evaluation (text-image, audio, video) and long-context...VideoAudioFull timeRemote work
$142.7k - $270.95k
...Scientists in Generative AI to our world‑class AI... ...all modalities: images, video, 3D, LLMs and cross‑modal... ...creative workflows, and multimodal priors. What You’ll... ...visual (image/video/3D), audio, and multi‑modal... ...class researchers and ML engineers to bring research ideas...VideoAudioTemporary workLocal area- ...than ever before. Generative AI platforms represent a major... ...the Year. As a Principal Engineer (IC7+), you will: Solve... ...harness the power of LLMs and multimodal AI for scalable, enterprise-... ...) and multimodal AI (text, image, video, audio) Impact & Outcomes In...VideoAudioFull timeWork at officeFlexible hours
- ...TensorRT, ONNX, vLLM) to accelerate multimodal models including video diffusion, LLMs, and speech models... ...compute or memory bottlenecks. ~ Data Engineering: Experience building scalable data... ...~ Experience with video or audio models in research or production settings...VideoAudio
$320k - $405k
...Staff Software Engineer, iOS San Francisco, CA, New York City... ...interpretable, and steerable AI systems. We want AI to be safe... ..., visual effects, and audio and video streaming on mobile ~ A vision... ...Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI &...VideoAudioWork at officeVisa sponsorshipFlexible hours- ...emotionally expressive, real-time AI. This is a critical role to... ..., and ensure our real-time video AI runs reliably with low... ...maintain the serving stack for multimodal AI workloads. Optimize for latency... ..., ensuring smooth video and audio delivery at scale. ~ Orchestrate...VideoAudioWorldwide
$115k - $145k
...experiences. From pioneering digital audio and lighting systems to... ...high-performance embedded audio, video, and data distribution systems... ...We're looking for a DevOps Engineer to improve and maintain the developer... ..., binary assets, and firmware images. Lead the migration from...VideoAudioFull timeImmediate start- ...are seeking a hands-on Machine Learning Engineer to drive the data & evaluation lifecycle... ...crafting creative techniques to analyze audio & video datasets, designing metrics to understand... .... Background in Computer Vision (image augmentation), Audio and Natural Language...VideoAudio
- ...speech, and vision — to make AI socially and emotionally intelligent... ...-to-speech, etc.), training audio generation models, or related... ...fragmented. Real-time, multimodal interaction — where voice, facial... ...research with the ruthless engineering needed for consumer-grade,...VideoAudioShift work
- A leading technology company in Seattle seeks a PhD-level professional to lead AI research in multimodal machine learning and video understanding. The ideal candidate will have a strong background in applied research with a focus on developing scalable models and understanding...Video
$135.9k - $265.88k
...our Data Driven Transformation (DDT) team makes AI work at scale by combining data science and engineering with business consulting. We build context-driven... ...Please be aware that Capgemini may capture your image (video or screenshot) during the interview process and that...VideoFull timeLocal area- ...focused on public safety is seeking a Senior AI Research Scientist in Seattle. The role... ...include leading research on multimodal reasoning systems, collaborating across teams... ...competitive salary, 401k, and various benefits. #J-18808-Ljbffr National Society for Black EngineersVideo
$120k - $180k
...team of best-in-class machine learning engineers. We are looking for developers who are... ...a deep learning machine model (image, NLP, video, or audio) into production, with measurably improved... ...passionate about creating a revolutionary AI company. At Hive, you will have a...VideoAudio$106.9k - $176.5k
...world. Technology – Data and Decision Science – AI Native Engineering AI/Machine Learning Engineer, Senior... ...fine-tuning Generative AI models Experience with image processing techniques and/or speech and audio processing and analysis What we look for...AudioFull timeWork experience placementSummer holidayFlexible hours$60 per hour
...management workflows Coordinate onsite event operations including stage management, speaker readiness, run-of-show execution, audio checks, video framing, and production timelines Partner closely with recruiters, marketing teams, event vendors, production partners,...VideoAudioContract workLive in$142.8k - $261.8k
...and help to build a better working world. AI & Data - Physical AI Engineering Consultant - Manager The opportunity Our Artificial... ...Processing and Generation. Knowledge in Image Processing and Analysis. Skills in Speech and Audio Processing and Analysis. Ability to scale...AudioFull timeWork experience placementSummer holidayFlexible hours$139.5k - $258.1k
Software Development Engineer in Test (SDET), Video Apps Seattle, Washington, United States Software and Services Imagine what you... ...in Test that has experience in macOS apps, still image file formats, video codecs, audio, and metadata. Ideally, we are looking for...VideoAudioRelocation- ...research accelerator for frontier AI labs and a trusted partner for... ..., STEM, multilinguality, multimodality, and agents; and second, by applying... ...engaging way. As a City Guide Video Creator, you’ll act as a local... ...or camera with clear audio. Choose safe, public locations...VideoAudioContract workFor contractorsFreelanceLocal areaRemote workFlexible hours
$181.1k - $318.4k
...California, United States - Applied AI Scientist - Machine Learning... ...to work on generative AI and multimodal foundation models. This role... ..., and collaborating with engineering teams to transform promising... ...systems (e.g., vision, language, video, etc.). Proficiency in...VideoRelocation package$125.5k - $230.2k
...world. Technology – Data and Decision Science – AI Native Engineering AI/Machine Learning Engineer, Manager... ...visualization and storytelling with data Experience with image processing techniques and/or speech and audio processing and analysis What we look for We...AudioFull timeWork experience placementSummer holidayFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Multimodal AI Engineer - Image/Video & Audio. Be the first to apply!


