AI Tutor - Image & Video [Remote]
$90k - $200kxAI
- Remote job
About xAI
xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. All engineers are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates.
About the Role
As an AI Tutor specialized in image and video, you will contribute to xAI's mission by training and refining Grok to interpret and produce visual content with precision and a keen artistic insight. Key to this role is expertise in cinematography or videography, a track record of producing and sharing visual works (including via online platforms), experience creating short films, advertisements, or marketing materials, and a refined aesthetic judgment in visual composition and storytelling.
Responsibilities
You will use proprietary software to provide labels, annotations, and inputs on projects involving images, videos, and multimedia elements. You must support the delivery of high-quality curated data that captures visual nuances and enhances Grok’s understanding of cinematographic techniques. To this end, you will collaborate with technical staff to develop tasks that improve Grok’s ability to interpret and generate visual narratives effectively. You’ll also work with technical staff to improve annotation tools for efficient workflows.
Required Qualifications
- Demonstrated high proficiency in Film, Media Studies, Visual Arts, or a related field.
- Demonstrated expertise in cinematography or videography, including hands-on production and a keen eye for high-quality visual aesthetics.
- Proficiency in analyzing and critiquing visual media, with strong skills in composition, lighting, editing, and narrative flow.
- Ability to reference industry standards, trends, and tools for evaluating and annotating visual content.
- Strong communication, interpersonal, analytical, and creative skills.
- Commitment to developing AI that excels in visual interpretation and creation.
Preferred Qualifications
- Portfolio of published visual content, such as films, videos, or online posts, including short movies, ads, or marketing campaigns.
- Experience in film production, video editing, content creation, or roles involving visual critique and feedback.
Location & Other Expectations
- This position is based in Palo Alto, CA, or fully remote.
- The Palo Alto option is an in-office role requiring 5 days per week; remote positions require strong self-motivation.
- If you are based in the US, please note we are unable to hire in the states of Wyoming and Illinois at this time.
- We are unable to provide visa sponsorship.
- Team members are expected to work from 9:00am - 5:30pm PST for the first two weeks of training and 9:00am - 5:30pm in their own timezone thereafter.
- For those who will be working from a personal device, please note your computer must be a Chromebook, Mac with MacOS 11.0 or later, or Windows 10 or later.
- You must own and have reliable access to a smartphone.
Compensation
$45/hour - $100/hour
Benefits:
Hourly pay is just one part of our total rewards package at xAI. Specific benefits vary by country, depending on your country of residence you may have access to medical benefits. We do not offer benefits for part-time roles.
xAI is an equal opportunity employer.
- ...Manager, Software Image Quality, Creativity Apps The Creativity Apps team is looking for a Software Image Quality... ...maintaining, and overseeing the evaluation and validation of AI and ML models used in photo and video editing features, ensuring their accuracy, performance,...Video
- ...local candidates About EPATT East Palo Alto Tennis and Tutoring (EPATT) is a nonprofit serving K-12 students since 1988 through... ...Apply Now! Step 1: Complete the application Step 2: Submit your video interview EPATT is an equal opportunity employer...VideoSummer workLocal areaRemote workMonday to Friday
$35 - $45 per hour
...xAI is seeking an AI Tutor specialized in multilingual audio to enhance Grok's voice interactions. The role involves curating and annotating audio data to improve speech recognition globally. Candidates should have native proficiency in Italian and be proficient in English...SuggestedHourly payRemote workFlexible hours$90k - $200k
...About xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge... ...or compliance practices. Comfort with recording audio or video sessions for data collection. Familiarity with AI or data...VideoRemote jobHourly payContract workPart timeWork at office$45 - $100 per hour
...Description About xAI xAI's mission is to create AI systems that can accurately understand the... .... Comfort with recording audio or video sessions for data collection.... ...setting. LOCATION AND OTHER EXPECTATIONS: Tutor roles may be offered as full-time, part-time...VideoFull timePart timeFor contractorsRemote workWorldwide10 hours per week$90k - $200k
...About xAI xAI’s mission is to create AI systems that can accurately understand the... ...teammates. About the Role As an AI Tutor - Quantitative Finance, you will be instrumental... ...providing data, such as text, voice, and video data, sometimes providing annotations,...VideoRemote jobHourly payPart timeWork at office$180k
...Mid-training Palo Alto, CA xAI's mission is to create AI systems that can accurately understand the universe and aid... ...-the-art techniques for curating AI training data for text, image, audio, and video modalities. Strong engineering abilities in Spark, Ray, and...VideoTemporary work- ...a Senior Agent Engineer to push the boundaries of what AI agents can do at Pika. You'll work on the systems that give... ...and response generation • Build agent capabilities — image generation, voice synthesis, video creation, web browsing, code execution, and other skills...VideoFull timeRemote work
- ...first culture. Position Overview: We are looking for an AI Storyboard & Video Artist to support short-form narrative projects by... ...and creative direction into consistent, usable AI-generated images and video clips. This role operates in a fast-paced, iterative...VideoFor contractorsLocal area
$10,000 per month
...ll build A multimodal ingestion pipeline for source text, images, video, transcripts, or mixed-media assets. A generation system... ...automation creates leverage versus cleanup work. Work with AI Fund's build team on the sharpest wedge, technical architecture...VideoFull timeContract workVisa sponsorshipFlexible hours$180k
...Alto, CA; Seattle, WA About XAI XAI's mission is to create AI systems that can accurately understand the universe and aid... ...on enabling high-fidelity understanding and generation across image and video modalities, while also incorporating audio where it enhances visual...VideoTemporary work- ...Staff Inception creates the world's fastest, most efficient AI models. Our Mercury model is the world's fastest reasoning LLM... ...500 companies. Diffusion is the technology behind today's image and video AI, and we're making it the standard for LLMs as well. Our team...VideoImmediate startFlexible hours
$180k
...Understanding Palo Alto, CA About xAI xAI's mission is to create AI systems that can accurately understand the universe and aid... .... Advance understanding and generation across modalities—image, video, audio, and text—spanning the full stack: data curation/...VideoTemporary work$139.9k - $274.8k
...build the world's most advanced multimodal dataset at Microsoft AI We are on a mission to create the largest and most advanced... ...ingest enormous amounts of multi-modal training data (text, audio, images, video). Own and maintain critical data infrastructures, including...VideoOngoing contractWork experience placementWork at officeLocal areaShift work$181.1k - $318.4k
...Camera Imaging Software Engineer, Camera & Photos iPhone is the most popular camera in the world, with billions of photos taken every... ...in the Camera & Photos org delivers amazing quality photos and videos by combining state of the art computer vision, image processing...VideoRelocation$61.79 - $81.71 per hour
...is an essential tenet to building our IT image as not only a service provider but an enabler... ...standards, and approval workflows Use AI tools and automation to scale recurring... ...multi-modal content (written, visual, audio/video) Hands-on experience with Google Workspace...VideoLocal areaImmediate start$115.5k - $189.75k
...vehicles; Woven City, a test course for mobility; and Cloud & AI, the digital infrastructure powering our collaborative foundation... ...interpret complex signals from logs, metrics, test results, images, and video, helping QA and development engineers quickly understand...VideoTemporary workWork at officeFlexible hours- ...Engineer Inception creates the world's fastest, most efficient AI models. Our Mercury model is the world's fastest reasoning LLM... ...500 companies. Diffusion is the technology behind today's image and video AI, and we're making it the standard for LLMs as well. Our team...VideoImmediate startFlexible hours
$139.9k - $274.8k
...Core team at Microsoft, you will design, build, and maintain AI systems that power some of the largest workloads on the... ...multimodal verticals, including real-time audio interaction, image generation, video generation and safety. Our work spans the entire stack—from...VideoOngoing contractLocal area$247.5k - $267k
...category-defining technology at the intersection of creativity and AI with real impact. Join us to help shape the future of... ...engineering, fine-tuning, inference) and multimodal AI (text, image, video, audio) In your first 3 months, you will: Shape the...VideoWork at officeFlexible hours3 days per week$92k - $138k
...What to Expect The Data Labeling team is responsible for annotating images, videos, and other data for our Tesla AI software. Accurate data is the foundation for training our neural networks and serves as the ground truth for Tesla's artificial intelligence. The team...VideoHourly payFull timeTemporary workFlexible hours- ...Engineer Inception creates the world's fastest, most efficient AI models. Our Mercury model is the world's fastest reasoning LLM... ...500 companies. Diffusion is the technology behind today's image and video AI, and we're making it the standard for LLMs as well. Our team...VideoImmediate startFlexible hours
$139.9k - $274.8k
...The CoreAI organization at Microsoft builds the end-to-end AI stack and is core to Azure AI innovation and differentiation,... ...risks in AI-generated and human-generated content spanning text, image, audio, video, and multimodal content. We are looking for a Principal...VideoOngoing contractWork at officeLocal area$140k - $360k
...intersection of embedded systems, computer vision, and AI to drive the development of Tesla's imaging pipeline. Your primary focus will be to optimize... ...development, system level performance profiling, and video codec optimization for large scale data collection...VideoHourly payFull timeTemporary workFlexible hours$170k - $277k
...latency reduction. Multi-Modal Strategy: Video and Audio Models Post Training strategy... ...Management: Lead the development of the "Golden Image" environment. Maintain and distribute... ...training on our clusters. Responsible AI & Compliance Partnership: Serve as the bridge...VideoFor contractorsWork at officeFlexible hours$190k - $234k
...category-defining technology at the intersection of creativity and AI with real impact. Join us to help shape the future of... ...roadmap for generation of various content types including text, image, audio and video enabling us to generate brand-affinized, hyper-personalized,...VideoWork at officeLocal areaFlexible hours3 days per week$140.7k - $223.4k
...stability. * Creative GenAI infrastructure: Lead the backend integration of Generative AI models (Diffusion, LLMs) to automate the creation of high-performing image and video assets tailored to specific campaign goals and formats. * Marketplace experimentation engine...VideoWork at officeWorldwideRelocation package- ...AI Researcher – Video World Generation San Francisco (Bay Area) Help build the next generation of AI video systems that can create rich, interactive worlds from text or images. What you’ll work on: Foundational diffusion models and world models for high-quality...Video
$2,790 per month
...Name Mountain View Maternal and Fetal Medicine Job Type Travel Offering Allied Profession Imaging & Radiology Specialty Ultrasound Technician Job ID 33809635 Job Title Imaging & Radiology - Ultrasound...Weekly payShift work- ...Ultrasound Technician Job Type: Travel Profession: Imaging & Radiology Specialty: Ultrasound Technician Shift Details: 8 Hours Days Job Order Details: Start Date 10/13/2025 End Date 01/12/2026 Duration 13 Week(s) Client Details: City Mountain View State...Shift work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI Tutor - Image & Video [Remote]. Be the first to apply!
- video Palo Alto, CA
- video engineer Palo Alto, CA
- video assistant Palo Alto, CA
- remote coding part time Palo Alto, CA
- franchise development manager (remote) Palo Alto, CA
- junior devops remote Palo Alto, CA
- telecommute Palo Alto, CA
- call center remote Palo Alto, CA
- remote ruby on rails developer Palo Alto, CA
- remote contract Palo Alto, CA


