AI Applied Scientist
$225k - $280kWizard
Wizard is the top-performing AI Shopping Agent, delivering the best products from across the web with unmatched accuracy, quality, and trust. Role We’re looking for an Applied Scientist to own how we measure, understand, and improve the accuracy of our AI agent. This role sits at the intersection of applied ML, evaluation science, and product. You’ll define what "good" looks like for our agent, build the systems to measure it, and lead the science work to improve it, including fine‑tuning the LLM judges that power our evaluation pipeline. You’ll partner with ML Engineering and AI Engineering. What you will do is bring scientific rigor to the most important question at Wizard: is our agent getting better, and how do we know? This is a foundational hire on our science team. Evaluation is the starting point, and the role is scoped to grow into broader applied science work as the surface area of the agent expands (recommendations, personalization, ranking, multimodal, conversational understanding). What You’ll Do Define and evolve accuracy metrics across the full shopping experience (retrieval, ranking, recommendations, outcomes) Design and run experiments to measure improvements and regressions Build and maintain evaluation datasets, benchmarks, and scoring frameworks Improve the LLM judges that power our evaluation pipeline: prompting, calibration, and fine‑tuning where it matters Translate ambiguous product questions into clear, measurable hypotheses and analysis Partner with ML Engineers to validate model changes and guide iteration Identify failure modes and edge cases, and drive improvements through data Make agent performance visible, trusted, and actionable across product and engineering First 3 months Go deep on the agent, the current eval pipeline, and the metrics we use today Audit existing accuracy metrics and benchmarks; identify gaps, blind spots, and signals that aren’t trustworthy Build relationships with ML, AI Engineering, and Product Ship one quick win: a missing benchmark, an improved metric, or a fix to a misleading signal Establish a baseline view of agent performance the team can rally around Months 3 to 6 Own the evaluation framework: datasets, metrics, scoring, reporting, both offline and online Drive measurable improvements to LLM judge quality (calibration, fine‑tuning where appropriate) Run experiments that influence at least one significant model or product change Stand up automated evaluation the team trusts before and after every launch Build dashboards and reporting that make agent performance legible to leadership Beyond 6 months Lead applied science work on the next frontier as the agent grows: multi‑turn evaluation, multimodal, personalization, ranking quality, conversational understanding Influence team‑level strategy on what we measure, what we improve, and why Mentor and help grow the science function as it expands What Success Looks Like Clear, trusted accuracy metrics are consistently used across product and engineering A robust automated evaluation framework for both offline and live experiments Model and product changes are consistently measured before and after launch Demonstrable improvements in LLM judge quality and eval coverage Science leadership that informs what we build, not just whether it works Depth track: become the org’s authority on AI evaluation: eval strategy, judge models, agent benchmarking Breadth track: expand into other applied science problems (recommendations, personalization, ranking, multimodal, conversational understanding) as those areas come online Leadership track: Senior / Staff Applied Scientist, with technical leadership across the science function As the agent gets more capable, the science problems get richer Ideal Background 5+ years in Applied ML, AI Research, or Applied Science (PhD or equivalent depth strongly preferred) Hands‑on experience evaluating modern AI/ML systems: LLMs, agents, ranking, or recommendations Direct experience with LLM‑based systems: judge models, RAG, prompt engineering, fine‑tuning, RLHF, or similar Strong experimentation foundations: A/B testing, causal inference, statistical rigor Proven ability to operate in ambiguity: defining problems, not just solving pre‑defined ones Clear, structured communication that influences across ML, engineering, and product The expected base salary range for this role is $225,000 - $280,000 USD, and will vary based on skills, experience, role level, and geographic location. Final compensation will be determined by considering these factors alongside overall role scope and responsibilities. In addition to base salary, Wizard offers: Equity in the form of stock options Medical, dental, and vision coverage 401(k) plan Flexible PTO and company holidays Fully remote work within the United States Periodic company offsites and team gatherings Wizard is committed to fair, transparent, and competitive compensation practices. As set forth in Wizard’s Equal Employment Opportunity policy, we do not discriminate on the basis of any protected group status under any applicable law. #J-18808-Ljbffr
$104k - $166k
...About the team Are you passionate about building rigorous evaluation frameworks that advance AI systems? The Zillow AI Applied Science team develops next-generation evaluation methodologies for generative AI, computer vision, and agentic systems. We work at the intersection...SuggestedPermanent employmentSummer workInternshipLive inWork at officeLocal areaRemote work$104k - $166k
...About the team The Applied Science AQ/EQ (Action/Emotion Quotient ) Foundation team at... ...domain-specific datasets to design adaptive AI systems that integrate with expert... ...opportunity to collaborate with applied scientists, engineers, and product leaders while pushing...SuggestedPermanent employmentInternshipLive inWork at officeLocal areaRemote work$104k - $166k
...About the team Zillow AIs Foundational IQ group builds the core intelligence that powers... ...on Zillow. Our team operates as applied researchers who ship: we prototype quickly... ...experiments, collaborating with applied scientists, engineers, and product partners, and contributing...SuggestedPermanent employmentInternshipLive inWork at officeLocal areaRemote work$190k - $220k
...NobleAI is a Science-Based AI platform that predicts complex systems, helping companies accelerate discovery, improve product... ..., recognizable and sustained value. We are seeking an Applied AI/ML Scientist that will be responsible for developing and deploying machine...SuggestedRemote workFlexible hours- ...Job Title: Applied AI Research Scientist Employment Type: Contract Work Mode: Remote (PST overlap required) Start Date: Immediate Contract Duration: 3–6 Months Openings: 5 Experience & Education 2+ years professional Python experience MS / PhD in: Computer Science Electrical...SuggestedContract workImmediate startRemote work
$80 - $95 per hour
...Senior Applied Data Scientist Anywhere Type: Contract Category: Data Industry: Retail Workplace Type: Remote Reference ID:... ...advised that Eliassen Group utilizes artificial intelligence (AI) tools as part of its initial application screening and hiring...Hourly payContract workLocal areaRemote work$120k - $160k
...® Fortune Best Workplaces in Financial Services & Insurance AI Data Scientist Sr. PRIMARY PURPOSE OF THE ROLE: To partner with stakeholders... ...with every qualification in the job description, consider applying for it anyway! Sedgwick is building a diverse, equitable, and...Work at officeLocal areaFlexible hours- ...Job Title: AI/ Data Scientist Location: Remote Security Clearance: due to government requirement: U.S. Citizenship required and ability... ...4 years of experience in data science, machine learning, or applied analytics roles. U.S. Citizenship required and ability to...Full timeLocal areaRemote work
$30 per hour
...We are looking for an Applied Quantitative Analyst to join our team to train AI models. You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the quality of each model. In this role you will need to hold an expert level...Hourly payFull timeContract workPart timeRemote work- ...DTS is looking for AI Engineer / Applied AI Developer for our client Position in Farmington Hills / Okemos, MI Job Description: Delta Dental is seeking an AI Engineer / Applied AI Developer to design, prototype, and deliver AI-powered applications using large language...
$135k - $175k
...As Spliteros Applied AI Engineer, you will own the identification, scoping, and delivery of AI-powered capabilities across our product and operations. Youll sit within the engineering organization but operate cross-functionally, partnering with product, operations, and...Remote workFlexible hours$10k
...Flatfile is revolutionizing enterprise data handling through AI automation. We’re building an AI‑native data mapping, cleaning, and... ...conventional software engineering and machine learning, focusing on applying AI to solve real‑world problems rather than fundamental ML...Full timeRemote workHome officeFlexible hours$170k - $190k
...effective care and provide doctors with AI-powered quality insights and tools to enhance... ...We are looking for a talented Senior AI Scientist with a focus on Deep Learning to join us... ...-of-the-art NLP and/or CV techniques Applied programming experience in Python and PyTorch...Work experience placementVisa sponsorshipWork visaFlexible hours- ...AI Scientist ICLR 2026 • Rio de Janeiro • April 23–27 Apply to interview with our team on-site at ICLR About Proxima Proxima (formerly VantAI) is advancing an AI-native approach to drug discovery by making protein interactions programmable. Our platform brings together...Work experience placement
$190k - $300k
...Snorkel At Snorkel, we believe meaningful AI doesn’t start with the model, it starts... ...world’s largest organizations to empower scientists, engineers, financial experts, product creators... ...to help us redefine how AI is built? Apply to be the newest Snorkeler! As an Applied...Local areaRemote work$84k - $120k
...Thomas Talent Network is hiring an Applied AI Engineer to build and maintain agents for the entertainment industry. This role requires extensive experience with non-deterministic LLM systems and the ability to execute tasks end-to-end. The ideal candidate will work closely...$141.15k - $172.52k
...Overview AI Experimental Systems Research Scientist (Causal Learning & Adaptive Experimentation) at 3M. Collaborate with innovative 3Mers around the... ...learning process. This is not a conventional data science or applied machine learning role; the work centers on how...Full timeTemporary workWork at officeRemote workRelocationRelocation package- ...Senior AI Research Scientist COMPANY OVERVIEW Every year, supply chain disruptions cause estimated losses of $40 billion for trade between the... ...annotation, scripting and automation. Continuously research and apply the latest techniques in NLP to supply chain problems....
- ...Join to apply for the Applied AI Engineer Consultant - US/Canada role at Bitovi Get AI-powered advice on this job and more exclusive features. PLEASE NOTE: We are currently placing a preference on candidates in the US and Canada for this job application link. We have...Remote workFlexible hours
£50 per hour
...Prolific is not just another player in the AI space – we are building the biggest pool of quality human data in the world. Over 35,000 AI developers, researchers, and organizations use Prolific to gather data from paid study participants with a wide variety of experiences...Self employmentWork from homeFlexible hours$60 per hour
...A leading AI development firm is seeking quantitative professionals to evaluate AI-generated analyses and provide feedback on data reasoning. This fully remote position offers the flexibility to set your schedule and pay up to $60 per hour. Candidates should have experience...Hourly payRemote work$60 per hour
...A leading AI development company is seeking experienced quantitative professionals to evaluate AI-generated analyses and provide actionable... ..., statistics, or other quantitative fields are encouraged to apply. This position is an excellent opportunity for those who want to...Remote work$140k - $160k
...A technology firm is seeking an AI Engineer to join its multidisciplinary team. The role involves identifying key business areas for AI, designing prototypes, and collaborating closely with scientists and engineers. Ideal candidates should have two years of experience...$70k - $170k
...Location: Global (Remote or in-person in NYC) About This Opportunity We’re looking for exceptional AI engineers who’ve shipped breakthrough products that millions actually use. The kind of people who don’t just integrate APIs, they build features users love. You...Local areaRemote workWork from homeWorldwideRelocation$85k - $95k
...your best, every day. Summary: The Maintenance Digitization & AI Enablement Analyst plays a critical role in modernizing how... ...subsidiaries are Equal Opportunity Employers and comply with applicable employment laws. EOE/M/F/Vet/Disabled are encouraged to apply....Remote workShift work$65k - $75k
...Posting Summary Working Title Institutional Analytics & AI Specialist Role Title Education Coodinator I... ...to inclusion, we are encouraging individuals with disabilities to apply through the Commonwealth's Alternative Hiring Process. To be considered...Full timeTemporary workPart timeLocal areaRemote workMonday to FridayWeekend workAfternoon shift$12 - $16 per hour
...shaping the future of artificial intelligence? We’re looking for AI Evaluation & Annotation Specialists to help train and improve... ...specific guidelines. Follow detailed written instructions and apply them consistently. Generate or evaluate prompts depending on assignment...Hourly payShift work- ...A biotech company is looking for talented AI Scientists to develop advanced pipelines for proximity-inducing molecules. Ideal candidates will have a MS/PhD in Computational Biology or related fields and extensive experience in ML research, particularly with generative...
$123.5k - $197.6k
...Cerence Inc. is seeking experts in Generative AI to design and implement cutting-edge models, collaborating with various teams to advance LLM technology. Ideal candidates will have extensive experience in developing LLMs, strong Python skills, and a Ph.D. in a related...Flexible hours$123.5k - $197.6k
...Responsibilities Design, develop, and implement state‑of‑the‑art Generative AI models (auto‑regressive, diffusion, discrete diffusion, hybrid... ...models. Collaborate with cross‑functional teams—including AI scientists, software engineers, and domain experts—to develop customized...Local areaFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI Applied Scientist. Be the first to apply!

