Senior+ Data Scientist - ML & Image Generation
$150k - $250kHapiko
Data Scientist
Build and optimize the ML pipeline behind Stickerbox, an AI-powered voice-to-sticker printer for kids.
Hapiko is a Brooklyn-based company building the future of play. Founded by Arun Gupta (former CEO of Grailed, which sold to GOAT Group in 2022) and Bob Whitney (Anthropic, NYT Games), we're on a mission to create safe, hands-on AI experiences that fuel kids' imaginations rather than replace them.
Our first product, Stickerbox, is the world's first voice-to-sticker printer. A device that instantly transforms a child's spoken ideas into printable, colorable stickers. We sold out our first run shipping for the holidays, and it's already being called "one of the first products to make AI feel magical for kids and grounded for parents."
We have a $7M funding round led by Maveron (backers of Lovevery), Serena Ventures, and Ai2 (The Allen Institute). Stickerbox is bringing imagination to life for kids nationwide!
The technical challenge is real. We're running real-time audio transcription, proprietary content safety systems, and custom image generation, all serving thousands of concurrent users with sub-second latency. We're training our own models from scratch, optimizing for kid-friendly aesthetics, and building safety guardrails that actually work. We need a Data Scientist to own data quality, evaluation, and ML optimization across this entire pipeline. You'll work with the team to define what to train on, how to measure success, and how to make our models better every day.
Model Training & Data
- Build and curate large-scale image datasets for training custom models
- Design annotation pipelines and data quality processes
- Analyze training runs and model outputs to guide iteration
- Work with our team to define what to train on and how to evaluate it
ML Pipeline Optimization
- Optimize our transcription pipeline for accuracy and latency
- Improve image generation quality, prompt adherence, and consistency
- Identify bottlenecks and failure modes across the pipeline
- Run experiments and A/B tests to measure improvements
Safety & Content Moderation
- Refine content safety systems for child-appropriate outputs, and develop new ones
- Build on our evaluation datasets for safety edge cases
- Analyze moderation performance and reduce false positives/negatives
- Stay current on best practices for AI safety in generative systems
Evaluation & Metrics
- Build evaluation frameworks to measure model performance at scale
- Define metrics that correlate with user satisfaction (aesthetic quality, relevance, safety)
- Develop automated evaluation pipelines (LLM-as-judge, CLIP scores, human eval)
- Track experiments and communicate findings to the team
Prompt Engineering
- Optimize prompts for transcription accuracy and image generation quality
- Develop systematic approaches to prompt testing and iteration
- Build prompt templates and guidelines for different use cases
Location: NYC only, On-site (flexible on WFH but we like to be in office the majority of the week) in our Brooklyn based office, close to most major train lines.
Salary Range: $150k - $250k base + equity and benefits
$225.4k - $257.2k
...Senior Manager, Data Scientist - US Card (Generative AI Systems) Data is at the center of everything we do. As a startup... ..., speech recognition, image/document processing as well as time... ...contributing research to major NLP and AI/ML conferences. Role Description...SeniorFull timePart timeLocal areaFlexible hours- ...Senior Data Scientist (Generative AI), Data Science Lab Direct Hire Hybrid Office locations : NYC, NY | Holmdel, NJ | Stamford, CT... ...Leads on high-impact high-visibility projects to deliver AI/ML solutions that will be market-tested and deployed to make...SeniorWork at office
$161.8k - $184.6k
...Principal Associate, Data Scientist - US Card (Generative AI Systems) Data is at the center of everything we... ...reading comprehension, speech recognition, image/document processing as well as time-... ...research to major NLP and AI/ML conferences. Role Description In...SuggestedFull timePart timeLocal areaFlexible hours- ...Senior Data Scientist Publicis Re:Sources is the backbone of Publicis Groupe... ...with deep expertise in Generative AI and advanced machine learning... ...models, transformers) for text, image, and multimodal applications... ...and implement end-to-end ML pipelines, including data...Senior
- ...Senior Data Scientist/Python/Generative AI Midtown New York City, NY (Hybrid 2-3 Days in a Week) 12+ Months Web Cam Interview $50-$55/Hr on W2 We need: A Junior (2+ Years) to Mid-level Data Scientist with experience working with Python for data analysis and...SeniorContract workWork at office2 days per week3 days per week
- ...Senior Data Scientist Dynamic work schedule - This is 5 days on site a month... ...science/machine learning (ML). Academic publications are... ...(any modality, e.g., text, image, audio); trending open-source... ...requirements into real-world next-generation AI-driven solutions. The...SeniorWork at officeWork from homeFlexible hours
- ...analytical and impact-driven Senior Data Scientist to join our Data Science... ...analysis, accelerate insight generation, and scale how we evaluate identity... ...language, point clouds, and images, in support of fraud... ...scoring. Lead the end-to-end ML/analytics lifecycle for...Senior
$110k - $170k
...Senior Data Scientist Choosing Capgemini means choosing a company where you... ...preparation and synthetic data generation strategies to support... ...Frameworks - Python, APIs, ML frameworks, AI orchestration... ...Capgemini may capture your image (video or screenshot) during...SeniorPermanent employmentFull timeContract workLocal area$135.6k - $154.8k
...Senior Associate, Data Scientist - US Card (Applied GenAI) Data is at the center of... ...modal data sources - text, image, and audio data. We operate... ...building with open source generative AI models and tooling, but... ...least 2 years' experience AI/ML tools and ecosystems, such...SeniorFull timePart timeLocal area- ...Principal/Senior - Data Scientist – New York City US (New York), Hybrid Full-time Permanent employee... ...-making, powered by production-grade ML systems at enterprise scale. Manufacturing... ...based on a mix of textual, image, and structured data. Research & Development...SeniorPermanent employmentFull time
- ...Experimentation Jobs is looking for a Sr Data Scientist in the United States. The role involves designing end-to-end machine learning solutions... ...testing and multi-arm bandits, and analyzing complex data to generate insights. You will collaborate with various teams to provide...Senior
$115.4k - $192.3k
...RELX INC is looking for a Senior Data Scientist III in the United States to lead AI/ML experimentation and model development. The role requires strong qualifications in NLP and generative AI techniques, along with experience in hybrid search design. The U.S. National Base...Senior- Senior Data Scientist - Gen AI Engineer - Assistant Vice President Apply (opens... ..., hands-on experience in Generative AI , including leading technical... .... Communicating complex AI/ML concepts clearly to non-... ...Experience with multimodal AI (text, image, audio, video). Education...SeniorFull time
- ...Senior Manager, Applied AI Health System Engineering... ..., our people in data and analytics... ...advanced AI and ML solutions to drive... ...research (including images and genomics) and... ...text corpora, generative SQL query development... ...engineers and other data scientists to deliver...Senior
- ...Senior Data Scientist We are seeking a Senior Data Scientist, Agentic AI—an experienced individual... ...contributor with deep expertise in AI/ML and a track record of turning advanced... ...high-visibility projects to bring next-generation AI to life across client's products and...Senior
- ...partnered with our client in their search for a Senior Data Scientist to work remotely on eastern time zone.... ...and model development for AI/ML solutions in legal products. Design and evaluate NLP, LLM, and generative AI approaches (e.g., RAG, prompt strategies...SeniorWork experience placementRemote work
- ...Senior Data Scientist Our client's Data & AI team spearheads a culture of intelligence and automation... ...background in production ready AI/ML solutions. In this role as Senior Data... ...ML and seeing it deployed in-market and generating value for Guardian. You enjoy collaborating...SeniorWork at office3 days per week
$115.4k - $192.3k
...of law, deploying ethical and powerful generative AI solutions with a flexible, multi‑model... ...legal use case. About the Role Senior Data Scientist III are emerging subject matter experts... ...experimentation and model development for AI/ML solutions in legal products. Design and...SeniorLocal areaWorldwideFlexible hours$170k - $200k
...connected ecosystem in senior living. Founded by Michael... ..., we provide the data operators need to make... ...looking for a Senior Data Scientist to bring rigor, modeling... ...familiar with the end-to-end ML lifecycle, comfortable... ...analysis, code generation, documentation, and decision...SeniorRemote workFlexible hours- ...The Role We're looking for a Senior Data Scientist (Agentic AI) to turn advanced research... ...deploy, and scale agentic AI , LLM and generative AI solutions that transform how we operate... ...& statistics ; solid grasp of ML algorithms, optimization, and statistical...SeniorFull time3 days per week
$180k - $200k
...Senior Data Scientist - Semantic Chunking $180,000-$200,000 base + bonus + benefits Remote... ..., iterate on PoCs, and help define how ML models are built for in production environments... ...Machine Learning | GenAI | Gen AI | Generative AI | LLMs | Large Language Models | Artificial...SeniorRemote work$130k - $150k
...efficiencies - by providing impactful data driven insights, analysis,... ...We are looking for a Senior Data Scientist to build and own advanced... ...Experience with core Python data and ML libraries, including NumPy,... ...such as retrieval-augmented generation (RAG), fine-tuning, and...SeniorWork at officeRemote work1 day per week- ...iQuasar LLC is seeking to fill a 100% remote Senior Data Scientist position. At iQuasar, we strive to provide the next generation of cutting-edge technologies. Our growth means... ...as Stata, SAS and strong SQL Expertise with ML algorithms (e.g. model selection, evaluation,...SeniorContract workRemote work
- ...Bluefish is seeking a full-stack Senior Data Scientist who thrives in fast-paced environments... ...from problem definition and hypothesis generation to modeling, testing, and deployment.... ...and maintain scalable data workflows and ML systems without needing support from product...SeniorImmediate start
$111.24k - $284.28k
...Senior Data Scientist We're building a world of health around every individual — shaping a more... ...advisory capabilities—that uncover trends, generate insights, and enable smarter decision-... ...health Experience developing AI/ML-driven products or decision-support tools...SeniorFull timeLocal area$90.8k - $145k
...Description PURPOSE OF THE JOB: The Senior Data Scientist should be able to work on multiple... ...communicate governance principles. Generates hypotheses about the underlying... ...components analysis (PCA). Applies various ML (machine learning) and advanced...SeniorLocal areaFlexible hours$144k - $181k
...about About the Role As an Applied Data Scientist on Clair's Data Science team, you'll be... ...for developing and maintaining the next generation of credit models and predictive systems... ...and monitoring production-ready ML models that drive critical product decisions...SeniorHourly payTemporary workWork at officeRemote workWork from home3 days per week- ...A leading consulting firm in the United States is seeking a Senior AI Engineer with 4+ years of experience in software engineering and... ...skills in deploying AI solutions. The role involves designing generative AI applications, collaborating with various teams, and directly...SeniorRemote work
$177.5k - $232k
...Senior Data Scientist, Medical Imaging Boston, MA; New York, NY About Formation Bio Formation Bio is a tech and AI driven pharma company differentiated by radically more efficient drug development. Advancements in AI and drug discovery are creating more candidate...SeniorWork at officeLocal areaRelocation3 days per week- ...Ciklum is looking for a Senior Data Scientist to join our team full‐time in the US. We are a custom... ...statistical analysis, data science and ML algorithms; hands‐on experience with Python... ...integrate LLMs into AI solutions Embed generative AI solutions into consolidation,...SeniorFull timeTemporary workPart time
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior+ Data Scientist - ML & Image Generation. Be the first to apply!
- junior data scientist remote Brooklyn, NY
- entry level data scientist remote Brooklyn, NY
- data scientist Brooklyn, NY
- ai data scientist Brooklyn, NY
- data scientist (hedge fund) Brooklyn, NY
- python data scientist (contract) Brooklyn, NY
- energy data scientist Brooklyn, NY
- healthcare data scientist Brooklyn, NY
- python data scientist Brooklyn, NY
- senior data scientist Brooklyn, NY

