Human Data Architect, Quality
Mecka AI
About Mecka AI Mecka AI is building the data and deployment infrastructure for embodied intelligence. We collect, curate, and license the world's most useful robotics training data to leading AI labs, and we deploy real robotic systems with enterprise customers across hospitality, retail, QSR, pharmacy, logistics, and healthcare. We work with the foundation model teams shaping the next decade of robotics, and with the operators running real businesses today. Quality, trust, and execution are core to our partnerships.
The Role We're hiring a Human Data Architect, Quality to be the person with taste for what robotics training data should look like at Mecka. You will define what good data is - the labeling rubrics, ontologies, schemas, sampling philosophy, and acceptance criteria that every dataset we ship is measured against. You decide what goes in or out of a dataset and why. This is a standards-and-methodology architecture role, not a QA-management role . You set the quality bar; data operations and QA teams enforce it. Your output is the spec the entire data org and our customers run on. You will work shoulder-to-shoulder with foundation-model researchers at our customers to translate model behavior into data structure - what to label, how to label it, how to organize it, how to compose a training set, what the edge cases are, and what makes a dataset trainable versus merely large.
What You'll Own
Labeling Rubrics & Quality Criteria (per customer)
Required Background
The Role We're hiring a Human Data Architect, Quality to be the person with taste for what robotics training data should look like at Mecka. You will define what good data is - the labeling rubrics, ontologies, schemas, sampling philosophy, and acceptance criteria that every dataset we ship is measured against. You decide what goes in or out of a dataset and why. This is a standards-and-methodology architecture role, not a QA-management role . You set the quality bar; data operations and QA teams enforce it. Your output is the spec the entire data org and our customers run on. You will work shoulder-to-shoulder with foundation-model researchers at our customers to translate model behavior into data structure - what to label, how to label it, how to organize it, how to compose a training set, what the edge cases are, and what makes a dataset trainable versus merely large.
What You'll Own
Labeling Rubrics & Quality Criteria (per customer)
- Define the labeling rubrics, severity levels, rejection taxonomies, and acceptance criteria for each customer program across video, sensor streams, trajectories, action labels, task outcomes, language grounding, and metadata.
- Translate ambiguous customer requirements ("we want a model that can do X") into precise, measurable, executable data specifications.
- Maintain customer-specific quality criteria and the canonical data dictionary every program references.
- Build golden datasets, reference examples, and calibration tasks that define "correct" by demonstration, not just description.
- Own the taxonomy, schema, and class hierarchies for robotics datasets - how attributes are structured, how temporal segmentation works, how event boundaries are defined, how ambiguity is handled, how edge cases are categorized.
- Decide how data is organized end-to-end so it is trainable, queryable, and composable across customers and modalities.
- Set dataset versioning conventions, schema evolution rules, and the data-organization philosophy the org runs on.
- Own the philosophy for what goes into a dataset and what gets cut: distribution, diversity, edge-case representation, redundancy, license/provenance constraints.
- Decide sampling strategies, balancing rules, and curation principles for each program.
- Make taste-driven calls on what data is worth collecting at all - and push back when collection plans won't produce trainable data.
- Define the acceptance bar that says "this dataset is ready to ship" - and hold it under deadline pressure.
- Iterate rubrics and ontology based on model-failure signal from customers - your standards evolve with what models actually struggle to learn.
- Run cross-customer reviews of recurring quality misses and translate them into standards improvements.
- Partner with engineering on automated validation (schema completeness, duplicates, time sync, metadata coverage, model-assisted review) so the standard is enforceable at scale.
Required Background
- 5+ years working at the intersection of ML and data - annotation methodology, dataset curation, data-centric ML, ground truth design, or labeling-specifications work for autonomy, vision, or multimodal teams.
- Hands-on experience designing taxonomies, ontologies, or labeling schemas that fed production model training (not just internal analytics).
- Strong data instincts: you can open a dataset in SQL, a notebook, or Python and tell us what's wrong with it within an hour.
- Comfortable reading ML papers and translating model-architecture needs into data-structure choices.
- Built a labeling rubric, ontology, or ground-truth spec that a large annotation org executed against in production.
- Worked directly with research scientists at frontier AI labs or autonomy companies on what training data should contain.
- Background in computer vision, robotics, cognitive science, linguistics, or a related field where taxonomy design is craft.
- Have strong opinions about data quality you can defend with concrete examples.
- A taste-maker. You believe data quality is a design problem, not a process problem.
- Precise about definitions and obsessive about edge cases.
- Confident saying "this dataset isn't useful and here's why" - to customers, to leadership, to research teams.
- Energized by deciding the standard, not by managing the team that enforces it.
- Define the data standards the foundation-model teams shaping the next decade of robotics will train on.
- Be the person with the pen on what good robotics data looks like - across video, sensors, trajectories, and language.
- Work directly with researchers at frontier AI labs, not through a sales or PM layer.
- Build the methodology backbone of a data company at the moment the field is still deciding what "good" means.
- Every major customer program has a clear, documented quality standard, ontology, and acceptance criteria authored by you.
- The data organization runs against a canonical schema and rubric set - not ad-hoc per-project decisions.
- Customer rejection rates fall and dataset usefulness rises because the right data is being collected and labeled the right way the first time.
- Researchers at customer labs treat you as the technical counterpart they want to talk to about what they're actually buying.
- Standards evolve continuously from model-failure signal, not in annual rewrites.
Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Human Data Architect, Quality in New York, NY vacancy
- ...Brightside is seeking a Principal Architect, Data & AI Platforms, to own and evolve the technical... ...Engineering leadership to ensure speed, quality, and long-term integrity. This is not a... ...blend of products, technology and true human care. Our goal is to make it easier for...Quality
- ...Senior Data Architect New York, New York, United States About the Job Experience Level... ...to deliver the results on time and with quality. Responsibilities Looking for a... ...Information We are the global leader in the human resources consulting. We offer top-...QualityRelocation packageFlexible hours
- ...transformation specialists, uniting human expertise with AI to create... ...: Managing the complete Data Lifecycle, from where data is... ..., to develop and deliver high-quality data products. Attracting, retaining... ...: Solid experience as a Data Architect, leading teams Experience in...Quality
- ...Job Title: Strong Data Architect Location: New York Position Type: Contract... ...Solid understanding of data governance quality lineage and compliance eg GDPR SOC2 HIPAA... ...enterprise that leverages a unique blend of human talent, machine learning algorithms, and...QualityContract workWork experience placement
- ...lot about tech experience, the attitude, human approach, and what we could call “... ...the ride. About the role As the Senior Data Architect , you'll be part of the center data function... ...data practices, and maintaining a high quality bar. Operate as a self-starter with an ownership...QualityContract workWork at officeRemote workFlexible hoursAfternoon shift
$200k - $220k
...AI & Data Architect The AI & Data Architect sits within Carlyle's Enterprise Technology & Data... ...capabilities. Data Governance, Quality & AI Trust (≈10%) Partner with Data... ...consume that data with the same rigor as human users. Technical Leadership & Mentorship...QualityWork at office$160k - $200k
...innovation should be meaningful, beautiful and human. We craft practical, powerful digital... ...We are seeking an experienced Principal Data Architect to join our Data & AI team. In this... ...and help teams deliver practical, high-quality solutions that support long-term value creation...QualityWork experience placementLocal areaRemote workWork from homeFlexible hours- ...business unit or location. Position: Senior Data Architect Location: REMOTE Remote Status: Remote... ...with business semantics. Establish data quality rules, validation frameworks, and... ...probabilistic matching, deduplication, and human-in-the-loop review processes. Partner with...QualityWeekly payFull timeTemporary workWork at officeLocal areaImmediate startRemote work
$147.5k - $211k
...seeking an energetic and passionate Senior Data Architect to design and execute our company's... ...and standards to ensure data integrity, quality, and security. Works closely with... ...foundation is rooted in our core values of humanity and integrity, ensuring that every...QualityLocal area3 days per week- ...Title: Data Solution Architect (fin services) Location: New York, NY Duration: 6+ months Interviews... ...by formulating an intuitive and human-centric design approach. This... ...business transformations, establishing data quality approaches and rules, create a comprehensive...QualityFlexible hours
- Samaritan Daytop Village is hiring a Quality Assurance Analyst in New York to ensure compliance with regulatory requirements across agency... ...care for clients. The role requires a Bachelor’s Degree in Human Services and at least two years of industry-related experience....Quality
- ...Nimble Gravity is seeking an AI Data Quality Analyst (Human-in-the-Loop) to ensure data quality in AI-powered document processing. This role involves reviewing AI-extracted data from insurance submissions for accuracy and correcting discrepancies. The ideal candidate will...QualityRemote work
- ...leader responsible for end-to-end ownership of a Snowflake-based data platform architecture. The ideal candidate will have hands-on... ...will collaborate with teams across various time zones, ensuring quality and consistency while managing architectural standards. A proactive...Quality
- ...An established industry player is seeking a skilled Data Quality Specialist with 8 to 10 years of experience in resolving data quality issues across various data domains. In this role, you will leverage advanced SQL skills to write data quality checks and profile large...Quality
- ...global education technology company is looking for a Senior Data Architect to define and govern enterprise data architecture. This role... ...and design scalable data products while advocating for data quality. Excellent communication skills and experience navigating complex...QualityRemote workFlexible hours
$140k - $155k
...consulting and technology firm is seeking an experienced ETL Data Architect to develop scalable ETL solutions for analytics and reporting... ...successful candidate will collaborate across teams, ensuring data quality and governance while mentoring junior engineers. This remote...QualityRemote work- ...leading consulting firm in the United States is seeking an experienced Data Governance & Collibra Solutions Architect. You will have key responsibilities in configuring and managing the Collibra Data Quality platform while enhancing data governance initiatives. The ideal...QualityRemote workFlexible hours
- ...An innovative firm is seeking a skilled Data Architect to lead the design and implementation of data architecture solutions. In this role... ...will drive the development of data integrations and ensure high-quality data management practices are followed. This position offers...Quality
- ...Prove Identity, Inc. is seeking a full-time remote Data Science Lead to architect the organization's data ecosystem. The role involves designing... ...establishing rigorous analytical methodologies for high-quality data integration into production. The ideal candidate will...QualityFull timeRemote work
- ...Data Architect Duration: 0-6 month(s) Location: Jersey City NJ 07310 High Must Have Skills: 1. Data Architecture Principles 2. Data Lineage... ...and easily. - Knowledge of data governance, including quality management, stewardship, and compliance. - Enables effective...QualityWork at officeLocal areaRemote work
- ...~12+ years of experience is required. Experience in data profiling, Strong experience in SQL analytic functions. Hands... ...on ETL concepts and Source to Target Mappings (STTM's) Data Quality Rules and Validations - Compare the Business Rules to what is actually...Quality
- ...Software Technology, Inc. is seeking a Data Architect with expertise in Population Health and EMR systems to work remotely. The role requires... ...include optimizing data models and ensuring high data quality. Ideal candidates should have proficiency with ETL tools and...QualityRemote work
- ...Data Governance SME (Data Architecture Focus) We are seeking an experienced Data Governance SME with strong expertise in data architecture... ...and applications Identify gaps impacting data lineage, data quality, and governance Evaluate ETL/ELT pipelines (SQL, Spark,...QualityFull time
- A leading technology firm in New York seeks an experienced Technical Architect/Data Engineer to design and implement large-scale data solutions focusing on data quality and governance. The ideal candidate will possess over 15 years of software engineering experience, with...Quality
- ...A leading consultancy is seeking an experienced Data Architect / Data Analyst to assist in migrating HR IT functions for a government client... ...with legacy systems. Responsibilities include assessing data quality, developing migration plans, and working with stakeholders to...QualityRemote work
$3,200 per month
...A healthcare organization is seeking a Senior Data Professional to design and implement a comprehensive data governance framework. The role will focus on creating policies for data quality, security, and access control, along with ensuring compliance with regulations...Quality- ...A leading educational content provider is seeking a Data Science and Analytics Subject Matter Expert to create high-quality course materials. The role involves developing content focused on data analysis, machine learning, and data visualization techniques. Candidates...QualityFreelanceRemote work
- ...Enterprise Data Architect We are seeking a skilled Enterprise Data Architect to join our team in the NY/NJ area. The ideal candidate... .... Define data standards and best practices to ensure data quality and integrity. Lead data governance initiatives and ensure...Quality
- ...potential, leveraging our Disruptive Talent Solution. Role: Data Architect / Data Modeler Location: Midtown New York City, NY... ...data architectures that supportperformance, governance, data quality, and enterprise reporting needs. Develop and implement...QualityContract work
- ...Data Modeler / Informaitional Architect Head count: 1 at Jersey City (Primary) Jersey City NJ - 07310, Columbus office (Secondary) - Columbus OH 4... ...solutions or break down problems Creates secure and high-quality production code and maintains algorithms that run...QualityWork at office
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Human Data Architect, Quality. Be the first to apply!
Related searches
- azure data architect New York, NY
- sr data modeler New York, NY
- data architect New York, NY
- remote data architect New York, NY
- big data architect New York, NY
- hadoop big data architect New York, NY
- data center architect New York, NY
- database designer New York, NY
- enterprise data architect New York, NY
- data integration architect New York, NY

