Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Member of Technical Staff - Data Quality Engineer (Pre-training)

Reflection AI

Data Team Engineer

Data is playing an increasingly crucial role at the frontier of AI innovation. Many of the most meaningful advances in recent years have come not from new architectures, but from better data.

As a member of the Data Team, your mission is to ensure that the data used to train our models meets a high bar for quality, reliability, and downstream impact. You will directly shape how our models perform on critical capabilities.

Working with world-class researchers on our pre-training teams, you'll help turn fuzzy notions of "good data" into concrete, measurable standards that scale across large data campaigns. We're looking for engineers who combine strong engineering fundamentals with a deep curiosity about data quality and its impact on model performance.

Working closely with our pre-training teams you will:

  • Own upstream data quality for LLM pre-training; as a specialist or generalist across languages and modalities
  • Partner closely with research and pre-training teams to translate requirements into measurable quality signals, and provide actionable feedback to external data vendors
  • In addition to human-in-the-loop processes, you will design, validate, and scale automated QA methods to reliably measure data quality across large campaigns
  • Build reusable QA pipelines that reliably deliver high-quality data to pre-training teams for model training
  • Monitor and report on data quality over time, driving continuous iteration on quality standards, processes, and acceptance criteria

About You:

  • Strong engineering fundamentals with experience building data pipelines, QA systems, or evaluation workflows for pre-training data
  • Detail-oriented with an analytical mindset, able to identify failure modes, inconsistencies, and subtle issues that affect data quality
  • Solid understanding of how data quality impacts pre-training, with the ability to translate quality concerns into concrete signals, decisions, and feedback
  • Experience designing and validating automated quality checks, including rule-based systems, statistical methods, or model-assisted approaches such as LLM-as-a-Judge
  • Comfortable working autonomously, owning problems end-to-end, and collaborating effectively with researchers, engineers, and operations partners

Skills and Qualifications:

  • Proficiency in Python and building ML / LLM workflows. Must be comfortable debugging and writing scalable code
  • Experience working with large datasets and automated evaluation or quality-checking systems
  • Familiarity with how LLMs work and can describe how models are trained and evaluated
  • Excellent communication skills with the ability to clearly articulate complex technical concepts across teams

What We Offer:

We believe that to build superintelligence that is truly open, you need to start at the foundation. Joining Reflection means building from the ground up as part of a small talent-dense team. You will help define our future as a company, and help define the frontier of open foundational models.

We want you to do the most impactful work of your career with the confidence that you and the people you care about most are supported.

  • Top-tier compensation: Salary and equity structured to recognize and retain the best talent globally.
  • Health & wellness: Comprehensive medical, dental, vision, life, and disability insurance.
  • Life & family: Fully paid parental leave for all new parents, including adoptive and surrogate journeys. Financial support for family planning.
  • Benefits & balance: Paid time off when you need it, relocation support, and more perks that optimize your time.
  • Opportunities to connect with teammates: Lunch and dinner are provided daily. We have regular off-sites and team celebrations.
Vacancy posted 8 hours ago
Similar jobs that could be interesting for youBased on the Member of Technical Staff - Data Quality Engineer (Pre-training) in New York, NY vacancy
  • $134.64k - $176k

     ..., Inc. Position Title: Member of Technical Staff Quality Engineer Salary: $134,638-$176,0...  ...Compile and revie audit output data to identify improvement...  ...successful completion of pre-employment conditions, as...  ...including recruitment, selection, training, utilization, promotion,... 
    Training
    Local area
    Monday to Friday

    GLOBALFOUNDRIES

    New York, NY
    9 hours ago
  •  .... About the Role Build and scale distributed training systems that power frontier model pre-training. Work closely with research teams to design...  ...with large-scale model parallelism strategies (data, tensor, pipeline, or expert parallelism). Experience... 
    Training
    Relocation package

    Reflection AI

    New York, NY
    8 hours ago
  •  ...solutions across algorithms, scaling laws, data processing, optimizers, and model...  ...collaborating on larger initiatives Optimize the training infrastructure for efficient scaling....  ...related discipline. Solid software engineering capabilities with experience building... 
    Training
    Relocation package

    Reflection AI, Inc

    New York, NY
    5 days ago
  •  ...users daily with reliable, high-quality answers grounded in an LLM-first search engine and specialized data sources. The Answer Quality...  ...Search, Product, and model training teams Communicate findings...  ...Qualifications ~ MS in a technical field or equivalent... 
    Training

    Perplexity

    New York, NY
    5 days ago
  •  ...humanity. We're training and deploying frontier...  ...of researchers, engineers, designers, and...  ...developing data-generation techniques...  ...hybrid works! As a Member of Technical Staff for Agents...  ..., Post-training, Pre-training, etc.) to...  ...and well-being, quality time, and workspace... 
    Training
    Full time
    Work at office
    Remote work
    Flexible hours

    Cohere

    New York, NY
    2 days ago
  • $139.9k - $274.8k

     ....???? Microsoft AI (MS AI) is seeking a experienced Member of Technical Staff - Data Engineer - Microsoft AI - Copilot to help build mission critical...  ...data platform products and services.? Ship high-quality, well-tested, secure, and maintainable code.?? Find a... 
    Ongoing contract
    Work at office
    Local area

    Microsoft Corporation

    New York, NY
    5 days ago
  • $160k - $320k

     ...efficient, motivated, and focused on engineering excellence. We cultivate individuals...  ...the Role We're seeking a remarkable Member of Technical Staff to join our team in creating a...  ...highly performant trading systems Training custom models, and harnessing information... 
    Training
    Work at office

    Liquid

    New York, NY
    3 days ago
  •  ...systems and processes that create tight feedback loops between data, evals, and model behavior Develop generalizable evaluation...  ...reasoning, alignment, and usefulness. Collaborate closely with pre-training, post-training, and applied teams to translate insights into... 
    Training
    Relocation package

    Reflection AI, Inc

    New York, NY
    5 days ago
  •  ...serve humanity. We're training and deploying...  ...team of researchers, engineers, designers, and more,...  ...Engineer" role. As a Member of Technical Staff, Applied ML, you will...  ...including RLVR), and data assets. Develop SOTA...  ...fitness and well-being, quality time, and workspace improvement... 
    Training
    Full time
    Work at office
    Remote work
    Flexible hours

    Cohere

    New York, NY
    7 hours ago
  • $175k - $220k

     ...Member Of Technical Staff, Cloud Infrastructure New York, NY; San Mateo...  ...platform delivers the highest-quality models with the fastest...  ...Role: As a Software Engineer on our Cloud...  ...to support distributed training, inference, and data processing pipelines.... 
    Training

    Fireworks AI

    New York, NY
    5 days ago
  •  ...announced soon. Our Technical Staff develops the...  ...apply! As a Member of Technical...  ...combining large-scale engineering with rigorous...  ...performance, distributed training code running on...  ...way; Build data pipelines to support...  ..., Write high-quality software in Rust... 
    Training
    Live in
    Work at office
    Relocation
    Visa sponsorship

    Adaptive ML

    New York, NY
    3 days ago
  • $119.8k - $234.7k

     ...impactful publications or technical leadership on high-...  ...to detail, and a data-driven approach to decision...  ...pipelines. Improve training and deployment...  ...infrastructure, data engineering, pre-training, post-training...  ...annotation pipelines, quality evaluation, bias detection... 
    Training
    Ongoing contract
    Work at office
    Local area

    Microsoft Corporation

    New York, NY
    3 days ago
  •  ...We are looking for a Member of Technical Staff, Research to investigate...  ...into the broader AI engine FirstPrinciples is...  ...Design and automate data ingestion pipelines in...  ...internal tests are stable. Training, Testing & Safety:...  ...tests that flag poor quality model output.... 
    Training
    Remote work

    Firstprinciples

    New York, NY
    2 days ago
  •  ...announced soon. Our Technical Staff develops the...  ...from combining strong engineering with careful experimentation...  ...performance, distributed training code running on...  ...systematic way; Build data pipelines to support reinforcement...  ...Nearly all members of our Technical Staff... 
    Training
    Internship
    Live in
    Work at office

    Adaptive ML

    New York, NY
    5 days ago
  • $120.7k - $142k

     ...shaping how data flows across...  ...Senior Data Engineer, you'll design...  ...needs Create training resources to...  ...Partnership & Technical Leadership...  ...Mentor team members and support adoption...  ...Governance, Quality & Security...  ...finances of our staff and their...  ...benefits ~ Pre-tax flexible... 
    Training
    Summer holiday
    Work at office
    Local area
    Flexible hours
    3 days per week

    Uncommon Schools

    New York, NY
    12 hours ago
  •  ...experienced GPU Performance Engineer with a strong background in Python and large-scale model training. In this role, you will design...  ...infrastructure and directly contribute to technical decisions that optimize...  ...along with many of our team members, has contributed to numerous... 
    Training
    H1b
    Remote work
    Visa sponsorship

    Reka

    New York, NY
    2 days ago
  •  ...serve humanity. We're training and deploying...  ...team of researchers, engineers, designers, and more,...  ...have all the compute, data, and talent available...  ...for this role. As a Member of Technical Staff, you will: Design...  ...fitness and well-being, quality time, and workspace... 
    Training
    Full time
    Work at office
    Remote work
    Flexible hours

    Cohere

    New York, NY
    7 hours ago
  • $125k - $185k

     ...About the job Founding AI Engineer / Member of Technical Staff YC - Startup Role: Founding AI Engineer...  ...for structured and unstructured data, ensuring investigators have quick and...  ...requires (from debugging to running training sessions). Mission-Motivated - Energized... 
    Training
    Temporary work
    Work at office

    Butterfly Recruitment

    New York, NY
    3 days ago
  •  ...serve humanity. We're training and deploying...  ...team of researchers, engineers, designers, and more,...  ...to enhance the global quality of the post-training...  ...and UTC+01:00. As a Member of Technical Staff, you will: Design...  ...ideas on our cluster and data infrastructure.... 
    Training
    Full time
    Work at office
    Remote work
    Flexible hours

    Cohere

    New York, NY
    5 days ago
  •  ...Applied AI Engineer Valthos Inc. Valthos...  ...seeking a highly skilled, data-centric AI Engineer...  ...including adapting and post-training biological frontier...  ...Embrace learning about areas-technical and non-technical-...  ...ML Experience with pre- or post-training language... 
    Training
    Work at office

    Valthos

    New York, NY
    2 days ago
  • $139k

     ...through rigorous tech training. By teaming up...  ..., Cybersecurity, Data Engineering, IT Support,...  ...higher than their pre-training earnings...  ...role focuses on the technical execution of our...  ...Perform regular data quality checks to ensure...  .... Assist team members in building complex... 
    Training
    Remote work

    Per Scholas

    New York, NY
    2 days ago
  •  ...experienced Senior Data Engineer to join our...  ...stack Implement data quality checks, monitoring...  ...platform Mentor team members and promote data engineering...  ...communicate technical concepts Strong...  ...requirements for training and deploying models...  ...and we cover pre‑existing conditions... 
    Training
    Contract work
    Temporary work
    Work at office
    Work from home
    Worldwide
    Home office
    Flexible hours

    Lodgify

    New York, NY
    2 days ago
  • $150k - $300k

     ...About Deeptune Deeptune builds training gyms for AI agents: high-fidelity simulation...  ...completion. We're a ~20-person team of engineers and operators from Anthropic, Scale AI,...  ...said human, and founding recruiter) # Technical / Culture call with a founding engineer... 
    Training
    Work at office

    Deeptune

    New York, NY
    2 days ago
  •  ...minds. Agents have already reshaped software engineering. The same shift is coming for financial...  ...hardest applied AI problems out there. Our technical staff works across the stack: agent architecture, evals, model post-training, and new product surfaces. You'll ship to... 
    Training
    Full time
    Shift work

    Endex Inc

    New York, NY
    5 days ago
  •  ...Modal Backend Engineer Modal provides the infrastructure foundation for AI teams. With...  ...native storage, Modal makes it simple to train models, run batch jobs, and serve low-latency...  ...on the backend, and ClickHouse for data and analytics. Deep knowledge of observability... 
    Training
    Work at office

    Modal

    New York, NY
    2 days ago
  •  ...you will work closely with Reflection's training teams to co-design fault tolerance, node...  ...and rapid hardware debugging. Platform Engineering: Design and iterate on our cluster...  ...own multi-cloud storage, petabyte-scale data replication, and GPU-to-GPU network performance... 
    Training
    Relocation package

    Reflection AI

    New York, NY
    7 hours ago
  •  ...Design, build, and operate large-scale GPU infrastructure for high-throughput model inference and mid-training workloads. Develop systems that power synthetic data generation and reinforcement learning pipelines at scale. Build high-performance inference platforms... 
    Training
    Relocation package

    Reflection AI

    New York, NY
    6 hours ago
  •  ...ll work closely with model researchers, data infrastructure engineers, and cross-functional partners to make sure our data is high quality and can be produced at petabyte scale in...  ...them, you’ll help ensure our models are trained on the best data we can get. What you’ll... 
    Training

    Reka

    New York, NY
    2 days ago
  • $165k - $300k

     ...provinces. As a member of our team,...  ...relevant education or training. In addition to...  ...and implement data streaming...  ...advancements in data engineering, data science...  ...to learn new technical concepts and adapt...  ..., and high‑quality care network options...  ...require pre‑employment background... 
    Training
    H1b
    Remote work

    BNSF

    New York, NY
    2 days ago
  • $207k - $276k

     ...Technical Leader At Early Warning, we...  ...excellence in quality of outputs across...  ...into the engineering organization....  ...of expertise on Data Engineering, Data...  ...and mentor team members. Takes ownership...  ..., experience, training, and specialized...  ...(HSA) or pre-tax savings through... 
    Training
    Hourly pay
    Work at office
    Immediate start
    Visa sponsorship
    Work visa
    Flexible hours

    Early Warning Services

    New York, NY
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Member of Technical Staff - Data Quality Engineer (Pre-training). Be the first to apply!