Member of Technical Staff - Data Quality Engineer (Pre-training)
Reflection AI
Data Team Engineer
Data is playing an increasingly crucial role at the frontier of AI innovation. Many of the most meaningful advances in recent years have come not from new architectures, but from better data.
As a member of the Data Team, your mission is to ensure that the data used to train our models meets a high bar for quality, reliability, and downstream impact. You will directly shape how our models perform on critical capabilities.
Working with world-class researchers on our pre-training teams, you'll help turn fuzzy notions of "good data" into concrete, measurable standards that scale across large data campaigns. We're looking for engineers who combine strong engineering fundamentals with a deep curiosity about data quality and its impact on model performance.
Working closely with our pre-training teams you will:
- Own upstream data quality for LLM pre-training; as a specialist or generalist across languages and modalities
- Partner closely with research and pre-training teams to translate requirements into measurable quality signals, and provide actionable feedback to external data vendors
- In addition to human-in-the-loop processes, you will design, validate, and scale automated QA methods to reliably measure data quality across large campaigns
- Build reusable QA pipelines that reliably deliver high-quality data to pre-training teams for model training
- Monitor and report on data quality over time, driving continuous iteration on quality standards, processes, and acceptance criteria
About You:
- Strong engineering fundamentals with experience building data pipelines, QA systems, or evaluation workflows for pre-training data
- Detail-oriented with an analytical mindset, able to identify failure modes, inconsistencies, and subtle issues that affect data quality
- Solid understanding of how data quality impacts pre-training, with the ability to translate quality concerns into concrete signals, decisions, and feedback
- Experience designing and validating automated quality checks, including rule-based systems, statistical methods, or model-assisted approaches such as LLM-as-a-Judge
- Comfortable working autonomously, owning problems end-to-end, and collaborating effectively with researchers, engineers, and operations partners
Skills and Qualifications:
- Proficiency in Python and building ML / LLM workflows. Must be comfortable debugging and writing scalable code
- Experience working with large datasets and automated evaluation or quality-checking systems
- Familiarity with how LLMs work and can describe how models are trained and evaluated
- Excellent communication skills with the ability to clearly articulate complex technical concepts across teams
What We Offer:
We believe that to build superintelligence that is truly open, you need to start at the foundation. Joining Reflection means building from the ground up as part of a small talent-dense team. You will help define our future as a company, and help define the frontier of open foundational models.
We want you to do the most impactful work of your career with the confidence that you and the people you care about most are supported.
- Top-tier compensation: Salary and equity structured to recognize and retain the best talent globally.
- Health & wellness: Comprehensive medical, dental, vision, life, and disability insurance.
- Life & family: Fully paid parental leave for all new parents, including adoptive and surrogate journeys. Financial support for family planning.
- Benefits & balance: Paid time off when you need it, relocation support, and more perks that optimize your time.
- Opportunities to connect with teammates: Lunch and dinner are provided daily. We have regular off-sites and team celebrations.
$134.64k - $176k
..., Inc. Position Title: Member of Technical Staff Quality Engineer Salary: $134,638-$176,0... ...Compile and revie audit output data to identify improvement... ...successful completion of pre-employment conditions, as... ...including recruitment, selection, training, utilization, promotion,...TrainingLocal areaMonday to Friday- .... About the Role Build and scale distributed training systems that power frontier model pre-training. Work closely with research teams to design... ...with large-scale model parallelism strategies (data, tensor, pipeline, or expert parallelism). Experience...TrainingRelocation package
- ...solutions across algorithms, scaling laws, data processing, optimizers, and model... ...collaborating on larger initiatives Optimize the training infrastructure for efficient scaling.... ...related discipline. Solid software engineering capabilities with experience building...TrainingRelocation package
- ...users daily with reliable, high-quality answers grounded in an LLM-first search engine and specialized data sources. The Answer Quality... ...Search, Product, and model training teams Communicate findings... ...Qualifications ~ MS in a technical field or equivalent...Training
- ...humanity. We're training and deploying frontier... ...of researchers, engineers, designers, and... ...developing data-generation techniques... ...hybrid works! As a Member of Technical Staff for Agents... ..., Post-training, Pre-training, etc.) to... ...and well-being, quality time, and workspace...TrainingFull timeWork at officeRemote workFlexible hours
$139.9k - $274.8k
....???? Microsoft AI (MS AI) is seeking a experienced Member of Technical Staff - Data Engineer - Microsoft AI - Copilot to help build mission critical... ...data platform products and services.? Ship high-quality, well-tested, secure, and maintainable code.?? Find a...Ongoing contractWork at officeLocal area$160k - $320k
...efficient, motivated, and focused on engineering excellence. We cultivate individuals... ...the Role We're seeking a remarkable Member of Technical Staff to join our team in creating a... ...highly performant trading systems Training custom models, and harnessing information...TrainingWork at office- ...systems and processes that create tight feedback loops between data, evals, and model behavior Develop generalizable evaluation... ...reasoning, alignment, and usefulness. Collaborate closely with pre-training, post-training, and applied teams to translate insights into...TrainingRelocation package
- ...serve humanity. We're training and deploying... ...team of researchers, engineers, designers, and more,... ...Engineer" role. As a Member of Technical Staff, Applied ML, you will... ...including RLVR), and data assets. Develop SOTA... ...fitness and well-being, quality time, and workspace improvement...TrainingFull timeWork at officeRemote workFlexible hours
$175k - $220k
...Member Of Technical Staff, Cloud Infrastructure New York, NY; San Mateo... ...platform delivers the highest-quality models with the fastest... ...Role: As a Software Engineer on our Cloud... ...to support distributed training, inference, and data processing pipelines....Training- ...announced soon. Our Technical Staff develops the... ...apply! As a Member of Technical... ...combining large-scale engineering with rigorous... ...performance, distributed training code running on... ...way; Build data pipelines to support... ..., Write high-quality software in Rust...TrainingLive inWork at officeRelocationVisa sponsorship
$119.8k - $234.7k
...impactful publications or technical leadership on high-... ...to detail, and a data-driven approach to decision... ...pipelines. Improve training and deployment... ...infrastructure, data engineering, pre-training, post-training... ...annotation pipelines, quality evaluation, bias detection...TrainingOngoing contractWork at officeLocal area- ...We are looking for a Member of Technical Staff, Research to investigate... ...into the broader AI engine FirstPrinciples is... ...Design and automate data ingestion pipelines in... ...internal tests are stable. Training, Testing & Safety:... ...tests that flag poor quality model output....TrainingRemote work
- ...announced soon. Our Technical Staff develops the... ...from combining strong engineering with careful experimentation... ...performance, distributed training code running on... ...systematic way; Build data pipelines to support reinforcement... ...Nearly all members of our Technical Staff...TrainingInternshipLive inWork at office
$120.7k - $142k
...shaping how data flows across... ...Senior Data Engineer, you'll design... ...needs Create training resources to... ...Partnership & Technical Leadership... ...Mentor team members and support adoption... ...Governance, Quality & Security... ...finances of our staff and their... ...benefits ~ Pre-tax flexible...TrainingSummer holidayWork at officeLocal areaFlexible hours3 days per week- ...experienced GPU Performance Engineer with a strong background in Python and large-scale model training. In this role, you will design... ...infrastructure and directly contribute to technical decisions that optimize... ...along with many of our team members, has contributed to numerous...TrainingH1bRemote workVisa sponsorship
- ...serve humanity. We're training and deploying... ...team of researchers, engineers, designers, and more,... ...have all the compute, data, and talent available... ...for this role. As a Member of Technical Staff, you will: Design... ...fitness and well-being, quality time, and workspace...TrainingFull timeWork at officeRemote workFlexible hours
$125k - $185k
...About the job Founding AI Engineer / Member of Technical Staff YC - Startup Role: Founding AI Engineer... ...for structured and unstructured data, ensuring investigators have quick and... ...requires (from debugging to running training sessions). Mission-Motivated - Energized...TrainingTemporary workWork at office- ...serve humanity. We're training and deploying... ...team of researchers, engineers, designers, and more,... ...to enhance the global quality of the post-training... ...and UTC+01:00. As a Member of Technical Staff, you will: Design... ...ideas on our cluster and data infrastructure....TrainingFull timeWork at officeRemote workFlexible hours
- ...Applied AI Engineer Valthos Inc. Valthos... ...seeking a highly skilled, data-centric AI Engineer... ...including adapting and post-training biological frontier... ...Embrace learning about areas-technical and non-technical-... ...ML Experience with pre- or post-training language...TrainingWork at office
$139k
...through rigorous tech training. By teaming up... ..., Cybersecurity, Data Engineering, IT Support,... ...higher than their pre-training earnings... ...role focuses on the technical execution of our... ...Perform regular data quality checks to ensure... .... Assist team members in building complex...TrainingRemote work- ...experienced Senior Data Engineer to join our... ...stack Implement data quality checks, monitoring... ...platform Mentor team members and promote data engineering... ...communicate technical concepts Strong... ...requirements for training and deploying models... ...and we cover pre‑existing conditions...TrainingContract workTemporary workWork at officeWork from homeWorldwideHome officeFlexible hours
$150k - $300k
...About Deeptune Deeptune builds training gyms for AI agents: high-fidelity simulation... ...completion. We're a ~20-person team of engineers and operators from Anthropic, Scale AI,... ...said human, and founding recruiter) # Technical / Culture call with a founding engineer...TrainingWork at office- ...minds. Agents have already reshaped software engineering. The same shift is coming for financial... ...hardest applied AI problems out there. Our technical staff works across the stack: agent architecture, evals, model post-training, and new product surfaces. You'll ship to...TrainingFull timeShift work
- ...Modal Backend Engineer Modal provides the infrastructure foundation for AI teams. With... ...native storage, Modal makes it simple to train models, run batch jobs, and serve low-latency... ...on the backend, and ClickHouse for data and analytics. Deep knowledge of observability...TrainingWork at office
- ...you will work closely with Reflection's training teams to co-design fault tolerance, node... ...and rapid hardware debugging. Platform Engineering: Design and iterate on our cluster... ...own multi-cloud storage, petabyte-scale data replication, and GPU-to-GPU network performance...TrainingRelocation package
- ...Design, build, and operate large-scale GPU infrastructure for high-throughput model inference and mid-training workloads. Develop systems that power synthetic data generation and reinforcement learning pipelines at scale. Build high-performance inference platforms...TrainingRelocation package
- ...ll work closely with model researchers, data infrastructure engineers, and cross-functional partners to make sure our data is high quality and can be produced at petabyte scale in... ...them, you’ll help ensure our models are trained on the best data we can get. What you’ll...Training
$165k - $300k
...provinces. As a member of our team,... ...relevant education or training. In addition to... ...and implement data streaming... ...advancements in data engineering, data science... ...to learn new technical concepts and adapt... ..., and high‑quality care network options... ...require pre‑employment background...TrainingH1bRemote work$207k - $276k
...Technical Leader At Early Warning, we... ...excellence in quality of outputs across... ...into the engineering organization.... ...of expertise on Data Engineering, Data... ...and mentor team members. Takes ownership... ..., experience, training, and specialized... ...(HSA) or pre-tax savings through...TrainingHourly payWork at officeImmediate startVisa sponsorshipWork visaFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Member of Technical Staff - Data Quality Engineer (Pre-training). Be the first to apply!
- technical support assistant New York, NY
- technical analyst New York, NY
- end user support technician New York, NY
- IT assistant New York, NY
- oracle technical analyst New York, NY
- help desk assistant New York, NY
- IT support technician New York, NY
- operations support technician New York, NY
- desktop support analyst New York, NY
- support analyst New York, NY


