Synthetic Data Engineer (AI Data/Training)
Hyphen Connect Limited
We are seeking a talented and innovative Synthetic Data Engineer. In this role, you will design and implement domain-specific synthetic data generation pipelines, ensuring high-quality data management for training loops. Your expertise will drive the success of data processing and model training within the organization.
Responsibilities:- Design domain-specific synthetic data generation (SDG) pipelines via self-instruct and constitutional prompting.
- Implement automated quality scoring and de-duplication systems.
- Manage data pipelines that feed directly into SFT and DPO training loops.
- Proven experience building large-scale data pipelines (Airflow, Spark, Ray).
- Deep knowledge of prompt engineering for data generation.
- Familiarity with dataset distillation and bias mitigation.
Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Synthetic Data Engineer (AI Data/Training) in San Francisco, CA vacancy
- ...Synthetic Data Engineer (AI Data/Training) San Francisco Bay Area, USA We are seeking a talented and innovative Synthetic Data Engineer. In this role, you will design and implement domain-specific synthetic data generation pipelines, ensuring high-quality data management...Training
$172.5k - $260.1k
...Job Category Software Engineering Job Details About... ...Salesforce is the #1 AI CRM, where humans with... ...Salesforce. The Enterprise Data & AI Solutions group... ...specialized in ETL, synthetic data generation, automated... ..., promotion, benefits, training, assessment of job...TrainingShift work- ...Data/ETL Engineer (Founding Team) Location: San Francisco Bay Area Type: Full-Time Compensation... ...Role We're building a multi-tenant, AI-native platform where enterprise data... ...a connected ontology ready for model training, vector search, and insight-to-action workflows...TrainingFull time
- ...Senior Data Engineer Disney Entertainment and ESPN Product & Technology is a global organization... ...video through the power of data and AI. We design and build innovative... ...Required Education, Experience/Skills/Training: ~5+ years of data engineering experience...Training
- ...Data Engineer Location: San Francisco, CA Required Clearance: Secret Salary: Competitive... ...Data Engineer with a strong focus on AI and machine learning to join our dynamic... ...quality, clean, and usable data for model training and evaluation. Optimize data storage...Training
- ...Description: Role: Data Engineer - Artificial Intelligence & Machine Learning Location Options: Bay Area - CA Responsibilities: - 1. Develop AI/ML Models: •Design, build, and train machine learning models using appropriate algorithms (e.g., supervised...Training
- ...Responsibilities: 1.Design and Build Data Pipelines: •Develop,... ...preparing datasets for model training and deployment. •... ..., and best practices in data engineering and big data systems.... ...learning and preparing data for AI/ML model training. -Familiarity...Training
$180k - $220k
...insights, and a host of business-critical KPIs. As a Data Engineer in the Data Engineering team, you will own the... ...related skills, experience, and relevant education or training. We may use artificial intelligence (AI) tools to support parts of the hiring process, such...Training$138.9k - $186.2k
...Senior Data Engineer Disney Entertainment and ESPN Product & Technology is a global organization... ...video through the power of data and AI. We design and build innovative... ...Required Education, Experience/Skills/Training: ~5+ years of data engineering experience...Training- ...machine learning models Architecting ML training, validation and inference pipelines... ...approaches to maximizing the potential of data in AI models Defining creative solutions... ...of study Strong ML research and engineering utilizing established and emerging NLP...Training
- ...minerals powering modern energy, AI, and defense technologies. We'... ...software, automation, and data-driven decision-making. The... ...'re looking for a Senior Data Engineer to help make it autonomous. We... ...analysis and machine learning training, validation, and monitoring; own...TrainingContract workImmediate startShift work
$99k - $149k
...insights about companies and AI-driven personalization to help... ...responsibility is to integrate data from a variety of sources into... ...You will provide documentation, training, and consultation for users of... ...experience in software engineering fundamentals and coding Salary...TrainingWork experience placementLocal area$160k - $190k
...Senior Data Engineer Los Angeles; New York; Remote; San Francisco EDO is the TV outcomes... ...world-class decision science and vertical AI, EDO equips industry leaders with... ..., relevant work experience, key skills, training, and business considerations. EDO is...TrainingFull timeWork experience placementWork at officeImmediate startRemote workFlexible hours$120k - $160k
...Data Engineer Los Angeles; New York; Remote; San Francisco EDO is the TV outcomes company... ...-class decision science and vertical AI, EDO equips industry leaders with syndicated... ..., relevant work experience, key skills, training, and business considerations. EDO is...TrainingFull timeWork experience placementWork at officeImmediate startRemote workFlexible hours$99k - $147k
...positive mark on culture. Summary The Data Engineering team is hiring a Data Engineer - Data... ...data systems powering analytics, ML, and AI applications. You will also grow your expertise... ...location, market demands, experience, training, and education. The benefits available...Training$172.5k - $260.1k
...Job Category Software Engineering Job Details About Salesforce Salesforce is the #1 AI CRM, where humans with agents... ...About the Team At Slack, data isn't just infrastructure - it... ...compensation, promotion, benefits, training, assessment of job performance...TrainingPermanent employment$207k - $238k
...Senior Data Engineer At Komodo Health, our mission is to reduce the global burden of disease... ...Map, analytics products, and downstream AI/ML-enabled use cases. This is a hands-on... ..., geographic work location, relevant training and certifications, business needs and market...TrainingFor contractorsWork experience placementWork at officeLocal areaRemote workFlexible hours$139.44k - $174.31k
...Senior Scientific Data Engineer Berkeley Lab's Joint Genome Institute has an opening for a... ...capabilities, expert support, and large-scale, AI-ready data resources. As a Department of... ...'s Degree (or equivalent knowledge/training) in Computer Science or a related field...TrainingFull timeWork at officeRemote workRelocation package- ...Data Science & ML Ops Engineer Location: Bay Area, CA Tax Term (W2, C2C): W2, C2C We are seeking a... ...Leverage AutoML tools (e.g., Vertex AI AutoML, H2O Driverless AI) for low-code... ...Kubeflow, or Vertex AI. Automate model training, testing, deployment, and monitoring...Training
$110k - $145k
...Role Is We are seeking a talented Senior Data Engineer to design, build, and maintain our data... ..., and role-based access management AI & LLM Proficiency: Practical experience... ...and demonstrated experience, education, training and certifications, and other factors permitted...TrainingH1bWork at officeLocal areaRelocationVisa sponsorshipFlexible hours$130k - $196.5k
...LiveRamp is the data collaboration platform of choice for the world... ...processing, analytics, and AI/ML workloads. Define and implement... ...like Grafana. Onboard, train, and mentor vendor teams and... ...design documents. Champion engineering best practices (code reviews,...TrainingWork from homeFlexible hoursNight shift$170k - $220k
...Windfall is seeking a Sr. Data Engineer to join our data team. As a Sr. Data Engineer on our... ..., experience, and relevant education or training. We also offer a comprehensive benefits... ...Windfall is a people intelligence and AI company that gives go-to-market teams actionable...Training$350k
...Machines Lab in San Francisco is seeking a pre-training researcher, responsible for curating and... ...large-scale datasets that support AI model development. The ideal candidate will... ...relevant fields. This role blends research and engineering, requiring both theoretical knowledge and...Training- ...About the Role We are seeking a Data Infrastructure Engineer to build and operate the infrastructure... ...research to support perception model training and evaluation workflows, enabling faster... ...to work on novel sensing, data, and AI systems with real-world deployment paths...TrainingPermanent employmentFull time
$140k - $200k
Labelbox is the data factory for generative AI, providing the highest quality training data for frontier and task-specific models. Labelbox’s comprehensive platform... ...at Labelbox! You will use a unique mix of engineering, product and sales to deliver data on high stakes...TrainingWork at office£75k - £95k per year
Join to apply for the Data Engineer role at Surecall Tech Join to apply for the Data Engineer role at Surecall Tech Get AI-powered advice on this job and more exclusive features. Direct... ...growth: You’ll have a budget for tools, training, and conferences Culture-led: Empowered...TrainingFull timeFreelanceRemote workFlexible hours$180k - $250k
...founding team, we build multi-agent AI systems that can automate... ..., operational continuity, and data-driven decision-making. Shape... ...agent automation, where your data engineering expertise accelerates business... ...related to model training, model serving, or deployment....TrainingFull timeWork at office- ...with groundbreaking vision-based AI, designed for today’s global... ...don’t believe culture can be engineered - but when it falls into place... ...Overview We’re looking for a data engineer to help us turn raw driving... ...point it gets pulled into a training run, and your work will...TrainingLocal areaFlexible hours
- ...About the Role We’re looking for a Sr. Data Engineer with strong data platform experience to... ...contribute to the foundation of our emerging AI and ML platform. This role sits at the... ...source ingestion and preparation to training, tuning, experimentation, productionization...Training
$156k - $195k
About The Team Data is our fuel at Turo. It is ever‑more abundant... ...scientists and machine learning engineers, it propels Turo on its... ...GCP, Azure). Experience with AI tools for code generation (cursor... ...experience, and relevant education or training. We encourage you to talk with...TrainingFull timeWork at officeLocal area3 days per week
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Synthetic Data Engineer (AI Data/Training). Be the first to apply!
Related searches
- staff data engineer San Francisco, CA
- data engineering intern summer San Francisco, CA
- senior data integration developer San Francisco, CA
- data engineer contract San Francisco, CA
- data science developer San Francisco, CA
- senior data center engineer San Francisco, CA
- software data engineer San Francisco, CA
- hadoop big data developer San Francisco, CA
- data developer San Francisco, CA
- remote data engineer San Francisco, CA

