Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Synthetic Data Engineer (AI Data/Training)

Hyphen Connect Limited

We are seeking a talented and innovative Synthetic Data Engineer. In this role, you will design and implement domain-specific synthetic data generation pipelines, ensuring high-quality data management for training loops. Your expertise will drive the success of data processing and model training within the organization.

Responsibilities:
  • Design domain-specific synthetic data generation (SDG) pipelines via self-instruct and constitutional prompting.
  • Implement automated quality scoring and de-duplication systems.
  • Manage data pipelines that feed directly into SFT and DPO training loops.
Qualifications:
  • Proven experience building large-scale data pipelines (Airflow, Spark, Ray).
  • Deep knowledge of prompt engineering for data generation.
  • Familiarity with dataset distillation and bias mitigation.
Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Synthetic Data Engineer (AI Data/Training) in San Francisco, CA vacancy
  • $181.1k - $318.4k

     ...Senior ML Data Engineer, MLO Do you believe Machine Learning and AI can change the world? We truly believe it can! We are...  ...produces datasets used in the training of ML and AI-centric features for...  ...; and production-ize synthetic data workflows including orchestration... 
    Training
    Temporary work
    Relocation

    Apple

    San Francisco, CA
    2 days ago
  •  ...partners use real‑time data to transform their business...  ...The Senior Data Engineer is responsible for building...  ...RxSense’s analytics, AI systems, and business intelligence...  ...data subsetting, synthetic data generation, and...  ...feature stores, training data pipelines, and governed... 
    Training

    Rxsense

    San Francisco, CA
    2 days ago
  • $172.5k - $260.1k

     ...**Salesforce is the #1 AI CRM, where humans with...  ...it all.The **Enterprise Data & AI Solutions** group...  ...to build the autonomous engines that power executive decision...  ...specialized in ETL, synthetic data generation,...  ..., promotion, benefits, training, assessment of job performance... 
    Training
    Shift work

    Salesforce, Inc.

    San Francisco, CA
    3 days ago
  •  ...Data/ETL Engineer (Founding Team) Location: San Francisco Bay Area Type: Full-Time Compensation...  ...Role We're building a multi-tenant, AI-native platform where enterprise data...  ...a connected ontology ready for model training, vector search, and insight-to-action workflows... 
    Training
    Full time

    Fabrion

    San Francisco, CA
    3 days ago
  •  ...Data Engineer Location: San Francisco, CA Required Clearance: Secret Salary: Competitive...  ...Data Engineer with a strong focus on AI and machine learning to join our dynamic...  ...quality, clean, and usable data for model training and evaluation. Optimize data storage... 
    Training

    Fullscope

    San Francisco, CA
    3 days ago
  • £75k - £95k per year

     ...Join to apply for the Data Engineer role at Surecall Tech Join to apply for the Data Engineer role at Surecall Tech Get AI-powered advice on this job and more exclusive features...  ...: You’ll have a budget for tools, training, and conferences ~ Culture-led: Empowered... 
    Training
    Full time
    Freelance
    Remote work
    Flexible hours

    Surecall Tech

    San Francisco, CA
    3 days ago
  • $275k - $370k

     ...interpretable, and steerable AI systems. We want AI to...  ...committed researchers, engineers, policy experts, and...  ...As part of our growing Data Science and Analytics...  ...controlled experiments, synthetic controls - to make...  ...combination of education, training, and/or experience Required... 
    Training
    Work at office
    Visa sponsorship
    Flexible hours

    anthropic

    San Francisco, CA
    1 day ago
  • $138.9k - $186.2k

     ...Senior Data Engineer Disney Entertainment and ESPN Product & Technology is a global organization...  ...video through the power of data and AI. We design and build innovative...  ...Required Education, Experience/Skills/Training: ~5+ years of data engineering experience... 
    Training

    Disney France

    San Francisco, CA
    3 days ago
  •  ...Description: Role: Data Engineer - Artificial Intelligence & Machine Learning Location Options: Bay Area - CA Responsibilities: - 1. Develop AI/ML Models: •Design, build, and train machine learning models using appropriate algorithms (e.g., supervised... 
    Training

    TEPHRA

    San Francisco, CA
    2 days ago
  •  ...machine learning models Architecting ML training, validation and inference pipelines...  ...approaches to maximizing the potential of data in AI models Defining creative solutions...  ...of study Strong ML research and engineering utilizing established and emerging NLP... 
    Training

    NovumTech Partners

    San Francisco, CA
    3 days ago
  • $180k - $220k

     ...insights, and a host of business-critical KPIs. As a Data Engineer in the Data Engineering team, you will own the...  ...related skills, experience, and relevant education or training. We may use artificial intelligence (AI) tools to support parts of the hiring process, such... 
    Training
    Flexible hours

    Finix

    San Francisco, CA
    3 days ago
  • $165k - $175k

     ...ARE Zeta Global (NYSE: ZETA) is the AI-Powered Marketing Cloud that leverages...  ...Role We're looking for a Senior Data Engineer to design, build, and operate the data...  ...offline/online feature generation, or model training datasets. ~ Experience with real-... 
    Training

    Zeta Global

    San Francisco, CA
    5 hours ago
  •  ...Responsibilities: 1.Design and Build Data Pipelines: •Develop,...  ...preparing datasets for model training and deployment. •...  ..., and best practices in data engineering and big data systems....  ...learning and preparing data for AI/ML model training. -Familiarity... 
    Training

    TEPHRA

    San Francisco, CA
    5 hours ago
  • $99k - $149k

     ...insights about companies and AI-driven personalization to help...  ...responsibility is to integrate data from a variety of sources into...  ...You will provide documentation, training, and consultation for users of...  ...experience in software engineering fundamentals and coding Salary... 
    Training
    Work experience placement
    Local area

    Indeed

    San Francisco, CA
    2 days ago
  • $25 - $30 per hour

     ...technically advanced energy and data center infrastructure projects...  ...our projects, including our AI-based digital platform, to deliver...  ...motivated undergraduate Data Engineering & AI Enablement Intern to lead...  ...programs and workforce training. When you join SB Energy, you... 
    Training
    Hourly pay
    Internship
    Summer internship
    Work at office
    Local area

    SB Energy

    San Francisco, CA
    4 days ago
  • $167.2k - $209k

     ...Senior Forward Deployed Data Scientist/Engineer San Francisco, CA; New York, NY At Scale AI, we help leading enterprises turn AI from a promising capability into...  ...performance, and relevant education or training. Scale employees in eligible roles are also granted... 
    Training
    Full time

    Scale AI

    San Francisco, CA
    5 hours ago
  •  ...Data Science & ML Ops Engineer We are seeking a hybrid Data Science & ML Ops Engineer to drive the full...  ...Leverage AutoML tools (e.g., Vertex AI AutoML, H2O Driverless AI) for low-code...  ..., or Vertex AI. Automate model training, testing, deployment, and monitoring... 
    Training

    Apolis

    San Francisco, CA
    3 days ago
  • $172.5k - $260.1k

     ...Job Category Software Engineering Job Details About Salesforce Salesforce is the #1 AI CRM, where humans with agents...  ...About the Team At Slack, data isn't just infrastructure - it...  ...compensation, promotion, benefits, training, assessment of job performance... 
    Training
    Permanent employment

    Salesforce.Com Inc

    San Francisco, CA
    4 days ago
  • $176k - $238k

     ...Senior Data Engineer, Knowledge & Information United States We Breathe Life Into Data...  ...Map, analytics products, and downstream AI/ML-enabled use cases. This is a hands-on...  ...experience, geographic work location, relevant training and certifications, business needs and... 
    Training
    For contractors
    Work experience placement
    Work at office
    Local area
    Remote work
    Flexible hours

    Komodo Health

    San Francisco, CA
    4 days ago
  • $160k - $190k

     ...Senior Data Engineer Los Angeles; New York; Remote; San Francisco EDO is the TV outcomes...  ...world-class decision science and vertical AI, EDO equips industry leaders with...  ..., relevant work experience, key skills, training, and business considerations. EDO is... 
    Training
    Full time
    Work experience placement
    Work at office
    Immediate start
    Remote work
    Flexible hours

    EDO

    San Francisco, CA
    3 days ago
  • $176k - $210k

     ...Senior Data Engineer, Sentinel United States We Breathe Life Into Data At Komodo Health...  ...rotation Spearheaded the adoption of AI-driven development workflows,...  ...experience, geographic work location, relevant training and certifications, business needs and market... 
    Training
    For contractors
    Work experience placement
    Work at office
    Local area
    Remote work
    Flexible hours

    Komodo Health

    San Francisco, CA
    3 days ago
  • $139.44k - $174.31k

     ...Senior Scientific Data Engineer Berkeley Lab's Joint Genome Institute (JGI) has an opening...  ...data foundation for an emerging era of AI-enabled scientific discovery in support...  ...Bachelor's Degree (or equivalent knowledge/training) in Computer Science or a related field... 
    Training
    Full time
    Work at office
    Remote work
    Relocation package

    Berkely Lab

    San Francisco, CA
    45 minutes ago
  •  ...Join our San Francisco office as an ML Engineer focused on Data Engineering. Visa sponsorship...  ...provide seamless data access across all training machines. Develop strategies to reduce...  ...world class developing the future of AI tooling Significant impact on our market... 
    Training
    Full time
    H1b
    Work at office
    Immediate start
    Visa sponsorship

    Dfbooking Recruitment Services

    San Francisco, CA
    3 days ago
  • $110k - $145k

     ...Is We are seeking a talented Senior Data Engineer to design, build, and maintain our data...  ...encryption, and role-based access management AI & LLM Proficiency: Practical experience...  ...and demonstrated experience, education, training and certifications, and other factors... 
    Training
    H1b
    Work at office
    Local area
    Relocation
    Visa sponsorship
    Flexible hours

    Clearway Energy, Inc.

    San Francisco, CA
    1 day ago
  •  ...experienced and motivated Senior Staff Data Engineer to be the technical leader of our Data Engineering...  ...and architecture of our next gen AI powered SoFi Data Platform(SDP), and...  ...Development: Contribute to hiring and training efforts to build a skilled and motivated... 
    Training
    Remote work

    SoFi

    San Francisco, CA
    5 hours ago
  • $208k - $282k

     ...Staff Data Engineer At Komodo Health, our mission is to reduce the global burden of disease...  ...Python, Spark, Rust, C++, and emerging AI-enabled engineering workflows. Foundational...  ..., or infrastructure that supports AI/ML training, inference, experimentation, and... 
    Training
    Work experience placement
    Local area
    Flexible hours

    Komodo Health

    San Francisco, CA
    5 hours ago
  • $227.33k - $312.58k

     ...We're looking for a Staff ML Data Engineer to join Procore's AI & Frontier Models organization. In this role, you'll be responsible for designing...  ...building scalable data pipelines that support machine learning training, evaluation, or inference workflows. ~ Solid... 
    Training
    Work at office
    Local area
    Immediate start
    3 days per week

    ProCore CPA

    San Francisco, CA
    2 days ago
  •  ...Lead Data Engineer The Office of Information Technology (IT) is responsible for enabling State...  ...solutions that support analytics, AI, compliance, and operational efficiency....  ...supervision, leadership, project management, and training. Project management and information... 
    Training
    Work at office

    State Bar CA

    San Francisco, CA
    3 days ago
  • $197.3k - $313.7k

     ...efforts. Job Category Data Job Details About Salesforce Salesforce is the #1 AI CRM, where humans with agents...  ...workloads, including feature engineering for ML models and real-time scoring...  ...feature freshness, model training pipelines, and real-time inference... 
    Training
    Work at office

    Salesforce.Com Inc

    San Francisco, CA
    1 day ago
  • $191.52k - $212.8k

     ...Ready for a career glow up? As a Lead Engineer you will design and implement innovative...  ...Reporting to the Director, Engineering, Data & AI you will work closely with other team members...  ...built into every role, with access to training, development, and tuition reimbursement.... 
    Training
    Full time

    Sephora

    San Francisco, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Synthetic Data Engineer (AI Data/Training). Be the first to apply!