Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI Data Infrastructure Engineer

$100k - $150k

Bright Vision Technologies

Bright Vision Technologies is a forward-thinking software development company dedicated to building innovative solutions that help businesses automate and optimize their operations. We leverage cutting-edge technologies to create scalable, secure, and user-friendly applications.
As we continue to grow, we're looking for a skilled AI Data Infrastructure Engineer to join our dynamic team and contribute to our mission of transforming business processes through technology.
This is a fantastic opportunity to join an established and well-respected organization offering tremendous career growth potential.

Job Title: AI Data Infrastructure Engineer
Location: 100% Remote (Continental United States)
Position Type: In-house Bright Vision Technologies SOW engagement (no third-party client or vendor)
Salary: $100K - $150K

Experience: 6+ years
Sponsorship: No new H1B sponsorship available. H1B transfers welcomed for qualified candidates.
Employment Type: Full-time, direct W2 with Bright Vision Technologies (no C2C, no 1099, no third-party)
Engagement: Long-term, multi-year, aligned to the Bright Vision SOW delivery roadmap
Compensation: Competitive base salary commensurate with experience, plus benefits.

Employment Terms & Visa Policy
This is a 100% remote, full-time, direct W2 position with Bright Vision Technologies.
This role is part of Bright Vision Technologies' in-house Statement of Work (SOW) engagement. The client, end customer, and employer for this position is Bright Vision Technologies - there is no third-party client, vendor, or implementation partner involved.
We do not engage in C2C, 1099, or third-party arrangements for this role.

BUT STRICTLY NO C2C/1099/3RD PARTY COMPANIES. ALL OUR ROLES ARE W2 AND NO 3RD PARTY BROKERING PLEASE.
Candidates must be willing to work directly as a full-time W2 employee of Bright Vision Technologies and contribute to our in-house SOW deliverables.
No new H1B sponsorship is available for this role.

However, candidates who are currently on a valid H1B visa and require a transfer are welcome to apply. We will support H1B transfers for qualified candidates.
For every role, a technical coding assessment is mandatory. Please apply only if you are confident in your technical abilities and hands-on experience.

Job Summary
We are seeking an AI Data Infrastructure Engineer to build and operate the large-scale data systems that power modern AI training and evaluation pipelines. The role combines deep data engineering expertise with a strong understanding of AI workloads, focusing on ingestion, transformation, quality assurance, lineage, and high-throughput delivery of data to training jobs across diverse modalities. The ideal candidate has experience operating petabyte-scale data systems, strong software engineering fundamentals, and clear understanding of how data infrastructure choices propagate into model quality and training efficiency.

Key Responsibilities
  • Design and operate large-scale data pipelines supporting AI training, evaluation, and continual improvement workflows.
  • Build ingestion systems for diverse modalities including text, image, audio, video, and structured signals.
  • Implement data cleaning, deduplication, filtering, and quality assurance at petabyte scale.
  • Develop dataset versioning, lineage, and provenance tracking systems suitable for reproducible training.
  • Build high-throughput data loading systems that maximize GPU utilization during training.
  • Implement labeling workflows, active learning pipelines, and human-in-the-loop data improvement systems.
  • Design storage architectures balancing cost, throughput, and latency across data tiers.
  • Build evaluation dataset construction pipelines with strict integrity and contamination controls.
  • Implement data privacy, redaction, and consent enforcement throughout the pipeline.
  • Collaborate with ML researchers and engineers to align data systems with model development needs.
  • Drive observability of data quality, drift, and pipeline health across the AI data estate.
  • Optimize cost and performance through compression, format selection, and caching strategies.
  • Document data systems, schemas, and operational procedures for broad internal use.
  • Stay current with AI data infrastructure research and emerging open-source tools.
Required Qualifications
  • Bachelor's or Master's degree in Computer Science or a related field.
  • Six or more years of data engineering experience, with significant work supporting ML or AI workloads.
  • Strong proficiency in Python and at least one JVM or systems language.
  • Deep experience with modern data processing frameworks such as Spark, Ray, or Beam.
  • Hands-on experience operating petabyte-scale storage and pipeline systems.
  • Strong understanding of distributed systems, data modeling, and storage formats.
  • Experience with dataset versioning, lineage, and reproducibility for ML workflows.
  • Familiarity with high-throughput data loading for accelerator-based training.
  • Strong software engineering practices including testing, CI/CD, and code review.
  • Excellent communication and cross-functional collaboration skills.
Preferred Qualifications
  • Experience with multimodal datasets at large scale.
  • Familiarity with data quality tooling and dataset evaluation methodology.
  • Exposure to privacy-preserving data systems and regulated data handling.
  • Open-source contributions to data infrastructure projects.
  • Experience supporting frontier model training pipelines.

How to Apply
Would you like to know more about this opportunity?
For immediate consideration, please send your resume to [email protected].
Learn more about Bright Vision Technologies at
We recognize that our people are our strength, and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company.
We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. We also make reasonable accommodations for applicants' and employees' religious practices and beliefs, as well as mental health or physical disability needs.
Bright Vision Technologies is an Equal Opportunity Employer, including Disability/Veterans.
Position offered by "No Fee Agency."


Equal Employment Opportunity (EEO) Statement

Bright Vision Technologies (BV Teck) is committed to equal employment opportunity (EEO) for all employees and applicants without regard to race, color, religion, sex, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, veteran status, or any other protected status as defined by applicable federal, state, or local laws. This commitment extends to all aspects of employment, including recruitment, hiring, training, compensation, promotion, transfer, leaves of absence, termination, layoffs, and recall.

BV Teck expressly prohibits any form of workplace harassment or discrimination. Any improper interference with employees' ability to perform their job duties may result in disciplinary action up to and including termination of employment.
Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the AI Data Infrastructure Engineer in Dublin, CA vacancy
  •  ...Ross Stores in Dublin, CA is seeking a Data Engineer to develop data pipelines supporting analytics. The ideal candidate will have 5-8 years of data engineering experience, proficiency in modern data architecture, and be familiar with tools like Snowflake and Airflow.... 
    Suggested

    Ross Stores, Inc.

    Dublin, CA
    8 hours ago
  •  ...Armanino Advisory LLC is seeking a professional to support cloud data platforms. This role involves day-to-day operations, including...  ...data governance. Ideal candidates will have experience in data engineering, familiarity with platforms like Snowflake or PowerBI, and a... 
    Suggested

    Armanino Advisory LLC

    San Ramon, CA
    1 day ago
  • $102.6k - $120.7k

     ...Support day-to-day operations for cloud data platforms, including workspace/project setup...  ...Minimum 2years of experience in data engineering, platform engineering, analytics engineering...  ..., etc.). Demonstrated experience with AI/ML/GenAI enablement (model lifecycle, AI... 
    Suggested
    Internship
    Local area
    Flexible hours

    Armanino Advisory LLC

    San Ramon, CA
    8 hours ago
  •  ...Pacific Gas and Electric Company is seeking a Senior Manager, Workplan Data Platforms & Engineering, responsible for leading the strategy and execution of data management across their electric operations. This role focuses on the architecture and reliability of Workplan... 
    Suggested

    Pacific Gas And Electric Company

    Pleasanton, CA
    7 hours ago
  • $144k

     ...distribution and transmission operations, including planning, engineering, maintenance and construction, asset management, business planning...  ...Distribution, Substation & Transmission work. Workplan and Data Management is Portfolio Operations’ centralized data insights and... 
    Suggested
    Work at office
    Remote work
    2 days per week

    PG&E Corporation

    Pleasanton, CA
    1 day ago
  • Workday in Pleasanton, CA, is seeking a Full Stack Engineer to enhance our data engineering team. In this role, you'll develop user-facing solutions and integrate AI capabilities into our HR and Finance workflows. This is an opportunity to make a real impact as Workday... 
    Remote job
    Flexible hours

    HR Tech Job

    Pleasanton, CA
    1 day ago
  •  ...A forward-thinking technology company is seeking an Analytics and AI Integration Engineer to enhance AI capabilities within their systems. This role involves collaborating with data scientists to integrate AI models, optimizing performance, and deploying AI services on... 

    CXApp

    San Ramon, CA
    1 day ago
  • $120k - $135k

     ...About the Role   We are seeking a Data Acquisition Engineer to design, build, and scale systems that...  ...practices  Data Engineering & Infrastructure  Build ETL/ELT pipelines for ingestion...  ...and crawler scaling  Exposure to AI/ML tools in data workflows  What... 
    Monday to Friday

    Vagaro Inc

    Pleasanton, CA
    4 days ago
  • $88.1k - $141k

     ...We're looking for a Data Engineer - United States This role is Hybrid, Dublin Office...  .... • Maintain and optimize the data infrastructure required for accurate extraction,...  ...world. Cornerstone Galaxy, the complete AI-powered workforce agility platform, meets... 
    Full time
    Work at office
    Local area

    Cornerstone OnDemand

    Dublin, CA
    1 day ago
  •  ...Senior Data Engineer Location: Pleasanton, California (hybrid work) Role Overview...  ...development, and ownership of core data infrastructure—from pipelines to storage to data products...  ...Evaluate and experiment with emerging AI and data technologies, providing feedback... 

    SnapCode Inc

    Pleasanton, CA
    3 days ago
  •  ...Data Analytics Engineer VentureSoft Global, Inc. is a technology services company specializing in software development and IT consulting. We...  ...development, mobile app development, cloud solutions, Data, AI/ML Engineering and digital transformation services. The company... 
    Remote work
    Flexible hours

    Venturesoft

    Pleasanton, CA
    4 days ago
  • Oracle is seeking a Senior Software Engineer to assist in developing software applications tied to AI in Supply Chain Management. In this role, you will define specifications for new projects, develop software solutions, and collaborate with cross-functional teams. The... 

    Oracle

    Pleasanton, CA
    5 days ago
  • $125.8k - $229.4k

     ...A leading retail company is seeking an MDM Engineer III to focus on the development and integration within their Master Data Management platform. Candidates should have 8+ years of experience in MDM solutions, strong skills in Java, and a solid grasp of software development... 
    Remote work

    Ross Stores, Inc.

    Dublin, CA
    4 days ago
  •  ...while being proficient in Python and Azure Databricks. Familiarity with Agile methodologies is required. This position offers a dynamic work environment with opportunities for growth and collaboration with data professionals. #J-18808-Ljbffr Cloud Hybrid Technologies, LLC

    Cloud Hybrid Technologies, LLC

    San Ramon, CA
    1 day ago
  •  ...skilled Senior Azure Databricks (ADB) Developer to join our Data Engineering team. This role involves developing large-scale batch and...  ...teams to ensure secure, scalable, and maintainable cloud data infrastructure. CI/CD Support: Implement CI/CD for Databricks pipelines... 

    InterSources

    Pleasanton, CA
    3 days ago
  • $146.34k - $222.56k

     ...Description Wehave multiple openings for a Data Science Engineer with a background in applied machine...  ...adversarial resilience of critical infrastructure. You will write code, create...  ...algorithms, including deep learning and modern AI techniques such as neural networks,... 
    Minimum wage
    For contractors
    Local area
    Work from home
    Relocation package
    Flexible hours
    1 day per week

    Lawrence Livermore National Laboratory

    Livermore, CA
    1 day ago
  • $120k - $150k

     ...Data Engineer Veev is leading the transformation of the construction industry with an innovative...  ..., building, and maintaining the data infrastructure that powers organizational insights....  .... Leverage machine learning and AI models to forecast business scenarios,... 
    Local area

    VEEV

    Hayward, CA
    1 day ago
  •  ...Data Engineer (Hybrid) Dublin, CA / Houston, TX (onsite 1-2 days a week) Duration: 6 month CTH Top skill sets: Snowflake Must have Experience in Azure cloud is first preference, open to the candidate that has experience in GCP or AWS Must have a minimum... 
    2 days per week
    1 day per week

    Staffing the Universe

    Dublin, CA
    2 days ago
  • $88.1k - $141k

     ...Cornerstone Research in Dublin, CA is seeking a Data Engineer to design and maintain data pipelines in a hybrid work environment. Candidates should have strong SQL skills and experience with cloud-based data solutions. The role involves working with data ingestion tools... 

    Cornerstone Research

    Dublin, CA
    4 days ago
  • $93k - $124k

     ...Job Description Summary As the Sr. Data Engineer, you will play a critical role in building...  ..., Elasticsearch/OpenSearch stacks. Infrastructure: AMI creation, deployment, CI/CD...  ...OpenMetadata experience. Machine learning or AI-driven automation techniques.... 
    Permanent employment
    Contract work
    Remote work
    Visa sponsorship
    Work visa
    Relocation package

    GE Aerospace

    San Ramon, CA
    2 days ago
  •  ...Bachelor's degree or equivalent experience in computer science, applied math, physics, engineering, statistics, economics or related field. 3+ years of industry experience in Data Engineering 3+ years of work experience including hands-on technical... 
    Work experience placement

    Apex Informatics

    Pleasanton, CA
    2 days ago
  •  ...Job Title : Data Engineer Location : Dublin, CA Hybrid Onsite 2-3 days Roles & Responsibilities Provide L3 escalation support for existing production systems and assist with related enhancement activities. Ensure proper testing and adherence to business... 

    Perfict Global, Inc.

    Dublin, CA
    5 days ago
  •  ...Overview: Job Summary: The Senior Data Engineer will be responsible for designing, building, and maintaining robust data pipelines...  ...: Experience with Terraform or CloudFormation for infrastructure automation. Familiarity with Apache Airflow or... 

    Purple Drive

    Pleasanton, CA
    1 day ago
  •  ...Vagaro in Pleasanton, CA is seeking a Data Acquisition Engineer to design and build scalable systems for collecting and processing high-quality external data. Candidates should have 3+ years of experience in web crawling and scraping, along with strong programming skills... 

    Vagaro Inc

    Pleasanton, CA
    8 hours ago
  • $146.34k - $222.56k

     ...We have multiple openings for a Data Science Engineer with a background in applied machine...  ...and adversarial resilience of critical infrastructure. You will write code, create analytical...  ...algorithms, including deep learning and modern AI techniques such as neural networks,... 
    Minimum wage
    For contractors
    Local area
    Work from home
    Relocation package
    Flexible hours
    1 day per week

    LLNL

    Livermore, CA
    5 days ago
  • $100k - $150k

     ...applications. As we continue to grow, we're looking for a skilled AI Infrastructure Engineer to join our dynamic team and contribute to our mission of...  ...offering. Operate high-performance storage systems and data pipelines that keep accelerators fed with training data at... 
    Full time
    H1b
    Local area
    Immediate start
    Remote work
    Visa sponsorship
    Work visa

    Bright Vision Technologies

    Dublin, CA
    1 day ago
  •  ...Azure Data Protection Engineer/Consultant Duration - 4-5 Months Remote Qualifications The ideal candidate should have 3-5 years' experience with Azure or Azure Security functions and a strong understanding of Azure Information Protection and experience... 
    Remote work

    RIT Solutions Inc/ Tech Dev IT/ Texperts Inc/ConceptsIT, Inc...

    San Ramon, CA
    3 days ago
  • $129.2k - $167.2k

     ...standards that others follow. You will partner closely with Data Engineering, Platform Engineering, Data Science, Architecture, and...  ...and appropriate IT teams (for example, Solutions Delivery, Infrastructure, Enterprise Architecture) and leading junior team members in... 
    Full time
    Temporary work
    Work experience placement
    Work from home
    Flexible hours
    Shift work

    Kaiser Permanente

    Pleasanton, CA
    1 day ago
  • $150k - $175k

     ...Data Engineer Location: San Francisco, CA or New York, NY Work Model: Onsite Compensation...  ...combining engineering, analytics, and AI to build scalable internal systems that...  ...across structured data, APIs, cloud infrastructure, and emerging AI workflows. You will... 

    Harnham

    Hayward, CA
    2 days ago
  • A fast-growing organization in life sciences is seeking a Senior Data Engineer to lead the design and implementation of a scalable Data Lakehouse. You will build real-time and batch data ingestion pipelines and develop complex ETL workflows to optimize large-scale data... 

    Veeva Systems

    Pleasanton, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Data Infrastructure Engineer. Be the first to apply!