Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI Data Infrastructure Engineer

$100k - $150k

Bright Vision Technologies

Bright Vision Technologies is a forward-thinking software development company dedicated to building innovative solutions that help businesses automate and optimize their operations. We leverage cutting-edge technologies to create scalable, secure, and user-friendly applications.
As we continue to grow, we're looking for a skilled AI Data Infrastructure Engineer to join our dynamic team and contribute to our mission of transforming business processes through technology.
This is a fantastic opportunity to join an established and well-respected organization offering tremendous career growth potential.

Job Title: AI Data Infrastructure Engineer
Location: 100% Remote (Continental United States)
Position Type: In-house Bright Vision Technologies SOW engagement (no third-party client or vendor)
Salary: $100K - $150K

Experience: 6+ years
Sponsorship: No new H1B sponsorship available. H1B transfers welcomed for qualified candidates.
Employment Type: Full-time, direct W2 with Bright Vision Technologies (no C2C, no 1099, no third-party)
Engagement: Long-term, multi-year, aligned to the Bright Vision SOW delivery roadmap
Compensation: Competitive base salary commensurate with experience, plus benefits.

Employment Terms & Visa Policy
This is a 100% remote, full-time, direct W2 position with Bright Vision Technologies.
This role is part of Bright Vision Technologies' in-house Statement of Work (SOW) engagement. The client, end customer, and employer for this position is Bright Vision Technologies - there is no third-party client, vendor, or implementation partner involved.
We do not engage in C2C, 1099, or third-party arrangements for this role.

BUT STRICTLY NO C2C/1099/3RD PARTY COMPANIES. ALL OUR ROLES ARE W2 AND NO 3RD PARTY BROKERING PLEASE.
Candidates must be willing to work directly as a full-time W2 employee of Bright Vision Technologies and contribute to our in-house SOW deliverables.
No new H1B sponsorship is available for this role.

However, candidates who are currently on a valid H1B visa and require a transfer are welcome to apply. We will support H1B transfers for qualified candidates.
For every role, a technical coding assessment is mandatory. Please apply only if you are confident in your technical abilities and hands-on experience.

Job Summary
We are seeking an AI Data Infrastructure Engineer to build and operate the large-scale data systems that power modern AI training and evaluation pipelines. The role combines deep data engineering expertise with a strong understanding of AI workloads, focusing on ingestion, transformation, quality assurance, lineage, and high-throughput delivery of data to training jobs across diverse modalities. The ideal candidate has experience operating petabyte-scale data systems, strong software engineering fundamentals, and clear understanding of how data infrastructure choices propagate into model quality and training efficiency.

Key Responsibilities
  • Design and operate large-scale data pipelines supporting AI training, evaluation, and continual improvement workflows.
  • Build ingestion systems for diverse modalities including text, image, audio, video, and structured signals.
  • Implement data cleaning, deduplication, filtering, and quality assurance at petabyte scale.
  • Develop dataset versioning, lineage, and provenance tracking systems suitable for reproducible training.
  • Build high-throughput data loading systems that maximize GPU utilization during training.
  • Implement labeling workflows, active learning pipelines, and human-in-the-loop data improvement systems.
  • Design storage architectures balancing cost, throughput, and latency across data tiers.
  • Build evaluation dataset construction pipelines with strict integrity and contamination controls.
  • Implement data privacy, redaction, and consent enforcement throughout the pipeline.
  • Collaborate with ML researchers and engineers to align data systems with model development needs.
  • Drive observability of data quality, drift, and pipeline health across the AI data estate.
  • Optimize cost and performance through compression, format selection, and caching strategies.
  • Document data systems, schemas, and operational procedures for broad internal use.
  • Stay current with AI data infrastructure research and emerging open-source tools.
Required Qualifications
  • Bachelor's or Master's degree in Computer Science or a related field.
  • Six or more years of data engineering experience, with significant work supporting ML or AI workloads.
  • Strong proficiency in Python and at least one JVM or systems language.
  • Deep experience with modern data processing frameworks such as Spark, Ray, or Beam.
  • Hands-on experience operating petabyte-scale storage and pipeline systems.
  • Strong understanding of distributed systems, data modeling, and storage formats.
  • Experience with dataset versioning, lineage, and reproducibility for ML workflows.
  • Familiarity with high-throughput data loading for accelerator-based training.
  • Strong software engineering practices including testing, CI/CD, and code review.
  • Excellent communication and cross-functional collaboration skills.
Preferred Qualifications
  • Experience with multimodal datasets at large scale.
  • Familiarity with data quality tooling and dataset evaluation methodology.
  • Exposure to privacy-preserving data systems and regulated data handling.
  • Open-source contributions to data infrastructure projects.
  • Experience supporting frontier model training pipelines.

How to Apply
Would you like to know more about this opportunity?
For immediate consideration, please send your resume to [email protected].
Learn more about Bright Vision Technologies at
We recognize that our people are our strength, and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company.
We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. We also make reasonable accommodations for applicants' and employees' religious practices and beliefs, as well as mental health or physical disability needs.
Bright Vision Technologies is an Equal Opportunity Employer, including Disability/Veterans.
Position offered by "No Fee Agency."


Equal Employment Opportunity (EEO) Statement

Bright Vision Technologies (BV Teck) is committed to equal employment opportunity (EEO) for all employees and applicants without regard to race, color, religion, sex, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, veteran status, or any other protected status as defined by applicable federal, state, or local laws. This commitment extends to all aspects of employment, including recruitment, hiring, training, compensation, promotion, transfer, leaves of absence, termination, layoffs, and recall.

BV Teck expressly prohibits any form of workplace harassment or discrimination. Any improper interference with employees' ability to perform their job duties may result in disciplinary action up to and including termination of employment.
Vacancy posted 9 hours ago
Similar jobs that could be interesting for youBased on the AI Data Infrastructure Engineer in Newark, CA vacancy
  •  ...A pioneering AI company in California is seeking a Data Infrastructure Engineer to build and operate large-scale data systems. The role involves architecting multi-cluster systems for optimized performance and maintaining modern storage solutions. Ideal candidates have... 
    Suggested

    Mistral AI

    Palo Alto, CA
    1 day ago
  •  ...Data Infrastructure Engineer Los Angeles, Palo Alto, San Francisco, Toronto About HeyGen At HeyGen, our mission is to make visual storytelling...  ...of developing applications powered by our cutting-edge AI research. As a Data Infrastructure Engineer, you will lead... 
    Suggested

    HeyGen

    Palo Alto, CA
    4 days ago
  • Rhoda AI is looking for Data Infrastructure MLEs in Palo Alto to develop systems that manage immense data volumes essential for robotics. This role requires expertise in designing large-scale data infrastructure to optimize the processing of billions of video clips, ensuring... 
    Suggested

    Rhoda AI

    Palo Alto, CA
    1 day ago
  •  ...NimbleRx in Redwood City is seeking a Senior Data Engineer to own and evolve the data platform crucial for driving business growth and operational efficiency. As part of a fast-growing healthtech startup, you will build and optimize data pipelines, partner with teams across... 
    Suggested

    NimbleRx

    Redwood City, CA
    1 day ago
  • $144k - $224k

     ...Software Engineer III, Data Platform Hybrid- Any Office (Fremont, CA, Salem, OR, or Pittsburgh...  ..., you will implement and scale the infrastructure required to ingest, process, and synchronize...  ...Collaboration: Work closely with AI and Hardware teams to ensure data tools... 
    Suggested
    Full time
    Temporary work
    Work at office
    Relocation package
    Flexible hours

    Agility Robotics

    Fremont, CA
    3 days ago
  • $100k - $180k

     ...What To Expect As a Sr. Data Engineer in the global EHS&S data team, you'll build and scale...  ...orchestration, and overall end-to-end infrastructure Create and refine dimensional data models...  ...and help raise the bar on engineering/AI best practices across the team What You... 
    Hourly pay
    Temporary work
    Flexible hours

    Tesla

    Fremont, CA
    1 day ago
  • $154.4k - $212.3k

    Uniphore Technologies North America Inc in Palo Alto is looking for a candidate to join their Data Layer and Marketing AI platform. The role focuses on building scalable distributed data systems that power both batch and real-time data processing. The ideal candidate will... 

    Uniphore Technologies North America Inc

    Palo Alto, CA
    4 days ago
  • Micro1 is seeking a Software Engineer to assist in training and evaluating AI systems through real-world tasks. This contractor role offers 10-15 hours per week of work in a remote capacity. Ideal candidates will have 5+ years of experience in software engineering, with... 
    Remote job
    For contractors
    10 hours per week

    Micro1

    Fremont, CA
    2 days ago
  • Relanto is looking for a skilled Data Engineer based in Fremont, CA, who has proven experience in building and maintaining scalable data pipelines on Google Cloud Platform (GCP). The ideal candidate should be proficient in Python, SQL, and PySpark, as well as familiar with... 

    Relanto

    Fremont, CA
    20 hours ago
  • $240k - $280k

    Pantera Capital is looking for a Software Engineer for their Data team. In this role, you will develop a highly reliable and scalable data platform while collaborating with various teams. The ideal candidate holds a Bachelor's degree in a related field or has equivalent... 

    Pantera Capital

    Palo Alto, CA
    3 days ago
  •  ...KPI Partners is a 5 times Gartner recognized data, analytics, and AI consulting company. We are leaders in data engineering on Azure, AWS, Google, Snowflake and Databricks. Founded in 2006, KPI has over 400 consultants and has successfully delivered over 1,000 projects... 
    Contract work
    3 days per week

    KPI Partners

    Fremont, CA
    2 days ago
  • $100k - $180k

     ...strategic decision-making through data. In this role, you will serve as the primary Data Engineer supporting Tesla’s Inbound...  ...go‑to expert for logistics data infrastructure, carrier performance KPIs, and...  ...highly preferred. Familiarity with AI tools and AI‑assisted... 
    Hourly pay
    Full time
    Temporary work
    Work experience placement
    Flexible hours

    Tesla

    Fremont, CA
    1 day ago
  • $141k - $307k

     ...role, you will directly contribute to Lam’s Enterprise AI strategy by building the scalable, AI-ready data foundation that powers generative AI, machine...  ...and semantic data models. Partner closely with AI/ML engineers, data scientists, business teams, and platform teams... 
    Local area
    Remote work
    Flexible hours
    2 days per week
    3 days per week
    1 day per week

    Lam Research

    Fremont, CA
    2 days ago
  • Consolidate the Customer data to create a Unified Customer profile Design and implement data ingestion pipelines into Salesforce Data Cloud from internal and third-party systems . Work with stakeholders to define Customer data model requirements, identity resolution rules... 
    Permanent employment
    Contract work
    Local area

    Blockchain Technologies. LLC

    Fremont, CA
    1 day ago
  • $168.1k - $243.7k

     ...multi-team technical projects that directly impact business outcomes. The role involves managing large-scale projects and mentoring engineers in a dynamic environment. Candidates should have significant experience in programming and cloud-native technologies. The... 
    Full time

    PassFort

    Newark, CA
    2 days ago
  • A technology company in Fremont, California is seeking a specialist to consolidate customer data into unified profiles and design data ingestion pipelines into Salesforce Data Cloud. The role involves ensuring data quality and governance, collaborating with stakeholders... 

    Ethereum Technologies LLC

    Fremont, CA
    1 day ago
  • $225k - $275k

    EarnIn is looking for a highly skilled professional for their AI & Data team. This role involves developing automated reporting systems and driving data-driven decisions in Palo Alto. The salary range for this full-time position is $225,000 - $275,000 plus equity and benefits... 
    Full time
    Work at office
    2 days per week

    EarnIn

    Palo Alto, CA
    1 day ago
  • $219k - $240k

    Menlo Ventures is seeking a Lead Analytics Engineer to manage Obsidian's data warehouse and analytics foundation. This role requires ownership of the dbt project and close collaboration across teams to ensure data integrity and reporting accuracy. The ideal candidate will... 

    Menlo Ventures

    Palo Alto, CA
    1 day ago
  • A leading AI technology firm in California is seeking an experienced backend engineer to focus on data systems within their Data Layer and Marketing AI platform. You will work on building and optimizing distributed systems that handle extensive real-time and batch data... 

    Uniphore Technologies Inc.

    Palo Alto, CA
    1 day ago
  • $215k - $265k

     ...About The Team Financial Systems owns the data and reporting foundation for Accounting, and...  ...org. We’re looking for a Staff Analytics Engineer to build and own our Financial Subledger...  ...support close and audit readiness. Embed AI‑assisted reconciliation capabilities into... 
    Work at office
    Remote work
    Flexible hours

    Affirm

    Palo Alto, CA
    2 days ago
  • $105k - $125k

    Peterson Holding Company is seeking a Data & Analytics Engineer to work onsite in San Leandro, CA. This role supports the evolution of the enterprise analytics platform, focusing on data engineering and reporting using modern cloud data platforms such as Snowflake and Microsoft... 

    Peterson Holding Company

    San Leandro, CA
    20 hours ago
  • $100k - $180k

    Tesla Motors, Inc. is looking for a Data Engineer in Fremont, CA to build and optimize data solutions supporting its Environmental, Health, Safety, and Security goals. The role involves developing data pipelines, collaborating with stakeholders on business challenges, and... 

    Tesla Motors, Inc.

    Fremont, CA
    2 days ago
  • Clutch Canada is seeking a skilled Software Engineer for their Data team at Speechify, responsible for aspects of data collection to support model...  ...bonuses and equity. Join a fast-growing company that shapes AI products impacting millions of users with learning... 
    Remote job

    Clutch Canada

    Fremont, CA
    1 day ago
  • Tesla is seeking a Sr. Data Engineer to join its global EHS&S data team in Fremont, California. In this role, you'll own the data pipelines that enhance Tesla's Environmental, Health, Safety, and Security initiatives, collaborate with stakeholders, and refine data models... 

    Tesla

    Fremont, CA
    20 hours ago
  •  ...motivated individual to join us as a Data Science Engineer, Manufacturing Sciences and Technology...  ...data, including ensuring robust data infrastructure, data collection and analysis, and establishing...  ...quality Contribute to company‑wide AI initiatives as a thought leader for... 
    Local area

    Allogene Therapeutics

    Newark, CA
    2 days ago
  • $100k - $216k

    What To Expect Tesla is seeking a Data Engineer to join our Optimus production engineering team...  ...in scaling our production analytics infrastructure. In this role, you will design, build,...  ...tuning, and scalability Integrate factory AI resources into streamlined data... 
    Temporary work
    Flexible hours

    Tesla

    Fremont, CA
    3 days ago
  • We’re building a modern data + AI platform where ontologies and semantic models create a consistent understanding of entities, relationships, and meaning across systems. We are seeking a Sr. Data Engineer (Ontology & Semantic Modeling) to design scalable data pipelines... 
    Work at office
    Local area
    3 days per week

    SoundThinking

    Fremont, CA
    4 days ago
  • $80 per hour

     ...Job Title: Agentic Analytics Engineer (contract) PR: $80/hr Contract Length: 12 months...  ...for integrating scientific and business data from multiple sources to generate agentic...  ...molecule") into a series of logical steps an AI can follow. RAG Pipeline Maintenance:... 
    Contract work
    Work experience placement
    Immediate start

    Medasource

    Fremont, CA
    20 hours ago
  • TechDigital Group is seeking a Senior Data Engineer with over 12 years of experience in designing and modernizing enterprise data platforms. You will architect scalable ETL/ELT pipelines using Python, PySpark, and Azure Data Factory, and lead enterprise-scale Data Architecture... 

    TechDigital Group

    Fremont, CA
    2 days ago
  • GXL is seeking a Lead Data Engineer in Palo Alto to own the data infrastructure for their AI products. The role involves designing and maintaining scalable ETL/ELT pipelines, enhancing database performance, and collaborating with product teams. The ideal candidate has... 
    Visa sponsorship

    GXL

    Palo Alto, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Data Infrastructure Engineer. Be the first to apply!