AI Data Infrastructure Engineer
Bright Vision Technologies
AI Data Infrastructure Engineer
Job Title: AI Data Infrastructure EngineerLocation: 100% Remote (Continental United States)
Position Type: In-house Bright Vision Technologies SOW engagement (no third-party client or vendor)
Salary: 100 K - 150 K Experience: 6+ years
Sponsorship: No new H1B sponsorship available. H1B transfers welcomed for qualified candidates.
Employment Type: Full-time, direct W2 with Bright Vision Technologies (no C2C, no 1099, no third-party)
Engagement: Long-term, multi-year, aligned to the Bright Vision SOW delivery roadmap
Compensation: Competitive base salary commensurate with experience, plus benefits. Employment Terms & Visa Policy
This is a 100% remote, full-time, direct W2 position with Bright Vision Technologies.
This role is part of Bright Vision Technologies’ in-house Statement of Work (SOW) engagement. The client, end customer, and employer for this position is Bright Vision Technologies — there is no third-party client, vendor, or implementation partner involved.
We do not engage in C2C, 1099, or third-party arrangements for this role. BUT STRICTLY NO C2C/1099/3RD PARTY COMPANIES. ALL OUR ROLES ARE W2 AND NO 3RD PARTY BROKERING PLEASE.
Candidates must be willing to work directly as a full-time W2 employee of Bright Vision Technologies and contribute to our in-house SOW deliverables.
No new H1B sponsorship is available for this role. However, candidates who are currently on a valid H1B visa and require a transfer are welcome to apply. We will support H1B transfers for qualified candidates.
For every role, a technical coding assessment is mandatory. Please apply only if you are confident in your technical abilities and hands-on experience. Job Summary
We are seeking an AI Data Infrastructure Engineer to build and operate the large-scale data systems that power modern AI training and evaluation pipelines. The role combines deep data engineering expertise with a strong understanding of AI workloads, focusing on ingestion, transformation, quality assurance, lineage, and high-throughput delivery of data to training jobs across diverse modalities. The ideal candidate has experience operating petabyte-scale data systems, strong software engineering fundamentals, and clear understanding of how data infrastructure choices propagate into model quality and training efficiency. Key Responsibilities
- Design and operate large-scale data pipelines supporting AI training, evaluation, and continual improvement workflows.
- Build ingestion systems for diverse modalities including text, image, audio, video, and structured signals.
- Implement data cleaning, deduplication, filtering, and quality assurance at petabyte scale.
- Develop dataset versioning, lineage, and provenance tracking systems suitable for reproducible training.
- Build high-throughput data loading systems that maximize GPU utilization during training.
- Implement labeling workflows, active learning pipelines, and human-in-the-loop data improvement systems.
- Design storage architectures balancing cost, throughput, and latency across data tiers.
- Build evaluation dataset construction pipelines with strict integrity and contamination controls.
- Implement data privacy, redaction, and consent enforcement throughout the pipeline.
- Collaborate with ML researchers and engineers to align data systems with model development needs.
- Drive observability of data quality, drift, and pipeline health across the AI data estate.
- Optimize cost and performance through compression, format selection, and caching strategies.
- Document data systems, schemas, and operational procedures for broad internal use.
- Stay current with AI data infrastructure research and emerging open-source tools.
- Bachelor’s or Master’s degree in Computer Science or a related field.
- Six or more years of data engineering experience, with significant work supporting ML or AI workloads.
- Strong proficiency in Python and at least one JVM or systems language.
- Deep experience with modern data processing frameworks such as Spark, Ray, or Beam.
- Hands-on experience operating petabyte-scale storage and pipeline systems.
- Strong understanding of distributed systems, data modeling, and storage formats.
- Experience with dataset versioning, lineage, and reproducibility for ML workflows.
- Familiarity with high-throughput data loading for accelerator-based training.
- Strong software engineering practices including testing, CI/CD, and code review.
- Excellent communication and cross-functional collaboration skills.
- Experience with multimodal datasets at large scale.
- Familiarity with data quality tooling and dataset evaluation methodology.
- Exposure to privacy-preserving data systems and regulated data handling.
- Open-source contributions to data infrastructure projects.
- Experience supporting frontier model training pipelines.
Would you like to know more about this opportunity?
For immediate consideration, please send your resume to View email address on brightvisiontechnologies.applytojob.com or contact us at Show phone number. Learn more about Bright Vision Technologies at
We recognize that our people are our strength, and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company.
We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. We also make reasonable accommodations for applicants’ and employees’ religious practices and beliefs, as well as mental health or physical disability needs.
Bright Vision Technologies is an Equal Opportunity Employer, including Disability/Veterans.
Position offered by “No Fee Agency.”
Equal Employment Opportunity (EEO) Statement
Bright Vision Technologies (BV Teck) is committed to equal employment opportunity (EEO) for all employees and applicants without regard to race, color, religion, sex, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, veteran status, or any other protected status as defined by applicable federal, state, or local laws. This commitment extends to all aspects of employment, including recruitment, hiring, training, compensation, promotion, transfer, leaves of absence, termination, layoffs, and recall.
BV Teck expressly prohibits any form of workplace harassment or discrimination. Any improper interference with employees' ability to perform their job duties may result in disciplinary action up to and including termination of employment.
- ...technologies to create scalable, secure, and user-friendly applications. As we continue to grow, we’re looking for a skilled AI Data Infrastructure Engineer to join our dynamic team and contribute to our mission of transforming business processes through technology. This...SuggestedFull timeH1bLocal areaImmediate startRemote workVisa sponsorshipWork visa
- ...the best job for you. Role: Senior Data Engineer Location: Spring Texas Duration: 6 Months... ...of scalable cloud and on premises data infrastructure supporting enterprise applications, advanced analytics, and AI/ML workloads. Operating within an agile environment...SuggestedPermanent employmentContract workRemote work
- ...to join our team. If you're excited to be part of a winning team, CirrusLabs () is a great place to grow your career. Role: AI Data Engineer- Expert Level Location: Spring, Texas (5 Days Onsite Each Week) Duration: Long term contact The ideal candidate...Suggested
- ...Type: 12-month contract to start Location: Spring, Texas Schedule: Onsite 5 days a week Palantir Data Engineer What role you will play in our team • As part of the team, you would help build tools that optimize investment and operational decisions...SuggestedContract workLocal area
- ...Overview: Position Title * Palantir Ontology Data Engineer Position Responsibilities Job Title:- Palantir Ontology Data Engineer Location:- Spring Texas (On-Site) Job Type:- Long Term Contract Market rate Need Oil & Gas experience No relocation...SuggestedLong term contractLocal areaRelocation
- ...Sr. Data Engineer The Howard Hughes name is synonymous with entrepreneurial vision, tenacity and a pioneering spirit-values still embodied by The Howard Hughes Corporation today. While Hughes' passion for aviation and the silver screen are legendary, it was his investment...
- ...organization. About the Role We are seeking a Sr. Data Engineer to design, build, and optimize enterprise data solutions across... ...business stakeholders, analysts, application teams, and infrastructure/security teams to create robust data pipelines, models, and...
$140k - $150k
...An employer in The Woodlands, TX area is seeking a Sr. Data Engineer to join their team. Currently, this team is responsible for integration and enhancement of the enterprise organization's applications and systems. This role will own the integration and enterprise data...Permanent employment- ...No exceptions Note: Available Immediately & Benefits posted below Introduction Our client seeks a skilled Data Scientist / Data Engineer to join our team and support the Houston Field Office. In this role, you will analyze large and complex datasets to aid...Full timeWork at officeImmediate startRemote work
- ...Data Engineer Seeking a highly skilled and motivated Data Engineer to join a dynamic team. As a key contributor, you will be responsible for integrating into the Common Data Platform (CDP) Program Team to build and migrate data assets to a new AWS technology stack using...
- ...Data Engineer 1 Beusa The Woodlands Corporate Office - The Woodlands, TX 77380 Overview Level: Experienced Position Type: Full Time Job Shift: Day Education Level: 4 Year Degree Travel Percentage: up to 10% Description Data Engineer Job Description...Full timeWork experience placementWork at officeMonday to FridayShift workWeekend workAfternoon shiftEarly shift
- ...Data Engineer Beusa The Woodlands Corporate Office - The Woodlands, TX 77380 Overview Level: Experienced Position Type: Full Time Education Level: 4 Year Degree Travel Percentage: up to 10% Description Data Engineer Job Description Department: Data...Full timeWork at officeMonday to FridayWeekend workAfternoon shiftEarly shift
- ...Senior Data Engineer - Palantir Foundry (Ontology & Data Modeling) Overview We are seeking an experienced Senior Data Engineer... ...decisionsupport systems Experience supporting largescale industrial, infrastructure, or energyrelated data platforms (not required) Top...
$137.3k - $254.9k
...professional to develop and execute a channel-focused sales strategy for data center applications in Spring, Texas. You will provide... ...partners. Key qualifications include a Bachelor’s degree in engineering and a minimum of 5 years’ experience in technology sales. The position...$120k - $200k
...An Oil & Gas customer of Insight Global is looking for a Master Data Engineer to join their team full-time to sit in The Woodlands, TX, on a hybrid schedule 3 days a week and as needed with travel up to 5% within the Texas area. The IT division's primary goal over the...Full timeFlexible hours3 days per week- ...ExxonMobil LCS is building on the US Gulf Coast Implement the data layer of the CCS100 application using the Palantir Foundry platform... ...system Qualifications Bachelor’s degree in computer science, engineering, quantitative sciences, or mathematics; alternatively...Contract workFor contractors
- ...BI Data Engineer Crane ChemPharma & Energy Flow Solutions is a division within the Fluid Handling Business Segment of Crane Co, a US multi-national which specializes in highly engineered products in niche markets. Crane ChemPharma & Energy Flow Solutions designs, manufactures...
- ...Product Data Management Engineer Dynamis The Woodlands Corporate Office - The Woodlands, TX 77380 Overview Level: Experienced Position Type: Full Time Job Shift: Day Education Level: 4 Year Degree Travel Percentage: up to 10% Description Product Data Management...Full timeWork at officeRemote workMonday to FridayShift workWeekend workAfternoon shiftEarly shift
- ...Overview: Platform Engineer with GenAI experience + Full Stack... ..., trading fusion developers, data engineers, and other full stack... ...engineering Author infrastructure-as-code using Terraform for cloud... ..., model orchestration, and AI application patterns Soft...Long term contractLocal areaRelocation
$120k - $190k
...Job Responsibilities Lead execution of AI-assisted development and AI-enabled... ...Architecture, AI Architecture, Cybersecurity, Infrastructure, and Business teams to ensure aligned... ...in developer platforms, internal engineering tooling, or DevEx initiatives Experience...Local areaImmediate startRemote workFlexible hours$150k - $170k
...Job Description The Senior Data Scientist II analyzes complex structured and unstructured... ...such as digital, services, class, and engineering to build scalable ML solutions and... ...practices to build advanced analytics and AI products using Databricks Workflows and Azure...3 days per week- ...Sr. Data Scientist This is a unique opportunity to collaborate with a diverse team of... ...team comprised of data scientists, software engineers, and operations research experts that... ...stay abreast of new technology (including AI/ML and generative AI) and actively contribute...Remote work
$147.05k - $232.85k
...s Strategic Planning and Modeling (SPaM) Data Science Team - Where Vision Meets Impact... ...predictive, prescriptive, and generative AI models to optimize business outcomes. ?... ...business analytics field (i.e. Industrial Engineering, Management Science, Operations Research,...Temporary workFlexible hours- ...client in Spring, TX that is seeking a Senior Data Scientist. This is a hybrid role... ...Scientist to design, build, and deploy scalable AI and machine learning solutions using the... ..., Information Systems, Computer Science, Engineering, or a related field Demonstrated...Hourly payContract workRemote work
$120k - $200k
...Job Description An Oil & Gas customer of Insight Global is looking for a Data Engineering Advisor to join their team full-time to sit in The Woodlands, TX, on a hybrid schedule 3 days a week and as needed with travel up to 5% within the Texas area. The IT division'...Full timeFlexible hoursShift work3 days per week- ...hands-on experience designing, developing, and deploying advanced analytics solutions in Power BI. The ideal candidate will excel in data modeling, have strong SQL skills, and experience with GitHub for version control. Responsibilities include developing dashboards,...
- ...Title: Sr. Cloud Engineer (Azure & AWS) Location: Hybrid in The Woodlands, TX - 773... ...Overview As a member of our client's Infrastructure Team, this position will report to the... ...Solutions' Privacy Policy and INSPYR Solutions' AI and Automated Employment Decision Tool...Contract workWork at officeLocal areaRemote workFlexible hours
- ..., development, and deployment of advanced AI/ML solutions for upstream oil & gas subsurface... ...and sustainment partnering with engineers and business stakeholders to deliver scalable... ...listed above, applying best practices for data quality, explainability, and governance....Part timeFlexible hours
$147.05k - $230.85k
...About the Position The HP Enterprise AI & Machine Learning organization is a centralized team of data scientists and machine learning engineers building GenAI-based tools and digital products that enable increased productivity for 50,000+ employees and improve the...Temporary workFlexible hours- A multinational technology company in Texas is seeking an experienced AI/ML Solutions Lead. You will design and deliver AI solutions for commercial analytics across pricing, marketing, and supply chain. The role demands strong expertise in AI/ML solution delivery and commercial...Flexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI Data Infrastructure Engineer. Be the first to apply!
- test data management The Woodlands, TX
- data internship The Woodlands, TX
- clinical data The Woodlands, TX
- data intern The Woodlands, TX
- data recovery The Woodlands, TX
- data collection researcher The Woodlands, TX
- data loss prevention analyst The Woodlands, TX
- clinical data coordinator remote The Woodlands, TX
- data network cabling The Woodlands, TX
- provider data management The Woodlands, TX


