AI Infrastructure Engineer
$100k - $150kBright Vision Technologies
AI Infrastructure Engineer
Bright Vision Technologies is a forward-thinking software development company dedicated to building innovative solutions that help businesses automate and optimize their operations. We leverage cutting-edge technologies to create scalable, secure, and user-friendly applications. As we continue to grow, we're looking for a skilled AI Infrastructure Engineer to join our dynamic team and contribute to our mission of transforming business processes through technology. This is a fantastic opportunity to join an established and well-respected organization offering tremendous career growth potential.
Job Title: AI Infrastructure Engineer
Location: 100% Remote (Continental United States)
Position Type: In-house Bright Vision Technologies SOW engagement (no third-party client or vendor)
Experience: 6+ years
Salary Range: $100k to $150k per annum
Sponsorship: No new H1B sponsorship available. H1B transfers welcomed for qualified candidates.
Employment Type: Full-time, direct W2 with Bright Vision Technologies (no C2C, no 1099, no third-party)
Engagement: Long-term, multi-year, aligned to the Bright Vision SOW delivery roadmap
Compensation: Competitive base salary commensurate with experience, plus benefits.
This is a 100% remote, full-time, direct W2 position with Bright Vision Technologies. This role is part of Bright Vision Technologies' in-house Statement of Work (SOW) engagement. The client, end customer, and employer for this position is Bright Vision Technologies — there is no third-party client, vendor, or implementation partner involved. We do not engage in C2C, 1099, or third-party arrangements for this role. BUT STRICTLY NO C2C/1099/3RD PARTY COMPANIES. ALL OUR ROLES ARE W2 AND NO 3RD PARTY BROKERING PLEASE. Candidates must be willing to work directly as a full-time W2 employee of Bright Vision Technologies and contribute to our in-house SOW deliverables. No new H1B sponsorship is available for this role. However, candidates who are currently on a valid H1B visa and require a transfer are welcome to apply. We will support H1B transfers for qualified candidates. For every role, a technical coding assessment is mandatory. Please apply only if you are confident in your technical abilities and hands-on experience.
Job Summary
We are seeking an AI Infrastructure Engineer to design, build, and operate the platform layer that powers large-scale AI training and inference workloads. The role focuses on GPU clusters, distributed training frameworks, scheduling, storage performance, and developer experience for ML engineers and researchers, with strong emphasis on reliability, efficiency, and cost control. The ideal candidate has built or operated production AI infrastructure at scale, understands the interaction between hardware, kernel, scheduler, and ML framework, and brings strong software engineering discipline to platform work.
Key Responsibilities
- Design and operate GPU and accelerator infrastructure for training and inference, spanning on-prem clusters, cloud-managed services, and hybrid configurations.
- Build scheduling, queueing, and resource-sharing systems that maximize accelerator utilization across many teams.
- Integrate frameworks such as PyTorch, JAX, DeepSpeed, FSDP, Megatron-LM, and Ray Train into a unified platform offering.
- Operate high-performance storage systems and data pipelines that keep accelerators fed with training data at near-line-rate.
- Design networking architectures supporting RDMA, InfiniBand, NCCL, and high-bandwidth collective communication.
- Build observability for AI workloads including utilization, throughput, training stability, and failure-mode analytics.
- Implement checkpointing, restart, and fault-tolerance patterns for long-running training jobs at scale.
- Drive cost optimization across compute, storage, and networking through scheduling, spot capacity, and right-sizing.
- Develop developer tooling and paved-road workflows that let researchers launch experiments safely and efficiently.
- Partner with research and applied ML teams to plan capacity for upcoming training runs.
- Implement security controls, isolation, and access management for multi-tenant AI infrastructure.
- Drive automation across cluster provisioning, lifecycle management, and configuration enforcement.
- Maintain runbooks, capacity dashboards, and operational documentation for the AI platform.
- Stay current with AI infrastructure research, accelerator hardware, and emerging open-source AI tooling.
Required Qualifications
- Bachelor's or Master's degree in Computer Science or a related field.
- Six or more years of experience in infrastructure, platform, or HPC engineering.
- Hands-on experience operating GPU clusters or large-scale ML training infrastructure.
- Strong proficiency in Python and at least one systems language such as Go or C++.
- Deep understanding of distributed training, accelerator architectures, and collective communication.
- Experience with Kubernetes, Slurm, Ray, or similar scheduling systems for ML workloads.
- Strong understanding of Linux internals, networking, and high-performance storage.
- Experience with at least one major cloud provider's ML infrastructure offerings.
- Strong software engineering practices including testing, CI/CD, and code review.
- Excellent communication and cross-functional collaboration skills.
Preferred Qualifications
- Experience operating InfiniBand or RDMA networking at scale.
- Contributions to open-source ML infrastructure projects.
- Familiarity with custom orchestrators or research-grade training stacks.
- Exposure to frontier model training operations.
- Experience with FinOps for AI workloads.
$157.49k - $174.71k
...AI Infrastructure Engineer Intelligent Data Management: Use AI tools to analyze, map, and automate the data migration from the existing workflows and system Design modern, flexible data architectures, not locked to legacy patterns Leverage AI to detect...SuggestedRemote workFlexible hours$170k - $210k
...AI Infrastructure Engineer Utilidata is a fast-growing AI company enabling AI data centers to dynamically orchestrate power and unlock more compute capacity from existing energy infrastructure. For over a decade, we have applied AI to the electric grid — bringing real...SuggestedLocal areaRemote workFlexible hours$200k - $300k
...AI Training Infrastructure Engineer – Humanoid Whole Body Control San Jose, CA Figure is an AI Robotics company developing autonomous general-purpose humanoid robots. The goal of the company is to ship humanoid robots with human level intelligence. Its robots are...SuggestedFull timeWork at office$1,000 per month
...Join Elliptic's Ai Platform Team This is an opportunity to join Elliptic's AI Platform... ...to help build the foundational infrastructure that will power how Elliptic's products... ...and act. You will be one of the first engineers working on a centralised AI platform whose...SuggestedRemote workHome office- ...AI Infrastructure Engineer At 42dot, our AI Infrastructure Engineer manages the high-performance AI infrastructure orchestrating thousands of GPUs across multiple data centers. You will contribute to the scaling, monitoring, and operational optimization required to...Suggested
- Mercor is seeking talented Performance Engineers in Beaumont, Texas, to join their advanced AI Lab's GenAI team. This position requires deep expertise in low-level systems optimization, particularly in C++, Python, and Rust, with a focus on enhancing AI training and inference...
- ...we partner with global logistics company leveraging AI, Machine Learning, and Data Engineering to optimize warehouse operations, predictive maintenance... .... Role: Build and maintain scalable AI infrastructure, enabling teams to run ML experiments, deploy machine...Long term contractRemote work
- ...Tribe is seeking an experienced engineer to deploy AI systems in Fortune 500 enterprises. You will work hands-on with cloud platforms such as AWS, GCP, or Azure and have strong Kubernetes experience. This role demands deep production debugging skills and the ability to...Remote work
$150k - $200k
...AI Infrastructure Specialist As vCluster’s AI Infrastructure Specialist, you will work directly with customers at the earliest and most... ...next customer’s head start. Feedback Loop: Collaborate with Engineering and Product to surface recurring infrastructure challenges,...Remote workFlexible hours$60 per hour
...A leading AI development company is looking for proficient programmers to join their remote team. You will work on challenging coding tasks to train AI systems, with responsibilities including designing solutions, writing quality code, and evaluating AI-generated outputs...Remote work- ...next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded... ...We are seeking a DevOps / Platform Engineer to join our team building and operating large-scale GPU compute infrastructure that powers AI and ML workloads. The ideal candidate...
$140k - $252k
...screenshot-based VLM agents, with the larger goal of integrating with Tesla's broader AI ecosystem. We're seeking an ML/RL Infra Engineer to build scalable, reliable infrastructure that powers these agents and enables seamless, high-volume rollouts for model evaluation...Hourly payFull timeTemporary workFlexible hours- ...AI Infrastructure Engineer IV At ASI, we are revolutionizing industries with state-of-the-art autonomous robotics solutions. Within the fields of agriculture, construction, landscaping, and logistics, we deliver technologies that enhance safety, productivity, and efficiency...Local area
$60 per hour
...A leading AI development company seeks proficient programmers to engage in innovative tasks involving state-of-the-art AI models. Responsibilities include designing coding problems, writing high-quality code, and evaluating AI-generated outputs. This fully remote role...Remote workFlexible hours$60 per hour
A leading AI development firm is seeking proficient programmers to join their team. This remote role allows for flexible scheduling, letting you choose your projects and work when it suits you. Responsibilities include solving coding challenges for AI training and providing...Remote workFlexible hours$163.5k - $212.4k
...flagship sedan, and the ET5, a mid-size smart electric sedan. About the Position We are looking for a senior AI Inference Infrastructure Software Engineer with strong hands-on experience building, optimizing, and deploying high-performance, scalable inference systems...Full timeTemporary workImmediate startFlexible hours$60 per hour
A technology company is looking for proficient programmers to contribute to the development of AI systems. This remote position allows for a flexible schedule and offers competitive pay up to $60 per hour. Responsibilities include solving coding problems, writing code,...Hourly payRemote workFlexible hours- ...Founders Fund–backed NVIDIA cloud partner building the infrastructure platform that powers AI at scale. We connect AI Factories—high-performance GPU... ...onboarding. Your job is to change that. As an AI Infrastructure Engineer, you'll work directly with AI platform customers to get...Remote work
$60 per hour
A growing AI development company is seeking proficient programmers to contribute to cutting-edge AI systems. This fully remote role allows flexibility in choosing projects and working hours, with competitive pay up to $60 per hour based on performance. Responsibilities...Hourly payRemote work- ...AI Infrastructure Engineer At BNY, our culture allows us to run our company better and enables employees' growth and success. As a leading global financial services company at the heart of the global financial system, we influence nearly 20% of the world's investible...Work experience placementWorldwideFlexible hours
$100k - $150k
...technologies to create scalable, secure, and user-friendly applications. As we continue to grow, we’re looking for a skilled AI Infrastructure Engineer to join our dynamic team and contribute to our mission of transforming business processes through technology. This is...Full timeH1bLocal areaImmediate startRemote workVisa sponsorshipWork visa- ...HTEC Group is hiring for a software development position focused on next-generation AI compute platforms. You will design and implement software components across various stacks while collaborating with compiler developers and ML scientists. Candidates should have at...
- ...transform critical institutions with applied AI. We care that industries that power the... ...bring: Forward-deployed expertise in engineering, product, and research Mosaic, our in... ...About the role We're hiring an AI Infrastructure Engineer to own the infrastructure,...Contract work
- ...AI Engineer The AI Engineer will design, develop, and deploy scalable machine learning and AI-driven analytics capabilities. Responsibilities include multi-source data fusion, entity resolution and behavioral modeling, predictive and prescriptive intelligence analytics...Remote work
$124k - $420k
...What to Expect As a Software Engineer for the Optimus team, you will build the tools and infrastructure to make and measure improvements to neural network architecture, visualize data, assist with exporting and deploying neural networks to the bot, and evaluate experimental...Hourly payFull timeTemporary workFlexible hours€66.5k - €104.5k per year
...Sword Health is shifting healthcare from human-first to AI-first through its AI Care platform, making world-class healthcare... ...Capital, and Founders Fund. As a Senior AI Infrastructure Engineer at Sword Health, you will own the infrastructure that brings...Remote workWorldwideFlexible hoursShift work- ...TetraScience is the Scientific Data and AI company. We are catalyzing the Scientific... ...players in compute, cloud, data, and AI infrastructure have converged on TetraScience as the de... ...We’re looking for a Senior AI Platform Engineer to help design, build, and scale our AI...Immediate startRemote workFlexible hours
$190k - $270k
...AI Infrastructure Engineer As an AI Infrastructure Engineer at Together, you are responsible for keeping all user-facing services and production systems running smoothly. You are a blend of a pragmatic operator and a software engineer that applies sound engineering...Full timeWork experience placement- ...Job Description A healthcare client is looking for a AI Infrastructure Engineer to sit fully remote. This person is going to be supporting a large scale initiative for an AI-Powered consumer health platform that is designed to give people a more connected and personalized...Remote work
$151.8k
...AI Infrastructure Engineer We are seeking an experienced AI Infrastructure Engineer to join our AI Incubation team. You will be focused on building and optimizing large-scale training infrastructure for Large Language Models (LLMs). The ideal candidate will combine...Work at officeRemote work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI Infrastructure Engineer. Be the first to apply!
- machine learning ai engineer United States
- senior ai engineer United States
- ai engineer remote United States
- ai ml engineer United States
- ai engineer United States
- ai developer United States
- ai research engineer United States
- ai prompt engineer United States
- data infrastructure engineer United States
- infrastructure engineering manager United States

