AI Infrastructure Engineer
$100k - $150kBright Vision Technologies
AI Infrastructure Engineer
Bright Vision Technologies is a forward-thinking software development company dedicated to building innovative solutions that help businesses automate and optimize their operations. We leverage cutting-edge technologies to create scalable, secure, and user-friendly applications. As we continue to grow, we're looking for a skilled AI Infrastructure Engineer to join our dynamic team and contribute to our mission of transforming business processes through technology. This is a fantastic opportunity to join an established and well-respected organization offering tremendous career growth potential.
Location: 100% Remote (Continental United States)
Position Type: In-house Bright Vision Technologies SOW engagement (no third-party client or vendor)
Salary: $100K - $150K
Experience: 6+ years
Sponsorship: No new H1B sponsorship available. H1B transfers welcomed for qualified candidates.
Employment Type: Full-time, direct W2 with Bright Vision Technologies (no C2C, no 1099, no third-party)
Engagement: Long-term, multi-year, aligned to the Bright Vision SOW delivery roadmap
Compensation: Competitive base salary commensurate with experience, plus benefits.
This is a 100% remote, full-time, direct W2 position with Bright Vision Technologies. This role is part of Bright Vision Technologies' in-house Statement of Work (SOW) engagement. The client, end customer, and employer for this position is Bright Vision Technologies — there is no third-party client, vendor, or implementation partner involved. We do not engage in C2C, 1099, or third-party arrangements for this role. BUT STRICTLY NO C2C/1099/3RD PARTY COMPANIES. ALL OUR ROLES ARE W2 AND NO 3RD PARTY BROKERING PLEASE. Candidates must be willing to work directly as a full-time W2 employee of Bright Vision Technologies and contribute to our in-house SOW deliverables. No new H1B sponsorship is available for this role. However, candidates who are currently on a valid H1B visa and require a transfer are welcome to apply. We will support H1B transfers for qualified candidates. For every role, a technical coding assessment is mandatory. Please apply only if you are confident in your technical abilities and hands-on experience.
Job Summary
We are seeking an AI Infrastructure Engineer to design, build, and operate the platform layer that powers large-scale AI training and inference workloads. The role focuses on GPU clusters, distributed training frameworks, scheduling, storage performance, and developer experience for ML engineers and researchers, with strong emphasis on reliability, efficiency, and cost control. The ideal candidate has built or operated production AI infrastructure at scale, understands the interaction between hardware, kernel, scheduler, and ML framework, and brings strong software engineering discipline to platform work.
Key Responsibilities
- Design and operate GPU and accelerator infrastructure for training and inference, spanning on-prem clusters, cloud-managed services, and hybrid configurations.
- Build scheduling, queueing, and resource-sharing systems that maximize accelerator utilization across many teams.
- Integrate frameworks such as PyTorch, JAX, DeepSpeed, FSDP, Megatron-LM, and Ray Train into a unified platform offering.
- Operate high-performance storage systems and data pipelines that keep accelerators fed with training data at near-line-rate.
- Design networking architectures supporting RDMA, InfiniBand, NCCL, and high-bandwidth collective communication.
- Build observability for AI workloads including utilization, throughput, training stability, and failure-mode analytics.
- Implement checkpointing, restart, and fault-tolerance patterns for long-running training jobs at scale.
- Drive cost optimization across compute, storage, and networking through scheduling, spot capacity, and right-sizing.
- Develop developer tooling and paved-road workflows that let researchers launch experiments safely and efficiently.
- Partner with research and applied ML teams to plan capacity for upcoming training runs.
- Implement security controls, isolation, and access management for multi-tenant AI infrastructure.
- Drive automation across cluster provisioning, lifecycle management, and configuration enforcement.
- Maintain runbooks, capacity dashboards, and operational documentation for the AI platform.
- Stay current with AI infrastructure research, accelerator hardware, and emerging open-source AI tooling.
Required Qualifications
- Bachelor's or Master's degree in Computer Science or a related field.
- Six or more years of experience in infrastructure, platform, or HPC engineering.
- Hands-on experience operating GPU clusters or large-scale ML training infrastructure.
- Strong proficiency in Python and at least one systems language such as Go or C++.
- Deep understanding of distributed training, accelerator architectures, and collective communication.
- Experience with Kubernetes, Slurm, Ray, or similar scheduling systems for ML workloads.
- Strong understanding of Linux internals, networking, and high-performance storage.
- Experience with at least one major cloud provider's ML infrastructure offerings.
- Strong software engineering practices including testing, CI/CD, and code review.
- Excellent communication and cross-functional collaboration skills.
Preferred Qualifications
- Experience operating InfiniBand or RDMA networking at scale.
- Contributions to open-source ML infrastructure projects.
- Familiarity with custom orchestrators or research-grade training stacks.
- Exposure to frontier model training operations.
- Experience with FinOps for AI workloads.
- ...transform critical institutions with applied AI. We care that industries that power the... ...We bring: Forward-deployed expertise in engineering, product, and research Mosaic, our in-... .... About the role We're hiring an AI Infrastructure Engineer to own the infrastructure, deployment...SuggestedContract work
- ...A leading autonomous robotics firm is seeking an AI Infrastructure Engineer IV to design and maintain systems for AI and machine learning capabilities. You'll collaborate with various engineering teams to optimize cloud infrastructure for high-performance workloads. Ideal...Suggested
- ...A leading AI research firm in San Francisco seeks a Staff Infrastructure Engineer to identify and resolve infrastructure bottlenecks and design large-scale systems for AI training. The ideal candidate has over 3 years of experience in infrastructure engineering and strong...Suggested
- ...automated, and intelligent, the internal tools that power our engineering and business operations must embody those same principles to... ...stay focused on the mission. We are looking for a Senior AI Infrastructure Engineer to design, build, and deploy AI‑powered tooling across...SuggestedPermanent employmentFull time
- ...AI Infrastructure Engineer At BNY, our culture allows us to run our company better and enables employees’ growth and success. As a leading global financial services company at the heart of the global financial system, we influence nearly 20% of the world’s investible...SuggestedWork experience placementWorldwideFlexible hours
$170k - $210k
...Utilidata is a fast‑growing NVIDIA‑backed AI company enabling AI data centers to... ...more compute capacity from existing energy infrastructure. For over a decade, we have applied AI to... ...to them. The AI Infrastructure Engineer is responsible for designing, building,...Local areaRemote workFlexible hours- ...Founders Fund–backed NVIDIA cloud partner building the infrastructure platform that powers AI at scale. We connect AI Factories—high-performance GPU... ...onboarding. Your job is to change that. As an AI Infrastructure Engineer, you'll work directly with AI platform customers to get...Remote work
$190k - $270k
...About the Role As an AI Infrastructure Engineer at Together, you are responsible for keeping all user-facing services and production systems running smoothly. You are a blend of a pragmatic operator and a software engineer that applies sound engineering principles, operational...Full timeWork experience placement$80 - $90 per hour
...AI Infrastructure Engineer (Cloud-Native AI Platform | AWS/Terraform) South San Francisco, CA (3 days/week onsite preferred) – Remote possible (West Coast Time-Zone) | 6-month initial contract (potential long-term) Pay: $80-$90/hour, based on experience Overview Help...Contract workRemote work3 days per week- ...ABOUT YOU We are seeking a hands‑on and forward‑thinking AI Infrastructure Engineer to help build and operate the intelligent systems that power Xsolla's infrastructure. As part of our Infrastructure Team, you will implement AI‑driven solutions across cloud optimization...Shift work
$287.8k - $328.5k
...Distinguished AI Engineer (Agentic AI Platform Infrastructure) At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry leader in using machine learning to create real-time, personalized...Local area- ...next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded... ...We are seeking a DevOps / Platform Engineer to join our team building and operating large-scale GPU compute infrastructure that powers AI and ML workloads. The ideal candidate...
- ...technologies to create scalable, secure, and user-friendly applications. As we continue to grow, we're looking for a skilled AI Infrastructure Engineer to join our dynamic team and contribute to our mission of transforming business processes through technology. This is...Full timeH1bLocal areaImmediate startRemote workVisa sponsorshipWork visa
$105.9k - $180k
...into R&D. Our expert teams of physicists, engineers, data scientists and problem-solvers... ...Qualifications Lead the Future of AI in Semiconductor Manufacturing Join... ...orchestration (Kubernetes, Docker), and infrastructure automation (Terraform). *Experience deploying...Minimum wageWork experience placementFlexible hours- ...Job Title: AI Infrastructure Engineer Location: Remote, USA Job Description This role focuses on managing and optimizing our AI infrastructure, ensuring seamless operations, and providing guidance and training to our team members. The ideal candidate will...Remote work
$180k - $240k
...facilitating effortless integration into customers' logistics operations. About the role We are seeking a Senior AI Infrastructure Engineer to design, build, and scale the high-performance AI platform powering our autonomous driving models. While researchers focus...Odd jobWork at office$157.49k - $174.71k
...AI Infrastructure Engineer Intelligent Data Management: Use AI tools to analyze, map, and automate the data migration from the existing workflows and system Design modern, flexible data architectures, not locked to legacy patterns Leverage AI to detect...Remote workFlexible hours$190k - $270k
...AI Infrastructure Engineer As an AI Infrastructure Engineer at Together, you are responsible for keeping all user-facing services and production systems running smoothly. You are a blend of a pragmatic operator and a software engineer that applies sound engineering...Full timeWork experience placement$144k - $198k
...ADI ensures today's innovators stay Ahead of What's Possible™. Learn more at and on LinkedIn and Twitter (X). Senior AI Infrastructure Engineer, Developer Experience Analog Devices, Inc. (NASDAQ: ADI) is a global semiconductor leader that bridges the physical...Permanent employmentWork at officeShift workDay shift- ...AI Infrastructure Engineer Spellbrush, the world's leading generative AI studio behind nijijourney, is looking for an AI Infrastructure Engineer to join us in building out end-to-end ML infrastructure to run our models on all platforms. What You'll Do Design...Work at officeVisa sponsorship
- ...Bandwidth Recruitment is looking for a Sr. Software Developer (Infrastructure) to help build the platform and tools that enable engineers to ship better software faster. You will design and operate Bandwidth's AI infrastructure layer while helping build systems that...
$1,000 per month
...Join Elliptic's Ai Platform Team This is an opportunity to join Elliptic's AI Platform... ...to help build the foundational infrastructure that will power how Elliptic's products... ...and act. You will be one of the first engineers working on a centralised AI platform whose...Remote workHome office$60 per hour
...A technology company working on AI is seeking proficient programmers. Work from anywhere with a flexible schedule and earn up to $60/hour. Responsibilities include designing coding problems, writing quality code, evaluating AI-generated code, and contributing feedback...Remote workFlexible hours- ...A leading tech firm in Austin, Texas, is seeking an AI Engineer to develop and deploy AI/ML solutions integrating with operational workflows. The role involves collaborating with cross-functional teams to enhance roadway safety and operational efficiency. Candidates should...
- ...A technology company is seeking proficient programmers to contribute to AI development remotely. You will design coding tasks, evaluate AI code, and help shape future technologies while enjoying a flexible schedule. Ideal candidates possess fluency in English and are...Hourly payRemote workFlexible hours
- ...NOVA Corporation is seeking a Low Code AI Engineer to support the International Trade Administration by developing and maintaining AI solutions. This role requires expertise in machine learning and cloud environments, with a focus on automating and deploying AI models....Remote work
- ...A leading technology organization is seeking a Software Engineer 3 - Contingent to support AI and cloud platform initiatives. This role involves consulting on software engineering projects, developing Python microservices, and deploying solutions on platforms like GCP....Contract work
$190k - $270k
AI Chopping Block, Inc. in San Francisco is seeking an AI Infrastructure Engineer to maintain user-facing services and production systems. The role involves building and managing infrastructure with tools like Ansible and Kubernetes, ensuring reliability and scalability...- ...skilled Unix System Administrator to enhance the performance of AI systems built on the NVIDIA AI Enterprise platform. In this... ..., and develop automation scripts, playing a crucial part in AI infrastructure. The position offers a hybrid schedule: 3 days in office and 2...Work at officeRemote work
$126k - $423k
Decisive Point is seeking a Research Engineer (AI/RL Infrastructure) in Sunnyvale, California to design and operate large-scale ML systems. You will collaborate with leading experts and contribute to next-generation physical AI, impacting self-driving technologies. This...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI Infrastructure Engineer. Be the first to apply!
- ai research engineer United States
- ai developer United States
- ai prompt engineer United States
- ai engineer United States
- senior ai engineer United States
- ai ml engineer United States
- ai engineer remote United States
- machine learning ai engineer United States
- entry level infrastructure engineer United States
- security infrastructure engineer United States


