Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI Infrastructure Engineer

$100k - $150k

Bright Vision Technologies

AI Infrastructure Engineer

Bright Vision Technologies is a forward-thinking software development company dedicated to building innovative solutions that help businesses automate and optimize their operations. We leverage cutting-edge technologies to create scalable, secure, and user-friendly applications. As we continue to grow, we're looking for a skilled AI Infrastructure Engineer to join our dynamic team and contribute to our mission of transforming business processes through technology. This is a fantastic opportunity to join an established and well-respected organization offering tremendous career growth potential.

Job Title: AI Infrastructure Engineer

Location: 100% Remote (Continental United States)

Position Type: In-house Bright Vision Technologies SOW engagement (no third-party client or vendor)

Experience: 6+ years

Salary Range: $100k to $150k per annum

Sponsorship: No new H1B sponsorship available. H1B transfers welcomed for qualified candidates.

Employment Type: Full-time, direct W2 with Bright Vision Technologies (no C2C, no 1099, no third-party)

Engagement: Long-term, multi-year, aligned to the Bright Vision SOW delivery roadmap

Compensation: Competitive base salary commensurate with experience, plus benefits.

This is a 100% remote, full-time, direct W2 position with Bright Vision Technologies. This role is part of Bright Vision Technologies' in-house Statement of Work (SOW) engagement. The client, end customer, and employer for this position is Bright Vision Technologies — there is no third-party client, vendor, or implementation partner involved. We do not engage in C2C, 1099, or third-party arrangements for this role. BUT STRICTLY NO C2C/1099/3RD PARTY COMPANIES. ALL OUR ROLES ARE W2 AND NO 3RD PARTY BROKERING PLEASE. Candidates must be willing to work directly as a full-time W2 employee of Bright Vision Technologies and contribute to our in-house SOW deliverables. No new H1B sponsorship is available for this role. However, candidates who are currently on a valid H1B visa and require a transfer are welcome to apply. We will support H1B transfers for qualified candidates. For every role, a technical coding assessment is mandatory. Please apply only if you are confident in your technical abilities and hands-on experience.

Job Summary

We are seeking an AI Infrastructure Engineer to design, build, and operate the platform layer that powers large-scale AI training and inference workloads. The role focuses on GPU clusters, distributed training frameworks, scheduling, storage performance, and developer experience for ML engineers and researchers, with strong emphasis on reliability, efficiency, and cost control. The ideal candidate has built or operated production AI infrastructure at scale, understands the interaction between hardware, kernel, scheduler, and ML framework, and brings strong software engineering discipline to platform work.

Key Responsibilities
  • Design and operate GPU and accelerator infrastructure for training and inference, spanning on-prem clusters, cloud-managed services, and hybrid configurations.
  • Build scheduling, queueing, and resource-sharing systems that maximize accelerator utilization across many teams.
  • Integrate frameworks such as PyTorch, JAX, DeepSpeed, FSDP, Megatron-LM, and Ray Train into a unified platform offering.
  • Operate high-performance storage systems and data pipelines that keep accelerators fed with training data at near-line-rate.
  • Design networking architectures supporting RDMA, InfiniBand, NCCL, and high-bandwidth collective communication.
  • Build observability for AI workloads including utilization, throughput, training stability, and failure-mode analytics.
  • Implement checkpointing, restart, and fault-tolerance patterns for long-running training jobs at scale.
  • Drive cost optimization across compute, storage, and networking through scheduling, spot capacity, and right-sizing.
  • Develop developer tooling and paved-road workflows that let researchers launch experiments safely and efficiently.
  • Partner with research and applied ML teams to plan capacity for upcoming training runs.
  • Implement security controls, isolation, and access management for multi-tenant AI infrastructure.
  • Drive automation across cluster provisioning, lifecycle management, and configuration enforcement.
  • Maintain runbooks, capacity dashboards, and operational documentation for the AI platform.
  • Stay current with AI infrastructure research, accelerator hardware, and emerging open-source AI tooling.

Required Qualifications

  • Bachelor's or Master's degree in Computer Science or a related field.
  • Six or more years of experience in infrastructure, platform, or HPC engineering.
  • Hands-on experience operating GPU clusters or large-scale ML training infrastructure.
  • Strong proficiency in Python and at least one systems language such as Go or C++.
  • Deep understanding of distributed training, accelerator architectures, and collective communication.
  • Experience with Kubernetes, Slurm, Ray, or similar scheduling systems for ML workloads.
  • Strong understanding of Linux internals, networking, and high-performance storage.
  • Experience with at least one major cloud provider's ML infrastructure offerings.
  • Strong software engineering practices including testing, CI/CD, and code review.
  • Excellent communication and cross-functional collaboration skills.

Preferred Qualifications

  • Experience operating InfiniBand or RDMA networking at scale.
  • Contributions to open-source ML infrastructure projects.
  • Familiarity with custom orchestrators or research-grade training stacks.
  • Exposure to frontier model training operations.
  • Experience with FinOps for AI workloads.
Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the AI Infrastructure Engineer in United States vacancy
  • $157.49k - $174.71k

     ...AI Infrastructure Engineer Intelligent Data Management: Use AI tools to analyze, map, and automate the data migration from the existing workflows and system Design modern, flexible data architectures, not locked to legacy patterns Leverage AI to detect... 
    Suggested
    Remote work
    Flexible hours

    General Dynamics

    United States
    23 hours ago
  • $170k - $210k

     ...AI Infrastructure Engineer Utilidata is a fast-growing AI company enabling AI data centers to dynamically orchestrate power and unlock more compute capacity from existing energy infrastructure. For over a decade, we have applied AI to the electric grid — bringing real... 
    Suggested
    Local area
    Remote work
    Flexible hours

    Utilidata

    United States
    2 days ago
  • $200k - $300k

     ...AI Training Infrastructure Engineer – Humanoid Whole Body Control San Jose, CA Figure is an AI Robotics company developing autonomous general-purpose humanoid robots. The goal of the company is to ship humanoid robots with human level intelligence. Its robots are... 
    Suggested
    Full time
    Work at office

    Figure

    San Jose, CA
    21 hours ago
  • $1,000 per month

     ...Join Elliptic's Ai Platform Team This is an opportunity to join Elliptic's AI Platform...  ...to help build the foundational infrastructure that will power how Elliptic's products...  ...and act. You will be one of the first engineers working on a centralised AI platform whose... 
    Suggested
    Remote work
    Home office

    Elliptic

    United States
    21 hours ago
  •  ...AI Infrastructure Engineer At 42dot, our AI Infrastructure Engineer manages the high-performance AI infrastructure orchestrating thousands of GPUs across multiple data centers. You will contribute to the scaling, monitoring, and operational optimization required to... 
    Suggested

    42dot

    United States
    1 day ago
  • Mercor is seeking talented Performance Engineers in Beaumont, Texas, to join their advanced AI Lab's GenAI team. This position requires deep expertise in low-level systems optimization, particularly in C++, Python, and Rust, with a focus on enhancing AI training and inference... 

    Mercor Inc

    Beaumont, TX
    3 days ago
  •  ...we partner with global logistics company leveraging AI, Machine Learning, and Data Engineering to optimize warehouse operations, predictive maintenance...  .... Role: Build and maintain scalable AI infrastructure, enabling teams to run ML experiments, deploy machine... 
    Long term contract
    Remote work

    Sphere Partners LLC

    United States
    18 hours ago
  •  ...Tribe is seeking an experienced engineer to deploy AI systems in Fortune 500 enterprises. You will work hands-on with cloud platforms such as AWS, GCP, or Azure and have strong Kubernetes experience. This role demands deep production debugging skills and the ability to... 
    Remote work

    Tribe

    United States
    12 hours ago
  • $150k - $200k

     ...AI Infrastructure Specialist As vCluster’s AI Infrastructure Specialist, you will work directly with customers at the earliest and most...  ...next customer’s head start. Feedback Loop: Collaborate with Engineering and Product to surface recurring infrastructure challenges,... 
    Remote work
    Flexible hours

    vCluster

    San Francisco, CA
    4 days ago
  • $60 per hour

     ...A leading AI development company is looking for proficient programmers to join their remote team. You will work on challenging coding tasks to train AI systems, with responsibilities including designing solutions, writing quality code, and evaluating AI-generated outputs... 
    Remote work

    DataAnnotation

    Wausau, WI
    2 days ago
  •  ...next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded...  ...We are seeking a DevOps / Platform Engineer to join our team building and operating large-scale GPU compute infrastructure that powers AI and ML workloads. The ideal candidate... 

    Advanced Micro Devices , Inc.

    San Jose, CA
    4 days ago
  • $140k - $252k

     ...screenshot-based VLM agents, with the larger goal of integrating with Tesla's broader AI ecosystem. We're seeking an ML/RL Infra Engineer to build scalable, reliable infrastructure that powers these agents and enables seamless, high-volume rollouts for model evaluation... 
    Hourly pay
    Full time
    Temporary work
    Flexible hours

    Tesla

    Palo Alto, CA
    3 days ago
  •  ...AI Infrastructure Engineer IV At ASI, we are revolutionizing industries with state-of-the-art autonomous robotics solutions. Within the fields of agriculture, construction, landscaping, and logistics, we deliver technologies that enhance safety, productivity, and efficiency... 
    Local area

    Autonomous Solutions

    Lehi, UT
    4 days ago
  • $60 per hour

     ...A leading AI development company seeks proficient programmers to engage in innovative tasks involving state-of-the-art AI models. Responsibilities include designing coding problems, writing high-quality code, and evaluating AI-generated outputs. This fully remote role... 
    Remote work
    Flexible hours

    DataAnnotation

    Lincoln, NE
    2 days ago
  • $60 per hour

    A leading AI development firm is seeking proficient programmers to join their team. This remote role allows for flexible scheduling, letting you choose your projects and work when it suits you. Responsibilities include solving coding challenges for AI training and providing... 
    Remote work
    Flexible hours

    DataAnnotation

    Wyoming, OH
    2 days ago
  • $163.5k - $212.4k

     ...flagship sedan, and the ET5, a mid-size smart electric sedan. About the Position We are looking for a senior AI Inference Infrastructure Software Engineer with strong hands-on experience building, optimizing, and deploying high-performance, scalable inference systems... 
    Full time
    Temporary work
    Immediate start
    Flexible hours

    NIO

    San Jose, CA
    3 days ago
  • $60 per hour

    A technology company is looking for proficient programmers to contribute to the development of AI systems. This remote position allows for a flexible schedule and offers competitive pay up to $60 per hour. Responsibilities include solving coding problems, writing code,... 
    Hourly pay
    Remote work
    Flexible hours

    DataAnnotation

    Rockwell, NC
    2 days ago
  •  ...Founders Fund–backed NVIDIA cloud partner building the infrastructure platform that powers AI at scale. We connect AI Factories—high-performance GPU...  ...onboarding. Your job is to change that. As an AI Infrastructure Engineer, you'll work directly with AI platform customers to get... 
    Remote work

    Slope

    New York, NY
    7 days ago
  • $60 per hour

    A growing AI development company is seeking proficient programmers to contribute to cutting-edge AI systems. This fully remote role allows flexibility in choosing projects and working hours, with competitive pay up to $60 per hour based on performance. Responsibilities... 
    Hourly pay
    Remote work

    DataAnnotation

    Boston, MA
    4 days ago
  •  ...AI Infrastructure Engineer At BNY, our culture allows us to run our company better and enables employees' growth and success. As a leading global financial services company at the heart of the global financial system, we influence nearly 20% of the world's investible... 
    Work experience placement
    Worldwide
    Flexible hours

    BNY

    Lake Mary, FL
    1 day ago
  • $100k - $150k

     ...technologies to create scalable, secure, and user-friendly applications. As we continue to grow, we’re looking for a skilled AI Infrastructure Engineer to join our dynamic team and contribute to our mission of transforming business processes through technology. This is... 
    Full time
    H1b
    Local area
    Immediate start
    Remote work
    Visa sponsorship
    Work visa

    Bright Vision Technologies

    Edison, NJ
    4 days ago
  •  ...HTEC Group is hiring for a software development position focused on next-generation AI compute platforms. You will design and implement software components across various stacks while collaborating with compiler developers and ML scientists. Candidates should have at... 

    HTEC Group Inc

    New York, NY
    2 days ago
  •  ...transform critical institutions with applied AI. We care that industries that power the...  ...bring: Forward-deployed expertise in engineering, product, and research Mosaic, our in...  ...About the role We're hiring an AI Infrastructure Engineer to own the infrastructure,... 
    Contract work

    Percepta

    New York, NY
    2 days ago
  •  ...AI Engineer The AI Engineer will design, develop, and deploy scalable machine learning and AI-driven analytics capabilities. Responsibilities include multi-source data fusion, entity resolution and behavioral modeling, predictive and prescriptive intelligence analytics... 
    Remote work

    Navstar

    United States
    1 day ago
  • $124k - $420k

     ...What to Expect As a Software Engineer for the Optimus team, you will build the tools and infrastructure to make and measure improvements to neural network architecture, visualize data, assist with exporting and deploying neural networks to the bot, and evaluate experimental... 
    Hourly pay
    Full time
    Temporary work
    Flexible hours

    Tesla

    Palo Alto, CA
    2 days ago
  • €66.5k - €104.5k per year

     ...Sword Health is shifting healthcare from human-first to AI-first through its AI Care platform, making world-class healthcare...  ...Capital, and Founders Fund. As a Senior AI Infrastructure Engineer at Sword Health, you will own the infrastructure that brings... 
    Remote work
    Worldwide
    Flexible hours
    Shift work

    Phoenix Court Group

    New Bremen, OH
    4 days ago
  •  ...TetraScience is the Scientific Data and AI company. We are catalyzing the Scientific...  ...players in compute, cloud, data, and AI infrastructure have converged on TetraScience as the de...  ...We’re looking for a Senior AI Platform Engineer to help design, build, and scale our AI... 
    Immediate start
    Remote work
    Flexible hours

    TetraScience

    New York, NY
    2 days ago
  • $190k - $270k

     ...AI Infrastructure Engineer As an AI Infrastructure Engineer at Together, you are responsible for keeping all user-facing services and production systems running smoothly. You are a blend of a pragmatic operator and a software engineer that applies sound engineering... 
    Full time
    Work experience placement

    Together AI

    San Francisco, CA
    2 days ago
  •  ...Job Description A healthcare client is looking for a AI Infrastructure Engineer to sit fully remote. This person is going to be supporting a large scale initiative for an AI-Powered consumer health platform that is designed to give people a more connected and personalized... 
    Remote work

    Insight Global

    Hartford, CT
    5 days ago
  • $151.8k

     ...AI Infrastructure Engineer We are seeking an experienced AI Infrastructure Engineer to join our AI Incubation team. You will be focused on building and optimizing large-scale training infrastructure for Large Language Models (LLMs). The ideal candidate will combine... 
    Work at office
    Remote work

    Zoom Video Communications

    Seattle, WA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Infrastructure Engineer. Be the first to apply!