Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI Infrastructure Engineer

$100k - $150k

Bright Vision Technologies

AI Infrastructure Engineer

Bright Vision Technologies is a forward-thinking software development company dedicated to building innovative solutions that help businesses automate and optimize their operations. We leverage cutting-edge technologies to create scalable, secure, and user-friendly applications. As we continue to grow, we're looking for a skilled AI Infrastructure Engineer to join our dynamic team and contribute to our mission of transforming business processes through technology. This is a fantastic opportunity to join an established and well-respected organization offering tremendous career growth potential.

Location: 100% Remote (Continental United States)

Position Type: In-house Bright Vision Technologies SOW engagement (no third-party client or vendor)

Salary: $100K - $150K

Experience: 6+ years

Sponsorship: No new H1B sponsorship available. H1B transfers welcomed for qualified candidates.

Employment Type: Full-time, direct W2 with Bright Vision Technologies (no C2C, no 1099, no third-party)

Engagement: Long-term, multi-year, aligned to the Bright Vision SOW delivery roadmap

Compensation: Competitive base salary commensurate with experience, plus benefits.

This is a 100% remote, full-time, direct W2 position with Bright Vision Technologies. This role is part of Bright Vision Technologies' in-house Statement of Work (SOW) engagement. The client, end customer, and employer for this position is Bright Vision Technologies — there is no third-party client, vendor, or implementation partner involved. We do not engage in C2C, 1099, or third-party arrangements for this role. BUT STRICTLY NO C2C/1099/3RD PARTY COMPANIES. ALL OUR ROLES ARE W2 AND NO 3RD PARTY BROKERING PLEASE. Candidates must be willing to work directly as a full-time W2 employee of Bright Vision Technologies and contribute to our in-house SOW deliverables. No new H1B sponsorship is available for this role. However, candidates who are currently on a valid H1B visa and require a transfer are welcome to apply. We will support H1B transfers for qualified candidates. For every role, a technical coding assessment is mandatory. Please apply only if you are confident in your technical abilities and hands-on experience.

Job Summary

We are seeking an AI Infrastructure Engineer to design, build, and operate the platform layer that powers large-scale AI training and inference workloads. The role focuses on GPU clusters, distributed training frameworks, scheduling, storage performance, and developer experience for ML engineers and researchers, with strong emphasis on reliability, efficiency, and cost control. The ideal candidate has built or operated production AI infrastructure at scale, understands the interaction between hardware, kernel, scheduler, and ML framework, and brings strong software engineering discipline to platform work.

Key Responsibilities
  • Design and operate GPU and accelerator infrastructure for training and inference, spanning on-prem clusters, cloud-managed services, and hybrid configurations.
  • Build scheduling, queueing, and resource-sharing systems that maximize accelerator utilization across many teams.
  • Integrate frameworks such as PyTorch, JAX, DeepSpeed, FSDP, Megatron-LM, and Ray Train into a unified platform offering.
  • Operate high-performance storage systems and data pipelines that keep accelerators fed with training data at near-line-rate.
  • Design networking architectures supporting RDMA, InfiniBand, NCCL, and high-bandwidth collective communication.
  • Build observability for AI workloads including utilization, throughput, training stability, and failure-mode analytics.
  • Implement checkpointing, restart, and fault-tolerance patterns for long-running training jobs at scale.
  • Drive cost optimization across compute, storage, and networking through scheduling, spot capacity, and right-sizing.
  • Develop developer tooling and paved-road workflows that let researchers launch experiments safely and efficiently.
  • Partner with research and applied ML teams to plan capacity for upcoming training runs.
  • Implement security controls, isolation, and access management for multi-tenant AI infrastructure.
  • Drive automation across cluster provisioning, lifecycle management, and configuration enforcement.
  • Maintain runbooks, capacity dashboards, and operational documentation for the AI platform.
  • Stay current with AI infrastructure research, accelerator hardware, and emerging open-source AI tooling.

Required Qualifications

  • Bachelor's or Master's degree in Computer Science or a related field.
  • Six or more years of experience in infrastructure, platform, or HPC engineering.
  • Hands-on experience operating GPU clusters or large-scale ML training infrastructure.
  • Strong proficiency in Python and at least one systems language such as Go or C++.
  • Deep understanding of distributed training, accelerator architectures, and collective communication.
  • Experience with Kubernetes, Slurm, Ray, or similar scheduling systems for ML workloads.
  • Strong understanding of Linux internals, networking, and high-performance storage.
  • Experience with at least one major cloud provider's ML infrastructure offerings.
  • Strong software engineering practices including testing, CI/CD, and code review.
  • Excellent communication and cross-functional collaboration skills.

Preferred Qualifications

  • Experience operating InfiniBand or RDMA networking at scale.
  • Contributions to open-source ML infrastructure projects.
  • Familiarity with custom orchestrators or research-grade training stacks.
  • Exposure to frontier model training operations.
  • Experience with FinOps for AI workloads.
Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the AI Infrastructure Engineer in United States vacancy
  •  ...transform critical institutions with applied AI. We care that industries that power the...  ...We bring: Forward-deployed expertise in engineering, product, and research Mosaic, our in-...  .... About the role We're hiring an AI Infrastructure Engineer to own the infrastructure, deployment... 
    Suggested
    Contract work

    Percepta

    New York, NY
    18 hours ago
  •  ...A leading autonomous robotics firm is seeking an AI Infrastructure Engineer IV to design and maintain systems for AI and machine learning capabilities. You'll collaborate with various engineering teams to optimize cloud infrastructure for high-performance workloads. Ideal... 
    Suggested

    Autonomous Solutions

    Lehi, UT
    1 day ago
  •  ...A leading AI research firm in San Francisco seeks a Staff Infrastructure Engineer to identify and resolve infrastructure bottlenecks and design large-scale systems for AI training. The ideal candidate has over 3 years of experience in infrastructure engineering and strong... 
    Suggested

    Menlo Ventures

    San Francisco, CA
    1 day ago
  •  ...automated, and intelligent, the internal tools that power our engineering and business operations must embody those same principles to...  ...stay focused on the mission. We are looking for a Senior AI Infrastructure Engineer to design, build, and deploy AI‑powered tooling across... 
    Suggested
    Permanent employment
    Full time

    Voiceflow

    Kent, WA
    1 hour ago
  •  ...AI Infrastructure Engineer At BNY, our culture allows us to run our company better and enables employees’ growth and success. As a leading global financial services company at the heart of the global financial system, we influence nearly 20% of the world’s investible... 
    Suggested
    Work experience placement
    Worldwide
    Flexible hours

    BNY Mellon

    Florida, NY
    1 day ago
  • $170k - $210k

     ...Utilidata is a fast‑growing NVIDIA‑backed AI company enabling AI data centers to...  ...more compute capacity from existing energy infrastructure. For over a decade, we have applied AI to...  ...to them. The AI Infrastructure Engineer is responsible for designing, building,... 
    Local area
    Remote work
    Flexible hours

    Utilidata

    New York, NY
    4 days ago
  •  ...Founders Fund–backed NVIDIA cloud partner building the infrastructure platform that powers AI at scale. We connect AI Factories—high-performance GPU...  ...onboarding. Your job is to change that. As an AI Infrastructure Engineer, you'll work directly with AI platform customers to get... 
    Remote work

    Hydra Host

    New York, NY
    2 days ago
  • $190k - $270k

     ...About the Role As an AI Infrastructure Engineer at Together, you are responsible for keeping all user-facing services and production systems running smoothly. You are a blend of a pragmatic operator and a software engineer that applies sound engineering principles, operational... 
    Full time
    Work experience placement

    AI Chopping Block, Inc.

    San Francisco, CA
    18 hours ago
  • $80 - $90 per hour

     ...AI Infrastructure Engineer (Cloud-Native AI Platform | AWS/Terraform) South San Francisco, CA (3 days/week onsite preferred) – Remote possible (West Coast Time-Zone) | 6-month initial contract (potential long-term) Pay: $80-$90/hour, based on experience Overview Help... 
    Contract work
    Remote work
    3 days per week

    Planet Pharma Group

    South San Francisco, CA
    3 days ago
  •  ...ABOUT YOU We are seeking a hands‑on and forward‑thinking AI Infrastructure Engineer to help build and operate the intelligent systems that power Xsolla's infrastructure. As part of our Infrastructure Team, you will implement AI‑driven solutions across cloud optimization... 
    Shift work

    Xsolla

    Los Angeles, CA
    1 day ago
  • $287.8k - $328.5k

     ...Distinguished AI Engineer (Agentic AI Platform Infrastructure) At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry leader in using machine learning to create real-time, personalized... 
    Local area

    Capital One National Association

    San Jose, CA
    4 days ago
  •  ...next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded...  ...We are seeking a DevOps / Platform Engineer to join our team building and operating large-scale GPU compute infrastructure that powers AI and ML workloads. The ideal candidate... 

    Advanced Micro Devices , Inc.

    San Jose, CA
    1 day ago
  •  ...technologies to create scalable, secure, and user-friendly applications. As we continue to grow, we're looking for a skilled AI Infrastructure Engineer to join our dynamic team and contribute to our mission of transforming business processes through technology. This is... 
    Full time
    H1b
    Local area
    Immediate start
    Remote work
    Visa sponsorship
    Work visa

    Bright Vision Technologies

    United States
    2 days ago
  • $105.9k - $180k

     ...into R&D. Our expert teams of physicists, engineers, data scientists and problem-solvers...  ...Qualifications Lead the Future of AI in Semiconductor Manufacturing Join...  ...orchestration (Kubernetes, Docker), and infrastructure automation (Terraform). *Experience deploying... 
    Minimum wage
    Work experience placement
    Flexible hours

    KLA

    Ann Arbor, MI
    3 days ago
  •  ...Job Title: AI Infrastructure Engineer Location: Remote, USA Job Description This role focuses on managing and optimizing our AI infrastructure, ensuring seamless operations, and providing guidance and training to our team members. The ideal candidate will... 
    Remote work

    United IT Solutions

    Dallas, TX
    2 days ago
  • $180k - $240k

     ...facilitating effortless integration into customers' logistics operations. About the role We are seeking a Senior AI Infrastructure Engineer to design, build, and scale the high-performance AI platform powering our autonomous driving models. While researchers focus... 
    Odd job
    Work at office

    Gatik AI

    Mountain View, CA
    1 day ago
  • $157.49k - $174.71k

     ...AI Infrastructure Engineer Intelligent Data Management: Use AI tools to analyze, map, and automate the data migration from the existing workflows and system Design modern, flexible data architectures, not locked to legacy patterns Leverage AI to detect... 
    Remote work
    Flexible hours

    General Dynamics

    United States
    3 days ago
  • $190k - $270k

     ...AI Infrastructure Engineer As an AI Infrastructure Engineer at Together, you are responsible for keeping all user-facing services and production systems running smoothly. You are a blend of a pragmatic operator and a software engineer that applies sound engineering... 
    Full time
    Work experience placement

    Together AI

    San Francisco, CA
    14 days ago
  • $144k - $198k

     ...ADI ensures today's innovators stay Ahead of What's Possible™. Learn more at and on LinkedIn and Twitter (X). Senior AI Infrastructure Engineer, Developer Experience Analog Devices, Inc. (NASDAQ: ADI) is a global semiconductor leader that bridges the physical... 
    Permanent employment
    Work at office
    Shift work
    Day shift

    Analog Devices

    Wilmington, MA
    1 day ago
  •  ...AI Infrastructure Engineer Spellbrush, the world's leading generative AI studio behind nijijourney, is looking for an AI Infrastructure Engineer to join us in building out end-to-end ML infrastructure to run our models on all platforms. What You'll Do Design... 
    Work at office
    Visa sponsorship

    Spellbrush

    San Francisco, CA
    3 days ago
  •  ...Bandwidth Recruitment is looking for a Sr. Software Developer (Infrastructure) to help build the platform and tools that enable engineers to ship better software faster. You will design and operate Bandwidth's AI infrastructure layer while helping build systems that... 

    Bandwidth Recruitment

    Raleigh, NC
    2 days ago
  • $1,000 per month

     ...Join Elliptic's Ai Platform Team This is an opportunity to join Elliptic's AI Platform...  ...to help build the foundational infrastructure that will power how Elliptic's products...  ...and act. You will be one of the first engineers working on a centralised AI platform whose... 
    Remote work
    Home office

    Elliptic

    United States
    3 days ago
  • $60 per hour

     ...A technology company working on AI is seeking proficient programmers. Work from anywhere with a flexible schedule and earn up to $60/hour. Responsibilities include designing coding problems, writing quality code, evaluating AI-generated code, and contributing feedback... 
    Remote work
    Flexible hours

    DataAnnotation

    Brooklyn, NY
    2 days ago
  •  ...A leading tech firm in Austin, Texas, is seeking an AI Engineer to develop and deploy AI/ML solutions integrating with operational workflows. The role involves collaborating with cross-functional teams to enhance roadway safety and operational efficiency. Candidates should... 

    Compunnel

    Austin, TX
    18 hours ago
  •  ...A technology company is seeking proficient programmers to contribute to AI development remotely. You will design coding tasks, evaluate AI code, and help shape future technologies while enjoying a flexible schedule. Ideal candidates possess fluency in English and are... 
    Hourly pay
    Remote work
    Flexible hours

    DataAnnotation

    Wisconsin
    19 hours ago
  •  ...NOVA Corporation is seeking a Low Code AI Engineer to support the International Trade Administration by developing and maintaining AI solutions. This role requires expertise in machine learning and cloud environments, with a focus on automating and deploying AI models.... 
    Remote work

    Nova Usa

    New York, NY
    4 days ago
  •  ...A leading technology organization is seeking a Software Engineer 3 - Contingent to support AI and cloud platform initiatives. This role involves consulting on software engineering projects, developing Python microservices, and deploying solutions on platforms like GCP.... 
    Contract work

    ManpowerGroup Global, Inc.

    Waterford, WI
    18 hours ago
  • $190k - $270k

    AI Chopping Block, Inc. in San Francisco is seeking an AI Infrastructure Engineer to maintain user-facing services and production systems. The role involves building and managing infrastructure with tools like Ansible and Kubernetes, ensuring reliability and scalability... 

    AI Chopping Block, Inc.

    San Francisco, CA
    18 hours ago
  •  ...skilled Unix System Administrator to enhance the performance of AI systems built on the NVIDIA AI Enterprise platform. In this...  ..., and develop automation scripts, playing a crucial part in AI infrastructure. The position offers a hybrid schedule: 3 days in office and 2... 
    Work at office
    Remote work

    Lenovo

    Morrisville, NC
    18 hours ago
  • $126k - $423k

    Decisive Point is seeking a Research Engineer (AI/RL Infrastructure) in Sunnyvale, California to design and operate large-scale ML systems. You will collaborate with leading experts and contribute to next-generation physical AI, impacting self-driving technologies. This... 

    Decisive Point

    Sunnyvale, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Infrastructure Engineer. Be the first to apply!