Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI Infrastructure Engineer

$100k - $150k

Bright Vision Technologies

Bright Vision Technologies is a forward-thinking software development company dedicated to building innovative solutions that help businesses automate and optimize their operations. We leverage cutting-edge technologies to create scalable, secure, and user-friendly applications.

As we continue to grow, we’re looking for a skilled AI Infrastructure Engineer to join our dynamic team and contribute to our mission of transforming business processes through technology.

This is a fantastic opportunity to join an established and well-respected organization offering tremendous career growth potential.

 
AI Infrastructure Engineer
Job Title: AI Infrastructure Engineer
Location: 100% Remote (Continental United States)
Position Type: In-house Bright Vision Technologies SOW engagement (no third-party client or vendor)
Experience: 6+ years
Salary Range: 100k$/Annum-150k$/Annum
Sponsorship: No new H1B sponsorship available. H1B transfers welcomed for qualified candidates.
Employment Type: Full-time, direct W2 with Bright Vision Technologies (no C2C, no 1099, no third-party)
Engagement: Long-term, multi-year, aligned to the Bright Vision SOW delivery roadmap
Compensation: Competitive base salary commensurate with experience, plus benefits.
Employment Terms & Visa Policy
This is a 100% remote, full-time, direct W2 position with Bright Vision Technologies.
This role is part of Bright Vision Technologies’ in-house Statement of Work (SOW) engagement. The client, end customer, and employer for this position is Bright Vision Technologies — there is no third-party client, vendor, or implementation partner involved.
We do not engage in C2C, 1099, or third-party arrangements for this role.
BUT STRICTLY NO C2C/1099/3RD PARTY COMPANIES. ALL OUR ROLES ARE W2 AND NO 3RD PARTY BROKERING PLEASE.
Candidates must be willing to work directly as a full-time W2 employee of Bright Vision Technologies and contribute to our in-house SOW deliverables.
No new H1B sponsorship is available for this role.
However, candidates who are currently on a valid H1B visa and require a transfer are welcome to apply. We will support H1B transfers for qualified candidates.
For every role, a technical coding assessment is mandatory. Please apply only if you are confident in your technical abilities and hands-on experience.
Job Summary
We are seeking an AI Infrastructure Engineer to design, build, and operate the platform layer that powers large-scale AI training and inference workloads. The role focuses on GPU clusters, distributed training frameworks, scheduling, storage performance, and developer experience for ML engineers and researchers, with strong emphasis on reliability, efficiency, and cost control. The ideal candidate has built or operated production AI infrastructure at scale, understands the interaction between hardware, kernel, scheduler, and ML framework, and brings strong software engineering discipline to platform work.
Key Responsibilities
  • Design and operate GPU and accelerator infrastructure for training and inference, spanning on-prem clusters, cloud-managed services, and hybrid configurations.
  • Build scheduling, queueing, and resource-sharing systems that maximize accelerator utilization across many teams.
  • Integrate frameworks such as PyTorch, JAX, DeepSpeed, FSDP, Megatron-LM, and Ray Train into a unified platform offering.
  • Operate high-performance storage systems and data pipelines that keep accelerators fed with training data at near-line-rate.
  • Design networking architectures supporting RDMA, InfiniBand, NCCL, and high-bandwidth collective communication.
  • Build observability for AI workloads including utilization, throughput, training stability, and failure-mode analytics.
  • Implement checkpointing, restart, and fault-tolerance patterns for long-running training jobs at scale.
  • Drive cost optimization across compute, storage, and networking through scheduling, spot capacity, and right-sizing.
  • Develop developer tooling and paved-road workflows that let researchers launch experiments safely and efficiently.
  • Partner with research and applied ML teams to plan capacity for upcoming training runs.
  • Implement security controls, isolation, and access management for multi-tenant AI infrastructure.
  • Drive automation across cluster provisioning, lifecycle management, and configuration enforcement.
  • Maintain runbooks, capacity dashboards, and operational documentation for the AI platform.
  • Stay current with AI infrastructure research, accelerator hardware, and emerging open-source AI tooling.
Required Qualifications
  • Bachelor’s or Master’s degree in Computer Science or a related field.
  • Six or more years of experience in infrastructure, platform, or HPC engineering.
  • Hands-on experience operating GPU clusters or large-scale ML training infrastructure.
  • Strong proficiency in Python and at least one systems language such as Go or C++.
  • Deep understanding of distributed training, accelerator architectures, and collective communication.
  • Experience with Kubernetes, Slurm, Ray, or similar scheduling systems for ML workloads.
  • Strong understanding of Linux internals, networking, and high-performance storage.
  • Experience with at least one major cloud provider’s ML infrastructure offerings.
  • Strong software engineering practices including testing, CI/CD, and code review.
  • Excellent communication and cross-functional collaboration skills.
Preferred Qualifications
  • Experience operating InfiniBand or RDMA networking at scale.
  • Contributions to open-source ML infrastructure projects.
  • Familiarity with custom orchestrators or research-grade training stacks.
  • Exposure to frontier model training operations.
  • Experience with FinOps for AI workloads.
How to Apply
Would you like to know more about this opportunity?
For immediate consideration, please send your resume to View email address on click.appcast.io or contact us at View phone number on click.appcast.io. Learn more about Bright Vision Technologies at
We recognize that our people are our strength, and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company.
We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. We also make reasonable accommodations for applicants’ and employees’ religious practices and beliefs, as well as mental health or physical disability needs.
Bright Vision Technologies is an Equal Opportunity Employer, including Disability/Veterans.
Position offered by “No Fee Agency.”

 

Equal Employment Opportunity (EEO) Statement

Bright Vision Technologies (BV Teck) is committed to equal employment opportunity (EEO) for all employees and applicants without regard to race, color, religion, sex, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, veteran status, or any other protected status as defined by applicable federal, state, or local laws. This commitment extends to all aspects of employment, including recruitment, hiring, training, compensation, promotion, transfer, leaves of absence, termination, layoffs, and recall.

BV Teck expressly prohibits any form of workplace harassment or discrimination. Any improper interference with employees' ability to perform their job duties may result in disciplinary action up to and including termination of employment.

Vacancy posted 5 days ago
Similar jobs that could be interesting for youBased on the AI Infrastructure Engineer in Gaithersburg, MD vacancy
  •  ...Job Title: Generative AI Cloud Operations Engineer - Evinova Location: Gaithersburg, MD At AstraZeneca, we pride ourselves on crafting...  ...broader cloud resources, automating operations through infrastructure-as-code and CI/CD pipelines, and ensure best-in-class operations... 
    Suggested
    Hourly pay
    Temporary work
    Work at office
    Relocation
    3 days per week

    AstraZeneca

    Gaithersburg, MD
    3 days ago
  • $100k - $150k

     ...technologies to create scalable, secure, and user-friendly applications. As we continue to grow, we’re looking for a skilled AI Data Infrastructure Engineer to join our dynamic team and contribute to our mission of transforming business processes through technology. This... 
    Suggested
    Full time
    H1b
    Local area
    Immediate start
    Remote work
    Visa sponsorship
    Work visa

    Bright Vision Technologies

    Rockville, MD
    1 day ago
  • $150k - $183k

     ...DescriptionThis role is responsible for designing, implementing, and maintaining the cloud infrastructure and CI/CD systems that power X-energy's AI-native application platform (APEX). The DevOps Engineer will work with the AI and Application Development team to accelerate... 
    Suggested
    Work at office

    X energy LLC

    Rockville, MD
    2 days ago
  • $150k - $183k

    Alumni Ventures in Rockville, MD is seeking a highly skilled DevOps Engineer to design and maintain cloud infrastructure and CI/CD systems for their AI-native application platform. The role requires expertise in Docker, AWS, and GitLab CI/CD, with responsibilities including... 
    Suggested
    Work at office
    3 days per week

    Alumni Ventures

    Rockville, MD
    1 day ago
  • $150k - $183k

    X energy LLC is hiring a DevOps Engineer in Rockville, MD, to manage cloud infrastructure and CI/CD systems for its AI-native platform. This position requires a strong background in Docker, AWS, and CI/CD practices to support deployment efforts and operational excellence... 
    Suggested
    3 days per week

    X energy LLC

    Rockville, MD
    2 days ago
  • $100k - $150k

     ...applications. As we continue to grow, we’re looking for a skilled AI Security Engineer to join our dynamic team and contribute to our mission of...  ...defenses for both AI-powered applications and the AI infrastructure that supports them. Key Responsibilities Define and... 
    Full time
    H1b
    Local area
    Immediate start
    Remote work
    Visa sponsorship
    Work visa

    Bright Vision Technologies

    Gaithersburg, MD
    5 days ago
  •  ...candidate to join our talented Team. Job Title: AI Security Engineer. Location: Gaithersburg, MD. Job Responsibilities:...  ...deployment) Maintaining AZ's internal model scanning pipeline infrastructure and ensuring continued appropriate ingestion in compliance... 
    Remote work

    Ampcus

    Gaithersburg, MD
    4 days ago
  • A technology and engineering company is seeking an AI Software Developer to join its Air Traffic Business Area in Gaithersburg, MD. Responsible for designing and implementing microservices-based applications, the candidate should have at least 2 years of experience, a... 

    Leidos

    Gaithersburg, MD
    4 days ago
  •  ...Analytica is seeking a mid-Level AI-ML Engineer to join our growing data and AI team and help design, build, and deploy production-grade AI solutions that make a real-world impact. In this role, you will work hands-on with modern Microsoft Azure services and AI/ML technologies... 
    Full time
    For contractors
    Local area

    Analytica

    Rockville, MD
    13 hours ago
  •  ...Analytica is seeking a  Senior AI-ML Engineer (Azure) with deep Microsoft Azure ecosystem experience combining a blend of high-level architectural design skills, deep proficiency in Azure’s specialized AI toolset, and a mastery of the end-to-end Machine Learning Lifecycle... 
    Full time
    For contractors
    Work at office
    Local area

    Analytica

    Rockville, MD
    13 hours ago
  • $107.9k - $195.05k

     ...Digital Modernization sector is seeking an experienced Senior AI/ML Engineer to support the delivery, enhancement, and adoption of...  ...pipelines. Support development and maintenance of model serving infrastructure and scalable inference capabilities. Implement model... 
    Local area
    Immediate start

    Leidos

    Gaithersburg, MD
    2 days ago
  •  ...C2C details Position: Backend SW Engineer - Microservices (Java/Python, Cloud, APIs) Location: Gaithersburg, MD - Hybrid...  ...applications within an Agile/SAFe environment. Experience with AI-enabled or data-driven systems is advantageous. This position supports... 
    Contract work

    3B Staffing LLC

    Gaithersburg, MD
    3 days ago
  • A defense technology company is seeking a Senior AI Integration Engineer in Gaithersburg, Maryland, to enhance enterprise data and analytics products within the Department of War. This role involves implementing AI solutions, designing APIs, and ensuring integration across... 

    Leidos

    Gaithersburg, MD
    1 day ago
  •  ...continue to grow, we’re looking for a skilled Cloud Networking Engineer to join our dynamic team and contribute to our mission of...  ...Networking Engineer to design, deploy, and operate cloud networking infrastructure across one or more major cloud providers. The role covers VPC... 
    Full time
    H1b
    Local area
    Immediate start
    Remote work
    Visa sponsorship
    Work visa

    Bright Vision Technologies

    Gaithersburg, MD
    4 days ago
  • $100k - $150k

     ...and cross-cloud authorization patterns that let workloads and engineers move between providers without compromising on least-...  ...connectivity, and zero-trust patterns. Establish reusable infrastructure-as-code patterns that abstract cloud-specific implementations... 
    Full time
    H1b
    Local area
    Immediate start
    Remote work
    Visa sponsorship
    Work visa

    Bright Vision Technologies

    Gaithersburg, MD
    4 days ago
  • $131.3k - $237.35k

     ...sector is seeking an experienced Platform Engineer to support the delivery, enhancement,...  ...intersection of data, analytics, and emerging AI technologies. Ideal candidates are...  ...Assurance Builds and manages the underlying infrastructure, platforms, and tooling that support... 
    Local area
    Immediate start

    Leidos

    Gaithersburg, MD
    13 hours ago
  • $100k - $150k

     ...we continue to grow, we’re looking for a skilled Azure Cloud Engineer to join our dynamic team and contribute to our mission of transforming...  ...-to-end cloud engineering lifecycle, including architecture, infrastructure-as-code, automation, security hardening, cost optimization,... 
    Full time
    H1b
    Local area
    Immediate start
    Remote work
    Visa sponsorship
    Work visa

    Bright Vision Technologies

    Gaithersburg, MD
    13 hours ago
  • $100k - $150k

     ...we continue to grow, we’re looking for a skilled PLM Platform Engineer (Windchill / Teamcenter) to join our dynamic team and...  ...operating PLM on cloud platforms (AWS, Azure, OCI). Exposure to infrastructure-as-code for PLM environments. Familiarity with CI/CD patterns... 
    Full time
    Fixed term contract
    H1b
    Local area
    Immediate start
    Remote work
    Visa sponsorship
    Work visa

    Bright Vision Technologies

    Gaithersburg, MD
    5 days ago
  • $155.66k - $225.16k

     ...with one place to chat, explore and build with a wide variety of AI language models (bots), including o3, o4-mini, Claude 3.7 Sonnet...  ...the Team and Role: We’re hiring our first AI Automation Engineer to lead how we apply AI internally across the company. This is... 
    Remote job
    Full time
    Shift work

    Quora

    Gaithersburg, MD
    13 hours ago
  •  ...leading technology firm in Maryland is seeking a Senior Systems Engineer for the Chinook Program. The ideal candidate will excel in...  ...Responsibilities include managing configurations, supporting AWS infrastructure, and participating in Agile processes. The position requires... 

    Leidos

    Gaithersburg, MD
    1 day ago
  • A leading government solutions provider is seeking a Senior Software Engineer in Gaithersburg, MD. The role involves designing and developing high-quality software solutions, working with cross-functional teams, and implementing scalable applications. Candidates should... 

    Chenega Corporation

    Gaithersburg, MD
    3 days ago
  • Koitecc Solutions is seeking a Senior Infrastructure Engineer to support mission-critical data and analytics programs. This role involves designing...  ...infrastructure necessary for delivering reliable, scalable AI and analytics capabilities. The ideal candidate has a strong... 

    Koitecc Solutions

    Gaithersburg, MD
    4 days ago
  • $87.1k - $157.45k

     ...Modernization sector is seeking an experienced Journeyman Infrastructure Engineer to support the delivery, enhancement, and adoption of enterprise...  ...at the intersection of data, analytics, and emerging AI technologies. Ideal candidates are motivated by mission impact... 
    Local area
    Immediate start

    Leidos

    Gaithersburg, MD
    5 days ago
  • $107.9k - $195.05k

     ...Digital Modernization sector is seeking an experienced Senior Infrastructure Engineer to support the delivery, enhancement, and adoption of...  ...program at the intersection of data, analytics, and emerging AI technologies. Ideal candidates are motivated by mission impact... 
    Local area
    Immediate start

    Leidos

    Gaithersburg, MD
    13 hours ago
  • $131.3k - $237.35k

     ...seeking an experienced SME Cloud Operations Engineer to support the delivery, enhancement,...  ...of data, analytics, and emerging AI technologies. Ideal candidates are motivated...  ...services. Design and optimize cloud infrastructure architectures to ensure scalability, resilience... 
    Local area
    Immediate start

    Leidos

    Gaithersburg, MD
    3 days ago
  • $87.1k - $157.45k

     ...experienced Journeyman Cloud Operations Engineer to support the delivery, enhancement,...  ...intersection of data, analytics, and emerging AI technologies. Ideal candidates are...  ...Support operation and maintenance of cloud infrastructure environments (e.g., AWS, Azure, or GCP)... 
    Local area
    Immediate start

    Leidos

    Gaithersburg, MD
    1 day ago
  •  ...Redport IA, LLC is seeking a Senior System Security Engineer to take ownership of securing critical systems in a high-impact environment. This hands-on role allows you to thrive independently while driving meaningful security initiatives. The position offers top-tier... 
    Remote work

    Redport IA, LLC

    Gaithersburg, MD
    1 day ago
  • $90k - $138k

    Enroute Computer Solutions in Gaithersburg, Maryland is looking for a Software Engineer to support the FAA/Leidos Common Automation Platform. The role involves designing and implementing microservices-based applications in an Agile environment. Candidates should have proficiency... 
    Full time

    Enroute Computer Solutions

    Gaithersburg, MD
    4 days ago
  • $100k - $150k

     ...continue to grow, we’re looking for a skilled Senior Backend Engineer (High-Throughput Platforms) to join our dynamic team and contribute...  ...Design and build internal platform services and shared infrastructure that hundreds of engineers depend on every day, with explicit... 
    Full time
    H1b
    Local area
    Immediate start
    Remote work
    Visa sponsorship
    Work visa

    Bright Vision Technologies

    Gaithersburg, MD
    13 hours ago
  • A leading IT services firm is seeking an experienced DevOps Engineer to assist with the development and deployment of mission-critical systems. The role includes supporting microservices deployment via a DevSecOps pipeline and developing CI/CD processes. Candidates should... 

    Bailey Information Technology Consultants, LLC

    Gaithersburg, MD
    13 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Infrastructure Engineer. Be the first to apply!