Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

GPU Systems Engineer (CUDA)

Bright Vision Technologies

GPU Systems Engineer (CUDA)

Bright Vision Technologies is a forward-thinking software development company dedicated to building innovative solutions that help businesses automate and optimize their operations. We leverage cutting-edge technologies to create scalable, secure, and user-friendly applications. As we continue to grow, we're looking for a skilled GPU Systems Engineer (CUDA) to join our dynamic team and contribute to our mission of transforming business processes through technology. This is a fantastic opportunity to join an established and well-respected organization offering tremendous career growth potential.

Job Title: GPU Systems Engineer (CUDA) Location: 100% Remote (Continental United States) Position Type: In-house Bright Vision Technologies SOW engagement (no third-party client or vendor) Salary: 100k to 150k Experience: 6+ years Sponsorship: No new H1B sponsorship available. H1B transfers welcomed for qualified candidates. Employment Type: Full-time, direct W2 with Bright Vision Technologies (no C2C, no 1099, no third-party) Engagement: Long-term, multi-year, aligned to the Bright Vision SOW delivery roadmap Compensation: Competitive base salary commensurate with experience, plus benefits.

Job Summary: We are seeking a GPU Systems Engineer with deep expertise in CUDA programming, GPU architecture, and high-performance computing to design and optimize compute-intensive workloads on modern accelerator hardware. This role focuses on extracting maximum performance from GPU platforms for AI training, inference, scientific computing, and high-throughput data processing workloads. The ideal candidate combines low-level systems mastery with strong software engineering practices, and has a track record of delivering measurable performance improvements on production GPU systems. In this role you will work closely with cross-functional partners — product, design, engineering, operations, and business stakeholders — to translate ambiguous requirements into well-engineered solutions, and will be expected to raise the bar through code review, design review, and mentorship of more junior engineers. The successful candidate brings strong engineering discipline, a clear communication style, and a track record of shipping meaningful work that holds up well in production.

  • Design and implement high-performance CUDA kernels for compute-intensive workloads across AI and HPC use cases.
  • Profile and optimize GPU code using tools such as Nsight Systems, Nsight Compute, and CUDA profilers.
  • Tune memory access patterns, occupancy, register usage, and shared memory utilization for peak performance.
  • Develop highly optimized libraries for linear algebra, attention, and other ML primitives.
  • Optimize multi-GPU and multi-node training using NCCL, RDMA, and high-performance networking.
  • Implement custom operators and fused kernels in PyTorch, JAX, or Triton.
  • Collaborate with ML engineers to identify performance bottlenecks in training and inference pipelines.
  • Develop benchmarks and regression tests to safeguard performance over time.
  • Evaluate new GPU architectures and feature sets, and advise on adoption strategy.
  • Contribute to compiler-level optimizations for tensor programs where appropriate, working at the boundary between ML frameworks and underlying accelerator codegen to unlock performance not reachable through framework-level tuning alone.
  • Optimize memory hierarchy usage across HBM, L2, shared memory, and registers.
  • Implement mixed-precision and quantized compute paths that maximize accelerator throughput while preserving numerical fidelity within bounds acceptable for the target workloads.
  • Document performance characteristics, design decisions, and tuning playbooks for internal teams.
  • Stay current with GPU architecture, CUDA evolution, and emerging accelerator technologies.

Required Qualifications:

  • Bachelor's or Master's degree in Computer Science, Computer Engineering, or a related field.
  • Six or more years of experience in GPU programming and performance engineering.
  • Deep expertise in CUDA C/C++ and GPU programming models.
  • Strong understanding of modern GPU architectures, memory hierarchies, and execution models.
  • Hands-on experience profiling and optimizing GPU workloads in production.
  • Familiarity with NCCL, MPI, and high-performance interconnect technologies.
  • Experience integrating custom kernels into ML frameworks.
  • Strong C++ skills and familiarity with modern systems programming practices.
  • Solid grounding in linear algebra and numerical methods.
  • Strong communication and collaboration skills with research and engineering teams.

Preferred Qualifications:

  • Experience with Triton, CUTLASS, or other GPU kernel authoring frameworks.
  • Familiarity with TensorRT, FasterTransformer, or vLLM internals.
  • Exposure to compiler infrastructure such as LLVM or MLIR.
  • Open-source contributions to GPU or ML performance libraries.
  • Experience with large-scale distributed training infrastructure.

How to Apply: Would you like to know more about this opportunity? For immediate consideration, please send your resume to View email address on click.appcast.io or contact us at View phone number on click.appcast.io. Learn more about Bright Vision Technologies at

Bright Vision Technologies is an equal opportunity employer and places a high value on diversity and inclusion at our company. We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. We also make reasonable accommodations for applicants' and employees' religious practices and beliefs, as well as mental health or physical disability needs. Bright Vision Technologies is an Equal Opportunity Employer, including Disability/Veterans.

Vacancy posted 11 hours ago
Similar jobs that could be interesting for youBased on the GPU Systems Engineer (CUDA) in United States vacancy
  • $100k - $150k

     ...technologies to create scalable, secure, and user-friendly applications. As we continue to grow, we’re looking for a skilled GPU Systems Engineer (CUDA) to join our dynamic team and contribute to our mission of transforming business processes through technology. This is... 
    Suggested
    Full time
    H1b
    Local area
    Immediate start
    Remote work
    Visa sponsorship
    Work visa

    Bright Vision Technologies

    Duluth, GA
    5 days ago
  • $200k - $300k

     ...outcomes as our own, taking pride in the systems we build and the trust we earn. If you’...  ...what's next. About the Role As a System Engineer, GPU Fleet, you will manage, operate, and...  ...workloads Deep knowledge of GPU architecture, CUDA toolkit, GPU drivers, monitoring tools (... 
    Suggested
    Local area

    Fluidstack

    Seattle, WA
    11 hours ago
  • $181k - $248.5k

     ...exploration across our solar system. Its mission is to make planetary...  ...About the Role: Own the GPU compute environment for a space...  ...Computer Science or Electrical Engineering and 5+ years of relevant...  ...programming and compute frameworks — CUDA, ROCm, or OpenCL — with real... 
    Suggested
    Shift work

    Relativity Space

    Long Beach, CA
    26 days ago
  • $160k - $230k

     ...Systems Research Engineer, GPU Programming San Francisco About the Role As a Systems Research Engineer specialized in GPU Programming, you...  ...background in GPU programming and parallel computing, such as CUDA and/or Triton. Knowledge of ML/AI applications and... 
    Suggested
    Full time
    Remote work

    Together AI

    San Francisco, CA
    2 days ago
  •  ...Requisition ID: 2684 Standard Title: Senior GPU Systems Engineer Required Security Clearance: Top Secret/SCI Location: Bethesda, MD Work Type: On-Site Shift: First Referral Eligibility: Eligible U.S. Citizenship Required? Yes Position... 
    Suggested
    Hourly pay
    Contract work
    Temporary work
    Immediate start
    Flexible hours
    Shift work

    Base2 Solutions

    Bethesda, MD
    11 hours ago
  •  ...Position: Software Engineer - GPU, C++, OpenCL, CUDA Location: Waukesha, WI (Onsite) Exp: 5 - 9 yrs Key Skills: GPU, C++, OpenCL, CUDA, OneAPI, Matlab Only USC / GC Job Requirements The CT Program is working on upgrading CT scanners used worldwide. The... 
    Work experience placement
    Worldwide

    Hudson Manpower

    Waukesha, WI
    2 days ago
  • $160k - $320k

     ...deliver excellence.  We seek engineers/researchers with strong...  ...the Role We’re looking for a systems engineer with HPC or parallel...  ...performance systems to optimize GPU performance at the bleeding edge...  ...or LA offices Tech Stack CUDA/C++, GPGPU, Python, Linux Key... 
    Full time
    Work at office

    Vast.ai Inc.

    San Francisco, CA
    more than 2 months ago
  • $200k - $300k

     ...Hudson River Trading (HRT) is looking for GPU Systems Engineers to help scale and evolve our exceptionally sophisticated HPC/AI research environment...  ...in Python scripting and automation frameworks  ~ CUDA or C/C++ experience is a plus Experience with NVIDIA technologies... 
    Work at office

    Hudson River Trading

    Washington DC
    more than 2 months ago
  • $195k - $255k

     ...Senior Systems Engineer (Virtualization / GPU Infrastructure) Laurel, MD We're seeking a Senior Systems Engineer to support our U.S. Government program(s). The Senior Systems Engineer will be responsible for designing, configuring, maintaining, and troubleshooting... 
    Immediate start
    Remote work
    Flexible hours

    EnDepth Solutions LLC

    Laurel, MD
    3 days ago
  • $195.2k - $292.8k

     ...Company: Qualcomm Technologies, Inc. Job Area: Engineering Group, Engineering Group GPU ASICS Engineering General Summary: GPU System Driver Team are looking for talented software engineers to develop in-house GPU drivers to verify GPU function... 
    Work experience placement

    Qualcomm

    Nacogdoches, TX
    1 day ago
  • $170k - $300k

     ...building large in-house AI/ML infrastructure. Built by engineers, for engineers. From large-scale GPU orchestration to inference optimization, we own the...  ...and AI R&D. We are looking for a Lead Software Systems Engineer - GPU Performance to play a key role in building... 
    Temporary work
    Immediate start
    Remote work

    Nebius

    United States
    3 days ago
  • $87.1k - $157.45k

    Leidos is seeking a mid-career Systems Engineer specializing in HPC & GPU Infrastructure in Bethesda. This role focuses on optimizing GPU clusters for the Intelligence Community, requiring extensive Linux integration and hardware knowledge. Active TS/SCI clearance is essential... 

    Leidos

    Bethesda, MD
    1 day ago
  • Krämer IT Solutions GmbH sucht einen AI Engineer / DevOps für unsere Saar-Cloud in Deutschland. Du baust den Maschinenraum für die KI von morgen und optimierst unsere GPU-Cluster für bestmögliche Performance. Du hast Erfahrung mit Docker und Kubernetes, und deine Aufgaben... 
    Remote job
    Flexible hours

    Server Eye

    New Bremen, OH
    2 days ago
  • $82.3k - $220k

    Draper in Boston, MA is seeking an AI Systems Administrator, responsible for overseeing a closed GPT environment and ensuring system health...  ...and configuring servers, maintaining observability for GPU workloads, and implementing automation solutions. The salary range... 
    Full time

    Draper

    Boston, MA
    1 day ago
  • $140k - $225k

     ...Systems Engineer - Graphics Processing Unit (GPU) Absolute Business Solutions Corp (ABSC) is not just another tech company. We're a community of innovators, engineers, analysts and business professionals working together with our customers to tackle the most complex... 
    Contract work

    Absolute Business Solutions Corp

    Bethesda, MD
    3 days ago
  • $161.8k - $242.6k

     ...Technologies, Inc. Job Area: Engineering Group, Engineering Group GPU ASICS Engineering General Summary...  ..., DirectX3D, OpenGL, OpenCL, or Cuda development ~ Experience with GPU...  ...Engineering, Hardware Engineering, Systems Engineering, or related work experience... 
    Work experience placement
    Work from home

    Qualcomm

    San Diego, CA
    11 hours ago
  • $165k - $242k

     ...Systems Engineer, Kernel Livingston, NJ / New York, NY / Sunnyvale, CA / Bellevue, WA CoreWeave...  ...(networking, storage, virtualization, GPU/DPU enablement). Stack-Wide Support –...  ...containerd, nydus, kubelet) HPC/AI workloads (CUDA, GPUDirect, RoCE/InfiniBand)... 
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours

    CoreWeave

    Bellevue, WA
    5 days ago
  •  ...Senior Software Engineer Focused On Edge Operating Systems About the Company: We are a stealth-mode Aerospace & Defense startup headquartered in Gardena...  ...Integrate low-level software components (GStreamer, CUDA, GPU drivers, telemetry, real-time threads) into a cohesive... 

    Singularity Defense

    Gardena, CA
    11 hours ago
  • $105.9k - $180k

     ...hands without us. KLA invents systems and solutions for the manufacturing...  ...expert teams of physicists, engineers, data scientists and problem-...  ...changes, drivers (including GPU drivers where applicable), and...  ...GPU-accelerated workloads (CUDA, drivers, GPU scheduling concepts... 
    Minimum wage
    Work experience placement
    Flexible hours

    KLA

    Ann Arbor, MI
    3 days ago
  • $155k - $205k

     ...general intelligence for autonomous systems and robotics. We aspire to...  ...We are seeking a Systems Engineer to join our team in Redmond, WA...  ...spanning edge devices to cloud GPU clusters - with a focus on low...  ...environments. Optimize GPU/CUDA workloads and accelerate ML frameworks... 

    General Robotics

    Redmond, WA
    3 days ago
  • Lenovo is seeking a highly skilled Unix System Administrator in Morrisville, NC, to manage and optimize infrastructure supporting advanced...  ...with modern AI operations, requiring expertise in GPU environments and automation. Candidates should have a Bachelor's... 
    Work at office
    Remote work
    Flexible hours

    Lenovo

    Raleigh, NC
    1 day ago
  •  ...Senior GPU Systems / AI Infrastructure Engineer (NYC) Location: New York City (Hybrid / On-site preferred) Comp: Competitive + equity (Series A-C...  ...What You’ll Work On Design and optimise GPU kernels (CUDA / Triton / HIP) for large-scale AI workloads Build and... 
    Permanent employment
    New York, NY
    a month ago
  • $102k - $170k

     ...currently seeking a Staff Edge Compute Systems Engineer to join our SpaceWrx Engineering team in...  ...architecture and development of Vantor’s GPU-accelerated onboard compute platform through...  ...NVIDIA GPUs or GPU-accelerated systems (CUDA a plus). · Familiarity with AI/ML... 
    Permanent employment

    Vantor

    Westminster, CO
    11 hours ago
  • $145k - $170k

     ...with camera architects and other teams such as systems, optics, algorithm development and IQ team, to...  ...such as MATLAB, Python, C/C++, preferably CUDA, design code for efficiency, and target different platforms such as GPU, Cloud, and other Experience coding in MATLAB... 
    Remote work
    Flexible hours

    DigitalFish Inc

    Sunnyvale, CA
    4 days ago
  • $500 per month

     ...Who We Are: Aurelius Systems is a VC backed defense tech startup...  .... We're a small team of engineers, former US military operators,...  ...latency accumulates across CPU, GPU, memory, and I/O; how bandwidth...  ...Develop and optimize CUDA kernels for high-throughput, low... 
    Permanent employment
    Work at office
    Monday to Friday
    Flexible hours
    Night shift
    Weekend work

    Aurelius Systems

    San Francisco, CA
    3 days ago
  • $155k - $200k

     ...ML Engineer General Robotics is an AI research and deployment company...  ...intelligence for autonomous systems and robotics. We aspire to...  ...with experience in areas like CUDA kernel programming; distributed...  ...robotics workloads. Optimize GPU workloads and accelerate ML... 

    General Robotics

    Redmond, WA
    1 day ago
  • $224k - $356.5k

     ...next era of computing. An era in which our GPU acts as the brains of computers, robots,...  ...We are seeking a Senior HPC & Quantum Systems Engineer to help architect, deploy, and operate a...  ..., and orchestration frameworks such as CUDA‑Q, cuQuantum, NVQlink, and related toolchains... 
    Work at office

    NVIDIA

    Westford, MA
    1 day ago
  •  ...Description Job Description Salary: Job Title: Linux Systems Engineer (HPC) Location: Charlottesville, VA (Onsite)...  ...of cluster communication issues. Support GPU-enabled compute environments and CUDA-based workloads, ensuring proper resource allocation... 
    Full time
    Temporary work
    Remote work
    Monday to Friday
    Flexible hours

    Plus3 IT Systems

    Charlottesville, VA
    4 days ago
  • $150k - $230k

     ...About Clockwork Systems Clockwork.io - Software Driven Fabrics to increase GPU cluster utilization Clockwork Systems was...  ...researchers and veteran systems engineers who share a vision for...  ...frameworks like PyTorch, NCCL, CUDA runtime-not as a user, but modifying... 

    Clockwork Inc

    Palo Alto, CA
    2 days ago
  • $144k - $192k

     ...Machine Learning Systems Engineer We are looking for a Machine Learning Systems Engineer to join our ML Acceleration team....  ...Kernel Development: Design and maintain high-performance GPU kernels in Triton or CUDA for state-of-the-art ML workloads. Data Pipeline Engineering... 
    Work at office
    Remote work

    Venturefizz Product Management Community

    Boston, MA
    11 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to GPU Systems Engineer (CUDA). Be the first to apply!