Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

GPU Software Engineer (CUDA)

Full-time

Bright Vision Technologies

Bright Vision Technologies is a forward-thinking software development company dedicated to building innovative solutions that help businesses automate and optimize their operations. We leverage cutting-edge technologies to create scalable, secure, and user-friendly applications. As we continue to grow, we’re looking for a skilled GPU Software Engineer (CUDA) to join our dynamic team and contribute to our mission of transforming business processes through technology. This is a fantastic opportunity to join an established and well-respected organization offering tremendous career growth potential. GPU Software Engineer (CUDA) Job Title: GPU Software Engineer (CUDA) Location: 100% Remote (Continental United States) Position Type: In-house Bright Vision Technologies SOW engagement (no third-party client or vendor) Experience: 6+ years Salary: 100k - 150k Sponsorship: No new H1B sponsorship available. H1B transfers welcomed for qualified candidates. Employment Type: Full-time, direct W2 with Bright Vision Technologies (no C2C, no 1099, no third-party) Engagement: Long-term, multi-year, aligned to the Bright Vision SOW delivery roadmap Compensation: Competitive base salary commensurate with experience, plus benefits. Employment Terms & Visa Policy This is a 100% remote, full-time, direct W2 position with Bright Vision Technologies. This role is part of Bright Vision Technologies’ in-house Statement of Work (SOW) engagement. The client, end customer, and employer for this position is Bright Vision Technologies — there is no third-party client, vendor, or implementation partner involved. We do not engage in C2C, 1099, or third-party arrangements for this role. BUT STRICTLY NO C2C/1099/3RD PARTY COMPANIES. ALL OUR ROLES ARE W2 AND NO 3RD PARTY BROKERING PLEASE. Candidates must be willing to work directly as a full-time W2 employee of Bright Vision Technologies and contribute to our in-house SOW deliverables. No new H1B sponsorship is available for this role. However, candidates who are currently on a valid H1B visa and require a transfer are welcome to apply. We will support H1B transfers for qualified candidates. For every role, a technical coding assessment is mandatory. Please apply only if you are confident in your technical abilities and hands-on experience. Job Summary We are seeking a GPU Software Engineer with deep expertise in CUDA programming, GPU architecture, and high-performance computing to design and optimize compute-intensive workloads on modern accelerator hardware. This role focuses on extracting maximum performance from GPU platforms for AI training, inference, scientific computing, and high-throughput data processing workloads. The ideal candidate combines low-level systems mastery with strong software engineering practices, and has a track record of delivering measurable performance improvements on production GPU systems. In this role you will work closely with cross-functional partners — product, design, engineering, operations, and business stakeholders — to translate ambiguous requirements into well-engineered solutions, and will be expected to raise the bar through code review, design review, and mentorship of more junior engineers. The successful candidate brings strong engineering discipline, a clear communication style, and a track record of shipping meaningful work that holds up well in production. Key Responsibilities Design and implement high-performance CUDA kernels for compute-intensive workloads across AI and HPC use cases. Profile and optimize GPU code using tools such as Nsight Systems, Nsight Compute, and CUDA profilers. Tune memory access patterns, occupancy, register usage, and shared memory utilization for peak performance. Develop highly optimized libraries for linear algebra, attention, and other ML primitives. Optimize multi-GPU and multi-node training using NCCL, RDMA, and high-performance networking. Implement custom operators and fused kernels in PyTorch, JAX, or Triton. Collaborate with ML engineers to identify performance bottlenecks in training and inference pipelines. Develop benchmarks and regression tests to safeguard performance over time. Evaluate new GPU architectures and feature sets, and advise on adoption strategy. Contribute to compiler-level optimizations for tensor programs where appropriate, working at the boundary between ML frameworks and underlying accelerator codegen to unlock performance not reachable through framework-level tuning alone. Optimize memory hierarchy usage across HBM, L2, shared memory, and registers. Implement mixed-precision and quantized compute paths that maximize accelerator throughput while preserving numerical fidelity within bounds acceptable for the target workloads. Document performance characteristics, design decisions, and tuning playbooks for internal teams. Stay current with GPU architecture, CUDA evolution, and emerging accelerator technologies. Required Qualifications Bachelor’s or Master’s degree in Computer Science, Computer Engineering, or a related field. Six or more years of experience in GPU programming and performance engineering. Deep expertise in CUDA C/C++ and GPU programming models. Strong understanding of modern GPU architectures, memory hierarchies, and execution models. Hands-on experience profiling and optimizing GPU workloads in production. Familiarity with NCCL, MPI, and high-performance interconnect technologies. Experience integrating custom kernels into ML frameworks. Strong C++ skills and familiarity with modern systems programming practices. Solid grounding in linear algebra and numerical methods. Strong communication and collaboration skills with research and engineering teams. Preferred Qualifications Experience with Triton, CUTLASS, or other GPU kernel authoring frameworks. Familiarity with TensorRT, FasterTransformer, or vLLM internals. Exposure to compiler infrastructure such as LLVM or MLIR. Open-source contributions to GPU or ML performance libraries. Experience with large-scale distributed training infrastructure. How to Apply Would you like to know more about this opportunity? For immediate consideration, please send your resume to View email address on click.appcast.io or contact us at View phone number on click.appcast.io. Learn more about Bright Vision Technologies at We recognize that our people are our strength, and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company. We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. We also make reasonable accommodations for applicants’ and employees’ religious practices and beliefs, as well as mental health or physical disability needs. Bright Vision Technologies is an Equal Opportunity Employer, including Disability/Veterans. Position offered by “No Fee Agency.” Equal Employment Opportunity (EEO) Statement Bright Vision Technologies (BV Teck) is committed to equal employment opportunity (EEO) for all employees and applicants without regard to race, color, religion, sex, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, veteran status, or any other protected status as defined by applicable federal, state, or local laws. This commitment extends to all aspects of employment, including recruitment, hiring, training, compensation, promotion, transfer, leaves of absence, termination, layoffs, and recall. BV Teck expressly prohibits any form of workplace harassment or discrimination. Any improper interference with employees' ability to perform their job duties may result in disciplinary action up to and including termination of employment.

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the GPU Software Engineer (CUDA) in United States vacancy
  • OVERVIEW NVIDIA’s invention of the GPU in 1999 sparked the growth of the PC gaming market...  ...advancement. Are you a motivated system software engineer with a deep understanding of device...  ...define forward‑looking improvements to the CUDA APIs and programming model. Extend... 
    Suggested

    NVIDIA

    Santa Clara, CA
    3 days ago
  • Bright Vision Technologies is looking for a GPU Software Engineer (CUDA) to join their team. This remote position requires 6+ years in GPU programming and expertise in CUDA. The role involves designing high-performance CUDA kernels and optimizing GPU workloads, with a focus... 
    Suggested
    Remote job

    Bright Vision Technologies

    Bartlett, IL
    3 days ago
  • Bright Vision Technologies is looking for a skilled GPU Software Engineer (CUDA) to join our remote team. The ideal candidate will have extensive experience in CUDA programming, GPU architecture, and performance optimization for AI and HPC workloads. This full-time role... 
    Suggested
    Remote job
    Full time

    Bright Vision Technologies

    Bartlett, IL
    2 days ago
  • Hudson Manpower is seeking a Software Engineer in Waukesha, WI to develop GPU-based image processing algorithms used in CT scanners. The role requires expertise in C++, OpenCL, and CUDA, aiming to enhance next-gen CT machines. The ideal candidate has a strong background... 
    Suggested

    Hudson Manpower

    Waukesha, WI
    3 days ago
  •  ...computer vision and machine learning systems for Nvidia Boards, utilizing your expertise in C++ and GPU programming (CUDA). You will collaborate with a team of R&D Engineers, focusing on translating C++ into optimized CUDA code and learning new Nvidia technologies. A... 
    Suggested

    Algoface

    Carefree, AZ
    1 day ago
  • Position: Software Engineer - GPU, C++, OpenCL, CUDA Location: Waukesha, WI (Onsite) Experience: 5 - 9 yrs Key Skills: GPU, C++, OpenCL, CUDA, OneAPI, Matlab Only USC / GC Job Requirements: The CT Program is working on upgrading CT scanners used worldwide. The center... 
    Work experience placement
    Worldwide

    Hudson Manpower

    Waukesha, WI
    3 days ago
  • $140k - $190k

    Grey Matters Defense Solutions, LLC in Arlington, Virginia is looking for a Senior CUDA/C++ Software Engineer to design and optimize high-performance GPU-accelerated software solutions. This role involves building scalable software using C++ and CUDA for real-time processing... 

    TryApplyNow

    Arlington, VA
    4 days ago
  • $170k - $220k

     ...technology company in California is seeking a full-time Staff Software Engineer specializing in GPU algorithms for ultrasound imaging. The role involves...  ...to enhance ultrasound image quality using C++ and CUDA. Candidates should have a strong background in ultrasound... 
    Full time

    EmergencyMD

    Santa Clara, CA
    2 days ago
  • Embedded System Developer (OOP|CUDA|GPU|C++) We’re looking for a passionate embedded system developer with experience in parallel and...  .... Basic Requirements: Bachelor or Master degree in Computer Engineering or Computer Science. Strong understanding of Algorithms and Data... 

    Algoface

    Carefree, AZ
    5 days ago
  • IBM Computing is seeking a Software Developer: Generalist in Austin, Texas to design, develop, test, and deliver innovative software solutions. You will be responsible for GPU execution layer enhancements and contribute to CI/CD pipelines. The ideal candidate will have... 

    IBM Computing

    Austin, TX
    4 days ago
  • IBM is seeking a Software Developer: Generalist to design, develop, and deliver innovative software solutions in an Agile environment. You will lead GPU execution layer projects and collaborate with stakeholders to meet their requirements. The ideal candidate will have... 

    IBM

    Austin, TX
    1 day ago
  • $184k - $287.5k

    NVIDIA Corporation is seeking a motivated System Software Engineer in Santa Clara, California, to enhance features for its advanced hardware...  ...successful candidate will collaborate across teams to improve CUDA APIs and functionality, with a rewarding salary range from... 

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $107.9k - $195.05k

     ...currently has an exciting opportunity for a Software Engineer to perform design, development, and...  ...processes A solid understanding of GPU programming and parallel computing...  ...and kernels Hand crafting bespoke CUDA kernels for high performance Designing... 
    Local area
    Immediate start
    Flexible hours

    Leidos

    Greenbelt, MD
    5 days ago
  •  ...environments. A strong grasp of multithreading, concurrency, and memory management is essential. This role also values familiarity with CUDA, real-time systems, and diagnostic tools. Additional benefits may be eligible based on client offerings. #J-18808-Ljbffr Real... 

    Real Staffing

    Los Angeles, CA
    2 days ago
  • About the job FriendliAI is looking for a GPU Kernel Engineer to design, build, and optimize the low-level compute kernels that power our large...  ...GEMM, attention, routing) Develop and maintain GPU code in CUDA and C++, including low-level assembly when needed Implement... 
    Flexible hours

    FriendliAI

    San Francisco, CA
    2 days ago
  • Bright Vision Technologies is seeking a skilled GPU Systems Engineer (CUDA) to join their team. This 100% remote position requires expertise in CUDA programming and GPU architecture to design and optimize workloads for AI training and scientific computing. The ideal candidate... 
    Remote job

    Bright Vision Technologies

    Stafford, TX
    5 days ago
  • $152k - $218.5k

    Software Engineer, CUDA-Q page is loaded## Software Engineer, CUDA-Qlocations: US, CA, Remote: US, WA, Remotetime type: Full timeposted on: Posted...  ...compiler toolchains, specifically LLVM/MLIR* Proficiency in GPU- and/or FPGA-programmingNVIDIA is widely considered to be... 
    Remote work

    NVIDIA Corporation

    California, MO
    3 days ago
  • Bright Vision Technologies is looking for a GPU Systems Engineer (CUDA) to work 100% remotely within the Continental United States. The position requires deep expertise in CUDA programming and GPU architecture, with responsibilities focused on optimizing compute-intensive... 
    Remote job
    Full time

    Bright Vision Technologies

    Hoffman Estates, IL
    4 days ago
  •  ...Spark Capital. Join us and help build the platform engineers turn to to ship AI products. THE ROLE We’re seeking a GPU Kernel Engineer to join our team at the cutting...  ...of‑experts routing Write and optimize code using CUDA, PTX assembly, and architecture‑specific... 
    Flexible hours

    The Consensus

    San Francisco, CA
    2 days ago
  • $325k

     ...model inference. About the Role We're hiring engineers to scale and optimize OpenAI's inference infrastructure across emerging GPU platforms. You'll work across the stack -...  ...experience writing or porting GPU kernels using HIP, CUDA, or Triton, and care deeply about low-level... 

    Centaur Labs

    San Francisco, CA
    1 day ago
  •  ...Exerciser team, you will develop low‑level software and GPU workloads designed to stress the system...  ...highly analytical and detail‑oriented engineer with strong problem‑solving skills. The...  ...or embedded systems) GPU programming (CUDA, HIP, OpenCL) — strong plus... 

    Advanced Micro Devices

    Austin, TX
    3 days ago
  • $184k - $287.5k

    NVIDIA is seeking a Senior Software Engineer, NCCL and CUDA specialization to join our Cloud Service Provider (CSP) Engagements team, focusing on ML...  ...issues in NCCL and CUDA libraries. Analyze and improve multi-GPU workloads performance through profiling, benchmarking, and... 

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $184k - $287.5k

    We are hiring software engineers to work on the CUDA driver, a core component of our platform for accelerating general purpose computation on the GPU. Our team delivers features and improvements to better realize the potential of NVIDIA hardware for a growing range of computational... 

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • $184k - $287.5k

    NVIDIA Gruppe is seeking a skilled software engineer to develop high-performance solutions for GPU data storage. The ideal candidate will have extensive experience in C, C++, Rust, and data architecture, and will collaborate with multidisciplinary teams. The position promises... 

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • $152k - $241.5k

    Senior Software Engineer, Fabric Networking - GPU page is loaded## Senior Software Engineer, Fabric Networking - GPUlocations: US, CA, Santa Clara: US, IL...  ...*Ways to stand out from the crowd:*** Understanding of CUDA programming model and NVIDIA GPUs.* Knowledge of memory... 
    Remote work

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • $166k - $244k

    Senior Software Engineer, GPU Performance Location: Sunnyvale, CA, USA; New York, NY, USA; Seattle, WA, USA About the job Google's software engineers...  ...bottlenecks. Experience low-level GPU programming (CUDA, Triton, CUTLASS, etc.) and performance engineering techniques... 
    Full time
    Worldwide

    Google

    Sunnyvale, CA
    2 days ago
  •  ...consulting firm is looking for an Imaging Software Engineer to design and develop high-performance...  ...implementation using MATLAB, Python, and CUDA, alongside automating workflows and...  ...engineering background and experience in GPU programming. This is a contract position... 
    Remote job
    Contract work

    Intelliswift - An LTTS Company

    New York, NY
    5 days ago
  • $184k - $287.5k

    Senior Software Engineer, CUDA Core Libraries page is loaded## Senior Software Engineer, CUDA Core Librarieslocations: US, CA, Santa Clara: US,...  ...that enable developers to write fast, reliable, and scalable GPU-accelerated software! We are hiring a full-time Software Engineer... 
    Full time

    NVIDIA

    Santa Clara, CA
    5 days ago
  • $140k - $224.25k

     ...data driven tools to improve software quality, and ensuring customers...  ...creative, and hands‑on software engineer with a test to failure...  ...optimize the testing workflows in GPU domain. Write maintainable, reliable...  ..., Frame Generation, Reflex, CUDA, G-Sync, etc. The ability to... 

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  •  ...Bachelor's or Master's in Computer Science, Robotics, Electrical Engineering, or related field 3+ years of production C++ (C++17/20)...  ...multithreading, concurrency, and memory management Experience with CUDA or GPU-accelerated systems Experience building real-time or latency... 

    Real Staffing

    Los Angeles, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to GPU Software Engineer (CUDA). Be the first to apply!