Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Systems Research Engineer, GPU Programming

$160k - $230k

Together AI

Systems Research Engineer, GPU Programming

San Francisco

About the Role

As a Systems Research Engineer specialized in GPU Programming, you will play a crucial role in developing and optimizing GPU-accelerated kernels and algorithms for ML/AI applications. Working closely with the modeling and algorithm team, you will co-design GPU kernels and model architecture to enhance the performance and efficiency of our AI systems. Collaborating with the hardware and software teams, you will contribute to the co-design of efficient GPU architectures and programming models, leveraging your expertise in GPU programming and parallel computing. Your research skills will be vital in staying up-to-date with the latest advancements in GPU programming techniques, ensuring that our AI infrastructure remains at the forefront of innovation.

Requirements
  • Strong background in GPU programming and parallel computing, such as CUDA and/or Triton.
  • Knowledge of ML/AI applications and models
  • Knowledge of performance profiling and optimization tools for GPU programming
  • Excellent problem-solving and analytical skills
  • Bachelor's, Master's, or Ph.D. degree in Computer Science, Electrical Engineering, or equivalent practical experiences
Responsibilities
  • Optimize and fine-tune GPU code to achieve better performance and scalability
  • Collaborate with cross-functional teams to integrate GPU-accelerated solutions into existing software systems
  • Stay up-to-date with the latest advancements in GPU programming techniques and technologies
About Together AI

Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers in our journey in building the next generation AI infrastructure.

Compensation

We offer competitive compensation, startup equity, health insurance, and other benefits, as well as flexibility in terms of remote work. The US base salary range for this full-time position is: $160,000 - $230,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge.

Equal Opportunity

Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.

Please see our privacy policy at

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Systems Research Engineer, GPU Programming in San Francisco, CA vacancy
  • $160k - $320k

     ...deliver excellence. We seek engineers/researchers with strong intrinsic drive, a true passion...  ...Angeles. About the Role As a systems/GPU engineer, you will play a crucial...  ...in AI model inference and GPU programming techniques. Full-Time On-site... 
    Suggested
    Full time
    Work at office

    Vast

    San Francisco, CA
    3 days ago
  • $160k - $320k

     ...deliver excellence.  We seek engineers/researchers with strong intrinsic drive, a true...  ...the Role We’re looking for a systems engineer with HPC or parallel programming experience to help scale AI...  ...performance systems to optimize GPU performance at the bleeding edge... 
    Suggested
    Full time
    Work at office

    Vast

    San Francisco, CA
    3 days ago
  •  ...Description We are Genmo, a research lab dedicated to building open...  ...an exceptional Software Engineer to join our research team in...  ...generative models ~ Solid programming skills in Python and PyTorch/...  ...with distributed computing and GPU optimization Contributions... 
    Suggested
    Work at office

    Genmo

    San Francisco, CA
    19 hours ago
  • $250k - $350k

     ...The Enterprise ML Research Lab works on the front...  ...As an ML Sys Research Engineer, you'll work on building...  ...technologies to optimize our ML system. Your customer will be...  ...of the modern GPU cluster Experience with...  ...internal policies and programs designed to protect personal... 
    Suggested
    Full time

    Scale AI

    San Francisco, CA
    23 hours ago
  •  ...nation states. Our team of AI researchers and company builders come...  ...algorithms into scalable training systems. You will design and optimize...  ...loops and distributed GPU training to massive-scale data...  ...The goal is to build the engineering foundation that allows researchers... 
    Suggested
    Relocation package

    Reflection AI

    San Francisco, CA
    3 hours ago
  •  ...E2B Infrastructure Engineer E2B is a fast-growing Series A startup...  ...# Building a distributed system for millions and billions of...  ...virtualization trade-offs ~ Systems programming skills - Strong in at least...  ..., or lazy loading GPU passthrough or PCIe device virtualization... 
    Work from home
    Relocation

    E2B

    San Francisco, CA
    3 days ago
  • $200k - $300k

     ...for AI. We hire people who care deeply about this problem space. If that is you, please apply! About the Role As a System Engineer, GPU Fleet, you will manage, operate, and optimize hyperscale GPU compute infrastructure supporting AI/ML training and inference... 
    Local area

    Fluidstack

    San Francisco, CA
    5 hours ago
  • $137k - $161k

     ...Systems Engineer II, Compute Specializing in Systems Applications The...  ...distributed systems, object oriented programming, and low-level systems...  ...adapting quickly, eager to research new technology and not get...  ...AI/ML workloads, including GPU virtualization. Previous... 
    Full time
    Temporary work

    Crusoe

    San Francisco, CA
    6 hours ago
  • $122k - $215k

     ...positive way. To learn more visit: As a Research Engineer, you will be at the forefront of...  ...deploying solutions to our production systems, collaborating closely with platform teams...  ...efficiency. - Proficient in Python programming with a focus on writing high-quality,... 
    Full time
    Work at office
    Work from home
    Flexible hours

    Waabi

    San Francisco, CA
    a month ago
  •  ...ML/AI Research Engineer — Agentic AI Lab (Founding Team) Location: San Francisco Bay Area Type...  ...alignment Optimize inference latency and GPU resource utilization across cloud and on...  ..., RDF, OWL, or other semantic modeling systems Agent Intelligence:... 
    Full time

    Fabrion

    San Francisco, CA
    23 hours ago
  • $147.4k - $220.9k

     ...AI/ML - Machine Learning Research Engineer, Machine Translation Work Locations (3) Submit...  ...app, Safari web page translation, and system-wide translation. We are seeking research...  ..., and implement solutions in programming languages such as C/C++, Java, Python,... 
    Relocation

    Apple

    San Francisco, CA
    4 days ago
  • $248.8k - $311k

     ...contributor in conducting applied research in Physical AI and...  ...AI. The Role As an ML Systems Engineer on the Physical AI team, you...  ...cloud environments, including GPU-level algorithm optimizations...  ...., CUDA, kernel tuning). Programming: Strong skills in one or more... 
    Full time

    Scale AI

    San Francisco, CA
    6 hours ago
  • $216k - $270k

     ...contributor in conducting applied research in Physical AI and...  .... The Role As an ML Systems Engineer on the Physical AI team, you...  ...cloud environments, including GPU-level algorithm optimizations...  ...., CUDA, kernel tuning). Programming: Strong skills in one or more... 
    Full time

    Scale AI

    San Francisco, CA
    1 day ago
  •  ...looking for talented and motivated data scientists, data engineers, and machine learning researchers to help us capture all of the data in the world and...  ...quantitative research engineer is to work on all of the systems that support the Numerai data, research, and trading... 

    Numerai

    San Francisco, CA
    3 hours ago
  •  ...Salesforce, etc. We are a small team of engineers wrangling problems from context to search...  ...and plans. train large agentic harness systems to improve session accuracy with millions...  ...are very good, nothing is a must per-se research you can independently execute against... 

    Composio

    San Francisco, CA
    3 days ago
  • $142.6k - $261.5k

     ...data scientists, designers, and software engineers enable our clients to solve their most...  ...and tablets. You will engage in coding, programming, and creating specifications to deliver...  ...testing practices. Knowledgeable in system development lifecycle and technology integration... 
    Summer holiday
    Flexible hours

    EY

    San Francisco, CA
    23 hours ago
  • $124k - $280k

     ...At PwC, our people in data and analytics engineering focus on leveraging advanced...  ...and optimising algorithms, models, and systems to enable intelligent decision-making and...  ...context Understanding of population health program design and how AI accelerates outcomes... 
    Full time
    H1b

    PwC

    San Francisco, CA
    4 days ago
  •  ...Research Engineer, Foundation Models About the Opportunity We are seeking...  ...generation of large-scale AI systems. This role sits at the...  ...models across large distributed GPU environments Build and...  ...and vision coverage ~401(k) program with employer matching ~ Flexible... 
    Visa sponsorship
    Relocation package
    Flexible hours

    Acceler8 Talent

    San Francisco, CA
    7 hours ago
  •  ...company based in San Francisco, California. The Role: As a Research Engineer - Audio & Speech Models , you will be a core contributor on...  ...GANs Experience with training on large-scale (multi-node) GPU clusters Strong grasp of proper experimental methodology for... 
    Work at office
    Relocation package

    Zyphra

    San Francisco, CA
    19 hours ago
  • $230k - $385k

     ...AI Systems Engineer - Codex Core Agents About The Team The Codex Core Agents team builds the...  ...behavior. This team sits close to research and works across the stack: harness, model...  ...model behavior, inference/runtime stack, GPU fleet, and product surface. You'll... 

    OpenAI

    San Francisco, CA
    2 days ago
  • $205k - $265k

     ...reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial...  ...a quickly growing group of committed researchers, engineers, policy experts, and business leaders...  ...Have object-oriented programming experience in Java or similar languages... 
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    1 day ago
  •  ...interpretable, and steerable AI systems. We want AI to be safe and...  ...growing group of committed researchers, engineers, policy experts, and...  ...experiment management across GPU clusters. Help scale our systems...  ...and async/concurrent programming with frameworks like Trio... 
    Work at office
    Visa sponsorship
    Flexible hours
    San Francisco, CA
    more than 2 months ago
  • $315k

     ...interpretable, and steerable AI systems. We want AI to be safe and...  ...growing group of committed researchers, engineers, policy experts, and...  ...neural networks like binary programs. More resources to learn...  ...training jobs, including multi-GPU parallelism and memory optimization... 
    Work at office
    Remote work
    Visa sponsorship
    Flexible hours
    San Francisco, CA
    more than 2 months ago
  •  ...into action. We're looking for engineers who want to build the operating system for AI Data Applications and...  ...application runtime. This is a systems programming role-you'll be writing the...  ...development process, from customer research to implementation This role is... 

    Tensorlake, Inc.

    San Francisco, CA
    2 days ago
  •  ...state-of-the-art AI compute engines capable of reconfiguring themselves...  ...technical boundaries A systems-level hardware engineer who...  ...accelerator boards or systems (GPU servers, TPU pods, custom...  ...~ Experience with systems programming (Linux drivers, low-level board... 

    Zettascale Computing Corp.

    San Francisco, CA
    5 hours ago
  • $180k - $250k

     ...About Unto Labs Unto Labs is a team of low-level engineers pushing distributed systems to the physical limits of modern hardware. We're reimagining...  ...Qualifications Deep expertise in systems programming languages (C, C++, Rust) with a focus on performance optimization... 
    Flexible hours

    Unto Labs

    San Francisco, CA
    4 days ago
  •  ...company based in San Francisco, California. The Role: As a Research Engineer - Brain Computer Interface Models , you will be a core...  ...series analysis Experience training on large-scale (multi-node) GPU clusters Experience contributing to large existing codebases... 
    Full time
    Work at office
    Relocation package

    Zyphra

    San Francisco, CA
    a month ago
  • $134k - $235k

     ...positive way. To learn more visit: As a Research Engineer in Neural Rendering, you will create the...  ...generation of multi-sensor rendering systems for autonomous driving. You will collaborate...  ...designing, launching, and debugging GPU machine learning jobs in the cloud (PyTorch... 
    Full time
    Work at office
    Work from home
    Flexible hours

    Waabi

    San Francisco, CA
    a month ago
  • $120k - $150k

     ...mental illness.   The Role As a Research Engineer at Paradromics, you’ll work closely with...  ...clinical studies of the Connexus System.   Responsibilities Develop data...  ...professional experience with multiple programming languages   Preferred Qualifications... 
    Full time

    Paradromics, Inc.

    Oakland, CA
    a month ago
  • $60 - $100 per hour

     ...techniques and tools, including Python and machine learning frameworks. The role offers flexibility in hours with a pay range of $60 to $100 per hour, depending on experience. Ideal for data scientists thriving in research-driven, technical environments. #J-18808-Ljbffr
    Hourly pay
    Part time
    Remote work

    Call For Referral

    San Francisco, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Systems Research Engineer, GPU Programming. Be the first to apply!