Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Member of Technical Staff (GPU Performance Engineer)

Reka

We are seeking an experienced GPU Performance Engineer with a strong background in Python and large-scale model training. In this role, you will design and implement improvements to our training infrastructure and directly contribute to technical decisions that optimize performance of our models. You will also work on post-training processes, including reinforcement learning and fine-tuning. Furthermore, you will contribute to improving the efficiency and scalability of our model serving infrastructure. Ideal Experience Strong engineering skills with fluency in Python and PyTorch (or other frameworks). Proven experience implementing and training large deep learning models. Experience writing and debugging low-level GPU code (CUDA, C++). Experience scaling up GPU jobs using large-scale compute clusters (e.g., Slurm or Kubernetes). Demonstrated ability to analyze and optimize the performance of GPU-accelerated workloads, including profiling, identifying bottlenecks, and implementing performance tuning techniques. Reka's Mission Reka's mission is to build useful multimodal artificial intelligence and use it to empower organizations and businesses. We are a globally distributed foundation model startup, headquartered in the San Francisco Bay Area, California. Embracing a remote-first approach, our team brings together top talent from around the world. Our founding team, along with many of our team members, has contributed to numerous breakthroughs in AI over the past decade. Why Reka? An Elite Team: Collaborate with top-tier engineers, researchers, and operators from renowned organizations like Google DeepMind, Facebook AI Research (FAIR), and successful startups, driving innovation in AI technology. Cutting-edge Infrastructure: Train state-of-the-art models leveraging the latest software and hardware, expanding the frontier of innovation in AI infrastructure development. Inclusive and Open Culture: Thrive in an open and inclusive work environment that values diverse perspectives and fosters creativity. Generous Benefits: Enjoy five weeks of paid leave to recharge, comprehensive healthcare benefits (including vision and dental), and additional perks that support your well-being. Visa Support: We provide visa assistance, including H1B and OPT transfers, for US employees to ensure a smooth transition and support your career with us. #J-18808-Ljbffr

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Member of Technical Staff (GPU Performance Engineer) in New York, NY vacancy
  •  ...our customers. Cohere is a team of researchers, engineers, designers, and more, who are all passionate...  ...anywhere between UTC-06:00 and UTC+01:00. As a Member of Technical Staff, you will: Design and write high-performing and scalable software for training models.... 
    Performance
    Work at office
    Remote work

    Cohere

    New York, NY
    3 hours ago
  • $134.64k - $176k

     ...Company Name: GlobalFoundries U.S., Inc. Position Title: Member of Technical Staff Quality Engineer Salary: $134,638-$176,000/year Hours: Monday –...  ...in the development of audit plans and schedules. Perform internal audits to verify the quality management system... 
    Performance
    Local area
    Monday to Friday

    GLOBALFOUNDRIES

    New York, NY
    2 days ago
  •  ...Data Team Engineer Data is playing an increasingly crucial role...  ...but from better data. As a member of the Data Team, your mission...  ...shape how our models perform on critical capabilities....  ...to clearly articulate complex technical concepts across teams What... 
    Performance
    Relocation package

    Reflection AI

    New York, NY
    2 days ago
  •  ...Founders: Early GPU cloud (9 figure exit)....  ...looking for a software engineer to help develop, launch...  ...experiences. As a key member of our team, you'll push...  ...development, combining technical excellence with design...  ...features with fluid UI, high performance, and scalable client... 
    Performance
    Shift work

    ATG intelligence

    New York, NY
    2 days ago
  •  ...Technical Intern Opportunity Adaptive ML is...  ...for both cost and performance across distributed...  ...Our Technical Staff develops the foundational...  ...combining strong engineering with careful...  ...Profile and iterate GPU inference kernels...  ...findings Nearly all members of our Technical... 
    Performance
    Internship
    Live in
    Work at office

    Adaptive ML

    New York, NY
    1 day ago
  •  ...optimizing for both cost and performance across distributed...  ...soon. Our Technical Staff develops the foundational...  ...apply! As a Member of Technical Staff, you...  ...combining large-scale engineering with rigorous empirical...  ...Profile and iterate GPU inference kernels in... 
    Performance
    Live in
    Work at office
    Relocation
    Visa sponsorship

    Adaptive ML

    New York, NY
    4 days ago
  •  ...as the world's AI inference engine and accelerate AI progress by...  ...entire vLLM stack: from low-level GPU kernels to high-level...  ...hundreds of accelerator types. Performance & Scale: Build the distributed...  ...high-impact work in complex technical environments Deep expertise in... 
    Performance
    Local area
    Remote work
    Worldwide
    Visa sponsorship
    Flexible hours

    Inferact

    New York, NY
    3 days ago
  •  ...customers. Cohere is a team of researchers, engineers, designers, and more, who are all...  ..." or "ML Engineer" role. As a Member of Technical Staff, Applied ML, you will: # Work directly...  ...that directly enhance model performance for customer use-cases. Contribute... 
    Performance
    Work at office
    Remote work

    Cohere

    New York, NY
    2 days ago
  •  ...and Suno. They rely on Modal for instant GPU access, sub-second container starts, and...  ...olympiad medalists, and experienced engineering and product leaders with decades of experience...  ...with experience in making ML systems performant at scale. If you are interested in contributing... 
    Performance

    Modal

    New York, NY
    5 days ago
  •  ...experimental ideas into scalable, production-ready training systems. Improve performance of distributed training workloads through optimization of communication, memory usage, and GPU utilization. Build and maintain training pipelines that support large-scale datasets... 
    Performance
    Relocation package

    Reflection AI

    New York, NY
    2 days ago
  •  ...multi-cloud scheduling, node health, and performance debugging at this scale presents...  ...and rapid hardware debugging. Platform Engineering: Design and iterate on our cluster management...  ...stack for workloads across large, multi-GPU fleets Monitoring & Observability:... 
    Performance
    Relocation package

    Reflection AI

    New York, NY
    2 days ago
  •  ...About the Role Design, build, and operate large-scale GPU infrastructure for high-throughput model inference and mid-training...  ...and reinforcement learning pipelines at scale. Build high-performance inference platforms capable of serving and evaluating models across... 
    Performance
    Relocation package

    Reflection AI

    New York, NY
    2 days ago
  •  ...customers. Cohere is a team of researchers, engineers, designers, and more, who are all...  ...and Paris. Join us! As a Senior Member of Technical Staff specializing in web data for pre-...  ...study its impact on downstream model performance, and collaborate closely with the broader... 
    Performance
    Work at office
    Remote work

    Cohere

    New York, NY
    5 days ago
  • $200k - $270k

     ...both clients and candidates. Member of Technical Staff Location: New York City Company...  ...growth and recent funding. Their engineering organization is still small and highly...  ...Experience in fast-moving startups or high-performance engineering organizations... 
    Performance
    Work at office
    Visa sponsorship

    Recruiting from Scratch

    New York, NY
    2 days ago
  • $175k - $220k

     ...Member Of Technical Staff, Cloud Infrastructure At Fireworks, we're building the future of generative...  ...AI. The Role: As a Software Engineer on our Cloud Infrastructure team, you...  ...to design solutions that balance performance, cost-efficiency, and operational simplicity... 
    Performance

    Fireworks AI

    New York, NY
    1 day ago
  • $142.8k - $274.8k

     ...that scale. We are looking for a Member of Technical Staff who is truly AI‑native—someone who experiments...  ...role, you'll shape and ship high‑performance, highly available AI services that...  ...models), including prompt engineering, evaluation, or fine‑tuning. Hands... 
    Performance
    Ongoing contract
    Local area

    Microsoft Corporation

    New York, NY
    1 day ago
  •  ...One. The Role We're hiring a Member of Technical Staff - Applied AI, Fullstack to design,...  ...customers to deliver seamless, high-performance experiences across the entire stack....  ...a critical role in shaping Stuut's engineering culture and product experience, ensuring... 
    Performance
    Full time
    Flexible hours

    Stuut

    New York, NY
    3 days ago
  • $185k - $200k

     ...We're looking for a Software Engineer to join our Storage team. Storage...  ...you're here. Improve the performance of CockroachDB. Work...  ...knowledge with a highly experienced technical organization. Ensure that...  ...will become an integrated member of our engineering team. You'... 
    Performance
    Local area
    Remote work
    Flexible hours

    Cockroach Labs

    New York, NY
    2 days ago
  •  ...A global AI foundation model startup is seeking an experienced GPU Performance Engineer to enhance training infrastructure and optimize model performance. The ideal candidate will have strong skills in Python and experience with large-scale model training, including GPU... 
    Performance
    Remote work
    Visa sponsorship
    Free visa

    Reka

    New York, NY
    3 days ago
  •  ...A leading automotive company in the United States is seeking an experienced GPU Software Engineer to design and implement high-performance GPU kernels for autonomous driving technologies. The position requires strong programming skills in CUDA and C++, and the ability... 
    Performance

    General Motors

    New York, NY
    3 days ago
  •  ...customers. Cohere is a team of researchers, engineers, designers, and more, who are all passionate...  ...where you can be located for this role. As a Member of Technical Staff, you will: Design and write high-performant and scalable software for training.... 
    Performance
    Work at office
    Remote work

    Cohere

    New York, NY
    2 days ago
  •  ...cryptographic concepts to both technical and non-technical audiences....  ...technology stack and help engineering teams resolve issues related...  ...oversight. Coordinate team members across engineering boundaries...  ...targets that affect team performance. Organizational Knowledge Understand... 
    Performance

    Anchorage

    New York, NY
    3 days ago
  • $180k - $238.1k

     ...we all win. The Role We are hiring performance engineers to expand our ongoing investment in performance...  ...30 days, you will become an integrated member of our performance engineering team....  ...experience level ranges from mid to staff level. At a minimum, this role requires... 
    Performance
    Local area
    Remote work
    Worldwide
    Flexible hours

    Cockroach Labs

    New York, NY
    3 days ago
  •  ...Anchorage, and on LinkedIn. As a member of the Token Vesting team,...  ...Digital. We define performance as acquiring, possessing, and...  ...outside of the Fullstack software Engineering role. You will be coached,...  ...capabilities in 4 major areas: Technical Skills Progress from... 
    Performance

    Anchorage

    New York, NY
    3 days ago
  •  ...financial markets. Founders: Early GPU cloud (9 figure exit). Investors:...  ...Role You'll bridge research and engineering-rapidly implementing, experimenting with...  ...and founders to translate ideas into high-performance systems, and will operate across the stack... 
    Performance

    ATG intelligence

    New York, NY
    2 days ago
  •  ...training loops and distributed GPU training to massive-scale...  ...pipelines The goal is to build the engineering foundation that allows...  ...algorithms Distributed systems High-performance computing You care deeply...  ...environments and enjoy solving hard technical problems. What We Offer: We... 
    Performance
    Relocation package

    Reflection

    New York, NY
    5 days ago
  • $100k - $300k

     ...AI Cyber Taskforce Engineer Cogent is an Applied AI Lab building the next generation...  ...Onboard, support and uplevel future team members Mentor and grow future junior team...  ...apply these technologies to build scalable, performant systems in production environments... 
    Performance

    Cogent Security, Inc.

    New York, NY
    2 days ago
  •  ...Inference Engine Engineer We build and run the inference engine behind every Perplexity...  ...ingress through continuous batching and GPU kernel interleaving. Build dashboards...  ...Deep experience with GPU programming and performance work (CUDA, Triton, CUTLASS, or similar).... 
    Performance

    Perplexity AI

    New York, NY
    2 days ago
  •  ...Agents have already reshaped software engineering. The same shift is coming for financial...  ...AI work. Building agents with the performance and reliability Wall Street demands is...  ...applied AI problems out there. Our technical staff works across the stack: agent architecture... 
    Performance
    Full time
    Shift work

    Endex Inc

    New York, NY
    1 day ago
  • $175k - $220k

     ...training, inference, and data processing pipelines. Lead technical design discussions, mentor engineers, and establish best practices for large-scale...  ...compute cost, storage lifecycle management, and network performance. Collaborate with machine learning, DevOps, and... 
    Performance

    Fireworks AI

    New York, NY
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Member of Technical Staff (GPU Performance Engineer). Be the first to apply!