Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Machine Learning Performance Engineer

Jane Street

We are looking for an engineer with experience in low-level systems programming and optimisation to join our growing ML team.


Machine learning is a critical pillar of Jane Street's global business. Our ever-evolving trading environment serves as a unique, rapid-feedback platform for ML experimentation, allowing us to incorporate new ideas with relatively little friction.

Your part here is optimising the performance of our models - both training and inference. We care about efficient large-scale training, low-latency inference in real-time systems and high-throughput inference in research. Part of this is improving straightforward CUDA, but the interesting part needs a whole-systems approach, including storage systems, networking and host- and GPU-level considerations. Zooming in, we also want to ensure our platform makes sense even at the lowest level - is all that throughput actually goodput? Does loading that vector from the L2 cache really take that long?

If you've never thought about a career in finance, you're in good company. Many of us were in the same position before working here. If you have a curious mind and a passion for solving interesting problems, we have a feeling you'll fit right in.


There's no fixed set of skills, but here are some of the things we're looking for:
  • An understanding of modern ML techniques and toolsets
  • The experience and systems knowledge required to debug a training run's performance end to end
  • Low-level GPU knowledge of PTX, SASS, warps, cooperative groups, Tensor Cores and the memory hierarchy
  • Debugging and optimisation experience using tools like CUDA GDB, NSight Systems, NSight Computesight-systems and nsight-compute
  • Library knowledge of Triton, CUTLASS, CUB, Thrust, cuDNN and cuBLAS
  • Intuition about the latency and throughput characteristics of CUDA graph launch, tensor core arithmetic, warp-level synchronization and asynchronous memory loads
  • Background in Infiniband, RoCE, GPUDirect, PXN, rail optimisation and NVLink, and how to use these networking technologies to link up GPU clusters
  • An understanding of the collective algorithms supporting distributed GPU training in NCCL or MPI
  • An inventive approach and the willingness to ask hard questions about whether we're taking the right approaches and using the right tools
  • Fluent in English

If you're a recruiting agency and want to partner with us, please reach out to View email address on click.appcast.io.
Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Machine Learning Performance Engineer in New York, NY vacancy
  •  ...About the job Machine Learning Engineer (GCP) Role: Machine Learning Engineer- 2 Positions Overall experience of minimum 7 years...  ...optimize machine learning models in production, ensuring high performance and uptime. Collaboration: Work with cross-functional... 
    Performance
    Remote work

    Inizio Partners

    New York, NY
    2 days ago
  •  ...Senior Machine Learning Engineer (AI Integration) New York, New York, United States $ 5,000.00 - 8,000.00 (US Dollar) About the job Senior...  ...and deploy machine learning models, ensuring they meet performance and scalability standards. Stay updated on the latest... 
    Performance

    Recroot

    New York, NY
    4 days ago
  •  ...Remote Machine Learning Engineer Jersey City, NJ, United States About the Job We're seeking an outstanding ML Engineer to join our data...  ...systems. Experience monitoring and optimizing model performance. Experience with Linux, Docker and AWS, and basic development... 
    Performance
    Remote work

    Angenex

    Jersey City, NJ
    3 days ago
  •  ...Machine Learning Engineer Location: Long Island City, NY 11101 (Onsite 4 Days/week) Type: Permanent Full Time About the Role: In...  ...Implement rigorous evaluation methodologies to assess model performance, making necessary adjustments for continuous improvement.... 
    Performance
    Permanent employment
    Full time

    InstantServe LLC

    New York, NY
    5 days ago
  •  ...publishers. Our service portfolio enables high-performance campaigns while using highly...  ...cutting-edge mobile programmatic solutions, machine-learned bidding algorithms and demand-side...  ...growth last year. Machine Learning Engineer New York, NY | Full Time... 
    Performance
    Full time
    Summer work

    Entravision

    New York, NY
    3 days ago
  • $220k

     ...Senior Machine Learning Engineer Location: Remote (with optional hybrid New York) Salary: Salary up to $220,000 THE COMPANY Our...  ...network architectures, and MLOps best practices in a high-performance environment. RESPONSIBILITIES You will... 
    Performance
    Remote work
    Flexible hours

    Harnham

    New York, NY
    5 days ago
  • $175k - $215k

     ...Machine Learning Engineer, Prediction & Planning Waymo is an autonomous driving technology company with the mission to be the world's most...  ...unprecedented scale of driving data into robust, generalizable, and performant deep neural networks. These models enable the autonomous... 
    Performance
    Full time
    Remote work

    Waymo

    New York, NY
    3 days ago
  • $214k - $335k

     ...Machine Learning Engineer, AI New York, NY (Hybrid) Biohub is the first large-scale initiative bringing frontier AI models, massive compute...  ...to backend compatibility, format evaluation, and I/O performance benchmarking at petabyte scale. Define and shape the engineering... 
    Performance
    Relocation package

    Biohub

    New York, NY
    3 days ago
  • $200k

     ...Optiver is a seeking a Machine Learning Performance Engineer to join our team, focusing on a pivotal AI initiative. This role would offer the opportunity to have significant impact across Machine Learning infrastructure, training, and inference challenges to advance our... 
    Performance
    Work at office

    Optiver

    New York, NY
    4 days ago
  •  ...their systems faster. Role Description: The Machine Learning Engineer will be responsible for designing and developing machine learning...  ...the product. They work with data to create models, perform statistical analysis, and train and retrain systems to optimize... 
    Performance
    Full time

    Root Access Inc

    New York, NY
    1 day ago
  •  ...Team The Decisioning & Optimization engineering team owns the systems that determine...  ...model outputs into auction Build model performance monitoring in production: inference latency...  ...is a unique culture and environment. Learn more here. Inclusion is a Netflix value... 
    Performance
    Hourly pay
    Full time
    Immediate start
    Flexible hours
    Shift work

    Netflix Inc

    New York, NY
    3 days ago
  •  ...About the job Machine Learning Engineer Job Title: Machine Learning Engineer Company: HeyMilo AI Location: New York, NY, USA...  ...processing and model deployment - Conduct experiments and perform data analysis to evaluate model performance and identify areas... 
    Performance
    Full time
    Contract work

    HeyMilo AI

    New York, NY
    1 day ago
  •  ...Machine Learning Engineer Menlo Park, California, United States; New York, New York, United States; Seattle, Washington, United States...  ...delivers real business outcomes for mobile app marketers through performance-based user acquisition. Moloco Commerce Media enables... 
    Performance
    Temporary work
    Immediate start
    Shift work

    Moloco

    New York, NY
    1 day ago
  • $170k - $212k

     ...Machine Learning Engineer The Music Promotion team is building products that allow creators to promote their work to reach new audiences and...  ...help us build systems that more accurately understand the performance that promotion can have, giving customers actionable... 
    Performance
    Flexible hours

    Spotify

    New York, NY
    3 days ago
  • $140k - $180k

     ...Machine Learning Engineer St. Louis, MO; Boston, MA; New York, NY Position Summary We are seeking a skilled Machine Learning Engineer...  ...predictive models and ensure scalable, maintainable, and performant production deployments. Design and implement data pipelines... 
    Performance

    Focus Financial Partners Inc.

    New York, NY
    21 hours ago
  • $184.05k - $262.93k

     ...Design and ship production-grade machine learning systems powering conversational and agentic...  ...iteration Partner closely with product, engineering, and design to deliver seamless, user-...  ...with production rigor, ensuring performance, latency, and reliability at Spotify scale... 
    Performance
    Work from home
    Flexible hours

    Spotify

    New York, NY
    20 hours ago
  •  ...end ML pipelines: data ingestion, feature engineering, model training, deployment, and...  ...robust monitoring and alerting for model performance, latency, data quality, and drift Lead...  ...frameworks (e.g., PyTorch, TensorFlow, scikit-learn) ~ Strong understanding of supervised/... 
    Performance
    Work at office
    Work from home
    Flexible hours

    Rain

    New York, NY
    2 days ago
  • $180k - $220k

     ...COMPANY: Canoe Intelligence WEBSITE : TITLE: Sr. Machine Learning Engineer LOCATION: New York City or London (hybrid) / Fully Remote in the...  ...language models for domain specific use cases, optimizing their performance for accuracy, efficiency, and scalability. Work closely... 
    Performance
    Local area
    Remote work
    Work from home
    Home office
    Flexible hours

    Canoe Intelligence

    New York, NY
    6 days ago
  • $210k - $260k

     ...systems. Who You Are You have a strong background in machine learning and enjoy applying theory to real-world applications, with...  ...in a level depends on relevant work history and interview performance. These ranges may be modified in the future. Spotify is an... 
    Performance
    Flexible hours

    Spotify

    New York, NY
    3 days ago
  • $150k - $180k

     ...Description ChasmTeam Senior Machine Learning Engineer at JudiHealth Location: Remote (For Non-Local) or Hybrid (Local to NYC area...  ...environment and enjoy creating practical, efficient, and high-performance software that leverages Large Language Models (LLM),... 
    Performance
    Local area
    Remote work
    Flexible hours

    Chasm Select

    New York, NY
    3 days ago
  • $145k - $180k

     .... Why this role matters: The ML Engineer is a new role within the AP Engineering...  ...for shaping how we build and scale machine learning systems at AP, helping to lay the foundation...  ...cloud environment, focusing on model performance, integration patterns, and inference... 
    Performance

    Associated Press

    New York, NY
    2 days ago
  •  ...have partnered with our client in their search for a Senior Machine Learning Engineer to work remotely. Responsibilities...  ...turn validated models and prototypes into reliable, high-performance, customer-facing systems. Key Responsibilities... 
    Performance
    Work experience placement
    Remote work

    Korn Ferry

    New York, NY
    3 days ago
  • $99.4k - $184.6k

     ...matters to them. Your New Role... CNN is seeking a Machine Learning Engineer I to build and deploy ML systems that power personalization...  ...and A/B testing frameworks to validate model performance and product impact Optimize ML systems for real-time, web... 
    Performance
    Temporary work
    Local area

    Warner Bros. Discovery

    New York, NY
    1 day ago
  • $180k - $220k

     ...Yourself from Impersonation and Fraudulent Activity The Sr. Machine Learning Engineer will join our Applied Data Science group, part of Nexxen...  ...of applied data science infrastructure for real-time performance optimization and machine learning solutions. The right person... 
    Performance
    Full time
    Work at office
    Remote work
    Flexible hours
    3 days per week

    Nexxen International Ltd

    New York, NY
    1 day ago
  • $148.7k - $199.4k

     ...Machine Learning Engineer ESPN is investing in large-scale data infrastructure and real-time processing platforms that power next-generation...  ...with platform/SRE teams to improve system availability, performance, and cost efficiency. Instrument data and ML infrastructure... 
    Performance

    The Walt Disney Studios

    New York, NY
    4 days ago
  • $153k - $198k

     ...Senior Machine Learning Engineer Button's mission is to empower the companies shaping the creator and affiliate economy - fueling mobile growth...  ...pipelines and online inference services with clear performance, reliability, and latency standards. Partner with data... 
    Performance
    Local area

    Button

    New York, NY
    1 day ago
  • $170k - $240k

     ...commercial intent beyond legacy search engines. Today, our Native Search solutions...  ...We are seeking an experienced Senior Machine Learning Engineer passionate about building impactful...  ...modeling, and improved bidding performance. You will have a clear career progression... 
    Performance
    Summer work
    Work at office

    adMarketplace

    New York, NY
    4 days ago
  • $210k - $250k

     ...Machine Learning Engineer Boston or NYC About Us: Layer Health was founded in 2023 by leading machine learning researchers from MIT...  ...results for our production models (methods to detect drift/performance degradation; develop observability tooling for performance... 
    Performance
    Work at office

    Layer Health

    New York, NY
    5 days ago
  • $140k - $210k

     ...seeking a highly skilled and motivated engineer to join our team. You will play a...  ...developing and deploying state-of-the-art machine learning solutions to advance our mission. We are...  ...Treeswift's ability to assess model performance and adapt to new operating conditions... 
    Performance
    Full time
    Work experience placement
    Work at office
    2 days per week

    Treeswift Inc

    New York, NY
    5 days ago
  •  ...Senior Machine Learning Engineer Patreon is a media and community platform where over 300,000 creators give their biggest fans access to exclusive...  ...the model. Debug models when observability shows performance gaps, and iterate on models. What You'll Need... 
    Performance
    Work at office
    Local area
    Remote work
    Worldwide
    Flexible hours
    2 days per week

    Patreon

    New York, NY
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Machine Learning Performance Engineer. Be the first to apply!