Low-Latency ML Inference Engineer (GPU Systems)
SIG Susquehanna
SIG Susquehanna is seeking a Machine Learning Engineer focused on optimizing low-latency inference systems. This role bridges machine learning and systems engineering to enhance model serving efficiency. Ideal candidates will have experience in deploying inference workloads, strong programming skills in multiple languages, and a solid grasp of ML frameworks like PyTorch. The position emphasizes collaboration with quantitative researchers to transform models into production-ready systems while addressing latency and performance. #J-18808-Ljbffr SIG Susquehanna
- ...the leading livestream shopping platform, is seeking an AI/ML Platform Engineer in the Seattle area to develop core infrastructure for machine... ...applications. This role involves designing scalable systems, productionizing ML architectures, and enhancing marketplace...SuggestedFlexible hours
- ...looking for a Machine Learning Engineer focused on low-latency inference optimization to help build... ...performance model serving systems. This role sits at the... ...systems engineering, and GPU performance. You will work... ...Solid understanding of modern ML frameworks such as PyTorch...Suggested
- Reddit, Inc. is seeking a Staff Machine Learning Engineer to lead the development of a large-scale ML Inference Platform. Responsibilities include designing cloud-based ML systems on Kubernetes and ensuring reliable, low-latency performance. Candidates should have 7+ years...Suggested
$113.9k - $189.9k
...with product managers, and mentoring team members. Ideal candidates will have strong skills in Java and Rust, experience with low-latency systems, and an understanding of FIX lifecycle. Competitive salary range is $113,900 - $189,900, alongside comprehensive benefits...Suggested- ...mission-critical inference for the world's... ...build the platform engineers turn to to ship... ...operating system for distributed,... ...engineers to lead our GPU Networking... ...in bandwidth and latency. Optimize Distributed... ...running low-level benchmarks... ...to a variety of ML startups, offering...SuggestedFlexible hours
$250k
...trading firm that is seeking a talented C++ Engineer to join their team. As a C++ Engineer,... ...implementing high-performance trading systems that utilize cutting-edge technology to... ...team of engineers to design and implement low-latency systems Optimize and maintain existing...$200k
Optiver is looking for a Hardware Engineer to build hardware-based trading systems that enhance market efficiency. This role involves close collaboration... ...solutions using your expertise in FPGA/ASIC design and low-latency hardware. Ideal candidates will possess a Bachelor's...- ...is seeking a highly skilled C++ Software Engineer to join our Chicago team. As a Software... ...hardware-specific needs of our global trading system with a focus on optimizing performance.... ...Core Responsibilities: Design and build low latency, high-performance trading systems...
$152k - $228k
...Job Description Senior ML Engineer About Invoca... ...and fine-tuning through inference optimization and production... ..., and Kubernetes-based GPU infrastructure. Profile and tune for low latency and high throughput, and... ...foundations to keep the systems powering our models reliable...Currently hiringRemote workFlexible hours$160k - $240k
...Bloomberg is seeking an experienced engineer to help reimagine its Enterprise Trading Network (ETN)... ...dynamic team focused on developing cutting-edge low-latency applications that connect execution and order management systems to Bloomberg’s trading networks. Candidates should...- Bank of America is looking for a C++ Senior Engineer in New York to work on low‑latency market data for Electronic Trading. This role involves architecture... ...experience in C++, comprehensive skills in low-level systems, and familiarity with Electronic Trading applications....
- A global digital assets firm is seeking an HFT System and Network Engineer to join their trading infrastructure team. This role focuses on architecting low-latency stacks and optimizing AWS environments alongside physical hardware. Candidates should have experience in...
- ...skilled professional in New York to design and operate large-scale GPU infrastructure for model inference and reinforcement learning. The role demands several years of experience in deploying GPU systems, optimizing model performance, and working with frameworks like...
- ...company is seeking a Machine Learning Engineer to design and optimize systems for bringing their models to life. The role involves ensuring ML models are efficient and reliable, requiring... .... Candidates should be familiar with low-latency techniques and operational maturity in...Remote work
- Machine Learning Engineer - Inference / Serving Join to apply for... ...intelligence layer for any system that makes a... ...dental, vision plans with low out‑of‑pocket costs.... .... This is an applied ML systems role—equal parts... ...experimentation. Ensure low‑latency inference by...Full timeRemote work
$175k - $250k
...Senior Machine Learning Engineer (ML Infrastructure & Data Systems) Our client is an early-stage robotics and... ...Have built and scaled ML training and inference systems in production environments... ...training at scale (e.g., large GPU workloads) Familiarity with video...- ...unmatched accuracy, low latency, and cost efficiency... ...problems in AI. As an ML Ops Infrastructure Engineer at Deepgram, you... ...pipelines, deployment systems, and testing infrastructure... ...as NVIDIA Triton Inference Server, TensorRT, or... ...) Familiarity with GPU‑accelerated...Home officeFlexible hours
- ...LSEG is seeking a Senior Software Engineer to join their News Engineering group in New York. This position focuses on evolving the... .... The role offers the chance to deepen expertise in low-latency systems and lead a scrum team while working with modern tools and practices...
- ...DV Equities desk in Chicago. This role involves enhancing our low-latency trading platform, requiring strong Python skills and a... ...algorithmic trading. Candidates should have 2-5 years of software engineering experience, a detail-oriented work ethic, and the ability to...
- ...Role Mirage is seeking an ML Engineer to build and scale the systems powering our video... ...scaling strategies, and inference optimization and efficiency... ...efficient, and capable of ultra-low latency, real-time generation.... ...training systems Optimize GPU utilization, parallelism,...Full timeLocal areaNight shift
- .... Our Technology Infrastructure team engineers and operates the foundational technology... ...looking for a detail-oriented and dedicated Low-Latency Market Data Engineer with at least 10... ..., and support of Linux/Unix operating systems. What you'll do In this role, you...
- ...Senior Bare Metal Linux Engineer (Low Latency) - Financial Services Career Developers Inc., a distinguished staffing and consulting firm,... ...Infrastructure Engineer with expertise in Linux, compute, and storage systems to join our team in Manhattan. In this role, you will design...Contract workTemporary workLocal area
$150k
...Low Latency Quantitative Researcher Tudor's Macro Pipeline team seeks a Quantitative Researcher to work within a low latency trading... ...(e.g. Statistics, Machine Learning, Physics, Mathematics, or Engineering) ~ Excellent understanding of probabilities, statistics and...Work experience placement$150k - $250k
Tudor’s Macro Pipeline team is looking for a Quantitative Researcher to join their low latency trading team. The successful candidate will focus on researching and automating systematic futures signals and strategies, requiring a strong quantitative background with at least...$200.8k - $251k
...member to build and optimize a machine learning framework for large language models. Candidates should have system optimization experience and solid software engineering skills, particularly in tools like CUDA and Pytorch. This full-time position offers a competitive salary...Full time- ...leading financial firm in New York is seeking a Lead Market Data Engineer to architect and optimize platforms delivering high-quality... ...extensive knowledge of programming, market protocols, and low-latency systems. The position offers a full-time contract within the finance...Full timeContract work
- ...are looking for an engineer with experience in low-level systems programming and optimisation... ...join our growing ML team.... ...both training and inference. We care about efficient... ...scale training, low-latency inference in real-time... ...and host- and GPU-level considerations...
$110k - $130k
...the Role: Machine Learning (ML) at the New York Times... ...The New York Times real-time ML inference models, including both data and... ...the other end, our partners are engineering systems that call these hosted models at scale with low-latency and Service Level Agreements guaranteed...Local areaFlexible hours- An established industry player is seeking an Algo Trading Systems Engineer to enhance their Trading and Research Infrastructure team. In this... ...of critical systems, ensuring optimal performance for latency-sensitive applications. You will design innovative scheduling...
- ...the intersection of foundational AI and systems design for real-world impact. Our mission... ...seeking a highly experienced Principal ML Engineer (Applied / Systems) to join our... ...approaches. Build, optimize, and scale inference pipelines and model serving infrastructure...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Low-Latency ML Inference Engineer (GPU Systems). Be the first to apply!
- machine learning software engineer New York, NY
- ai ml engineer New York, NY
- graduate machine learning engineer New York, NY
- computer vision machine learning engineer New York, NY
- machine learning engineer New York, NY
- senior ml engineer New York, NY
- junior machine learning research engineer New York, NY
- machine learning ai engineer New York, NY
- data scientist machine learning engineer New York, NY
- healthcare systems engineer New York, NY


