Machine Learning Engineer- Inference Optimization | Experienced Hire
Susquehanna International Group LLP
Overview We are looking for a Machine Learning Engineer focused on low-latency inference optimization to help build, tune, and productionize high-performance model serving systems. This role sits at the intersection of machine learning, systems engineering, and GPU performance. You will work on inference workloads where latency, throughput, reliability, and hardware efficiency all matter, and where a deep understanding of modern inference runtimes can meaningfully improve production outcomes. You will work closely with quantitative researchers and engineers to understand model structure, identify inference bottlenecks, and turn research ideas into efficient production systems. The work may involve other types of models, but focuses on transformer-style architectures, and structured inference workloads. You will evaluate and tune frameworks and related serving or compilation systems, while also reasoning about GPU execution, memory layout, batching strategies, precision tradeoffs, and end-to-end latency. What you'll do Design, build, and optimize low-latency inference systems for production machine learning workloads. Profile model inference pipelines across model execution, runtime configuration, batching, memory movement, serialization, networking, and I/O. Evaluate, integrate, and tune inference runtime systems. Improve latency, throughput, GPU utilization, for production inference workloads. Build and support benchmarking and profiling tools to compare model variants, hardware targets, runtime configurations, and deployment strategies. Debug performance issues involving GPU memory, compute saturation, kernel behavior, CPU/GPU coordination, data movement, and serving-layer overhead. Help shape model and system design choices so that research models are efficient to deploy under real latency constraints. Where necessary, collaborate with lower-level systems or GPU specialists on custom operators, kernel-level optimization, or hardware-specific performance work. What we’re looking for Experience deploying, optimizing, or operating machine learning inference workloads in production or production-like environments. Programming experience in Python, Java, C# etc. and at least one systems language such as C, C++, Rust, or Go Solid understanding of modern ML frameworks such as PyTorch, including model execution, export, tracing, compilation, and performance profiling. Ability to reason about latency, throughput, batching, memory use, GPU utilization, and reliability under real workloads. Strong practical judgment around tradeoffs between model quality, latency, throughput, implementation complexity, and maintainability. Preferred qualifications Experience optimizing inference for latency-sensitive or high-throughput applications. Experience with model optimization techniques such as quantization, pruning, distillation, operator fusion, graph lowering, custom operators, or model compilation. Exposure to CUDA, Triton language, ROCm, PTX, CuTe, CUTLASS, FlashInfer, or similar low-level GPU programming tools. Experience running inference workloads on Kubernetes or GPU clusters, including scheduling, autoscaling, observability, and resource management. Background in mathematics, physics, computer science, engineering, statistics, quantitative finance, or another technical field. Demonstrated ability to improve real-world inference performance beyond a baseline framework implementation. #J-18808-Ljbffr Susquehanna International Group, LLP
- Overview We’re looking for a versatile Machine Learning Engineer to help build and optimize the core infrastructure that supports our AI research. This role involves working across the stack—from data processing and training efficiency to low-level GPU programming and...Suggested
- Susquehanna International Group, LLP is seeking a Machine Learning Engineer in Bala Cynwyd, PA. This role focuses on low-latency inference optimization for high-performance model serving systems. You will collaborate with researchers to optimize performance, evaluate frameworks...Suggested
- ...Overview Susquehanna is hiring experienced professionals as... ...physics, statistics, electrical engineering, computer science, operations... ...desire to continuously innovate, learn, and grow What we offer... ...markets, Susquehanna applies machine learning and advanced quantitative...Suggested
$65.5k - $134k
...work closely with our experienced client-serving... ...decisions, and continuously learn from real-world... ...for autonomous systems Optimize inference costs while maintaining... ...Knowledge of prompt engineering and in-context learning... ...advisory services, we hire and develop the most...SuggestedSummer holidayFlexible hours$76.2k - $174.1k
...work closely with our experienced client-serving professionals... ...a focus on continuous learning and skills development... ...ML and DL models, machine learning workflows,... ...work experience in AI/ML engineering or MLE/ML Ops... ...advisory services, we hire and develop the most passionate...SuggestedWork experience placementSummer holidayFlexible hours$118k - $176k
...March 2025) Day to Day The Machine Learning Engineer I role partners closely with... ...prepare datasets, train and optimize models, and maintain and improve model inference services. You will learn and... ...belonging are fundamental to our hiring practices and company culture...Work experience placementLocal area- ...and fulfillment. We use machine learning and Internet-scale data to... ...modeling, and general causal inference. Search & Discovery ML... ...works alongside world-class engineers, data scientists, and product... ...willingness-to-pay, and optimizing revenue and user experience...Remote jobPermanent employmentWork experience placementInternshipWork at officeWork from homeFlexible hours
- ...greenfield development opportunities to optimize existing tooling from scratch What... ...'s degree in Computer Science, Engineering, Mathematics or related discipline or... ...of global markets, Susquehanna applies machine learning and advanced quantitative research to...
$163k - $245k
...through innovative technology. We integrate machine learning at every step to create consistent,... ...users. Our teams consist of Software Engineers, UX Designers, Product Managers, and Machine... ...and belonging are fundamental to our hiring practices and company culture, forming...Work experience placementLocal area- ...communicate complex technical concepts to non-engineering stakeholders in an accessible, outcome... ...An MS or PhD in Computer Science, Machine Learning, Statistics, or a related field, or... ...operating large-scale ad delivery and optimization systems. Experience working with...Immediate startWorldwide
$163k - $245k
...Visits, March 2025) Day to Day As a Machine Learning Engineer III, you will be a team lead. You will... ...cross-functional teams to enhance and optimize search algorithms for improved... ...and belonging are fundamental to our hiring practices and company culture, forming...Work experience placementLocal area- ...Machine Learning / Data Science Engineer CapTech is an award-winning consulting firm that collaborates with clients to achieve what's possible through... ...). Productionizing ML systems with a focus on optimization and scalability to satisfy clients' requirements. Growing...Work at officeRemote workVisa sponsorshipWork visaFlexible hours
- ...regular in-person events. Learn more about our flexible approach... ...About the Role: As a Machine Learning Engineer, you will have the... ...crucial areas such as routing optimization, pricing, dispatch, and mapping... ...role is eligible for a new hire equity grant as well as annual...Permanent employmentFull timeWork at officeRemote workWork from homeFlexible hours
- ...Position Overview: We are looking for a Principal Machine Learning Engineer to serve as a hands-on technical leader for machine learning... ...model deployment, batch scoring, real-time or near-real-time inference, model versioning, monitoring, retraining, and performance...Temporary workRemote work
- ...driven and highly collaborative, bringing together researchers, engineers, and traders to design and deploy impactful strategies in... ...unique challenges of global markets, Susquehanna applies machine learning and advanced quantitative research to vast datasets in order...
$106.9k - $176.5k
...Data and Decision Science – AI Native Engineering AI/Machine Learning Engineer, Senior Consultant... ...pipelines to ensure data integrity and optimize learning processes, all while collaborating... ...Augmented Generation (RAG)) and experienced with ML packages like scikit-learn...Full timeWork experience placementSummer holidayFlexible hours$125.5k - $230.2k
...Data and Decision Science – AI Native Engineering AI/Machine Learning Engineer, Manager Consultant The... ...data is clean, accurate, and optimized for XOps processes. Additionally, you... ...Retrieval Augmented Generation (RAG)) and experienced with ML packages like scikit-learn...Full timeWork experience placementSummer holidayFlexible hours$101.84k - $165.49k
...AI/ML Engineer 2 Job Summary: Responsible for the... ...of advanced AI and machine learning solutions in support... ...feature engineering, model optimization, deployment, and... ...technical stakeholders. Experienced in gathering... ...depending on date of hire, schedule type, and the...Work at officeFlexible hours$95.3k - $158.8k
...Are you a collaborative Machine Learning Ops Engineer looking to work for a mission driven global organization... ..., grounding), and A/B testing. Optimize infrastructure costs through... ...committed to providing a fair and accessible hiring process. If you have a disability or...Local area- ...Overview: Machine Learning Engineer Philadelphia, PA OR Washington, DC | Hybrid: 3-4 days/week 9 + Months Role: Design and validate ML models that support engineering tooling teams. Enhance existing AIML automation tools (e.g., Speech data), implement...3 days per week
$107.66k - $161.7k
...topics and providing a unique platform to learn and connect with others. Poe : a... ...About the Team and Role: Our small engineering team works on challenging problems every... ...product and our company. At Poe, we use Machine Learning in various parts of the product...Remote jobFull timeWork experience placementInternship- ...networking, and middleware layers integrate with software components What we're looking for Bachelor's degree in Computer Science, Engineering, Mathematics or related discipline or its foreign equivalent. Relevant technical experience may substitute for education. The...
- ...Senior Data Engineer URBN is seeking a Senior Data Engineer to join our Enterprise Data... .... You will collaborate with a deeply experienced team of engineers and managers to deliver... .... Comfortable using AI assistants to optimize workflows and generate boilerplate code;...
- ...Services (AWS). Our AWS Data Engineer will play a key role in... ...for business insights and machine learning models.The ideal candidate... ...seeking a highly skilled and experienced AWS Data Engineer who will... ...data solutions. Experience in optimizing Snowflake performance and managing...Permanent employmentFull timeContract workRemote workFlexible hours
- ...Senior Data Science Software Engineer At T-Mobile Advertising... ...powered by advanced machine learning, large-scale data processing... ...scalable data, training, and inference pipelines using distributed... ...structures, statistical modeling, optimization algorithms, big data, and...Full timeTemporary workPart timeWork experience placementLocal areaFlexible hours
- ...at Nuuly: Inventory Science, aimed at optimizing the acquisition and utilization of inventory... ...involve utilizing data science and machine learning best practices to achieve business... ...troubleshooting ML model behavior. Conduct causal inference studies and exploratory analyses to...
$116.5k - $210.1k
...powered by advanced machine learning, large-scale data processing... ...Science Software Engineer to join our team. In... ...data, training, and inference pipelines using distributed... ...modeling, optimization algorithms, big data,... ...for this role based on hiring location, At T-Mobile...Full timeTemporary workPart timeWork experience placementLocal areaFlexible hours$150k - $300k
...work in person, remotely or in a hybrid work environment. Primary Responsibilities: Design and implement multi-period portfolio optimization frameworks incorporating transaction costs, slippage, and other market frictions Leverage MOSEK and other optimization solvers...Remote work- ...Description Job Description Senior AI / ML Engineer – Enterprise Data & GenAI Solutions... .../ Full-Time Overview: We are hiring a Senior AI/ML Engineer to lead development... ...frameworks for automation and governance ~ Optimize models and pipelines for performance and...Full timeContract workRemote work
- ...Staff Data Engineer URBN is hiring a Staff Data Engineer to join the development... .... We are looking for an experienced data engineer to join our... ...that enable training and inference of ML models in... ...Experience deploying machine learning models in production....
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Machine Learning Engineer- Inference Optimization | Experienced Hire. Be the first to apply!
- machine learning Bala Cynwyd, PA
- hire power Bala Cynwyd, PA
- recruitment team leader Bala Cynwyd, PA
- job hiring Bala Cynwyd, PA
- recruiting lead Bala Cynwyd, PA
- international hiring Bala Cynwyd, PA
- driver hiring event Bala Cynwyd, PA
- recruiting associate Bala Cynwyd, PA
- recruiting analyst Bala Cynwyd, PA
- recruiting specialist Bala Cynwyd, PA



