Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Machine Learning Engineer- Inference Optimization | Experienced Hire

Susquehanna International Group LLP

Overview We are looking for a Machine Learning Engineer focused on low-latency inference optimization to help build, tune, and productionize high-performance model serving systems. This role sits at the intersection of machine learning, systems engineering, and GPU performance. You will work on inference workloads where latency, throughput, reliability, and hardware efficiency all matter, and where a deep understanding of modern inference runtimes can meaningfully improve production outcomes. You will work closely with quantitative researchers and engineers to understand model structure, identify inference bottlenecks, and turn research ideas into efficient production systems. The work may involve other types of models, but focuses on transformer-style architectures, and structured inference workloads. You will evaluate and tune frameworks and related serving or compilation systems, while also reasoning about GPU execution, memory layout, batching strategies, precision tradeoffs, and end-to-end latency. What you'll do Design, build, and optimize low-latency inference systems for production machine learning workloads. Profile model inference pipelines across model execution, runtime configuration, batching, memory movement, serialization, networking, and I/O. Evaluate, integrate, and tune inference runtime systems. Improve latency, throughput, GPU utilization, for production inference workloads. Build and support benchmarking and profiling tools to compare model variants, hardware targets, runtime configurations, and deployment strategies. Debug performance issues involving GPU memory, compute saturation, kernel behavior, CPU/GPU coordination, data movement, and serving-layer overhead. Help shape model and system design choices so that research models are efficient to deploy under real latency constraints. Where necessary, collaborate with lower-level systems or GPU specialists on custom operators, kernel-level optimization, or hardware-specific performance work. What we’re looking for Experience deploying, optimizing, or operating machine learning inference workloads in production or production-like environments. Programming experience in Python, Java, C# etc. and at least one systems language such as C, C++, Rust, or Go Solid understanding of modern ML frameworks such as PyTorch, including model execution, export, tracing, compilation, and performance profiling. Ability to reason about latency, throughput, batching, memory use, GPU utilization, and reliability under real workloads. Strong practical judgment around tradeoffs between model quality, latency, throughput, implementation complexity, and maintainability. Preferred qualifications Experience optimizing inference for latency-sensitive or high-throughput applications. Experience with model optimization techniques such as quantization, pruning, distillation, operator fusion, graph lowering, custom operators, or model compilation. Exposure to CUDA, Triton language, ROCm, PTX, CuTe, CUTLASS, FlashInfer, or similar low-level GPU programming tools. Experience running inference workloads on Kubernetes or GPU clusters, including scheduling, autoscaling, observability, and resource management. Background in mathematics, physics, computer science, engineering, statistics, quantitative finance, or another technical field. Demonstrated ability to improve real-world inference performance beyond a baseline framework implementation. #J-18808-Ljbffr Susquehanna International Group, LLP

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Machine Learning Engineer- Inference Optimization | Experienced Hire in Bala Cynwyd, PA vacancy
  • Overview We’re looking for a versatile Machine Learning Engineer to help build and optimize the core infrastructure that supports our AI research. This role involves working across the stack—from data processing and training efficiency to low-level GPU programming and... 
    Suggested

    SIG Susquehanna

    Bala Cynwyd, PA
    1 day ago
  • Susquehanna International Group, LLP is seeking a Machine Learning Engineer in Bala Cynwyd, PA. This role focuses on low-latency inference optimization for high-performance model serving systems. You will collaborate with researchers to optimize performance, evaluate frameworks... 
    Suggested

    Susquehanna International Group, LLP

    Bala Cynwyd, PA
    1 day ago
  •  ...Overview Susquehanna is hiring experienced professionals as...  ...physics, statistics, electrical engineering, computer science, operations...  ...desire to continuously innovate, learn, and grow What we offer...  ...markets, Susquehanna applies machine learning and advanced quantitative... 
    Suggested

    Susquehanna International Group

    Bala Cynwyd, PA
    1 day ago
  • $65.5k - $134k

     ...work closely with our experienced client-serving...  ...decisions, and continuously learn from real-world...  ...for autonomous systems Optimize inference costs while maintaining...  ...Knowledge of prompt engineering and in-context learning...  ...advisory services, we hire and develop the most... 
    Suggested
    Summer holiday
    Flexible hours

    EY

    Philadelphia, PA
    12 hours ago
  • $76.2k - $174.1k

     ...work closely with our experienced client-serving professionals...  ...a focus on continuous learning and skills development...  ...ML and DL models, machine learning workflows,...  ...work experience in AI/ML engineering or MLE/ML Ops...  ...advisory services, we hire and develop the most passionate... 
    Suggested
    Work experience placement
    Summer holiday
    Flexible hours

    EY

    Philadelphia, PA
    2 days ago
  • $118k - $176k

     ...March 2025) Day to Day The Machine Learning Engineer I role partners closely with...  ...prepare datasets, train and optimize models, and maintain and improve model inference services. You will learn and...  ...belonging are fundamental to our hiring practices and company culture... 
    Work experience placement
    Local area

    Indeed

    Philadelphia, PA
    3 days ago
  •  ...and fulfillment. We use machine learning and Internet-scale data to...  ...modeling, and general causal inference. Search & Discovery ML...  ...works alongside world-class engineers, data scientists, and product...  ...willingness-to-pay, and optimizing revenue and user experience... 
    Remote job
    Permanent employment
    Work experience placement
    Internship
    Work at office
    Work from home
    Flexible hours

    Instacart

    Philadelphia, PA
    3 days ago
  •  ...greenfield development opportunities to optimize existing tooling from scratch What...  ...'s degree in Computer Science, Engineering, Mathematics or related discipline or...  ...of global markets, Susquehanna applies machine learning and advanced quantitative research to... 

    Susquehanna International Group

    Bala Cynwyd, PA
    1 day ago
  • $163k - $245k

     ...through innovative technology. We integrate machine learning at every step to create consistent,...  ...users. Our teams consist of Software Engineers, UX Designers, Product Managers, and Machine...  ...and belonging are fundamental to our hiring practices and company culture, forming... 
    Work experience placement
    Local area

    Indeed

    Philadelphia, PA
    5 days ago
  •  ...communicate complex technical concepts to non-engineering stakeholders in an accessible, outcome...  ...An MS or PhD in Computer Science, Machine Learning, Statistics, or a related field, or...  ...operating large-scale ad delivery and optimization systems. Experience working with... 
    Immediate start
    Worldwide

    Toogeza

    Philadelphia, PA
    8 days ago
  • $163k - $245k

     ...Visits, March 2025) Day to Day As a Machine Learning Engineer III, you will be a team lead. You will...  ...cross-functional teams to enhance and optimize search algorithms for improved...  ...and belonging are fundamental to our hiring practices and company culture, forming... 
    Work experience placement
    Local area

    Indeed

    Philadelphia, PA
    3 days ago
  •  ...Machine Learning / Data Science Engineer CapTech is an award-winning consulting firm that collaborates with clients to achieve what's possible through...  ...). Productionizing ML systems with a focus on optimization and scalability to satisfy clients' requirements. Growing... 
    Work at office
    Remote work
    Visa sponsorship
    Work visa
    Flexible hours

    CapTech Consulting

    Philadelphia, PA
    12 hours ago
  •  ...regular in-person events. Learn more about our flexible approach...  ...About the Role: As a Machine Learning Engineer, you will have the...  ...crucial areas such as routing optimization, pricing, dispatch, and mapping...  ...role is eligible for a new hire equity grant as well as annual... 
    Permanent employment
    Full time
    Work at office
    Remote work
    Work from home
    Flexible hours

    Instacart

    Philadelphia, PA
    12 hours ago
  •  ...Position Overview: We are looking for a Principal Machine Learning Engineer to serve as a hands-on technical leader for machine learning...  ...model deployment, batch scoring, real-time or near-real-time inference, model versioning, monitoring, retraining, and performance... 
    Temporary work
    Remote work

    Medical Guardian

    Philadelphia, PA
    24 days ago
  •  ...driven and highly collaborative, bringing together researchers, engineers, and traders to design and deploy impactful strategies in...  ...unique challenges of global markets, Susquehanna applies machine learning and advanced quantitative research to vast datasets in order... 

    Susquehenna International Group

    Bala Cynwyd, PA
    4 days ago
  • $106.9k - $176.5k

     ...Data and Decision Science – AI Native Engineering AI/Machine Learning Engineer, Senior Consultant...  ...pipelines to ensure data integrity and optimize learning processes, all while collaborating...  ...Augmented Generation (RAG)) and experienced with ML packages like scikit-learn... 
    Full time
    Work experience placement
    Summer holiday
    Flexible hours

    EY

    Philadelphia, PA
    1 day ago
  • $125.5k - $230.2k

     ...Data and Decision Science – AI Native Engineering AI/Machine Learning Engineer, Manager Consultant The...  ...data is clean, accurate, and optimized for XOps processes. Additionally, you...  ...Retrieval Augmented Generation (RAG)) and experienced with ML packages like scikit-learn... 
    Full time
    Work experience placement
    Summer holiday
    Flexible hours

    EY

    Philadelphia, PA
    1 day ago
  • $101.84k - $165.49k

     ...AI/ML Engineer 2 Job Summary: Responsible for the...  ...of advanced AI and machine learning solutions in support...  ...feature engineering, model optimization, deployment, and...  ...technical stakeholders. Experienced in gathering...  ...depending on date of hire, schedule type, and the... 
    Work at office
    Flexible hours

    Day & Zimmermann

    Philadelphia, PA
    3 days ago
  • $95.3k - $158.8k

     ...Are you a collaborative Machine Learning Ops Engineer looking to work for a mission driven global organization...  ..., grounding), and A/B testing. Optimize infrastructure costs through...  ...committed to providing a fair and accessible hiring process. If you have a disability or... 
    Local area

    RELX

    Philadelphia, PA
    9 days ago
  •  ...Overview: Machine Learning Engineer Philadelphia, PA OR Washington, DC | Hybrid: 3-4 days/week 9 + Months Role: Design and validate ML models that support engineering tooling teams. Enhance existing AIML automation tools (e.g., Speech data), implement... 
    3 days per week

    Guru Schools

    Philadelphia, PA
    12 hours ago
  • $107.66k - $161.7k

     ...topics and providing a unique platform to learn and connect with others. Poe : a...  ...About the Team and Role: Our small engineering team works on challenging problems every...  ...product and our company. At Poe, we use Machine Learning in various parts of the product... 
    Remote job
    Full time
    Work experience placement
    Internship

    Quora

    Philadelphia, PA
    12 hours ago
  •  ...networking, and middleware layers integrate with software components What we're looking for Bachelor's degree in Computer Science, Engineering, Mathematics or related discipline or its foreign equivalent. Relevant technical experience may substitute for education. The... 

    SIG Susquehanna

    Bala Cynwyd, PA
    2 days ago
  •  ...Senior Data Engineer URBN is seeking a Senior Data Engineer to join our Enterprise Data...  .... You will collaborate with a deeply experienced team of engineers and managers to deliver...  .... Comfortable using AI assistants to optimize workflows and generate boilerplate code;... 

    Urban Outfitters , Inc.

    Philadelphia, PA
    12 hours ago
  •  ...Services (AWS). Our AWS Data Engineer will play a key role in...  ...for business insights and machine learning models.The ideal candidate...  ...seeking a highly skilled and experienced AWS Data Engineer who will...  ...data solutions. Experience in optimizing Snowflake performance and managing... 
    Permanent employment
    Full time
    Contract work
    Remote work
    Flexible hours

    The Data Sherpas

    Philadelphia, PA
    1 day ago
  •  ...Senior Data Science Software Engineer At T-Mobile Advertising...  ...powered by advanced machine learning, large-scale data processing...  ...scalable data, training, and inference pipelines using distributed...  ...structures, statistical modeling, optimization algorithms, big data, and... 
    Full time
    Temporary work
    Part time
    Work experience placement
    Local area
    Flexible hours

    T Mobile US

    Philadelphia, PA
    12 hours ago
  •  ...at Nuuly: Inventory Science, aimed at optimizing the acquisition and utilization of inventory...  ...involve utilizing data science and machine learning best practices to achieve business...  ...troubleshooting ML model behavior. Conduct causal inference studies and exploratory analyses to... 

    Urban Outfitters , Inc.

    Philadelphia, PA
    2 days ago
  • $116.5k - $210.1k

     ...powered by advanced machine learning, large-scale data processing...  ...Science Software Engineer to join our team. In...  ...data, training, and inference pipelines using distributed...  ...modeling, optimization algorithms, big data,...  ...for this role based on hiring location, At T-Mobile... 
    Full time
    Temporary work
    Part time
    Work experience placement
    Local area
    Flexible hours

    T-Mobile

    Philadelphia, PA
    3 days ago
  • $150k - $300k

     ...work in person, remotely or in a hybrid work environment. Primary Responsibilities: Design and implement multi-period portfolio optimization frameworks incorporating transaction costs, slippage, and other market frictions Leverage MOSEK and other optimization solvers... 
    Remote work

    STEVENS CAPITAL MANAGEMENT LP

    Radnor, PA
    4 days ago
  •  ...Description Job Description Senior AI / ML Engineer – Enterprise Data & GenAI Solutions...  .../ Full-Time Overview: We are hiring a Senior AI/ML Engineer to lead development...  ...frameworks for automation and governance ~ Optimize models and pipelines for performance and... 
    Full time
    Contract work
    Remote work

    NavitasPartners

    Philadelphia, PA
    18 days ago
  •  ...Staff Data Engineer URBN is hiring a Staff Data Engineer to join the development...  .... We are looking for an experienced data engineer to join our...  ...that enable training and inference of ML models in...  ...Experience deploying machine learning models in production.... 

    Urban Outfitters , Inc.

    Philadelphia, PA
    a month ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Machine Learning Engineer- Inference Optimization | Experienced Hire. Be the first to apply!