ML Inference Engineer - Low-Latency GPU Systems
Susquehanna International Group LLP
Susquehanna International Group, LLP is seeking a Machine Learning Engineer in Bala Cynwyd, PA. This role focuses on low-latency inference optimization for high-performance model serving systems. You will collaborate with researchers to optimize performance, evaluate frameworks, and debug GPU memory issues while managing inference workloads effectively. A strong background in modern ML frameworks, programming experience, and understanding of production environments is essential. #J-18808-Ljbffr Susquehanna International Group, LLP
- ...seeking a Machine Learning Engineer in Bala Cynwyd, PA. This role focuses on low-latency inference optimization for high-... ...model serving systems. You will collaborate... ...frameworks, and debug GPU memory issues while managing... ...background in modern ML frameworks,...Suggested
- ...Machine Learning Engineer We are looking for a Machine... ...Engineer focused on low-latency inference optimization to help... ...performance model serving systems. This role sits at the... ...engineering, and GPU performance. You will... ...understanding of modern ML frameworks such as PyTorch...Suggested
- ...Cynwyd, PA, seeks an experienced Software Developer for its Front Office team. You will design and optimize prediction market trading systems, collaborating with traders and technologists to enhance trading capabilities. The ideal candidate has a Bachelor's degree in a...Suggested
$118k - $176k
...Day The Machine Learning Engineer I role partners closely... ...maintain and improve model inference services. You will learn and... ...Indeed. Work spans classical ML through LLM systems. You improve search and... ...Monitor model quality, latency, and cost Debug data, models...SuggestedWork experience placementLocal area- ...a versatile Machine Learning Engineer to help build and optimize the... ...and training efficiency to low-level GPU programming and performance tooling. You'll contribute to systems that are reliable, scalable,... ...infrastructure at scale Familiarity of AI/ML workloads Comfortable working...Suggested
- ...Reporting to the Director of Data & AI Engineering, the AI/ML Engineer is a mid-level, hands-on... ...pipelines that move data from source systems into Snowflake via Fivetran and dbt.... ...& Cost Optimization: Monitor inference latency, token usage, and compute costs across...Work at officeLocal areaRemote work
- ...Trading System Engineering Summer Internship Program Susquehanna is looking for highly motivated full-time students for our 10-week trading... ...projects for our trading strategy development teams in low latency C++ trading system development. Additionally, you'll have...Full timeSummer workCasual workInternshipSummer internshipVisa sponsorship
- ...highly motivated full-time students for our 10-week trading system engineering summer internship program. This is a computationally... ...challenging projects for our trading strategy development teams in low latency C++ trading system development. Additionally, you’ll have...Full timeSummer workCasual workInternshipSummer internshipVisa sponsorship
$65.5k - $134k
...grade AI agents and multi-agent systems that transform financial... ...of customers with sub-second latency requirements Build robust... ...autonomous systems Optimize inference costs while maintaining performance... ...) Knowledge of prompt engineering and in-context learning optimization...Summer holidayFlexible hours- ...NLP PEOPLE is seeking a Staff AI/ML Engineer to lead the development of Agentic AI capabilities and LLM-based applications. Located in... ...implementing machine learning algorithms, and designing advanced AI systems. The ideal candidate will have over 8 years of experience in...Relocation packageFlexible hours
- ...developing custom solutions and low-code solutions, to... ...role in building and evolving ML/AI applications and services on... ...with product, architecture, and engineering teams to deliver reliable, high... ...Build and deliver scalable systems that incorporate machine learning...
$95.3k - $158.8k
...you a collaborative Machine Learning Ops Engineer looking to work for a mission driven global... ..., reliable, and scalable services. Our systems operate over one of the world's largest medical... .... Key Responsibilities ML & LLM Engineering, Search and Recommendation...Local area- ...roles open across our various ML teams. You can find a blurb... ...modeling, and general causal inference. Search & Discovery ML :... ...works alongside world-class engineers, data scientists, and product... ...solutions, applying LLMs, agentic systems, and computer vision to...Remote jobPermanent employmentWork experience placementInternshipWork at officeWork from homeFlexible hours
- ...life-saving emergency response systems and remote patient monitoring... ...a Principal Machine Learning Engineer to serve as a hands-on technical... .... Develop practical ML models that balance predictive... ..., real-time or near-real-time inference, model versioning, monitoring,...Temporary workRemote work
- ...Field Systems Engineer Eastern Lift Truck Co. is currently seeking a Field Systems Engineer to support our Warehouse Products Group. We offer tremendous opportunity for growth, competitive compensation and benefits for individuals who want a career with a great company...Temporary work
- ...Field Systems Engineer (Windows / Endpoint Support) We are looking for a Field Systems Engineer (Windows / Endpoint Support) to support the deployment, configuration, and maintenance of Windows-based systems across operational environments. This role requires a hands...
- ...Description VAST Data is looking for a Senior Systems Engineer to join our growing team! This is a great opportunity to be part... ...them available for real-time data analysis and AI training and inference. Designed from the ground up to make AI simple to deploy and...Traineeship
$150k - $210k
...motivated individuals that come together to form high performance team? Come join WWT today! We are looking for a Consulting Systems Engineer to join our Global Enterprise Sales team. What will you be doing? As a Consulting Systems Engineer (CSE), you will lead the...Full timeLive inShift work- ...validation and configuration management, the engineer ensures reliability and contributes to... ...and constraints into detailed low-level designs, including drafting solution... ...of the top 15 not-for-profit health care systems in the country and the largest provider in...Daily paidFull timeTemporary workPart timeWork experience placementFlexible hoursShift work
$77k - $202k
...At PwC, our people in data and analytics engineering focus on leveraging advanced technologies... ...developing and implementing advanced AI and ML solutions to drive innovation and enhance... ...and optimising algorithms, models, and systems to enable intelligent decision-making and...Full timeH1b- ...focused on Information Technology and Communications consulting, system engineering, integration, deployment and operation of state of the art... ...information, military or veteran status, citizenship, low-income status or any other status or characteristic protected...Full timeContract workTemporary workWork at officeShift work
$175k - $240k
...Founding AI/ML Engineer — Guthrie AI Location: Philadelphia, PA... ...and ship production-grade AI systems that solve real customer problems... ...Improve model quality, latency, reliability, and cost efficiency... ...workflows into products Inference optimization or evaluation...Remote jobFull timeFor subcontractorH1bWork at officeVisa sponsorship- ...focused on Information Technology and Communications consulting, system engineering, integration, deployment and operation of state of the art... ...information, military or veteran status, citizenship, low-income status or any other status or characteristic protected...Full timeContract workTemporary workWork at officeShift work
$163k - $245k
...Total Visits, March 2025) Day to Day As a Machine Learning Engineer III, you will be a team lead. You will own one of the team's... ...Machine Learning conferences, such as Neural Information Processing Systems (NeurIPS), the International Conference on Machine Learning (...Work experience placementLocal area- ...AI/ML Engineer (TS/SCI with CI Poly) {S} Job Category: ENG Requisition Number: AIMLE002904... ...datasets and building models to perform inference BS in machine learning, computer science... ...Experience implementing algorithms on the GPU in Python or C++ using CUDA and other CUDA...Full timeTemporary workWork at officeRemote workVisa sponsorshipRelocation packageFlexible hours
$101.84k - $165.49k
...AI/ML Engineer 2 Job Summary: Responsible for the end-to-end design, development, and deployment of advanced AI and machine learning... ...machine learning, spanning the delivery of LLM-based agents, RAG systems, MCP (Model Context Protocol) servers, and predictive models...Work at officeFlexible hours- ...CVR prediction models that power ad ranking and recommendation systems. Build and refine systems for creative understanding and... ...feedback. Clearly communicate complex technical concepts to non-engineering stakeholders in an accessible, outcome-focused way. What...Immediate startWorldwide
$107.66k - $161.7k
...on our Poe product. About the Team and Role: Our small engineering team works on challenging problems every day. We have a culture... ...Engineers have high impact by advancing the current Machine Learning systems, building performant and reliable LLM applications and...Remote jobFull timeWork experience placementInternship- ...About the Role: As a Machine Learning Engineer, you will have the opportunity to collaborate... ...and enhance Instacart's marketplace systems. You will use machine learning to devise... ...understand business needs and create impactful ML/AI applications. Actively engage with...Permanent employmentFull timeWork at officeRemote workWork from homeFlexible hours
- ...opportunity now! Position Overview: The Staff AI/ML Engineer (LLMs) will lead the development of... ...retrieval-augmented generation (RAG) systems and semantic search architectures Build... ...factory) Prompt engineering techniques / Inference time techniques (e.g. chain of thought,...Temporary workWork at officeVisa sponsorshipRelocation packageFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to ML Inference Engineer - Low-Latency GPU Systems. Be the first to apply!



