ML Systems Engineer: Cloud‑Scale Training Infra
Basis Research Institute
A nonprofit AI research organization in New York City seeks a full-time ML Systems Engineer. This role involves managing distributed training infrastructure, debugging complex issues, and optimizing cloud resources to enhance operational efficiency. Ideal candidates will have expertise in ML systems and cloud administration. Join a team focused on solving impactful problems through advanced AI infrastructure. #J-18808-Ljbffr Basis Research Institute
- ...problems . This means expanding the scale, complexity, and breadth of... ...human values first. About the Role ML Systems Engineers at Basis ensure training and evaluation infrastructure is fast... ...distributed training frameworks through cloud administration, making it possible...TrainingFull timeWork at office
- Prsala is looking for a reliable Systems Administrator to manage and maintain their infrastructure and IT systems. This role supports a... ...stable, secure, and monitored. Responsibilities include managing cloud infrastructure, handling IAM, and implementing security best practices...SuggestedRemote jobFlexible hours
$250k - $350k
...Applied ML Systems Engineer - Finance - NEW YORK - UNITED... ...GPU kernels trying to shave training time. Other weeks you'll be... ...machine" and "it works at scale, reliably, for months" - I must... ...Brain, DeepMind, Ads ML, Infra); Meta (FAIR, Infra, Recsys)...TrainingPermanent employmentFull timeWork experience placementInternshipImmediate startRemote workRelocationRelocation package- Gritt Robotics Inc is seeking a Software - ML & Cloud Infrastructure Engineer to design scalable cloud infrastructure for AI and data pipelines. Join... ...product evolution and develop high-performance ML systems. The ideal candidate has 4+ years of experience in deploying...Suggested
- A dynamic technology firm in New York is seeking a talented Senior/Staff level Systems Engineer to develop and scale a dedicated cloud for CI workloads. The role offers an opportunity to solve complex systems problems and build a new CI cloud from the ground up. Candidates...Suggested
- ...Role Overview Scale’s rapidly growing Global Public... ...high-quality training data for national LLMs... ...supporting end-to-end system reliability, real-time... ...integration, and the resilient cloud infrastructure... ...evolution: Partner with our Engineering and ML teams to ensure the lessons...Training
$250k - $350k
...function of our society. At Scale, our mission is to... ...state of the art post‑training algorithms to reach the... ...world. The Enterprise ML Research Lab works on the... ...As an ML Sys Research Engineer, you’ll work on building... ...to optimize our ML system. Your customer will be...TrainingFull time- Reflection, based in New York, is seeking an experienced professional to build and scale distributed training systems for frontier model pre-training. You will work closely with research teams to design large-scale training runs and optimize training efficiency across...Training
- ...construction of large-scale infrastructure around the globe. Gritt’s systems are already deployed commercially... ...VCs. Role: Software - ML & Cloud Infrastructure Location... ...& Cloud Infrastructure Engineer to join our team. As an... ...and deploy scalable AI training and validation...Training
$141.1k - $262.1k
F. Hoffmann-La Roche AG is seeking a motivated ML Engineer for its Genentech team in New York. The role focuses on designing and maintaining ML infrastructure to support drug discovery initiatives. The ideal candidate will have a strong background in AWS, Python, and C++...- ...powers healthcare AI at scale. New Jersey or... ...building applied AI systems that operate inside... .... This role is for engineers who want to build the foundational ML infrastructure that... ...systems that train, deploy, and monitor... ...) Comfortable with cloud infrastructure (AWS...TrainingRemote jobFull time
- ...Square). About the Role Mirage is seeking an ML Engineer to push the boundaries of large language... ...for building and extending agentic systems that understand and operate over complex... ...creative tasks Develop novel approaches for training and adapting the large language models...TrainingFull timeLocal areaNight shift
$94.4k - $232.75k
Quora, Inc. is looking for a Machine Learning Engineer to architect and maintain large-scale distributed systems that support our ML development workflow. This role is crucial for enhancing collaboration among ML engineers and requires coding proficiency in Python or C++...Remote work- ...Senior GPU Systems / AI Infrastructure Engineer (NYC) Location: New York City (Hybrid... ...A-C / high-growth AI infra) About the Role We’re... ...infrastructure powering large-scale model training and inference. This role... ...Collaborate closely with ML researchers and infra...TrainingPermanent employment
$152k - $272.25k
...Principal Machine Learning Engineer, ML Platform and Systems Architecture****POSITION... ...design and evolution of large-scale machine learning platforms... ...capabilities across training, evaluation, deployment, and... ...distributed data processing, and cloud-native platform...TrainingRemote work$100k - $136k
The Cloud Systems Engineer will play a vital role within the 11:11 Systems Cloud Operations organization... ...an ecosystem of large- and small-scale x86 and Open Systems (Unix/Linux) servers... ...level 11:11 Systems staff. Provide training and mentoring to junior level 11:11...TrainingWork experience placementWork at officeVisa sponsorship- Machinify is looking for a Sr/Director of Engineering to lead our AI/ML Engineering team in the United States. You will oversee a team of engineers... ...the core AI/ML platform and ensure its reliability at scale. The ideal candidate will have extensive experience in backend...Remote job
$137k - $180k
...The Machine Learning Engineering team designs, builds... ...in crafting and scaling foundational backend systems across predictive systems... ...analytics and model training Explore, integrate,... ...experience as a ML Engineer with a strong... ...development Experience with cloud platforms,...TrainingTemporary workRemote workHome office$140k - $155k
...includes robust sensing systems across imaging... ...-driven Field Engineering that drives real transformation... ...annotation, model training, and data... ...Work closely with ML engineers to... ...integrations with scale systems, ERPs, PLCs... ...React / React Native) Cloud computing (GCP,...TrainingWork experience placementFlexible hours$156.5k - $181k
...FinTech) is seeking an experienced Lead Cloud Systems Engineer (Microsoft 365, AWS, Collaboration... ...executes transactions on an extraordinary scale which has bolstered liquidity in the... ...Solutions. Strong communication and training skills for helpdesk enablement and executive...TrainingFull timeH1bWork at officeLocal areaRemote work- LiveKit is seeking a Senior/Staff Engineer to enhance our platform's core services and observability. This role demands expertise in distributed systems and a strong grasp of programming fundamentals. You will design resilient architectures, improve system reliability,...
- ...looking for a Founding Machine Learning Engineer to join their team in New York City. This... ...implementing optical design agents, developing ML models to optimize outcomes. The ideal... ..., you will define data structures and enhance training data. #J-18808-Ljbffr PhotoniumTrainingFull time
$150k
A leading staffing agency is seeking a mid-senior level professional for a Contract opportunity, focusing on multi-cloud architecture and identity management. In this role, you will design and integrate technology solutions supporting acquisitions, manage CI/CD pipelines...Remote jobContract work- Elea Ecuador is looking for a Senior Machine Learning Engineer in New York. You will join a team that focuses on machine learning and music... ...with various cross-functional teams, and improving systems that connect artists and fans. A strong background in machine learning...Remote job
- ...architecture changes, prompt engineering, fine‑tuning, or rule‑... ...teams to deploy, scale, and monitor pipelines... ...and where the current system fails. Run the evaluation... ...Requirements 2+ years building ML/AI systems in... ...infrastructure glue, not just model training scripts Built...Training
- ...progressive financial technology firm is seeking a Senior ML Infrastructure Engineer in Kentucky. The role involves building scalable ML pipelines... ...solutions for credit risk management, and working on large-scale data analytics. Ideal candidates have 5+ years in ML...Remote jobFlexible hours
- Softswiss is seeking a hands-on System Engineer / DevOps - Senior to ensure the design, automation, and maintenance of scalable infrastructure and deployment pipelines. The ideal candidate will have strong Kubernetes experience, knowledge of configuration management, and...
$150k - $175k
...creative scoring system, by leading the... ...of multimodal ML models that... ...end pipelines (training, fine-tuning, deployment... .... Scale distributed training... ...right-sized cloud infrastructure... ...5+ years in ML engineering or MLOps, with... ...abstractions, and infra that compound over...TrainingWork experience placementLocal area- ...global bank is seeking a Machine Learning Engineer to join their innovative team. In this... ...include collaborating with other specialists, scaling models for production, and monitoring... ...will have experience in Python and various ML frameworks. This position offers an...
- Zillow Group Inc. is seeking a Senior Machine Learning Engineer to lead the design and deployment of scalable ML infrastructure for its Agentic Data Foundations... ...requires over 6 years of experience in building and scaling data infrastructures, proficiency in Python and AWS...Remote job
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to ML Systems Engineer: Cloud‑Scale Training Infra. Be the first to apply!
- graduate machine learning engineer New York, NY
- machine learning engineer New York, NY
- data scientist machine learning engineer New York, NY
- junior machine learning research engineer New York, NY
- senior ml engineer New York, NY
- computer vision machine learning engineer New York, NY
- ai ml engineer New York, NY
- machine learning software engineer New York, NY
- machine learning ai engineer New York, NY
- system support engineer New York, NY


