Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

ML Systems Engineer: Cloud‑Scale Training Infra

Basis Research Institute

A nonprofit AI research organization in New York City seeks a full-time ML Systems Engineer. This role involves managing distributed training infrastructure, debugging complex issues, and optimizing cloud resources to enhance operational efficiency. Ideal candidates will have expertise in ML systems and cloud administration. Join a team focused on solving impactful problems through advanced AI infrastructure. #J-18808-Ljbffr Basis Research Institute

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the ML Systems Engineer: Cloud‑Scale Training Infra in New York, NY vacancy
  •  ...problems . This means expanding the scale, complexity, and breadth of...  ...human values first. About the Role ML Systems Engineers at Basis ensure training and evaluation infrastructure is fast...  ...distributed training frameworks through cloud administration, making it possible... 
    Training
    Full time
    Work at office

    Basis Research Institute

    New York, NY
    2 days ago
  • Prsala is looking for a reliable Systems Administrator to manage and maintain their infrastructure and IT systems. This role supports a...  ...stable, secure, and monitored. Responsibilities include managing cloud infrastructure, handling IAM, and implementing security best practices... 
    Suggested
    Remote job
    Flexible hours

    Prsala

    New York, NY
    2 days ago
  • $250k - $350k

     ...Applied ML Systems Engineer  - Finance - NEW YORK - UNITED...  ...GPU kernels trying to shave training time. Other weeks you'll be...  ...machine" and "it works at scale, reliably, for months" - I must...  ...Brain, DeepMind, Ads ML, Infra); Meta (FAIR, Infra, Recsys)... 
    Training
    Permanent employment
    Full time
    Work experience placement
    Internship
    Immediate start
    Remote work
    Relocation
    Relocation package
    New York, NY
    23 days ago
  • Gritt Robotics Inc is seeking a Software - ML & Cloud Infrastructure Engineer to design scalable cloud infrastructure for AI and data pipelines. Join...  ...product evolution and develop high-performance ML systems. The ideal candidate has 4+ years of experience in deploying... 
    Suggested

    Gritt Robotics Inc

    Brooklyn, NY
    2 days ago
  • A dynamic technology firm in New York is seeking a talented Senior/Staff level Systems Engineer to develop and scale a dedicated cloud for CI workloads. The role offers an opportunity to solve complex systems problems and build a new CI cloud from the ground up. Candidates... 
    Suggested

    Crossing Hurdles

    New York, NY
    4 days ago
  •  ...Role Overview Scale’s rapidly growing Global Public...  ...high-quality training data for national LLMs...  ...supporting end-to-end system reliability, real-time...  ...integration, and the resilient cloud infrastructure...  ...evolution: Partner with our Engineering and ML teams to ensure the lessons... 
    Training

    AI Chopping Block, Inc.

    New York, NY
    3 days ago
  • $250k - $350k

     ...function of our society. At Scale, our mission is to...  ...state of the art post‑training algorithms to reach the...  ...world. The Enterprise ML Research Lab works on the...  ...As an ML Sys Research Engineer, you’ll work on building...  ...to optimize our ML system. Your customer will be... 
    Training
    Full time

    Scale AI, Inc.

    New York, NY
    2 days ago
  • Reflection, based in New York, is seeking an experienced professional to build and scale distributed training systems for frontier model pre-training. You will work closely with research teams to design large-scale training runs and optimize training efficiency across... 
    Training

    Reflection

    New York, NY
    4 days ago
  •  ...construction of large-scale infrastructure around the globe. Gritt’s systems are already deployed commercially...  ...VCs. Role: Software - ML & Cloud Infrastructure Location...  ...& Cloud Infrastructure Engineer to join our team. As an...  ...and deploy scalable AI training and validation... 
    Training

    Gritt Robotics Inc

    Brooklyn, NY
    2 days ago
  • $141.1k - $262.1k

    F. Hoffmann-La Roche AG is seeking a motivated ML Engineer for its Genentech team in New York. The role focuses on designing and maintaining ML infrastructure to support drug discovery initiatives. The ideal candidate will have a strong background in AWS, Python, and C++... 

    F. Hoffmann-La Roche AG

    New York, NY
    4 days ago
  •  ...powers healthcare AI at scale. New Jersey or...  ...building applied AI systems that operate inside...  .... This role is for engineers who want to build the foundational ML infrastructure that...  ...systems that train, deploy, and monitor...  ...) Comfortable with cloud infrastructure (AWS... 
    Training
    Remote job
    Full time

    Neurex AI Limited

    New York, NY
    2 days ago
  •  ...Square). About the Role Mirage is seeking an ML Engineer to push the boundaries of large language...  ...for building and extending agentic systems that understand and operate over complex...  ...creative tasks Develop novel approaches for training and adapting the large language models... 
    Training
    Full time
    Local area
    Night shift

    Mirage

    New York, NY
    17 hours ago
  • $94.4k - $232.75k

    Quora, Inc. is looking for a Machine Learning Engineer to architect and maintain large-scale distributed systems that support our ML development workflow. This role is crucial for enhancing collaboration among ML engineers and requires coding proficiency in Python or C++... 
    Remote work

    Quora, Inc.

    New York, NY
    3 days ago
  •  ...Senior GPU Systems / AI Infrastructure Engineer (NYC) Location: New York City (Hybrid...  ...A-C / high-growth AI infra) About the Role We’re...  ...infrastructure powering large-scale model training and inference. This role...  ...Collaborate closely with ML researchers and infra... 
    Training
    Permanent employment
    New York, NY
    more than 2 months ago
  • $152k - $272.25k

     ...Principal Machine Learning Engineer, ML Platform and Systems Architecture****POSITION...  ...design and evolution of large-scale machine learning platforms...  ...capabilities across training, evaluation, deployment, and...  ...distributed data processing, and cloud-native platform... 
    Training
    Remote work

    Autodesk, Inc.

    New York, NY
    2 days ago
  • $100k - $136k

    The Cloud Systems Engineer will play a vital role within the 11:11 Systems Cloud Operations organization...  ...an ecosystem of large- and small-scale x86 and Open Systems (Unix/Linux) servers...  ...level 11:11 Systems staff. Provide training and mentoring to junior level 11:11... 
    Training
    Work experience placement
    Work at office
    Visa sponsorship

    Storcom, Inc.

    New York, NY
    1 day ago
  • Machinify is looking for a Sr/Director of Engineering to lead our AI/ML Engineering team in the United States. You will oversee a team of engineers...  ...the core AI/ML platform and ensure its reliability at scale. The ideal candidate will have extensive experience in backend... 
    Remote job

    Machinify

    New York, NY
    1 day ago
  • $137k - $180k

     ...The Machine Learning Engineering team designs, builds...  ...in crafting and scaling foundational backend systems across predictive systems...  ...analytics and model training Explore, integrate,...  ...experience as a ML Engineer with a strong...  ...development Experience with cloud platforms,... 
    Training
    Temporary work
    Remote work
    Home office

    SmartAsset

    New York, NY
    2 days ago
  • $140k - $155k

     ...includes robust sensing systems across imaging...  ...-driven Field Engineering that drives real transformation...  ...annotation, model training, and data...  ...Work closely with ML engineers to...  ...integrations with scale systems, ERPs, PLCs...  ...React / React Native) Cloud computing (GCP,... 
    Training
    Work experience placement
    Flexible hours

    Visia

    New York, NY
    3 days ago
  • $156.5k - $181k

     ...FinTech) is seeking an experienced Lead Cloud Systems Engineer (Microsoft 365, AWS, Collaboration...  ...executes transactions on an extraordinary scale which has bolstered liquidity in the...  ...Solutions. Strong communication and training skills for helpdesk enablement and executive... 
    Training
    Full time
    H1b
    Work at office
    Local area
    Remote work

    U.S. Financial Technology, LLC

    New York, NY
    2 days ago
  • LiveKit is seeking a Senior/Staff Engineer to enhance our platform's core services and observability. This role demands expertise in distributed systems and a strong grasp of programming fundamentals. You will design resilient architectures, improve system reliability,... 

    LiveKit

    New York, NY
    2 days ago
  •  ...looking for a Founding Machine Learning Engineer to join their team in New York City. This...  ...implementing optical design agents, developing ML models to optimize outcomes. The ideal...  ..., you will define data structures and enhance training data. #J-18808-Ljbffr Photonium
    Training
    Full time

    Photonium

    New York, NY
    3 days ago
  • $150k

    A leading staffing agency is seeking a mid-senior level professional for a Contract opportunity, focusing on multi-cloud architecture and identity management. In this role, you will design and integrate technology solutions supporting acquisitions, manage CI/CD pipelines... 
    Remote job
    Contract work

    The Planet Group

    New York, NY
    2 days ago
  • Elea Ecuador is looking for a Senior Machine Learning Engineer in New York. You will join a team that focuses on machine learning and music...  ...with various cross-functional teams, and improving systems that connect artists and fans. A strong background in machine learning... 
    Remote job

    Elea Ecuador

    New York, NY
    3 days ago
  •  ...architecture changes, prompt engineering, fine‑tuning, or rule‑...  ...teams to deploy, scale, and monitor pipelines...  ...and where the current system fails. Run the evaluation...  ...Requirements 2+ years building ML/AI systems in...  ...infrastructure glue, not just model training scripts Built... 
    Training

    triomics inc.

    New York, NY
    5 days ago
  •  ...progressive financial technology firm is seeking a Senior ML Infrastructure Engineer in Kentucky. The role involves building scalable ML pipelines...  ...solutions for credit risk management, and working on large-scale data analytics. Ideal candidates have 5+ years in ML... 
    Remote job
    Flexible hours

    Optasia Group

    Brooklyn, NY
    2 days ago
  • Softswiss is seeking a hands-on System Engineer / DevOps - Senior to ensure the design, automation, and maintenance of scalable infrastructure and deployment pipelines. The ideal candidate will have strong Kubernetes experience, knowledge of configuration management, and... 

    Softswiss

    New York, NY
    2 days ago
  • $150k - $175k

     ...creative scoring system, by leading the...  ...of multimodal ML models that...  ...end pipelines (training, fine-tuning, deployment...  .... Scale distributed training...  ...right-sized cloud infrastructure...  ...5+ years in ML engineering or MLOps, with...  ...abstractions, and infra that compound over... 
    Training
    Work experience placement
    Local area

    KARGO

    New York, NY
    17 days ago
  •  ...global bank is seeking a Machine Learning Engineer to join their innovative team. In this...  ...include collaborating with other specialists, scaling models for production, and monitoring...  ...will have experience in Python and various ML frameworks. This position offers an... 

    JPMorgan Chase & Co.

    Brooklyn, NY
    17 hours ago
  • Zillow Group Inc. is seeking a Senior Machine Learning Engineer to lead the design and deployment of scalable ML infrastructure for its Agentic Data Foundations...  ...requires over 6 years of experience in building and scaling data infrastructures, proficiency in Python and AWS... 
    Remote job

    Zillow Group Inc.

    Brooklyn, NY
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to ML Systems Engineer: Cloud‑Scale Training Infra. Be the first to apply!