Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Data Scientist

$119.8k - $234.7k

Microsoft Corporation

Overview

We'relooking for data scientists to help build the next generation of post-training methods for frontier models at Microsoft AI.You'lljoin a small, high-impact team working across all stages of post-training, with a focus onevaluation design, high-quality training data, and scalable data pipelinesforstate-of-the-artfoundation models.

In this role,you'llhelp turn raw model capability intoreliable, aligned, and measurable performance improvements, directly shaping how frontier models behave in real-world deployments.

About the Role:

Microsoft AI is building the next generation of frontier models that power Copilot and other large-scale AI experiences. ThePost-Trainingteamis responsible fortransforming powerful pretrained models intorobust, aligned, and high-performing systemsused by millions of people worldwide.

Our work focuses on improvinggeneral quality, instruction following, coding and math ability, tool use, agentic behaviors, personality, and other critical model capabilities. Weoperateacross the full post-training lifecycle - fromdata generation and curation, toevaluation and diagnostics, toreward modeling and reinforcement learning.

We are a small, highly autonomous team that works closely with pre-training, product, and engineering partners to rapidly iterate on ideas, run large-scale experiments, and safely advance model capabilities. Each team member owns meaningful parts of the post-training pipeline and has direct access to thecompute, data, and decision-making needed to move quickly from insight to production.

Microsoft Superintelligence Team

This role is part of Microsoft AI's Superintelligence Team. The MAIST is astartup-like team inside Microsoft AI, created to push the boundaries of AI towardHumanist Superintelligence-ultra-capable systems thatremaincontrollable, safety-aligned, and anchored to human values.Our mission is to create AI that amplifies human potential while ensuring humanityremainsfirmly in control. We aim to deliver breakthroughs thatbenefitsociety-advancing science, education, and global well-being.

We're also fortunate to partner with incredible productteamsgiving our models the chance to reach billions of users and createimmensepositive impact. Ifyou'rea brilliant, highly-ambitiousand low ego individual,you'llfit right in-come and join us as we work on our next generation of models! Responsibilities

  • Design evaluations of advanced model capabilities and use them to drive rapid, high-signal iteration loops

  • Work with vendors to produce high quality evaluation and training data

  • Build data pipelines to produce high quality evaluation and training data

  • Build data flywheels to hill-climb on model weaknesses, using data from various surfaces where our models are deployed

  • Ensureoptimalquality,quantityand coverage of data across our post-training stages

  • Run post-training experiments and ablations to produce models that climb ourevals

  • Embody ourcultureandvalues.

We'reLookingForPeople Who:

  • Havedeep experience withLLMs, either training them or applying them in production

  • Have developed production-scale data pipelines for synthesizing, curating, or processinglargequantitiesof data

  • Can design, run, and interpret large-scale ML experiments with careful statistical and empirical reasoning.

  • Possess strong generalist engineering and mathematical skills.

  • Have clear written and verbal communication, and theability to collaborate effectively withresearchers,engineersand other disciplines.

  • Bonus skills:Demonstrated SOTA results in any area of large-scale training, inference, or evaluation.




Qualifications
Required skills Handson experience with large language models, including training or applying them in production (not just prompting) Designing and running posttraining experiments (evals, ablations, preference tuning / RLHFstyle methods) Building and owning scalable data pipelines for training and evaluation data Strong Python skills for ML experimentation, data processing, and analysis Solid statistical, experimental, and general engineering fundamentals


Software Engineering IC4 - The typical base pay range for this role across the U.S. is USD $119,800.00 - $234,700.00 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $160,200.00 - $261,000.00 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:

Software Engineering IC5 - The typical base pay range for this role across the U.S. is USD $142,800.00 - $274,800.00 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $188,000.00 - $304,200.00 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:

This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.

Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.

Vacancy posted 5 hours ago
Similar jobs that could be interesting for youBased on the Data Scientist in New York, NY vacancy
  • $130k - $185k

     ...the way – enabling you to shape your future with confidence. Within the EY-Parthenon service line, the EY Growth Platforms Data Scientists collaborate with Business Leaders, AI/ML Engineers, Project Managers, and other team members to design, build, and scale... 
    Suggested
    Work experience placement
    Summer holiday
    Flexible hours

    EY

    Hoboken, NJ
    2 days ago
  • $170k - $210k

     ...diverse, and we are therefore committed to embracing diversity of thought and experience within our team. We are looking for a Lead Data Engineer to join our team. This is a high-impact, strategic role focused equally on technical leadership and hands-on execution. You... 
    Suggested
    Remote work
    Work visa
    Shift work

    Mark43

    New York, NY
    13 days ago
  • $153k - $180k

     ...support your professional development as part of our diverse, customer-centric, global team. National Grid is hiring a Lead Data Scientist. This role is designated as hybrid, with an expectation of one or two days per week in our offices in either Waltham, MA or... 
    Suggested
    Local area
    2 days per week
    1 day per week

    National Grid USA

    Brooklyn, NY
    2 days ago
  •  ...'ve been transforming business identity verification, replacing slow, manual processes with seamless access to complete, up-to-date data. Our platform helps companies across industries confidently verify business identities, onboard customers faster, and reduce risk at... 
    Suggested
    Work at office
    2 days per week

    Middesk

    New York, NY
    2 days ago
  • $150k - $195k

     ...Lead Data Scientist Prism Data is building the future of credit risk assessment using modern data science and transaction-level financial data. Our API-based platform enables banks, fintechs, and lenders to use automated cash flow underwriting—analyzing detailed banking... 
    Suggested
    Work experience placement
    Local area

    Prism Data

    New York, NY
    1 day ago
  • $136k - $190k

     ...difference our hard work makes, and continue on our own paths of lifelong learning. How Can You Make an Impact? The Lead Data Scientist independently applies advanced data science and AI (DS/AI) techniques to solve complex problems and drive business outcomes... 
    Immediate start
    Remote work
    Worldwide

    McGraw-Hill Education

    New York, NY
    4 days ago
  •  ...how we build and who we are. THE OPPORTUNITY: We've already started building something we think fundamentally rethinks how data science gets done at a company. We have an internal data agent that can query our warehouse, interpret results, and surface insights... 
    Shift work

    Brisk Teaching

    New York, NY
    1 day ago
  •  ...education in underserved communities and helping organizations achieve their full potential with AI. About the role A Lead Data Scientist is responsible for designing and implementing data-driven solutions to complex business problems. The role requires extensive... 
    H1b
    Local area

    Fusemachines

    New York, NY
    23 hours ago
  •  ...manufacturers recover value from surplus equipment. You'll lead the development of our appraisal and pricing capabilities — combining data science with agentic AI to automate and improve valuation decisions at scale. Key Responsibilities Design and iterate... 

    Amplio

    New York, NY
    1 day ago
  • $75 - $100 per hour

     ...Consulting Data Engineer (Professional Services) About Harnham Harnham is a specialist in data and analytics recruitment. We connect high-performing contractors with consulting teams delivering modern data platform projects. About the client Our client... 
    Hourly pay
    Contract work
    For contractors
    Remote work

    Harnham

    New York, NY
    1 day ago
  •  ...Data Engineer Consultant - NYC New York City About Indicium AI Indicium AI is trusted by the world's leading enterprises to deliver AI into production at scale. We are a global AI-native consultancy with proven experience across Financial Services, Energy & Utilities... 

    Indicium

    New York, NY
    2 days ago
  •  ...About the job Lead Data Engineer Job Title: Lead Data Engineer No of Positions:2 Location: Jersey City/ Bedford, NJ (Hybrid) (3 days from office per week) Experience: 10+ years specifically Key Skills: Snowflake, SQL, Python, Spark, AWS... 
    Work experience placement
    Work at office
    3 days per week

    Inizio Partners

    Jersey City, NJ
    3 days ago
  • TBD Gen is proud to be an equal-opportunity employer, committed to diversity and inclusivity. We base employment decisions on merit, experience, and business needs, without considering race, color, national origin, age, religion, sex, pregnancy, genetic information,...

    Gen Digital Inc

    New York, NY
    4 days ago
  •  ...Lead Data Engineer - Palantir & PySpark (Lead Data engineer role where experience should be on both hands on and leading the team.) Candidate preference : Should be in the United States and from East coast. Experience : 8-15 Years Location... 
    Remote work

    VBeyond

    Jersey City, NJ
    3 days ago
  •  ...Role: Lead Data Engineer Location: New York Mode of Work: Onsite 60/hour. Responsibilities - Lead the design and implementation of a robust, scalable, and reusable data ingestion framework using Microsoft Fabric - Building... 

    Maintec Technologies

    New York, NY
    23 hours ago
  • $250.8k - $286.2k

     ...Senior Lead Data Engineer Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a fast-paced, collaborative, inclusive, and iterative delivery environment? At Capital One, you'll be part of a big group of makers... 
    Full time
    Part time
    Internship
    Local area

    Capital One Financial Corp

    New York, NY
    4 days ago
  •  ...Hi, Position: Lead Data Engineer Experience Required: 12 years+ Location: Jersey City, NJ | Onsite Employment Type: Full-Time NOTE - Must have taken care of team size of Minimum of 10 or more people and sole contributer required... 
    Full time

    Centraprise

    Jersey City, NJ
    1 day ago
  • $220k - $230k

     ...Who We Are: Galaxy is a global leader in digital assets and data center infrastructure, delivering solutions that accelerate progress in finance and artificial intelligence. We believe that blockchain and digital asset innovation will transform how value moves through... 
    Local area
    Flexible hours

    Galaxy USA

    New York, NY
    4 days ago
  •  ...responsible for executing the vision and achieving the goals of our three core lines of business: Indexing, Digital Distribution, and Data & Analytics. Made up of developers, data engineers, designers, and project managers, the platform team is the engine that drives... 

    TMX Group

    New York, NY
    23 hours ago
  • Job Description: Must Have : DBT Core , Snowflake and Python, AWS Airflow , Container services(Docker) , Terraform Good to Have : BI tooling experience(Power BI or Tableau), understanding of Agile processes( Kanban or Jira) . We need strong person as this...

    ECHO IT SOLUTIONS INC .

    New York, NY
    6 days ago
  • $73.01k - $170.64k

     ...Databricks Lead Data Engineer As a Databricks Lead Data Engineer, you are expected to lead the development team and have strong...  ...and star schemas for BI and AI usage. Collaborate with data scientists and analysts to meet data requirements. Implement logging,... 
    Work at office
    Local area
    Flexible hours

    Perficient

    New York, NY
    1 day ago
  • $160k - $220k

     ...Lead Data Engineer Deliberate AI | Hybrid (NYC or Boston) | Full-Time About Deliberate AI: We're a venture-backed company at the frontier of precision mental health. In partnerships with some of the world's top ranked medical schools and psychiatric hospitals, we... 
    Full time
    Worldwide
    Relocation
    Flexible hours
    Shift work
    Night shift
    Day shift

    Deliberate AI

    New York, NY
    1 day ago
  •  ...Role Value Proposition: The position sits within the newly consolidated Data and Analytics (D&A) organization supporting the U.S. Business of MetLife. U.S. D&A assists all business lines of MetLife’s U.S. business (about 2/3 of MetLife Global by earnings) with everything... 

    MetLife

    New York, NY
    23 hours ago
  •  ...Power BI or other BI (Business Intelligence) tools Generate reporting, dashboards, and metrics for BI solutions Design and create data visualizations and reports according to user requirements and integrate into existing applications Develop and maintain data queries... 
    Remote work

    Adela Technologies

    New York, NY
    2 days ago
  • $220k - $300k

     ...Overview The Lead Data Engineer on the Nebula team plays a significant technical leadership role in shaping and scaling the data...  ...downstream consumers, including analysts, product teams, data scientists, and operational users, through reliable and well-modeled data... 
    Local area
    Remote work
    Flexible hours

    Bayview Asset Management

    New York, NY
    2 days ago
  •  ...Lead Snowflake Data Engineer Our client is seeking a Lead Snowflake Data Engineer to design, own, and deliver end-to-end data engineering solutions in modern cloud environments. This role requires full lifecycle ownership across Snowflake pipelines, data modeling,... 

    TheStaffed

    New York, NY
    1 day ago
  • $175k - $255k

     ...experience by sourcing, curating, and activating internally and externally sourced datasets, and building user-facing products. As a data engineer on the Revenue Engine team, you will be responsible for building ELT pipelines, developing custom DAGs, transforming and... 
    Full time
    Work at office
    Local area

    Charlie Health Outreach

    New York, NY
    1 day ago
  • $72k - $184.44k

     ...They evaluate compliance with regulations including assessing governance and risk management processes and related controls. Those in data, analytics and technology solutions at PwC will assist clients in developing solutions that help build trust, drive improvement, and... 

    PwC (US)

    New York, NY
    23 hours ago
  •  ...Position Overview The position of Senior Electronic Data Interchange (EDI) Database Programmer in Analytics and Reporting Department is responsible for managing encounter data submission to CMS (Centers for Medicare and Medicaid Services) and other regulatory... 

    MetroPlusHealth

    New York, NY
    6 hours ago
  • $72k - $184.44k

     ...They evaluate compliance with regulations including assessing governance and risk management processes and related controls. Those in data, analytics and technology solutions at PwC will assist clients in developing solutions that help build trust, drive improvement, and... 
    Full time
    H1b

    PwC

    New York, NY
    9 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Data Scientist. Be the first to apply!