Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Software Engineer - AI Inference

$160k - $240k

Bloomberg

Senior Software Engineer - AI Inference

Location

New York

Business Area

Engineering and CTO

Ref #

10050779

Description & Requirements

Our team:

Join the team that is building the core infrastructure for AI at Bloomberg. The Bloomberg AI Inference Platform provides production-grade managed infrastructure for hosting, deploying, and serving all machine learning models, both predictive and cutting-edge generative models. We abstract away infrastructure complexity, empowering engineering teams to focus on creating intelligent applications with guaranteed scalability, performance, and governance. Our platform is built on the open-source KServe project, and the CNCS AI Inference team is a primary contributor to its development.

We'll trust you to:

  • Design and build scalable infrastructure for both online and offline inference workloads.

  • Lead integration of high-performance inference runtimes and serving frameworks, including TensorRT, vLLM, ONNX, and Triton.

  • Drive architecture and technical decisions across Bloomberg's inference platform, balancing latency, throughput, reliability, and cost.

  • Partner across engineering teams to improve model deployment, observability, and production performance.

  • Mentor junior engineers on system design, debugging, and performance optimization.

You'll need to have:

  • 5+ years of professional software engineering experience.

  • Experience designing, building, and operating production distributed systems.

  • Strong systems intuition and a track record of debugging and optimizing performance-critical services.

  • Ability to own problems end-to-end and quickly ramp up in unfamiliar technical areas.

  • 4+ years of demonstrated experience working with an object-oriented programming language.

  • A degree in Computer Science, Electrical Engineering, or equivalent practical experience.

We'd love to see:

  • Experience deploying and operating machine learning systems at scale.

  • Experience with inference optimization techniques such as batching, caching, request scheduling, or memory-aware serving.

  • Familiarity with PyTorch and GPU software stacks such as CUDA and NCCL.

  • Exposure to high-performance interconnects and distributed computing technologies such as NVLink, InfiniBand, or MPI.

  • Experience with Kubernetes and cloud-native infrastructure.

  • Experience with load balancing, request routing, or traffic management systems.

Representative projects:

  • Autoscaling a heterogeneous compute fleet to match supply and demand aross diverse inference workloads.

  • Building production-grade deployment pipelines to safely roll out new models to millions of users.

  • Developing new inference capabilities such as structured sampling, prompt caching, and advanced serving optimizations.

  • Analyzing observability data from real production workloads to improve latency, throughput, and resource efficiency.

Salary Range = 160,000 - 240,000 USD Annual + Benefits + Bonus

The referenced salary range is based on the Company's good faith belief at the time of posting. Actual compensation may vary based on factors such as geographic location, work experience, market conditions, education/training and skill level.

We offer one of the most comprehensive and generous benefits plans available and offer a range of total rewards that may include merit increases, incentive compensation (exempt roles only), paid holidays, paid time off, medical, dental, vision, short and long term disability benefits, 401(k) +match, life insurance, and various wellness programs, among others. The Company does not provide benefits directly to contingent workers/contractors and interns.

Discover what makes Bloomberg unique - watch our for an inside look at our culture, values, and the people behind our success.

Bloomberg is an equal opportunity employer and we value diversity at our company. We do not discriminate on the basis of age, ancestry, color, gender identity or expression, genetic predisposition or carrier status, marital status, national or ethnic origin, race, religion or belief, sex, sexual orientation, sexual and other reproductive health decisions, parental or caring status, physical or mental disability, pregnancy or parental leave, protected veteran status, status as a victim of domestic violence, or any other classification protected by applicable law.

Bloomberg is a disability inclusive employer. Please let us know if you require any reasonable adjustments to be made for the recruitment process. If you would prefer to discuss this confidentially, please email View email address on click.appcast.io

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Senior Software Engineer - AI Inference in New York, NY vacancy
  • $160k - $240k

    Bloomberg L.P. in New York is seeking a Senior Software Engineer for AI Inference to design and build scalable infrastructure for machine learning applications. The ideal candidate will have over 5 years of software engineering experience, expertise in distributed systems... 
    Senior

    Bloomberg L.P.

    New York, NY
    1 day ago
  •  ...Staff + Sr. Software Engineer, Inference Deployment San Francisco, CA | New York City, NY | Seattle, WA About Anthropic Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for... 
    Senior
    Work at office
    Visa sponsorship
    Flexible hours
    Shift work

    anthropic

    New York, NY
    2 days ago
  • $300k

     ...Staff + Sr. Software Engineer, Inference San Francisco, CA | New York City, NY | Seattle, WA About Anthropic Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society... 
    Senior
    Work at office
    Worldwide
    Visa sponsorship
    Flexible hours

    anthropic

    New York, NY
    2 days ago
  • $200k - $250k

     ...At Wizard AI, we’re building the top-performing AI Shopping...  ...we’re seeking an experienced Senior MLOps Engineer to take ownership of how our...  ...– for a custom-built inference platform powering a live conversational...  ...and alerting. Apply software engineering best practices including... 
    Senior
    Remote work
    Flexible hours

    Wizard

    New York, NY
    3 days ago
  • $300k

     ...interpretable, and steerable AI systems. We want AI to be safe...  ...of committed researchers, engineers, policy experts, and business...  ...systems. About The Role Our Inference team builds and maintains the...  ...Qualifications Significant software engineering experience, particularly... 
    Senior
    Worldwide

    Menlo Ventures

    New York, NY
    3 days ago
  • $229.9k - $262.4k

    Senior Lead AI Engineer (FM Hosting, LLM Inference) Overview: At Capital One, we are creating responsible and reliable AI systems, changing banking for good...  .... Design, develop, test, deploy, and support AI software components including foundation model training,... 
    Senior
    Full time
    Part time
    Local area
    Immediate start

    Capital One

    New York, NY
    1 day ago
  •  ...is seeking a talented developer to join its Corellium team in Washington, DC. The role focuses on developing scalable, cloud-native AI solutions on AWS, including job orchestration and API integrations. Applicants should have proficiency in Python, Typescript, and hands... 
    Senior

    Cellebrite

    New York, NY
    3 days ago
  •  ...authenticity by building the world's best AI detection systems. We publish research...  ...We are looking for a backend-focused Senior Software Engineer to design, build, and scale the...  ...GPU infrastructure, batch vs. real-time inference) Familiarity with React or frontend... 
    Senior

    Pangram

    New York, NY
    18 hours ago
  • $220k - $270k

     ...Senior Software Engineer USD $220,000 - $270,000 meaningful equity | New York | 5 days onsite Soda has partnered with an AI infrastructure company founded by leaders behind one of the earliest...  ...infrastructure problems around inference, orchestration, context evolution... 
    Senior

    SoDA

    New York, NY
    2 days ago
  • $128.7k - $261.3k

     ...deployment platform within the autonomous vehicle sector. This role involves automating model deployment from training to on-vehicle inference and enhancing developer experience through robust tooling. Candidates should hold a relevant degree and possess significant... 
    Senior

    General Motors

    New York, NY
    3 days ago
  •  ...ABOUT BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay...  ..., and Conviction. Join us and help build the platform engineers turn to to ship AI products. THE ROLE: Voice... 
    Flexible hours

    Baseten

    New York, NY
    3 days ago
  • $200k - $280k

     ...Unify is building the first AI-powered system of action for revenue...  ...into a top-performing growth engine by making go-to-market...  ...reinforcement learning, and LLM inference infrastructure. If you are interested...  ...: You have 3+ years of software engineering experience and at... 
    Senior

    Unify

    New York, NY
    3 days ago
  •  ...Senior Software Engineer Take2 AI is hiring a Senior Software Engineer with deep Voice AI expertise and strong backend engineering skills. Our...  ...~ Shipped production AI applications with real-time inference, integrating ML models into live systems ~ Built and... 
    Senior
    Work at office

    Take2

    New York, NY
    2 days ago
  •  ...Senior Software Engineer Cobalt ID is building the business identity infrastructure for the financial...  ...from synthetic ones. With AI accelerating fraud rings and shell companies...  ...Develop infrastructure that serves inference and network analysis results in real time... 
    Senior
    Full time

    Cobalt Identity Systems

    New York, NY
    4 days ago
  • Description: VAST Data is looking for a Senior Software Engineer to join our growing team! This is a...  ...is the data platform company for the AI era. We are building the enterprise...  ...time data analysis and AI training and inference. Designed from the ground up to make AI... 
    Senior

    VAST Data

    New York, NY
    3 days ago
  • $140k - $160k

     ...Senior Software Engineer – Applied AI & Generative Systems Pearson Learning Studio (PLS) Role Overview Pearson is accelerating the adoption...  ...ingestion & transformation Embeddings & indexing Inference orchestration Evaluation & feedback loops Move... 
    Senior
    Full time
    Work at office

    Pearson Education Services

    New York, NY
    3 days ago
  • $250k - $300k

     ...assembling a world‑class engineering team to redefine how...  ...to support data and AI features Work across...  ...Looking For ~5+ years of software engineering experience...  ...prep, feature stores, inference pipelines) Work...  ...you. Apply today. Seniority Level Mid‑Senior level... 
    Senior
    Full time
    Work at office
    Remote work

    North

    New York, NY
    18 hours ago
  •  ...OKX Wallet, OKLink and more. About the Opportunity As a Senior Engineer on the Anti-Fraud AI Service Platform, you will build the foundational...  ...services, including real-time feature extraction, model inference, and Agentic workflows for automated investigation. AI... 
    Senior
    Contract work

    Framework Ventures

    New York, NY
    3 days ago
  • $175k - $250k

     ...Software Engineer, Machine Learning (MLOps & Data) A Career with Point72's Surveillance Team...  ...sophisticated Artificial Intelligence (AI) solutions, while embracing enterprise...  ...models, from data ingestion to production inference, contributing to the design of our next... 
    Senior
    Work experience placement

    Point72

    New York, NY
    18 hours ago
  • $180k - $225k

     ...sharing localhost or running AI workloads in production...  ...for API delivery, AI inference, device fleets, and...  ...our success! We like software that’s serious and culture...  ...runs entirely on AWS. Engineers develop by SSH’ing into...  .... Compensation Senior Software Engineer... 
    Senior
    Permanent employment
    Full time
    Work at office
    Local area
    Remote work

    GrabJobs

    New York, NY
    1 day ago
  • $160k - $200k

     ...Senior Software Engineer The chemical industry is a $5.2 trillion market that powers 96% of all manufactured...  ...system for chemical distribution—an AI-powered ERP and CRM platform that...  ...the time to understand customer needs, infer patterns, and implement solutions to support... 
    Senior
    Work at office
    10 hours per week
    2 days per week

    Shelfcycle

    New York, NY
    2 days ago
  •  ...for building accurate and performant AI applications at scale in production. Pinecone...  ...the Team and Role: We are hiring a senior software engineer to help design and build core...  ...throughput and cost across large-scale inference and retrieval workloads Drive technical... 
    Senior
    Local area
    Work from home
    Flexible hours

    GrabJobs

    New York, NY
    18 hours ago
  • $139k - $220k

     ...is The Essential Cloud for AI™. Built for pioneers by pioneers...  .... Our team empowers engineers to understand, troubleshoot,...  ...About the role: As a Senior Software Engineer on the Observability...  ...including large-scale training and inference. Wondering if you're a... 
    Senior
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours

    CoreWeave

    New York, NY
    18 hours ago
  • $140k - $160k

     ...The New York Times is looking for a Senior Software Engineer, with a focus on ML platform development, to join our Machine Learning & AI team to help solve creative challenges...  ...all of The New York Times real-time ML inference models, including both data and compute... 
    Senior
    Local area
    Flexible hours

    The New York Times

    New York, NY
    4 days ago
  •  ...Software Engineer, Model Routing & Inference Engineering · Full-time · New York; San Francisco Our mission is to automate coding. The first step in...  ...you'll build the inference platform that powers every AI interaction in the product. This team owns the full... 
    Full time
    Work at office

    Anysphere

    New York, NY
    2 days ago
  •  ...Senior Software Engineer (AI Applications) page is loaded## Senior Software Engineer (AI Applications)locations: Remotetime type: Full timeposted...  ...Agent evaluation frameworks.* Familiarity with scalable inference pipelines.* Familiarity with open tool protocols (e.g.,... 
    Senior
    Permanent employment
    Work at office
    Remote work

    Cambium Learning Group

    New York, NY
    3 days ago
  •  ...Get AI-powered advice on this job and more exclusive features. Job Title: Senior Software Engineer, Customer Success Location: 100% Remote Role About Hobson Associates’ Client:...  ...interviewing at Hobson Associates by 2x Inferred from the description for this job Medical... 
    Senior
    Permanent employment
    Full time
    Immediate start
    Remote work
    Work from home
    Worldwide

    Hobson Associates

    New York, NY
    3 days ago
  •  ...addresses from actual orders, not inferred from sessions), granular (full SKU...  ...the Role We're looking for a Senior Full Stack Engineer to help build the AI-driven products at the core of our...  ...Qualifications ~6+ years of professional software development experience ~ Proven... 
    Senior
    Summer work
    Work at office
    Remote work

    Proxima

    New York, NY
    18 hours ago
  • $150k - $250k

     ...innovation at the speed of software. We are building an AI-driven simulation software stack for engineering and manufacturing across advanced...  ...simulation through AI inference across the entire engineering...  ...most important problems. As a Senior Forward Deployed Software... 
    Senior
    Work at office
    Work from home
    Flexible hours

    PhysicsX

    New York, NY
    4 days ago
  • $160k - $190k

     ...and intelligent. Vanilla is the first AI-powered estate advisory platform, built...  ...you. Job Summary Vanilla is seeking a Senior Software Engineer - AI Applications with a strong...  ...to support model training, evaluation, inference serving, monitoring and alerting. You... 
    Senior
    Full time
    Work experience placement
    Work at office
    Local area
    Remote work
    Home office
    Flexible hours

    GrabJobs

    New York, NY
    18 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Software Engineer - AI Inference. Be the first to apply!