Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Software Engineer - AI Inference

$160k - $240k

Bloomberg

Senior Software Engineer - AI Inference

Location

New York

Business Area

Engineering and CTO

Ref #

10050779

Description & Requirements

Our team:

Join the team that is building the core infrastructure for AI at Bloomberg. The Bloomberg AI Inference Platform provides production-grade managed infrastructure for hosting, deploying, and serving all machine learning models, both predictive and cutting-edge generative models. We abstract away infrastructure complexity, empowering engineering teams to focus on creating intelligent applications with guaranteed scalability, performance, and governance. Our platform is built on the open-source KServe project, and the CNCS AI Inference team is a primary contributor to its development.

We'll trust you to:

  • Design and build scalable infrastructure for both online and offline inference workloads.

  • Lead integration of high-performance inference runtimes and serving frameworks, including TensorRT, vLLM, ONNX, and Triton.

  • Drive architecture and technical decisions across Bloomberg's inference platform, balancing latency, throughput, reliability, and cost.

  • Partner across engineering teams to improve model deployment, observability, and production performance.

  • Mentor junior engineers on system design, debugging, and performance optimization.

You'll need to have:

  • 5+ years of professional software engineering experience.

  • Experience designing, building, and operating production distributed systems.

  • Strong systems intuition and a track record of debugging and optimizing performance-critical services.

  • Ability to own problems end-to-end and quickly ramp up in unfamiliar technical areas.

  • 4+ years of demonstrated experience working with an object-oriented programming language.

  • A degree in Computer Science, Electrical Engineering, or equivalent practical experience.

We'd love to see:

  • Experience deploying and operating machine learning systems at scale.

  • Experience with inference optimization techniques such as batching, caching, request scheduling, or memory-aware serving.

  • Familiarity with PyTorch and GPU software stacks such as CUDA and NCCL.

  • Exposure to high-performance interconnects and distributed computing technologies such as NVLink, InfiniBand, or MPI.

  • Experience with Kubernetes and cloud-native infrastructure.

  • Experience with load balancing, request routing, or traffic management systems.

Representative projects:

  • Autoscaling a heterogeneous compute fleet to match supply and demand aross diverse inference workloads.

  • Building production-grade deployment pipelines to safely roll out new models to millions of users.

  • Developing new inference capabilities such as structured sampling, prompt caching, and advanced serving optimizations.

  • Analyzing observability data from real production workloads to improve latency, throughput, and resource efficiency.

Salary Range = 160,000 - 240,000 USD Annual + Benefits + Bonus

The referenced salary range is based on the Company's good faith belief at the time of posting. Actual compensation may vary based on factors such as geographic location, work experience, market conditions, education/training and skill level.

We offer one of the most comprehensive and generous benefits plans available and offer a range of total rewards that may include merit increases, incentive compensation (exempt roles only), paid holidays, paid time off, medical, dental, vision, short and long term disability benefits, 401(k) +match, life insurance, and various wellness programs, among others. The Company does not provide benefits directly to contingent workers/contractors and interns.

Discover what makes Bloomberg unique - watch our for an inside look at our culture, values, and the people behind our success.

Bloomberg is an equal opportunity employer and we value diversity at our company. We do not discriminate on the basis of age, ancestry, color, gender identity or expression, genetic predisposition or carrier status, marital status, national or ethnic origin, race, religion or belief, sex, sexual orientation, sexual and other reproductive health decisions, parental or caring status, physical or mental disability, pregnancy or parental leave, protected veteran status, status as a victim of domestic violence, or any other classification protected by applicable law.

Bloomberg is a disability inclusive employer. Please let us know if you require any reasonable adjustments to be made for the recruitment process. If you would prefer to discuss this confidentially, please email View email address on click.appcast.io

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Senior Software Engineer - AI Inference in New York, NY vacancy
  •  ...transitioning models from research to production environments. Integrate AI features into existing products, enriching them with the...  ...is a bonus. Strong experience with Llama.cpp and ggml inference engines, facilitating the deployment of models to specific GPU architectures... 
    Senior
    Remote job

    Framework Ventures

    New York, NY
    3 days ago
  • $200k - $250k

    At Wizard AI, we’re building the top-performing AI Shopping...  ...we’re seeking an experienced Senior MLOps Engineer to take ownership of how our...  ...- for a custom-built inference platform powering a live conversational...  ...and alerting. Apply software engineering best practices including... 
    Senior
    Remote work
    Flexible hours

    Wizard

    New York, NY
    4 days ago
  • Senior AI Engineer — Inference & Agent Systems United States Title: Applied AI Engineer — Inference & Agent Systems Location: United States What We're Building Arcana is building AI agents that synthesize information across heterogeneous sources and deliver structured... 
    Senior

    Arcana Analytics Inc.

    New York, NY
    3 days ago
  • $220k - $270k

     ...Senior Software Engineer USD $220,000 - $270,000 meaningful equity | New York | 5 days onsite Soda has partnered with an AI infrastructure company founded by leaders behind one of the earliest...  ...infrastructure problems around inference, orchestration, context evolution... 
    Senior

    SoDA

    New York, NY
    3 days ago
  • $128.7k - $261.3k

     ...deployment platform within the autonomous vehicle sector. This role involves automating model deployment from training to on-vehicle inference and enhancing developer experience through robust tooling. Candidates should hold a relevant degree and possess significant... 
    Senior

    General Motors

    New York, NY
    4 days ago
  • $200k - $280k

     ...Unify is building the first AI-powered system of action for revenue...  ...into a top-performing growth engine by making go-to-market...  ...reinforcement learning, and LLM inference infrastructure. If you are interested...  ...: You have 3+ years of software engineering experience and at... 
    Senior

    Unify

    New York, NY
    4 days ago
  •  ...Senior Software Engineer Take2 AI is hiring a Senior Software Engineer with deep Voice AI expertise and strong backend engineering skills. Our...  ...~ Shipped production AI applications with real-time inference, integrating ML models into live systems ~ Built and... 
    Senior
    Work at office

    Take2

    New York, NY
    3 days ago
  •  ...Senior Software Engineer Cobalt ID is building the business identity infrastructure for the financial...  ...from synthetic ones. With AI accelerating fraud rings and shell companies...  ...Develop infrastructure that serves inference and network analysis results in real time... 
    Senior
    Full time

    Cobalt Identity Systems

    New York, NY
    5 days ago
  • $175k - $250k

     ...Software Engineer, Machine Learning (MLOps & Data) A Career with Point72's Surveillance Team...  ...sophisticated Artificial Intelligence (AI) solutions, while embracing enterprise...  ...models, from data ingestion to production inference, contributing to the design of our next... 
    Senior
    Work experience placement

    Point72

    New York, NY
    1 day ago
  •  ...team is focused on using AI to address critical...  ...reliability, real-time inference observability,...  ...orchestration, high-security software integration, and the resilient...  ...clear insights for senior international...  ...evolution: Partner with our Engineering and ML teams to ensure... 
    Senior

    AI Chopping Block, Inc.

    New York, NY
    1 day ago
  • $140k - $160k

     ...Senior Software Engineer – Applied AI & Generative Systems Pearson Learning Studio (PLS) Role Overview Pearson is accelerating the adoption...  ...ingestion & transformation Embeddings & indexing Inference orchestration Evaluation & feedback loops Move... 
    Senior
    Full time
    Work at office

    Pearson Education Services

    Hoboken, NJ
    18 days ago
  •  ...Software Engineer Baton equips business owners and buyers with a modern, data-driven marketplace...  ..., it's product engineering with real AI systems. You'll ship full-stack agentic...  ...that supports production-level inference, evaluation, and monitoring Work across... 
    Senior
    Temporary work
    Flexible hours

    Baton, Inc.

    New York, NY
    2 days ago
  • $160k - $200k

     ...Senior Software Engineer The chemical industry is a $5.2 trillion market that powers 96% of all manufactured...  ...system for chemical distribution—an AI-powered ERP and CRM platform that...  ...the time to understand customer needs, infer patterns, and implement solutions to support... 
    Senior
    Work at office
    10 hours per week
    2 days per week

    Shelfcycle

    New York, NY
    3 days ago
  •  ...learn? Join our team of engineers and researchers at the...  ...the next generation of AI-powered solutions. Cambium...  ...firm with a growing software engineering and product...  ...We are seeking a Lead/Senior Software Engineer (AI Applications...  ...with scalable inference pipelines. Familiarity... 
    Senior
    Permanent employment
    Remote work

    Cambium Learning Group

    New York, NY
    4 days ago
  • $150k - $250k

     ...Senior Forward Deployed Software Engineer New York, United States About us PhysicsX is a deep‑tech company rooted...  ...of software. We are building an AI‑driven simulation software stack for...  ...multi‑physics simulation through AI inference across the entire engineering lifecycle... 
    Senior
    Work at office
    Work from home
    Flexible hours

    PhysicsX Ltd

    New York, NY
    5 days ago
  • $142.8k - $204k

     ...Senior Software Engineer – AI Platform Lead We are looking for a Senior Software Engineer who will act as the primary engineering lead for resolving...  ...diagnostic tools to monitor LLM performance, focusing on inference quality, token efficiency, and hallucination rates.... 
    Senior
    Full time
    Work at office
    Flexible hours

    RingCentral

    Brooklyn, NY
    5 days ago
  • $150k - $250k

     ...innovation at the speed of software. We are building an AI-driven simulation software stack for engineering and manufacturing across advanced...  ...simulation through AI inference across the entire engineering...  ...most important problems. As a Senior Forward Deployed Software... 
    Senior
    Work at office
    Work from home
    Flexible hours

    PhysicsX

    New York, NY
    5 days ago
  • $150k - $180k

     ...rewarding career. We are seeking a Senior Software Engineer - Integration to join a new Bruin Platform...  ...integration experience. You will use AI-assisted engineering tools as a core...  ...agentic coding tools (autocomplete, type inference, AI assistants using the SDK as... 
    Senior
    Contract work

    MetTel

    New York, NY
    3 days ago
  •  ...assembling a world‑class engineering team to redefine how...  ...to support data and AI features Work across multiple...  ...For 5+ years of software engineering experience...  ...prep, feature stores, inference pipelines) Work Setup...  ...from you. Apply today. Seniority Level Mid‑Senior level... 
    Senior
    Full time
    Work at office
    Remote work

    North

    New York, NY
    9 days ago
  • $300k

     ...Ventures is hiring for a role focused on building and maintaining systems for AI applications, optimizing request routing across diverse accelerators. The ideal candidate has strong software engineering skills, particularly in distributed systems, and a passion for advancing... 
    Senior

    Menlo Ventures

    New York, NY
    4 days ago
  • $200k - $400k

     ...Senior Data Infrastructure Engineer Decagon is the leading conversational AI platform empowering every brand to deliver concierge customer experiences. Our technology...  ...Infra: GPU and model‑serving platforms for LLM inference with multi‑provider routing and support for... 
    Senior
    Full time
    Work at office
    Local area

    Decagon

    New York, NY
    9 days ago
  • $250k - $330k

     ...Decagon is the leading conversational AI platform empowering every brand to...  ...and model‑serving platforms for LLM inference with multi‑provider routing and support...  ...accurately. About the Role We’re hiring a Senior Infrastructure Engineer to design, build, and operate... 
    Senior
    Work at office

    Decagon

    New York, NY
    5 days ago
  • $130k - $170k

     ...Mid-Level AI Software Engineer We are seeking a talented Mid-Level AI Software Engineer to join...  ...Experience building real-time inference systems or streaming AI pipelines...  ..., certifications, experience, skills, seniority, geographic location, business sector,... 
    Senior
    Work at office
    Local area
    Shift work
    2 days per week

    Neuberger Berman

    New York, NY
    4 days ago
  • Senior Software Engineer, Knowledge Engine About Pinecone Pinecone is the leading vector database for building accurate and performant AI applications at scale in production. Pinecone's mission is to...  ...and cost across large-scale inference and retrieval workloads Drive... 
    Senior
    Remote job
    Local area
    Work from home
    Flexible hours

    Pinecone

    New York, NY
    4 days ago
  • $150k - $195k

     ...infrastructure platform within Consor Engineers, Cavnue combines advanced...  ...environment, leveraging AI-enabled analytics, sensors,...  ...We are looking for a Senior Software Engineer who is equally comfortable...  ...low level, tracking, inference and sensor fusion algorithms... 
    Senior
    Local area
    Remote work

    Cavnue

    New York, NY
    4 days ago
  • $141k - $208k

    Senior Software Engineer - Python and Data Ecosystem United States (Remote) About ClickHouse Recognized...  ...data warehousing, observability, and AI workloads. The company’s sustained,...  ..., and real‑time feature stores for ML inference. You understand these workflows not from... 
    Senior
    Remote job
    Local area
    Worldwide
    Home office
    Flexible hours
    Shift work

    ClickHouse

    New York, NY
    1 day ago
  •  ...is the leading security-first enterprise AI company. We build cutting-edge...  ...customers. Cohere is a team of researchers, engineers, designers, and more, who are all passionate...  ...understand, trust, and pay for. As a Senior Software Engineer, you will: Improve the... 
    Work at office
    Remote work

    Cohere

    New York, NY
    1 day ago
  •  ...Orium is seeking a skilled Senior Software Developer to collaborate with high-profile clients on cutting-edge eCommerce solutions. You'll support developers, engage with clients, and utilize modern AI-powered tools to enhance development practices. The ideal candidate... 
    Senior
    Remote work
    Flexible hours

    ORIUM

    New York, NY
    4 days ago
  •  ...growing startup in the home services sector seeks a backend engineer to build production AI systems that automate complex decision-making. You will...  ...coding skills, and a drive to deliver meaningful software that impacts real working-class Americans. Competitive salary... 
    Senior

    Probook

    New York, NY
    5 days ago
  • $160k - $230k

     ...A modern connectivity company in New York City is seeking an experienced Software Developer to create and maintain fiber-based services. The ideal candidate should have over 7 years of experience, with expertise in Go and familiarity with Kubernetes and Docker. The role... 
    Senior

    Pilot Fiber

    New York, NY
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Software Engineer - AI Inference. Be the first to apply!