Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI Infrastructure Engineer

$163.5k - $212.4k

nio.com

****JOB DESCRIPTION******About NIO**NIO is a pioneer and a leading company in the premium smart electric vehicle market. Founded in November 2014, NIO’s mission is to shape a joyful lifestyle. NIO aims to build a community starting with smart electric vehicles to share joy and grow together with users.NIO designs, develops, jointly manufactures and sells premium smart electric vehicles, driving innovations in next-generation technologies in autonomous driving, digital technologies, electric powertrains and batteries. NIO differentiates itself through its continuous technological breakthroughs and innovations, such as its industry-leading battery swapping technologies, Battery as a Service, or BaaS, as well as its proprietary autonomous driving technologies and Autonomous Driving as a Service, or ADaaS.NIO’s product portfolio consists of the ES8, a six-seater smart electric flagship SUV, the ES7 (or the EL7), a mid-large five-seater smart electric SUV, the ES6, a five-seater all-round smart electric SUV, the EC7, a five-seater smart electric flagship coupe SUV, the EC6, a five-seater smart electric coupe SUV, the ET7, a smart electric flagship sedan, and the ET5, a mid-size smart electric sedan.**About the Position**We are looking for a senior AI Inference Infrastructure Software Engineer with strong hands-on experience building, optimizing, and deploying high-performance, scalable inference systems. This position is focused on designing, implementing, and delivering production-grade software that powers real-world applications of Large Language Models (LLMs) and Vision-Language Models (VLMs).This is an exciting opportunity for an engineer who thrives at the intersection of AI systems, hardware acceleration, and large-scale robust deployment, and who wants to see their contributions ship in production, at scale.In this role, you will directly shape the architecture, roadmap and performance of AI capabilities of our AIOS platform, driving innovations that make LLM/VLM systems fast, efficient, and scalable across cloud, edge, and hybrid edge-cloud environments. You will work closely with system, hardware, and product teams to deliver high-performance inference kernels for hardware accelerators, design scalable inference serving systems, and integrate optimizations such tensor parallelism and custom kernels into production pipelines. Your work will have immediate impact, powering intelligent automotive systems in the next generation of electric vehicles.**Roles and Responsibilities:*** Design and implement high-performance, scalable inference systems for LLMs and VLMs across cloud, edge, and edge-cloud hybrid platforms.* Develop and optimize custom kernels and operators for specific hardware accelerators (GPU, NPU, DSP, etc.), improving throughput, latency, and memory efficiency.* Integrate advanced optimization techniques such as KV-cache management, tensor/model parallelism, quantization, and memory-efficient execution into production inference systems.* Partner with system and hardware teams to ensure tight hardware-software integration and optimal performance across diverse compute environments.* Translate architectural requirements into robust, maintainable, production-ready software that meets performance, safety, and reliability standards.* Define and drive the evolution roadmap for LLM/VLM inference in the AIOS stack, ensuring scalability and adaptability to new workloads.* Stay ahead of industry trends and competitor solutions, applying best practices from both AI and large-scale systems engineering.**Qualifications:*** 5+ years of hands-on software development experience in building and optimizing AI inference systems at scale.* Direct experience in LLM/VLM model internals, including Transformer-based architectures, inference bottlenecks, and optimization techniques.* Strong expertise in performance engineering: kernel development, parallelism strategies, memory optimization, and distributed inference systems.* Proficiency with GPU/NPU programming (CUDA, or vendor-specific SDKs), compiler toolchains, and deep learning frameworks (PyTorch, or TensorFlow).* Strong programming skills in C/C++, with a track record of delivering high-performance, production-grade software.* Solid foundation in computer architecture, systems programming (CPU/GPU pipelines, memory hierarchy, scheduling), and embedded systems.* BS/MS in Computer Science, Computer Engineering, or related technical field.* Excellent communication and collaboration skills, with the ability to work across cross-functional teams.**Preferred Qualifications:*** Master’s or PhD degree in Computer Science, Electrical/Computer Engineering, or related fields, plus 5 years industry experience* Experience building inference serving systems for large models, including batching, scheduling, caching, and load balancing.* Expertise in hardware-aware model optimization (e.g., kernel fusion, mixed precision, quantization, pruning).* Familiarity with edge and embedded AI, including real-time constraints and limited-resource optimization.* Contributions to widely used AI frameworks, libraries, or performance-critical software (open source or proprietary).**Compensation:**The US base salary range for this full-time position is $163,500.00 - $212,400.00.* Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training.* Please note that the compensation details listed in US role postings reflect the base salary only. It does not include discretionary bonus, equity, or benefits.**Benefits:**Along with competitive pay, as a full-time NIO employee, you are eligible for the following benefits on the first day you join NIO:* Anthem Blue Cross, HSA, and Kaiser HMO medical plans with $0 for Employee Only Coverage.* Dental (including orthodontic coverage) and vision plan. Both provide options with a $0 paycheck contribution covering you and your eligible dependents.* Company Paid HSA (Health Savings Account) Contribution when enrolled in the High Deductible Anthem Blue Cross medical plan* Healthcare and Dependent Care Flexible Spending Accounts (FSA)* 401(k) with Brokerage Link option* Company paid Basic Life, AD&D, short-term and long-term disability insurance* Employee Assistance Program* Sick and Vacation time* 13 Paid Holidays a year* Paid Parental Leave for first 8 weeks at full pay (eligible after 90 days of employment with NIO)* Paid Disability Leave for first 6 weeks at full pay (eligible after 90 days of employment with NIO)* Voluntary benefits including: Voluntary Life and AD&D options for you, your spouse/domestic partner and dependent child(ren), pet insurance* Commuter benefits* Mobile Cell Phone Credit* Free lunch and snacks* Onsite gym* Employee discounts and perks program #J-18808-Ljbffr nio.com

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the AI Infrastructure Engineer in San Jose, CA vacancy
  •  ...next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded...  ...We are seeking a DevOps / Platform Engineer to join our team building and operating large-scale GPU compute infrastructure that powers AI and ML workloads. The ideal candidate... 
    Suggested

    Advanced Micro Devices , Inc.

    San Jose, CA
    5 days ago
  • $163.5k - $212.4k

     ...security, and dependability. Partner with engineering teams to understand real-world...  ...in software design and development for AI model training, and/or inference optimization...  ...application/project ~ Experience with cloud infrastructure and training (Azure, AWS, etc.) ~ CI/... 
    Suggested
    Full time
    Temporary work
    Flexible hours

    NIO

    San Jose, CA
    1 day ago
  • $94.16k - $141k

     ...are the essential building blocks of the data infrastructure that connects our world. Across enterprise, cloud and AI, and carrier architectures, our innovative technology...  ...Master's degree in Computer Science, Computer Engineering, Electrical Engineering, or a related field... 
    Suggested

    Marvell

    Santa Clara, CA
    2 hours ago
  • $124.09k - $210k

     ...Senior AI Data Infrastructure Engineer Santa Clara, CA XPENG is a leading smart technology company at the forefront of innovation, integrating advanced AI and autonomous driving technologies into its vehicles, including electric vehicles (EVs), electric vertical... 
    Suggested
    Full time
    Work experience placement

    XPENG

    Santa Clara, CA
    19 hours ago
  • $174k - $252k

    A leading tech company is seeking a Senior Software Engineer for AI and Infrastructure. This position involves writing and testing software, participating in design reviews, and maintaining coding best practices. Candidates should have at least 5 years of programming experience... 
    Suggested

    Google Inc.

    Sunnyvale, CA
    3 days ago
  • $136.5k - $253.5k

    Cadence is seeking a highly skilled AI Systems Engineer to join their team in San Jose, CA. This hands-on, senior role will lead the AI infrastructure development, including architecting high-performance GPU clusters and deploying advanced AI models. Ideal candidates will... 

    Cadence

    San Jose, CA
    3 days ago
  • $163.5k - $212.4k

    NIO is seeking a Senior AI Inference Infrastructure Software Engineer in San Jose, CA, specializing in building scalable inference systems for large language and vision-language models. This role requires over 5 years of software development experience and strong skills... 

    nio.com

    San Jose, CA
    1 day ago
  • $191k - $315k

     ...Senior Staff AI Engineer, Network Growth AI LinkedIn is the world's largest professional network, built to create economic opportunity...  ...discipline. Prior experience with large scale ML data infrastructure Experience with developing and designing production scale recommender... 
    For contractors
    Work at office
    Flexible hours

    LinkedIn

    Sunnyvale, CA
    2 days ago
  • $174k - $252k

    Google Inc. is looking for a Senior Software Engineer in Sunnyvale, CA, to join the AI and Infrastructure team. The role involves developing next-generation technologies, managing project priorities, and working on critical projects that impact billions of users. Candidates... 

    Google Inc.

    Sunnyvale, CA
    4 days ago
  • $160k - $240k

     ...AI Cloud Infrastructure Engineer - Fury Team Sunnyvale, CA The future of defense will be decided by those who field intelligent machines at scale. At Scout AI, we're developing Fury, the first robotic foundation model for defense, to give U.S. forces overwhelming... 
    Full time
    Relocation package

    Scout AI

    Sunnyvale, CA
    4 days ago
  • $184k - $287.5k

     ...Joining NVIDIA's DGX Cloud AI Efficiency Team means contributing to the infrastructure that powers our innovative AI research. This team focuses on developing...  ...innovation. We are seeking an AI infrastructure software engineer to join our team. You'll be instrumental in... 

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $314.8k - $359.3k

     ...Sr. Distinguished AI Engineer (Agentic AI Platform) Overview: At Capital One, we are creating responsible and reliable AI systems...  ...customer experiences. Our investments in technology infrastructure and world-class talent - along with our deep experience in machine... 
    Full time
    Part time
    Work at office
    Local area

    Capital One

    San Jose, CA
    1 day ago
  •  ...Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides...  ...We are seeking a highly skilled and experienced AI Infrastructure Operations Engineer to manage and operate our cutting-edge machine learning compute... 

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    19 hours ago
  • $168k - $270.25k

     ...looking to hire a deeply technical, creative, and Senior AI Platform Engineer to build, support, and maintain the next generation of AI-...  ...What you will be doing: Define and lead AI-native infrastructure roadmaps and cross-organizational initiatives. Architect... 

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $91.7k - $158.82k

     ...healthy, fulfilling life in and outside of work. Your Mission: We are seeking a highly motivated and talented AI Infrastructure & Platform Ops Engineer to join our team. In this role you will have the opportunity to work on cutting-edge AI technologies and... 
    Full time
    Temporary work
    Work experience placement
    Work at office
    Remote work
    Relocation
    Flexible hours
    Shift work
    3 days per week

    Lockheed Martin Corporation

    Sunnyvale, CA
    3 days ago
  • Google Inc. is seeking a Software Engineering Manager in San Jose, CA to lead a team focused on AI and machine learning infrastructure, optimizing technical leadership across major projects. This role requires at least 8 years of experience with distributed systems and... 

    Google Inc.

    San Jose, CA
    19 hours ago
  • A leading technology company in Santa Clara is seeking a Principal Backend Infrastructure Engineer to drive technical strategies for scalable search platforms. In this role, you will collaborate with backend engineers and machine learning researchers to enhance the functionality... 

    Apple Inc.

    Santa Clara, CA
    2 days ago
  •  ...Title: Prinicipal AI Engineer Location: Sunnyvale, California. Duration: 6 to 12+ Months Job Description:...  ...Proficient PostgreSQL, embeddings/vector search Cloud/Infrastructure Proficient GCP (BigQuery, GCS), Kubernetes, CCM Integration... 
    Contract work

    Redolent

    Sunnyvale, CA
    4 days ago
  •  ...Kai is the AI company rebuilding cybersecurity for the machine-speed era. Founded...  ...class leadership team: Our Heads of AI, Engineering, and Product bring extensive experience...  ...Engineer to drive the security of the Azure infrastructure that powers the Kai AI-native... 

    Kai Cyber, Inc.

    San Jose, CA
    3 days ago
  • $123.2k - $189.1k

     ...releases by turning failures into actionable engineering insights at scale. This is a...  ...vehicle hardware and compute -not cloud infrastructure or hardware design. The mission of...  ...intelligent triage, deep software debugging, and AI-assisted failure analysis across... 
    Local area
    Work from home
    Flexible hours

    General Motors

    Sunnyvale, CA
    2 days ago
  • $215.2k - $245.6k

     ...Lead AI Engineer (Gen AI Platform Services) Overview At Capital One, we are creating responsible and reliable AI systems, changing...  ...customer experiences. Our investments in technology infrastructure and world-class talent - along with our deep experience in machine... 
    Full time
    Part time
    Local area

    Capital One

    San Jose, CA
    1 day ago
  • $229.9k - $262.4k

    Senior Lead AI Engineer (Gen AI Platform Services, Agentic AI) Overview: At Capital One, we are creating responsible...  ...customer experiences. Our investments in technology infrastructure and world-class talent - along with our deep experience in machine... 
    Full time
    Part time
    Local area

    Capital One Financial Corp

    San Jose, CA
    2 days ago
  • $128.4k - $172.3k

     ...position. Meet the Team Join Cisco's Enterprise AI team, the core group enabling Generative AI powered...  ...We operate at the intersection of applied AI, cloud infrastructure and security -partnering across engineering, security, compliance, and product teams to bring... 
    Full time
    Temporary work
    Local area
    Flexible hours

    Cisco

    San Jose, CA
    1 day ago
  • Job Title Bachelor’s degree with 8+ years of experience, or Master’s degree with 6+ years in CS, EE, IT, or related field. 7+ years of hands-on experience in firmware or embedded software development. Strong proficiency in C and/or C++ for embedded systems. ...

    Saxon Global

    Santa Clara, CA
    19 hours ago
  • $229.9k - $262.4k

     ...Sr. Lead AI Engineer (GenAI Platform) Overview: At Capital One, we are creating responsible and reliable AI systems, changing...  ...personalized customer experiences. Our investments in technology infrastructure and world-class talent - along with our deep experience in... 
    Full time
    Part time
    Local area

    Capital One

    San Jose, CA
    1 day ago
  • $181.1k - $318.4k

     ...for its Special Projects team in Cupertino, California. The role focuses on building innovative applications and robust infrastructure to support AI research. Candidates should excel in programming languages like Go or Swift and have experience with web services and containers... 

    Apple Inc.

    Cupertino, CA
    1 day ago
  • $165k - $188k

    A leading IT solutions company in San Jose seeks a Sr. Software Engineer to develop AI/ML infrastructure software. This role requires strong proficiency in Python and expertise in the Nvidia AI/LLM stack. Responsibilities include deploying LLM applications, optimizing... 
    Work at office

    Victrays

    San Jose, CA
    4 days ago
  • A leading technology firm in California is seeking network engineers with hands-on experience in InfiniBand and Ethernet for managing high-performance computing (HPC) and artificial intelligence (AI) environments. Candidates should have advanced knowledge of networking... 

    TechDigital Group

    Santa Clara, CA
    19 hours ago
  • $141k - $202k

    A leading technology firm in Sunnyvale, CA is seeking a Software Engineer to improve compiler integration for TPUs and other accelerators. Ideal candidates hold a Bachelor's degree and possess strong skills in C++ and distributed systems. You'll write and test code, develop... 

    Google Inc.

    Sunnyvale, CA
    1 day ago
  •  ...Corporation is seeking a Senior Software Manager for AI Networking to lead a customer-centered engineering team. The role involves guiding a high-performing...  ...'s networking footprint with major clients and AI infrastructure initiatives. The ideal candidate will have over 12... 

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Infrastructure Engineer. Be the first to apply!