Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Inference Optimization Engineer (local / edge runtime)

$170.5k - $315.49k

Intel

Job Details:

Job Description:

Our Mission

At Intel, our journey is to transform AI into something safer, more trustworthy, and respectful of human privacy by design. We believe transformative AI should have a positive impact on people-powerful in capability, yet honest about its limits and protective of the data and resources it touches.

To get there, we build agentic AI that combines the best of local and cloud intelligence - private, affordable, and sustainable by design. Small, efficient models run directly on the user's machine (AI PC, edge, on-prem, and beyond), keeping data private and token costs low, while powerful cloud models handle the hardest work: planning, reasoning, and complex problem-solving. Today, neither approach can deliver this alone. Together, they give people real capability without compromise-data stays private, spend stays predictable, and energy use stays in check.

We're building intelligence that scales without sacrificing trust, cost, or the planet-because the future of AI should belong to the people it serves

Role Summary

Make models fast on the hardware people actually own. You optimize inference engines (llama.cpp, vLLM) for constrained local and edge environments - GPU/iGPUs, Vulkan backends - not datacenter H100 environment, mostly PC/edge. KV cache, batching, quantization, scheduling, and CPU-overhead reduction are your daily tools.

This is the rare skill that makes a hybrid, low-cost agent product viable.

What you'll do

  • Profile and optimize local inference (llama.cpp-vulkan and vLLM) for latency, throughput, and memory on edge hardware

  • Tune KV cache, continuous batching, and scheduling for interactive agent workloads

  • Drive quantization strategy (GGUF / AWQ / GPTQ) and validate quality impact with the Post-Training team

  • Cut CPU overhead and improve engine startup, model load, and lifecycle (start / stop / health)

  • Benchmark across hardware tiers and publish honest performance comparisons

  • Upstream fixes and patches to open-source engines where it helps us

What you'll learn / grow into

Curiosity is required. You will develop:

  • The internals of modern inference engines and where the milliseconds actually go

  • Hardware-aware optimization across iGPU / CPU paths (Vulkan, SYCL, oneAPI, CUDA where relevant)

  • The quality-vs-speed-vs-memory trade space for small models

  • Interest in local / edge AI and squeezing hardware

IMPORTANT:

Please be informed that Intel is proactively trying

to find candidates for this position which is frequently available

at Intel.

Please note that the position may not be available

at this time. If you would be interested in this position should it

become available, we would encourage you to apply, and our

hiring team will be glad to contact you when/if relevant.

Qualifications:

Minimum qualifications are required to be initially considered for this position. Preferred qualifications are in addition to the minimum requirements and are considered a plus factor in identifying top candidates.

You must possess the minimum qualifications to be initially considered for this position. Preferred qualifications are in addition to the minimum requirements and are considered a plus factor in identifying top candidates.

Required Qualifications

  • BS/MS in CS, EE, Math or related STEM field

  • 5+ years software development background

  • Strong in C++ and/or Python; comfortable reading systems-level code

  • Understands how LLM inference works (attention, KV cache, decoding)

  • Has profiled and optimized real performance problems (CPU or GPU) and can prove the speedup

  • Linux, build systems, and low-level debugging expertise

Preferred Qualifications

  • Hands-on with llama.cpp, vLLM, ggml, or similar engines

  • Experience with GPU / accelerator programming (Vulkan, CUDA, SYCL, Metal) or SIMD / CPU kernels

  • Familiarity with quantization formats and their quality trade-offs

  • Open-source contributions to inference engines

Requirements listed would be obtained through a combination of industry relevant job experience, internship experiences and or schoolwork/classes/research.

Benefits at Intel

Our total rewards package goes above and beyond just a paycheck. Whether you're looking to build your career, improve your health, or protect your wealth, we offer generous benefits to help you achieve your goals. Go to Intel Benefits | Intel Careers ( for details of benefits available to you. Intel reserves the right to modify, change or discontinue benefit plans at any time in its sole discretion.

Job Type:

Shift:

Shift 1 (United States of America)

Primary Location:

US, California, Santa Clara

Additional Locations:

US, Arizona, Phoenix, US, California, Folsom, US, Oregon, Hillsboro

Business group:

The Client Computing Group (CCG) is responsible for driving business strategy and product development for Intel's PC products and platforms, spanning form factors such as notebooks, desktops, 2 in 1s, all in ones. Working with our partners across the industry, we intend to deliver purposeful computing experiences that unlock people's potential - allowing each person use our products to focus, create and connect in ways that matter most to them.

Posting Statement:

All qualified applicants will receive consideration for employment without regard to race, color, religion, religious creed, sex, national origin, ancestry, age, physical or mental disability, medical condition, genetic information, military and veteran status, marital status, pregnancy, gender, gender expression, gender identity, sexual orientation, or any other characteristic protected by local law, regulation, or ordinance.

Position of Trust

N/A

Benefits

We offer a total compensation package that ranks among the best in the industry. It consists of competitive pay, stock bonuses, and benefit programs which include health, retirement, and vacation. Find out more about the benefits of working at Intel ( .

Annual Salary Range for jobs which could be performed in the US: $170,500.00-315,490.00 USD

The range displayed on this job posting reflects the minimum and maximum target compensation for the position across all US locations. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific compensation range for your preferred location during the hiring process.

Work Model for this Role

This role will be eligible for our hybrid work model which allows employees to split their time between working on-site at their assigned Intel site and off-site. * Job posting details (such as work model, location or time type) are subject to change.

ADDITIONAL INFORMATION: Intel is committed to Responsible Business Alliance (RBA) compliance and ethical hiring practices. We do not charge any fees during our hiring process. Candidates should never be required to pay recruitment fees, medical examination fees, or any other charges as a condition of employment. If you are asked to pay any fees during our hiring process, please report this immediately to your recruiter.

Vacancy posted 6 days ago
Similar jobs that could be interesting for youBased on the Inference Optimization Engineer (local / edge runtime) in Folsom, CA vacancy
  • $170.5k - $315.49k

     ...agentic AI that combines the best of local and cloud intelligence — private, affordable...  ...on the user's machine (AI PC, edge, on‑prem, and beyond), keeping data private...  ...the hardware people actually own. You optimize inference engines (llama.cpp, vLLM) for constrained... 
    Local area
    Immediate start
    Shift work

    Intel

    Folsom, CA
    1 day ago
  • Intel is seeking a skilled expert to optimize inference engines for edge environments. The role requires strong C++ and Python skills along with a robust...  ...development. You will be profiling and optimizing local inference strategies while managing latency and throughput... 
    Local area

    Intel

    Folsom, CA
    2 days ago
  • $141.91k - $269.1k

     ...Group (HIPD) within the Central Engineering Organization, where...  ...power integrity simulations to optimize design performance Silicon Validation...  ...the intersection of cutting‑edge IP development and real‑world...  ...characteristic protected by local law, regulation, or ordinance... 
    Local area
    Immediate start
    Remote work
    Shift work

    Intel

    Folsom, CA
    1 day ago
  • $105.65k - $200.34k

     .../ Post‑Silicon PnP Validation Engineer to join our Silicon Architecture...  ..., you will work on cutting‑edge technologies, performing comprehensive...  ...and validation strategies to optimize power and performance debug...  ...characteristic protected by local law, regulation, or ordinance.... 
    Local area
    Work experience placement
    Shift work

    Intel Corporation

    Folsom, CA
    1 day ago
  • $105.65k - $200.34k

    GPU Design Verification Engineer - Intel Join Intel's Graphics IP team...  ...of Intel's leading‑edge GPU architectures. Your work will...  ...guidelines. Analyze timing reports to optimize design approaches for...  ...other characteristic protected by local law, regulation, or ordinance.... 
    Local area

    Intel Corporation

    Folsom, CA
    3 days ago
  • $122.44k - $232.19k

     ...Mixed Signal Design Verification Engineer and play a crucial role in shaping the future of cutting-edge technology. In this role, you...  ...and physical design teams to optimize verification strategies and...  ...other characteristic protected by local law, regulation, or ordinance.... 
    Local area
    Full time
    Internship
    Immediate start
    Shift work

    Intel Corporation

    Folsom, CA
    1 day ago
  •  ...the efforts of groups including Product Engineering, Test, Probe, Process Integration, Assembly...  ...and Marketing to design products that optimize manufacturing functions and assure the...  ...protected by applicable federal, state, or local laws. Micron Prohibits the use of child... 
    Local area
    Worldwide

    1000 Micron Technology, Inc.

    Folsom, CA
    5 days ago
  • $131.56k - $171k

     ...verification teams worldwide to support product engineering, test, probe, process integration,...  ...groups in designing products that optimize cost, quality, reliability, time‑to‑market...  ...protected by applicable federal, state, or local laws. #J-18808-Ljbffr Micron Technology
    Local area
    Full time
    Worldwide

    Micron Technology

    Folsom, CA
    5 days ago
  •  ...impact on the world? We believe building engineering is more than systems and structures, it’...  ..., and collaboration. Whether you’re optimizing energy efficiency, integrating resilient...  ...person office culture Preference given to local candidates Required Qualifications... 
    Local area
    Full time
    Contract work

    Fashion Institute of Design & Merchandising

    Folsom, CA
    3 days ago
  • $120k - $190k

     ...commercial enterprises, and state and local government agencies. At...  ...We leverage proven, cutting‑edge methodologies and technology...  .... Perform a diverse array of engineering tasks related to the...  ...automation Ability to create a runtime environment for autonomous functions... 
    Local area
    Temporary work
    For contractors
    Work at office

    Kratos Defense & Security Solutions, Inc.

    Roseville, CA
    5 days ago
  •  ...main content#Autonomy Engineer page is loaded##...  ...software solutions, optimizing solar power plant performance...  ..., deployment, and inference optimization on constrained...  ...CUDA, TensorRT, ONNX Runtime)* Designing and...  ...sensor fusion, filtering, localization, and/or state... 
    Remote work
    Worldwide

    Nextpower Inc

    Folsom, CA
    1 day ago
  • $120k - $150k

     ...governments, commercial enterprises, and state and local government agencies. At Kratos, we...  ...technology. We leverage proven, cutting-edge methodologies and technology to minimize...  ...for assigned project(s), the Project Engineer supports the development and maintenance... 
    Local area
    Contract work
    For contractors
    For subcontractor
    Weekend work

    Kratos Defense

    Roseville, CA
    5 days ago
  • $77.6k - $176k

     ...Job Number: R0241862 Electrical Engineer The Opportunity: As an Electrical Engineer...  ...integration, and testing efforts, applying leading-edge principles and industry best practices....  ...any other status protected by applicable federal, state, local, or international law.... 
    Local area
    Full time
    Contract work
    Part time
    Work at office
    Remote work

    Booz Allen Hamilton

    Roseville, CA
    5 days ago
  • $95k - $129k

     ...electrical projects in collaboration with other engineering professionals across enterprise in a...  ...potential to collaborate with seasoned local and national teams and the opportunity for...  ...than 75 years, we have created leading-edge environmental solutions for municipalities... 
    Local area
    Contract work
    Temporary work
    Work experience placement
    Work at office
    Remote work

    Brown and Caldwell

    Rancho Cordova, CA
    3 days ago
  • $33 - $35 per hour

     ...documentation and providing safe field execution to support its clients’ projects in line with local, state and federal guidelines and regulations. About this position: Jr. Geologist / Jr. Engineer Location – Folsom, CA / Seattle, WA The Essential Duties and Responsibilities are... 
    Local area
    Contract work
    For contractors
    For subcontractor
    Internship
    Work at office
    Remote work
    Work from home
    Night shift
    2 days per week
    3 days per week

    PARAGON PROFESSIONAL SERVICES LLC

    Folsom, CA
    17 hours ago
  • UNICO Engineering provides high quality Construction Management, Land Surveying, and Systems Integration services to public and private clients...  ...and benefits. Ideal Candidate: Experience leading local, state and federally funded transportation contracts for projects... 
    Local area
    Contract work
    For contractors
    Work at office
    Flexible hours

    UNICO Engineering

    Folsom, CA
    3 days ago
  •  ...in growth mode and currently seeking a Senior Mechanical Design Engineer to join our Allient Sacramento team! The Senior Mechanical...  ...expression, or any other characteristic protected by federal, state, or local laws. This applies to all terms and conditions of employment,... 
    Local area
    For subcontractor
    Work at office

    Allient Incorporated

    Loomis, CA
    21 days ago
  • $95k - $120k

     ...Job Description Job Description Senior Conveyance Project Engineer Location: Rocklin, CA (Travel Required Occasionally) Position...  ...abilities ~ Ability to travel as required (typically local with occasional domestic trips for site visits and meetings) ~... 
    Local area
    Full time

    Sierra Conveyance and Automation

    Rocklin, CA
    22 days ago
  •  ...ASIC Verification Engineer This role has been designed as 'Onsite' with an expectation...  ...Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people...  ...communication skills; mastery in English and local language. Subject matter expertise or... 
    Local area
    Work at office

    Hewlett Packard Enterprise

    Roseville, CA
    2 days ago
  • $95.03k - $133.04k

     ...general direction of the Manager, you will work on highly complex engineering assignments and provide power systems expertise in support of...  ...flow, transient stability and post-transient analyses, and local capacity requirements. Proposes projects and mitigation alternatives... 
    Local area
    Remote job
    Work at office

    California ISO

    Folsom, CA
    4 days ago
  • Senior RF Electrical Engineer page is loaded## Senior RF Electrical Engineerlocations: US -...  ...that wins.**Job Description****Design and optimize high-power microwave amplifiers for...  ...including, but not limited to, location, local regulations (such as minimum wage), education... 
    Local area
    Minimum wage
    Work experience placement
    Worldwide
    Flexible hours

    FLIR Systems, Inc.

    Rancho Cordova, CA
    4 days ago
  • $85k - $105k

     ...States of America Your Role Sukut is seeking a dynamic project engineer. This is an excellent opportunity to start a career with a...  ...expression, or any other characteristic protected by federal, state or local laws. This policy applies to all terms and conditions of... 
    Local area
    Work experience placement
    Work at office

    Journeyfront, Inc.

    Folsom, CA
    4 days ago
  • Senior Mechanical Engineer (PE) - Buildings - ( 191299 ) At HDR, our employee‑owners are fully engaged in creating a welcoming environment...  ...Trane Trace3D, IESVE, or similar software Preference given to local candidates Required Qualifications Bachelor's degree in... 
    Local area
    Full time
    Contract work
    Work at office

    Fashion Institute of Design & Merchandising

    Folsom, CA
    4 days ago
  • $113.6k - $151.4k

     ...production users for data retrieval and use. Engineering Support & Problem Solving Own technical...  ...to manufacturing or microwave device optimization. What We Offer A collaborative and...  ...factor made unlawful by federal, state, or local laws. #J-18808-Ljbffr FLIR Systems,... 
    Local area

    FLIR Systems, Inc.

    Rancho Cordova, CA
    1 day ago
  •  ...Description As a Sr Advanced Project Engineer here at Honeywell, you will play a critical...  ...by driving project execution, optimizing processes, and ensuring adherence to quality...  ...addition to a competitive salary, leading-edge work, and developing solutions side-by-side... 
    Temporary work
    Flexible hours

    Honeywell

    Fair Oaks, CA
    2 days ago
  •  ...Position Description The SCADA Controls Engineer will work with the SCADA Engineering team and project managers. Responsible for integrating...  ...and implement logic and control schemes for remote and local operation of power system equipment Develop programs for the... 
    Local area
    Work at office
    Remote work

    Trimark Associates Inc

    Folsom, CA
    1 day ago
  •  ...Our tradition is centered on precision‑engineered systems for maximum impact, efficiency and...  ...for designing, implementing, and optimizing manufacturing processes to improve efficiency...  ...other group protected by federal, state or local laws. RENK America maintains a drug‑free... 
    Local area
    Full time
    Work at office

    Witt/Kieffer

    Roseville, CA
    1 day ago
  • $72k - $94k

     ...Summary The mission of the Manufacturing Engineering group is to provide manufacturing...  ...process improvement activities to increase/optimize yield, efficiency, and/or throughput. Design...  ...protected by federal, state, or local laws. If you reside in the State of California... 
    Local area
    Temporary work
    Work at office

    Penumbra, Inc.

    Roseville, CA
    4 days ago
  • $35 - $55 per hour

     ...seeking a highly skilled, detail-oriented, and motivated Office Engineer to join our dynamic team in a full-time, on-site role based in...  ...any other status protected under applicable federal, state, or local laws. Notice to Third Party Agencies Please note that Atlas Technical... 
    Local area
    Hourly pay
    Full time
    For contractors
    Work at office

    Atlas

    Rancho Cordova, CA
    5 days ago
  • $153.5k - $310.5k

     ...Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people...  ...leads project teams of Electronic and VLSI engineers and internal and outsourced development...  ...communication skills; mastery in English and local language.* Subject matter expertise or... 
    Local area
    Work experience placement
    Work at office

    Hewlett Packard Enterprise Development LP

    Roseville, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Inference Optimization Engineer (local / edge runtime). Be the first to apply!