Inference Optimization Engineer (local / edge runtime)
$170.5k - $315.49kIntel Corporation
Job Details: Job Description: Our Mission At Intel, our journey is to transform AI into something safer, more trustworthy, and respectful of human privacy by design. We believe transformative AI should have a positive impact on people—powerful in capability, yet honest about its limits and protective of the data and resources it touches. To get there, we build agentic AI that combines the best of local and cloud intelligence — private, affordable, and sustainable by design. Small, efficient models run directly on the user's machine (AI PC, edge, on-prem, and beyond), keeping data private and token costs low, while powerful cloud models handle the hardest work: planning, reasoning, and complex problem-solving. Today, neither approach can deliver this alone. Together, they give people real capability without compromise—data stays private, spend stays predictable, and energy use stays in check. We're building intelligence that scales without sacrificing trust, cost, or the planet—because the future of AI should belong to the people it serves Role Summary Make models fast on the hardware people actually own. You optimize inference engines (llama.cpp, vLLM) for constrained local and edge environments — GPU/iGPUs, Vulkan backends — not datacenter H100 environment, mostly PC/edge. KV cache, batching, quantization, scheduling, and CPU-overhead reduction are your daily tools. This is the rare skill that makes a hybrid, low-cost agent product viable. What you’ll do Profile and optimize local inference (llama.cpp-vulkan and vLLM) for latency, throughput, and memory on edge hardware Tune KV cache, continuous batching, and scheduling for interactive agent workloads Drive quantization strategy (GGUF / AWQ / GPTQ) and validate quality impact with the Post-Training team Cut CPU overhead and improve engine startup, model load, and lifecycle (start / stop / health) Benchmark across hardware tiers and publish honest performance comparisons Upstream fixes and patches to open-source engines where it helps us What you’ll learn / grow into Curiosity is required. You will develop: The internals of modern inference engines and where the milliseconds actually go Hardware-aware optimization across iGPU / CPU paths (Vulkan, SYCL, oneAPI, CUDA where relevant) The quality-vs-speed-vs-memory trade space for small models Interest in local / edge AI and squeezing hardware Qualifications: Minimum qualifications are required to be initially considered for this position. Preferred qualifications are in addition to the minimum requirements and are considered a plus factor in identifying top candidates. You must possess the minimum qualifications to be initially considered for this position. Preferred qualifications are in addition to the minimum requirements and are considered a plus factor in identifying top candidates. Required Qualifications BS/MS in CS, EE, Math or related STEM field 5+ years software development background Strong in C++ and/or Python; comfortable reading systems-level code Understands how LLM inference works (attention, KV cache, decoding) Has profiled and optimized real performance problems (CPU or GPU) and can prove the speedup Linux, build systems, and low-level debugging expertise Preferred Qualifications Hands-on with llama.cpp, vLLM, ggml, or similar engines Experience with GPU / accelerator programming (Vulkan, CUDA, SYCL, Metal) or SIMD / CPU kernels Familiarity with quantization formats and their quality trade-offs Open-source contributions to inference engines Requirements listed would be obtained through a combination of industry relevant job experience, internship experiences and or schoolwork/classes/research. Benefits at Intel Our total rewards package goes above and beyond just a paycheck. Whether you're looking to build your career, improve your health, or protect your wealth, we offer generous benefits to help you achieve your goals. Go to Intel Benefits | Intel Careers for details of benefits available to you. Intel reserves the right to modify, change or discontinue benefit plans at any time in its sole discretion. Job Type: Shift: Shift 1 (United States of America) Primary Location: US, California, Santa Clara Additional Locations: US, Arizona, Phoenix, US, California, Folsom, US, Oregon, Hillsboro Business group: The Client Computing Group (CCG) is responsible for driving business strategy and product development for Intel's PC products and platforms, spanning form factors such as notebooks, desktops, 2 in 1s, all in ones. Working with our partners across the industry, we intend to deliver purposeful computing experiences that unlock people's potential - allowing each person use our products to focus, create and connect in ways that matter most to them. Posting Statement: All qualified applicants will receive consideration for employment without regard to race, color, religion, religious creed, sex, national origin, ancestry, age, physical or mental disability, medical condition, genetic information, military and veteran status, marital status, pregnancy, gender, gender expression, gender identity, sexual orientation, or any other characteristic protected by local law, regulation, or ordinance. Position of Trust N/A Benefits We offer a total compensation package that ranks among the best in the industry. It consists of competitive pay, stock bonuses, and benefit programs which include health, retirement, and vacation. Find out more about the benefits of working at Intel. Annual Salary Range for jobs which could be performed in the US: $170,500.00-315,490.00 USD The range displayed on this job posting reflects the minimum and maximum target compensation for the position across all US locations. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific compensation range for your preferred location during the hiring process. Work Model for this Role This role will be eligible for our hybrid work model which allows employees to split their time between working on-site at their assigned Intel site and off-site. * Job posting details (such as work model, location or time type) are subject to change. * ADDITIONAL INFORMATION: Intel is committed to Responsible Business Alliance (RBA) compliance and ethical hiring practices. We do not charge any fees during our hiring process. Candidates should never be required to pay recruitment fees, medical examination fees, or any other charges as a condition of employment. If you are asked to pay any fees during our hiring process, please report this immediately to your recruiter. Intel’s official careers website. Find your next job and take on projects that shape tomorrow’s technology. Benefits Internships Life at Intel Locations Recruitment Process
$128.88k - $181.94k
...Join Intel's Compiler Engineering team, where you will collaborate on cutting-edge technologies driving the... ...compiler features and optimizations tailored to Intel... ...delivering IP, SOCs, runtimes, and platforms to support... ...characteristic protected by local law, regulation, or...Local areaImmediate startWorldwideShift work$140.83k - $198.82k
...chain team as a Supply Chain Engineer and play a pivotal role in shaping... ...market challenges, and optimize processes to enhance product quality... ...supplier labor issues (e.g. local FSE labor, supplier performance... ...-- from delivering cutting-edge silicon process and packaging...Local areaImmediate startShift work$111.03k - $211.2k
...like AI, analytics, and cloud-to-edge technology is at the heart of... ...motivated team of top-notch engineers solving challenging technical... ...and propose cross disciplinary optimal solutions.* Self-drive with strong... ...characteristic protected by local law, regulation, or ordinance....Local areaInternshipImmediate startShift work$149.6k - $211.2k
...an experienced MSVC Compiler Engineer to contribute to the development... ...teams to deliver highly optimized code generation for current and... ...responsible for delivering IP, SOCs, runtimes, and platforms to support the... ...characteristic protected by local law, regulation, or ordinance...Local areaShift work$180.77k - $255.2k
...bonding process or equipment engineering, contributing to both first-of... ...) platform development and optimization of manufacturing processes to... ...customers -- from delivering cutting-edge silicon process and packaging... ...characteristic protected by local law, regulation, or ordinance...Local areaImmediate startShift work$100k - $150k
...offering top-tier design and engineering services to a diverse array of... ...experts collaborate closely, optimizing and enhancing our work... ...sessions. By leveraging cutting-edge techniques and state-of-the-art... ...efficiency. Ensure designs adhere to local, state, and national...Local areaCurrently hiring- ...expanded portfolio of leading-edge technologies that include: 3D... ...issues. Onto Innovation strives to optimize customers' critical path of... ...of the Senior Systems Engineer are to develop new optical metrology... ...training documents to educate local engineering teams. This will require...Local areaPermanent employmentWork at office
$103.73k - $198.82k
...transformation as we create exceptionally engineered technology and bring AI... ...standards. Develops and optimizes equipment schedule durations,... ...-- from delivering cutting-edge silicon process and packaging... ...characteristic protected by local law, regulation, or ordinance...Local areaInternshipImmediate startShift work$142.9k - $265.3k
...critical leader within our Global Engineering function, responsible for... ...equipment engineering, optimizing assets, and delivering transformative... ...alignment with cutting‑edge technologies and sustainability... ...with all federal, state, or local laws. If you have a disability...Local areaFor contractorsRemote workWorldwideRelocation package$141.91k - $269.1k
...computing. You'll work with world-class engineers, access cutting-edge Intel technologies, and contribute... ...signal integrity analysis and optimization for qubit chips to ensure optimal performance... ...other characteristic protected by local law, regulation, or ordinance....Local areaImmediate startShift work$164.47k - $269.1k
...As an Analog Circuit Design Engineer, you will be at the forefront... ...designing and developing cutting-edge analog circuits in advanced... ...will play a critical role in optimizing performance, power, area, and... ...characteristic protected by local law, regulation, or ordinance...Local areaFull timeInternshipImmediate startShift work$122.44k - $232.19k
...like AI, analytics, and cloud-to-edge technology is at the heart of... ...Memory Electrical Validation Engineer, where you'll be at the... ...This role is pivotal in ensuring optimized electrical performance, compliance... ...characteristic protected by local law, regulation, or ordinance....Local areaInternshipImmediate startShift work$89.01k - $170.63k
...transformation as we create exceptionally engineered technology and bring AI... ...s ability to deliver cutting-edge technologies to our customers... .... Intel Foundry Capacity Optimization (FCO) is responsible for... ...characteristic protected by local law, regulation, or ordinance...Local areaInternshipImmediate startShift work- ...Field Applications Engineer Tektronix is looking for a customer-focused... ...a wide range of cutting-edge industries. Your primary mission... ...practices and help customers optimize the use of their equipment... ...sales incentives/commissions, in local currency) is 67300.00-124900.0...Local areaRelocation
$95.64k - $181.94k
...transformation as we create exceptionally engineered technology and bring AI... ...and deployment processes to optimize software development... ...customers -- from delivering cutting-edge silicon process and packaging... ...characteristic protected by local law, regulation, or ordinance...Local areaWork experience placementInternshipImmediate startShift work- ...execution stack targeting edge and robotic systems. In... ...of the firmware, runtime, and performance infrastructure... ..., develop, and optimize firmware and runtime components... ...by establishing engineering practices, driving software... ...protected by local law, regulation, or ordinance...Local area
$105.65k - $200.34k
...AI, analytics, and cloud-to-edge technology is at the heart of... ...Responsible for designing and optimizing processors, chipsets and other... ...Post-Silicon Validation Engineer to join our team and play a critical... ...characteristic protected by local law, regulation, or ordinance...Local areaFull timeInternshipImmediate startShift work$170k - $185k
...projects. About This Role The Electrical Engineer will lead the end‑to‑end technical... ...interconnection, substation and transmission design optimization, and MV collection system performance—... ...under applicable federal, state, or local law. These principles apply to all...Local areaHome office$141.91k - $269.1k
...team as an EDA Tools Hardware Engineer, where you will play a pivotal... ...enablement and adoption of cutting-edge hardware design tools, flows,... ...of design processes and optimize power, performance, and technology... ...characteristic protected by local law, regulation, or ordinance....Local areaFull timeInternshipImmediate startWorldwideShift work$164.47k - $311.89k
...our team as a Power and Performance Design Engineer, where you will play a critical role in designing and optimizing Intel's cutting-edge IPs and SoCs. This position offers an... ...or any other characteristic protected by local law, regulation, or ordinance. Position of...Local areaInternshipImmediate startShift work- ...Project Engineer The Project Engineer is responsible for assisting the project manager... ...voucher program ~ Access to StrongerWork optimal mental health services The GCON Way... ...protected under federal, state, or local law. Applicants will be considered regardless...Local areaPermanent employmentContract workFor contractorsWork experience placementInternshipImmediate startFlexible hours
$195.2k - $361.2k
...execution stack targeting edge and robotic systems. In... ...of the firmware, runtime, and performance infrastructure... ..., develop, and optimize firmware and runtime components... ...area by establishing engineering practices, driving... ...characteristic protected by local law, regulation, or...Local areaWork at officeImmediate startShift work$173.66k - $245.16k
...Cloud Software Development Engineer, you will drive innovation... .... You will work on cutting-edge technologies, optimize partner software stacks, and... ...responsible for delivering IP, SOCs, runtimes, and platforms to support... ...protected by local law, regulation, or ordinance...Local areaImmediate startShift work$69.3k - $103.77k
...seeking an opportunity to be on the cutting edge of technology? Join a dedicated team... ...pre-production environment. As a Process Engineer, you will be part of a team to provide development... ...with all applicable federal, state and local laws, regulations, orders and mandates,...Local areaFull timeFor contractorsFor subcontractorInterim roleWork at officeFlexible hours$115.11k - $219.55k
...electron microscopy group supports engineering activities in fabrication,... ...with engineering teams to optimize metrology strategies and troubleshoot... ...-- from delivering cutting-edge silicon process and packaging... ...characteristic protected by local law, regulation, or ordinance...Local areaInternshipImmediate startFlexible hoursShift work- ...Oregon is looking for a CPU Technology Feasibility Implementation Engineer. In this role, you will evaluate technology in the early... ...BS degree and over 10 years of experience, including in PPA optimizations and static timing analysis. Join Apple and help shape the future...
- ...semiconductor technology . Our cutting-edge mask writers power the world's most advanced... ...purpose in their work. Our Field Service Engineer II role emphasizes customer satisfaction,... ...excellent customer relationships with local Tool Owners and other critical customer stakeholders...Local areaWork at office
$50.4k - $66.15k
...services to accelerate profitability by optimizing device performance and advancing yield knowledge... ...: Overview: The Field Service Engineer - Probe Card will provide technical... ...complies with all national, state, and local laws that seek to promote equal...Local areaWork at officeRemote workFlexible hoursShift workDay shift$85.2k - $162.5k
...Process Integration Development Engineer, you will play a pivotal role... ...targeting strategies to optimize product performance and yield.... ...customers -- from delivering cutting-edge silicon process and packaging... ...characteristic protected by local law, regulation, or ordinance....Local areaInternshipImmediate startWorldwideShift work$163.5k - $214.62k
...updated and .Director, Process Development Engineering page is loaded## Director, Process... ...services to accelerate profitability by optimizing device performance and advancing yield knowledge... ...complies with all national, state, and local laws that seek to promote equal...Local areaLive inRemote workFlexible hoursShift workDay shift
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Inference Optimization Engineer (local / edge runtime). Be the first to apply!
- local trucking Hillsboro, OR
- local truck driving Hillsboro, OR
- no experience local truck driver Hillsboro, OR
- cdl class a local driver Hillsboro, OR
- local sales representative Hillsboro, OR
- dedicated local truck driver Hillsboro, OR
- local driver Hillsboro, OR
- local route driver Hillsboro, OR
- local cdl driver Hillsboro, OR
- local content analyst Hillsboro, OR


