AI Performance Optimization Engineer
$100k - $150kBright Vision Technologies
Bright Vision Technologies is a forward-thinking software development company dedicated to building innovative solutions that help businesses automate and optimize their operations. We leverage cutting-edge technologies to create scalable, secure, and user-friendly applications.
As we continue to grow, we're looking for a skilled AI Performance Optimization Engineer to join our dynamic team and contribute to our mission of transforming business processes through technology.
This is a fantastic opportunity to join an established and well-respected organization offering tremendous career growth potential.
Location: 100% Remote (Continental United States)
Position Type: In-house Bright Vision Technologies SOW engagement (no third-party client or vendor)
Salary: $100K - $150K Experience: 6+ years
Sponsorship: No new H1B sponsorship available. H1B transfers welcomed for qualified candidates.
Employment Type: Full-time, direct W2 with Bright Vision Technologies (no C2C, no 1099, no third-party)
Engagement: Long-term, multi-year, aligned to the Bright Vision SOW delivery roadmap
Compensation: Competitive base salary commensurate with experience, plus benefits. Employment Terms & Visa Policy
This is a 100% remote, full-time, direct W2 position with Bright Vision Technologies.
This role is part of Bright Vision Technologies' in-house Statement of Work (SOW) engagement. The client, end customer, and employer for this position is Bright Vision Technologies - there is no third-party client, vendor, or implementation partner involved.
We do not engage in C2C, 1099, or third-party arrangements for this role. BUT STRICTLY NO C2C/1099/3RD PARTY COMPANIES. ALL OUR ROLES ARE W2 AND NO 3RD PARTY BROKERING PLEASE.
Candidates must be willing to work directly as a full-time W2 employee of Bright Vision Technologies and contribute to our in-house SOW deliverables.
No new H1B sponsorship is available for this role. However, candidates who are currently on a valid H1B visa and require a transfer are welcome to apply. We will support H1B transfers for qualified candidates.
For every role, a technical coding assessment is mandatory. Please apply only if you are confident in your technical abilities and hands-on experience. Job Summary
We are seeking an AI Performance Optimization Engineer to focus on extracting maximum throughput, minimizing latency, and reducing cost across training and inference workloads for large neural network systems. The role spans the full stack from low-level kernel optimization to distributed system tuning, requiring deep understanding of GPU architecture, model parallelism, memory management, and compiler-level optimization. The ideal candidate has demonstrated impact on production AI workloads, with strong instrumentation and measurement discipline that enables rigorous, data-driven optimization decisions. In this role you will work closely with cross-functional partners - product, design, engineering, operations, and business stakeholders - to translate ambiguous requirements into well-engineered solutions, and will be expected to raise the bar through code review, design review, and mentorship of more junior engineers. The successful candidate brings strong engineering discipline, a clear communication style, and a track record of shipping meaningful work that holds up well in production. Key Responsibilities
- Profile and optimize end-to-end AI training and inference pipelines for throughput, latency, and cost.
- Identify and eliminate bottlenecks across data loading, model compute, communication, and memory.
- Implement and tune quantization, sparsity, and pruning strategies to reduce model footprint and accelerate inference.
- Optimize distributed training using tensor parallelism, pipeline parallelism, FSDP, and ZeRO-style sharding.
- Tune attention implementations using FlashAttention, paged attention, and related techniques.
- Implement KV cache optimization, continuous batching, and speculative decoding for LLM serving.
- Drive compiler-level optimizations using Triton, XLA, TorchInductor, or TVM, working with the broader ML framework community to land improvements that translate into measurable end-to-end performance gains.
- Optimize data pipelines, sharding strategies, and storage access patterns for high-throughput training.
- Build and maintain rigorous benchmark suites and regression frameworks across workloads.
- Collaborate with ML and platform engineering teams to embed best practices in standard pipelines.
- Drive cost-efficiency improvements through model architecture, hardware selection, and scheduling strategies.
- Evaluate new hardware and software offerings, and advise on adoption.
- Document performance tuning playbooks and share findings broadly across engineering teams.
- Stay current with AI systems research and translate advances into production improvements.
- Bachelor's or Master's degree in Computer Science, Computer Engineering, or a related field.
- Six or more years of experience in performance engineering, ML systems, or HPC.
- Strong proficiency in Python and C++.
- Hands-on experience optimizing deep learning workloads on modern GPUs.
- Deep understanding of distributed training and inference techniques.
- Experience with profiling tools across CPU, GPU, and distributed systems.
- Familiarity with model compression techniques and their accuracy implications.
- Strong grasp of memory hierarchies, communication primitives, and parallelism strategies.
- Excellent measurement, debugging, and analytical reasoning skills.
- Strong communication and collaboration skills.
- Experience optimizing LLM inference at production scale.
- Contributions to vLLM, TensorRT-LLM, DeepSpeed, or similar projects.
- Familiarity with custom kernel authoring in Triton or CUTLASS.
- Experience with FinOps for AI workloads.
- Publications or talks on AI systems performance.
How to Apply
Would you like to know more about this opportunity?
For immediate consideration, please send your resume to [email protected]
Learn more about Bright Vision Technologies at
We recognize that our people are our strength, and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company.
We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. We also make reasonable accommodations for applicants' and employees' religious practices and beliefs, as well as mental health or physical disability needs.
Bright Vision Technologies is an Equal Opportunity Employer, including Disability/Veterans.
Position offered by "No Fee Agency."
Equal Employment Opportunity (EEO) Statement Bright Vision Technologies (BV Teck) is committed to equal employment opportunity (EEO) for all employees and applicants without regard to race, color, religion, sex, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, veteran status, or any other protected status as defined by applicable federal, state, or local laws. This commitment extends to all aspects of employment, including recruitment, hiring, training, compensation, promotion, transfer, leaves of absence, termination, layoffs, and recall. BV Teck expressly prohibits any form of workplace harassment or discrimination. Any improper interference with employees' ability to perform their job duties may result in disciplinary action up to and including termination of employment.
- Liquid AI is seeking a Systems Programmer to join their Edge Inference team in San Francisco. In this role, you will implement and optimize inference kernels on various hardware, ensuring efficiency and performance. Ideal candidates have over 5 years of systems programming...PerformanceFlexible hours
- A leading tech company is seeking a Software Engineer specialized in GPU development to optimize AI accelerators for critical products. You will work on performance enhancements and software stack optimizations that impact billions of users. Ideal candidates have strong...Performance
- Pragmatike is seeking a CUDA Kernel Engineer to develop and optimize NVIDIA CUDA kernels for high-throughput AI systems. This role involves designing custom kernels, profiling GPU workloads, and resolving performance bottlenecks. Ideal candidates will have a strong understanding...PerformanceRemote job
- A pioneering AI technology company in Santa Clara is seeking a Graph Optimization Compiler Engineer to enhance their AI compiler stack. This role focuses on developing graph... ...optimizations to deliver significant performance improvements. The ideal candidate should have...Performance
$174k - $252k
Google is seeking a Software Engineer to drive optimizations for advanced GPU technologies impacting billions of users. Candidates should have a... ...building optimizations for GPU architectures and addressing performance bottlenecks across Google’s product suite. The position...Performance- ...Corporation is seeking a Middleware Development Engineer to join its Communication Runtimes team in... ...Texas. The role involves designing, building, and optimizing software communication libraries to enhance high-performance computing and artificial intelligence capabilities...Performance
- Slope is seeking a Founding Compiler Engineer in San Francisco, responsible for designing core compiler infrastructure and optimizing AI models. You will write CUDA kernels and conduct performance reviews, contributing to Luminal's mission of making AI workloads portable...PerformanceFull time
$166k - $220k
...Multidisciplinary Design Analysis and Optimization (MDAO) Engineer to join our fast-growing team. You... ...and proficiently apply cutting-edge AI tools to accelerate and optimize MDAO... ...Structures, Propulsion, Aerodynamics, Vehicle Performance, Thermal Management, Power Management...PerformanceFull timeWork experience placementImmediate start- A tech company is seeking a skilled Prompt Engineer for a remote position in the European Union. The ideal candidate will design, test, and optimize prompts to enhance AI model performance. Responsibilities include collaborating with data scientists, ensuring compliance...PerformanceRemote jobFlexible hours
- Obsidian is on the lookout for talented Performance Engineers to join our cutting-edge GenAI team in San Francisco... .... This role emphasizes low-level systems optimization using C++, Python, and Rust, aimed at elevating the quality of AI training and inference infrastructure. We...Performance
- ...experienced Process Simulation Twin (PST) and Optimization Engineer to join our Reliability Solutions... ...simulations (process digital twins), perform process optimization activities to identify... ...data and hybrid analytics through AI. Key Responsibilities As the successful...PerformanceFull time
- Walker Lovell is seeking a corporate-facing process engineer to drive optimization across multiple sites in Texas. The role offers a strong base salary with a performance-linked bonus, positioning you to influence major capital projects and data-driven initiatives. You...Performance
$184k - $356.5k
NVIDIA Gruppe is looking for skilled software engineers to develop AI inference systems that operate with high efficiency. The role involves architecting high-performance inference frameworks and optimizing GPU processes. Ideal candidates should have extensive programming...Performance- Noveon Magnetics Inc. in San Marcos, Texas is seeking a Process Engineer to enhance manufacturing processes. Responsibilities include optimizing performance, ensuring compliance with specifications, and collaborating with cross-functional teams. Ideal candidates hold a...Performance
$150k - $250k
...Description Senior Analog Mixed-Signal Engineer - AI Hardware Job Title: Senior Analog... ...compute building blocks including high-performance ADCs/DACs, data converter interfaces, and... ...teams, and process/device engineers to optimize performance, power, and area for AI workloads...PerformanceOngoing contractLocal areaRemote workFlexible hours- Derichebourg Multiservices is seeking an MTM / STV Process Engineer to develop, validate, and optimize manufacturing times using MTM methodologies. In this... ...will support production teams and improve industrial performance while ensuring accurate and safe production processes...Performance
- Walker Lovell is seeking a qualified engineer to take ownership of process performance at a flagship manufacturing site in Missouri. The successful candidate will drive process optimization, improve operational efficiencies, and support plant initiatives. Must have proven...Performance
- Sofidel is seeking a Process Engineer in Las Vegas, NV, to enhance manufacturing performance on the TAD Paper Machine. Responsibilities include driving process optimization, developing monitoring tools, leading trials, and implementing continuous improvements. Candidates...Performance
- ...Vistas Corporation (Nuvoco) is hiring a Process Engineer for their Cement Plant. The role includes monitoring and optimizing plant processes to ensure quality production... ..., maintenance, and quality teams, preparing performance reports and training operators. #J-18808-...Performance
- 6AM City, LLC in Chandler, AZ, is seeking a Process Engineer to provide engineering support for manufacturing operations. You will optimize processes to improve cost effectiveness and product performance while troubleshooting and resolving operational difficulties. The...PerformanceFull time
- DSJ Global is seeking a Process Engineer II to support batch manufacturing operations in Fremont, California... .... This hands-on role is focused on process optimization and offers a unique opportunity to influence plant performance and product quality directly. The successful...Performance
- ...manufacturing company in Fort Worth, TX, is seeking a Process Engineer to maintain performance in Power Circuit Board Assembly processes. The ideal... ...setting up SMT and DIP machines, conducting process optimizations, and training new staff. This position involves problem...Performance
- Schreiber Foods Inc. is looking for a Senior Process Engineer based in Mt. Vernon, MO. This role involves optimizing production processes and collaborating with cross-functional teams to enhance operational performance and ensure food safety standards. The ideal...PerformanceRelocation package
- Huhtamaki, Inc. in Batavia Township, Ohio, is seeking an Engineering professional to optimize manufacturing processes and complete projects aimed at improving cost effectiveness and product performance. The ideal candidate will manage project timelines, engage in problem...Performance
- OxyChem in La Porte, Texas is seeking a Chemical Engineer to ensure safety and optimize plant performance. The role involves maintaining the environment, supporting safety protocols, and providing technical expertise to enhance product quality and reduce costs. The ideal...Performance
- 6AM City, LLC is seeking a Sr Process Engineer to optimize refinery operations through data-driven critical thinking and process analysis.... ...be responsible for mentoring junior engineers, conducting performance monitoring, and driving operational excellence. Additionally...Performance
- ...biotechnology company in California is seeking a Process Engineer II to support and optimize large-scale manufacturing operations. The role involves... ...salary and comprehensive benefits, including performance bonuses and medical coverage in a supportive environment...Performance
$65k - $75k
Description: The Process Engineer provides support to the operation Manager/Associate Director... ...completing process simulations and optimization studies and implementing modifications... ...operations by monitoring process performance and identifying improvement opportunities...PerformanceLocal areaFlexible hoursDay shift3 days per week1 day per week- ...accommodation or an alternative application process. Process Optimization Engineer SALARY FT EXEMPT Houston, TX, US 15 days ago Requisition... .... Track and optimize site asset utilization, production performance, and variable manufacturing costs. Perform engineering analysis...PerformanceNight shiftWeekend work
- Teledyne Technologies Incorporated is looking for a Process Engineer in Garland, Texas. You will lead process optimization efforts, monitor production performance, and ensure adherence to quality standards. Candidates should have a BS degree in engineering, 5-7 years of...Performance
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI Performance Optimization Engineer. Be the first to apply!
- senior ai engineer United States
- ai ml engineer United States
- ai engineer remote United States
- ai engineer United States
- ai prompt engineer United States
- ai developer United States
- ai research engineer United States
- machine learning ai engineer United States
- senior performance engineer United States
- performance test driver United States


