Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI Performance Optimization Engineer

$100k - $150k
Full-time

Bright Vision Technologies

Bright Vision Technologies is a forward-thinking software development company dedicated to building innovative solutions that help businesses automate and optimize their operations. We leverage cutting-edge technologies to create scalable, secure, and user-friendly applications.
As we continue to grow, we’re looking for a skilled AI Performance Optimization Engineer to join our dynamic team and contribute to our mission of transforming business processes through technology.
This is a fantastic opportunity to join an established and well-respected organization offering tremendous career growth potential.

Job Title: AI Performance Optimization Engineer
Location: 100% Remote (Continental United States)
Position Type: In-house Bright Vision Technologies SOW engagement (no third-party client or vendor)
Salary: $100K - $150K

Experience: 6+ years
Sponsorship: No new H1B sponsorship available. H1B transfers welcomed for qualified candidates.
Employment Type: Full-time, direct W2 with Bright Vision Technologies (no C2C, no 1099, no third-party)
Engagement: Long-term, multi-year, aligned to the Bright Vision SOW delivery roadmap
Compensation: Competitive base salary commensurate with experience, plus benefits.

Employment Terms & Visa Policy
This is a 100% remote, full-time, direct W2 position with Bright Vision Technologies.
This role is part of Bright Vision Technologies’ in-house Statement of Work (SOW) engagement. The client, end customer, and employer for this position is Bright Vision Technologies — there is no third-party client, vendor, or implementation partner involved.
We do not engage in C2C, 1099, or third-party arrangements for this role.

BUT STRICTLY NO C2C/1099/3RD PARTY COMPANIES. ALL OUR ROLES ARE W2 AND NO 3RD PARTY BROKERING PLEASE.
Candidates must be willing to work directly as a full-time W2 employee of Bright Vision Technologies and contribute to our in-house SOW deliverables.
No new H1B sponsorship is available for this role.

However, candidates who are currently on a valid H1B visa and require a transfer are welcome to apply. We will support H1B transfers for qualified candidates.
For every role, a technical coding assessment is mandatory. Please apply only if you are confident in your technical abilities and hands-on experience.

Job Summary
We are seeking an AI Performance Optimization Engineer to focus on extracting maximum throughput, minimizing latency, and reducing cost across training and inference workloads for large neural network systems. The role spans the full stack from low-level kernel optimization to distributed system tuning, requiring deep understanding of GPU architecture, model parallelism, memory management, and compiler-level optimization. The ideal candidate has demonstrated impact on production AI workloads, with strong instrumentation and measurement discipline that enables rigorous, data-driven optimization decisions. In this role you will work closely with cross-functional partners — product, design, engineering, operations, and business stakeholders — to translate ambiguous requirements into well-engineered solutions, and will be expected to raise the bar through code review, design review, and mentorship of more junior engineers. The successful candidate brings strong engineering discipline, a clear communication style, and a track record of shipping meaningful work that holds up well in production.

Key Responsibilities
  • Profile and optimize end-to-end AI training and inference pipelines for throughput, latency, and cost.
  • Identify and eliminate bottlenecks across data loading, model compute, communication, and memory.
  • Implement and tune quantization, sparsity, and pruning strategies to reduce model footprint and accelerate inference.
  • Optimize distributed training using tensor parallelism, pipeline parallelism, FSDP, and ZeRO-style sharding.
  • Tune attention implementations using FlashAttention, paged attention, and related techniques.
  • Implement KV cache optimization, continuous batching, and speculative decoding for LLM serving.
  • Drive compiler-level optimizations using Triton, XLA, TorchInductor, or TVM, working with the broader ML framework community to land improvements that translate into measurable end-to-end performance gains.
  • Optimize data pipelines, sharding strategies, and storage access patterns for high-throughput training.
  • Build and maintain rigorous benchmark suites and regression frameworks across workloads.
  • Collaborate with ML and platform engineering teams to embed best practices in standard pipelines.
  • Drive cost-efficiency improvements through model architecture, hardware selection, and scheduling strategies.
  • Evaluate new hardware and software offerings, and advise on adoption.
  • Document performance tuning playbooks and share findings broadly across engineering teams.
  • Stay current with AI systems research and translate advances into production improvements.
Required Qualifications
  • Bachelor’s or Master’s degree in Computer Science, Computer Engineering, or a related field.
  • Six or more years of experience in performance engineering, ML systems, or HPC.
  • Strong proficiency in Python and C++.
  • Hands-on experience optimizing deep learning workloads on modern GPUs.
  • Deep understanding of distributed training and inference techniques.
  • Experience with profiling tools across CPU, GPU, and distributed systems.
  • Familiarity with model compression techniques and their accuracy implications.
  • Strong grasp of memory hierarchies, communication primitives, and parallelism strategies.
  • Excellent measurement, debugging, and analytical reasoning skills.
  • Strong communication and collaboration skills.
Preferred Qualifications
  • Experience optimizing LLM inference at production scale.
  • Contributions to vLLM, TensorRT-LLM, DeepSpeed, or similar projects.
  • Familiarity with custom kernel authoring in Triton or CUTLASS.
  • Experience with FinOps for AI workloads.
  • Publications or talks on AI systems performance.
How to Apply
Would you like to know more about this opportunity?
For immediate consideration, please send your resume to View email address on brightvisiontechnologies.applytojob.com
Learn more about Bright Vision Technologies at
We recognize that our people are our strength, and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company.
We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. We also make reasonable accommodations for applicants’ and employees’ religious practices and beliefs, as well as mental health or physical disability needs.
Bright Vision Technologies is an Equal Opportunity Employer, including Disability/Veterans.
Position offered by “No Fee Agency.”

Equal Employment Opportunity (EEO) Statement

Bright Vision Technologies (BV Teck) is committed to equal employment opportunity (EEO) for all employees and applicants without regard to race, color, religion, sex, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, veteran status, or any other protected status as defined by applicable federal, state, or local laws. This commitment extends to all aspects of employment, including recruitment, hiring, training, compensation, promotion, transfer, leaves of absence, termination, layoffs, and recall.

BV Teck expressly prohibits any form of workplace harassment or discrimination. Any improper interference with employees' ability to perform their job duties may result in disciplinary action up to and including termination of employment.

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the AI Performance Optimization Engineer in San Ramon, CA vacancy
  •  ...is seeking a Microservices Development Engineer to design and maintain scalable...  ...Kubernetes to ensure efficient deployment and performance optimization. You will contribute to the...  ...opportunity to work on transformative AI projects in a dynamic environment, collaborating... 
    Performance

    CXApp

    San Ramon, CA
    1 day ago
  •  ...thinking technology company is seeking an Analytics and AI Integration Engineer to enhance AI capabilities within their systems. This...  ...collaborating with data scientists to integrate AI models, optimizing performance, and deploying AI services on cloud platforms. You will... 
    Performance

    CXApp

    San Ramon, CA
    4 days ago
  •  ...thinking technology company that leverages AI and data science to drive innovation...  ...: As an Analytics and AI Integration Engineer at CXAPP, you will play a crucial role...  ...integrating AI capabilities into our systems, optimizing performance, and ensuring the seamless deployment... 
    Performance

    CXApp, Inc

    San Ramon, CA
    6 days ago
  • $160.2k - $240.2k

     ...Fortune 500 company and a leading AI platform for managing people,...  ...who bring sun-drenched optimism and drive. Whether you're building...  ...Join our team as a Sr. SDET Engineer (P4) and play a pivotal role...  ...to test automation, performance improvements, test stabilization... 
    Performance
    Work at office
    Remote work
    Home office
    Flexible hours

    Workday

    Pleasanton, CA
    4 days ago
  • P3-Engineer-Google CES(AI Agent) Support Headquartered in Dublin, Ohio, Cardinal Health, Inc. (NYSE...  ..., to inform model refinement, optimization, and observability. Familiarity with...  ...Salesforce, SAP). Ensure the quality, performance, and security of custom code across the... 
    Performance
    Work experience placement
    Local area
    Worldwide
    Shift work

    Dormont Manufacturing Co

    Dublin, CA
    2 days ago
  •  ...Sr. Software Engineer, Healthcare, EHR Systems Join to apply for...  ...data expertise with a focus on performance, reliability, and compliance....  ...and DevOps teams to deliver AI enabled features that enhance...  ...with system performance optimization and troubleshooting in production... 
    Performance

    Resiliency

    Pleasanton, CA
    3 days ago
  •  ...An innovative technology company is seeking a QA Engineer specializing in AI and Data. In this pivotal role, you will design and execute test...  ...scientists and developers to validate AI models and conduct performance testing, all while maintaining comprehensive... 
    Performance

    CXApp

    San Ramon, CA
    3 days ago
  •  ...thinking technology company that leverages AI to transform industries, drive innovation...  ...edge solutions. Job Description: As a QA Engineer specializing in AI and Data at CXAPP, you...  ...methodologies to ensure the quality and performance of our AI systems. Key Responsibilities:... 
    Performance

    CXApp

    San Ramon, CA
    5 days ago
  • $180k - $260k

     ...technology to remove bottlenecks in AI processing. They are looking to add a MEMS Design Engineer who will take responsibility...  ...the design, simulation, and optimization of MEMS devices that are...  ...MEMS structures, focusing on performance, reliability and manufacturability... 
    Performance
    Immediate start

    Lumicity

    Hayward, CA
    5 days ago
  •  ...development lifecycle. Working closely with QA engineers, mobile developers, backend teams,...  ..., execution, and defect analysis Perform root cause analysis and support bugfix validation...  ...engineering teams Experience using AI-assisted development or testing tools to... 
    Performance
    Work at office
    Local area
    Immediate start
    3 days per week

    PTC

    San Ramon, CA
    6 days ago
  • Role: BlackRock Aladdin Risk analytics engineer Experience: 10+ Years Location: San Ramon...  ...insights into portfolio exposures, performance, and risk across multiple asset classes....  ...manage risk, conduct scenario analysis, and optimize portfolio construction. Key... 
    Performance

    SWITS DIGITAL Private Limited

    San Ramon, CA
    5 days ago
  •  ...Tower Business Intelligence Engineer role is a specialized function...  ...visibility, standardization, and optimization across the health system....  ...technologies, business intelligence and AI/machine learning. Key...  ...production, verifying model performance, and collaborating with... 
    Performance
    Temporary work
    Work experience placement

    Kaiser Permanente

    Pleasanton, CA
    5 days ago
  • Cloud Software Group is seeking a Sr Cybersecurity Engineer in San Ramon, California, to design and implement technical...  ...teams to build scalable security workflows and optimizing security platforms for performance and reliability. The ideal candidate will drive the lifecycle... 
    Performance

    Cloud Software Group

    San Ramon, CA
    3 days ago
  • $100k - $130k

     ...Position: Senior Field Service Engineer Location Option A: San Ramon, California...  ...require that other or different tasks be performed as assigned. Operational attributes for...  ...expert guidance on system performance, optimization, and troubleshooting. Collaborate... 
    Performance
    Hourly pay
    Remote work
    Relocation package
    Night shift

    RheoSense

    San Ramon, CA
    2 days ago
  •  ...Senior Integration Engineer Hi, we're Gappify. We innovate and create technologies...  ...integration footprint, leveraging AI advancements and focusing on Gappify...  ...integrations to guarantee optimal functionality and performance. Document integrations thoroughly... 
    Performance
    Work experience placement
    Flexible hours

    Gappify

    San Ramon, CA
    3 days ago
  • $190.1k - $285.1k

     ...Fortune 500 company and a leading AI platform for managing people,...  ...who bring sun-drenched optimism and drive. Whether you're building...  ...-wide data access. By engineering high-throughput distributed platform...  ...on building robust, high-performance services that expose transactional... 
    Performance
    Full time
    For contractors
    Internship
    Work at office
    Local area
    Remote work
    Home office
    Flexible hours

    Workday

    Pleasanton, CA
    1 day ago
  •  ...leader in advanced materials science and engineering of ceramics, carbon and composites. We...  ...Advanced Materials engineers high performance functional and structural ceramic materials...  ...(DOE) applied to production optimization. Hands-on support working directly with... 
    Performance
    Permanent employment
    Full time
    Work at office

    Morgan Advanced Materials

    Hayward, CA
    2 days ago
  • $128.2k - $192.4k

    ## Process Engineer II - FormationApplylocations: San Leandro, Californiatime type: Full...  ...environment that celebrates ingenuity, optimism, and meaningful progress — together.Lyten...  ...that enables the creation of products that perform better and cost less.Lyten is rapidly... 
    Performance
    Local area
    Relocation
    Flexible hours

    Lyten, Inc.

    San Leandro, CA
    4 days ago
  •  ...cutting-edge robotics firm in Hayward, CA, is seeking a Quality Engineer for their manufacturing team. Responsibilities include...  ...strategies and collaborating with design teams to improve product performance. The ideal candidate will have a Bachelor’s or Master’s degree... 
    Performance

    1x.tech

    Hayward, CA
    2 days ago
  • $64k - $85k

    Associate Packaging Engineer Rodan + Fields® Dermatologists is a direct selling skincare company founded in 2000 by Dr. Katie...  ...packaging suppliers, CM’s and 3PL. Improve and optimize the quality and performance of existing core products; track packaging quality issues... 
    Performance
    Work at office
    1 day per week

    Rodan and Fields Beauty, LLC

    San Ramon, CA
    4 days ago
  • $80k - $90k

    Overview Carollo Engineers is an internationally recognized environmental engineering firm...  ...to quantify real‑world oxygen transfer performance and emissions (e.g., N₂O, CO₂), improve...  ...planning, preliminary design, and optimization efforts. Prepare technical memoranda and... 
    Performance
    Flexible hours

    Carollo Engineers, Inc.

    Walnut Creek, CA
    5 days ago
  • $156.29k - $308.46k

     ...sustainable infrastructure and our expertise in engineering, procurement, consulting and...  ...design, startup, commissioning and process optimization for operational facilities. The...  ...accountability for project execution and safety performance Mentoring and supervising team... 
    Performance
    Work experience placement
    Local area
    Relocation
    Overseas
    Flexible hours

    Black & Veatch Corporation

    Walnut Creek, CA
    1 day ago
  • $131k - $174k

     ...responsibility for ERP Kittyhawk related application support leadership. The role should be responsible for ensuring high availability, optimal performance, maintaining a compliant cyber posture, and low incident rates across assigned ERP application, platform and databases... 
    Performance
    Permanent employment
    Contract work
    For contractors
    Remote work
    Visa sponsorship
    Work visa
    Relocation package
    Weekend work

    GE Aerospace

    San Ramon, CA
    5 days ago
  • $120k - $210k

     ...continuous improvement, we deliver exceptional engineering, environmental consulting, and...  ...methods, conducting engineering studies, and optimizing operations. Support project teams and...  .... This position is eligible for performance and incentive compensation. Benefits Summary... 
    Performance
    Work at office
    Local area
    Work from home
    2 days per week

    Kennedy Jenks

    Walnut Creek, CA
    3 days ago
  • $127.5k - $210k

     ...seeking a talented and driven Material Flow Engineer to design, implement, and continuously...  ...Plan for Every Part (PFEP) to optimize material storage, transportation, replenishment...  ...flow constraints that impact production performance. Lead cycle count programs and inventory... 
    Performance
    Full time
    Local area
    Relocation package
    Shift work

    Skydio

    Hayward, CA
    2 days ago
  •  ...70.00/yr Job Summary (Senior Storage Engineer): Walnut Creek, CA Serve as a subject...  ...Storage and Cohesity platforms. Manage and optimize various storage systems including SAN,...  ...enhanced storage management. Conduct performance tuning and optimization for RAID... 
    Performance
    Full time

    Largeton Group

    Walnut Creek, CA
    3 days ago
  • $82.07k - $128.97k

     ...minimal supervision, the Field Application Engineer is responsible for, but not limited to,...  ...page ( . We use Artificial Intelligence (AI) to enhance our recruitment process....  ...area), as well as their skills, experience, education and certifications, and performance.... 
    Performance
    Remote work

    Qnity Inc

    Hayward, CA
    5 days ago
  • $122k

     ...Requisition ID # 170691  Job Category: Engineering / Science  Job Level: Individual Contributor Business Unit: Strategy & Growth...  ...industry. Services include inspections, failure analysis, performance assessments, and evaluations on a wide variety of systems and... 
    Performance

    PG&E Corporation Careers

    Danville, CA
    2 days ago
  •  ...are seeking an experienced Virtualization Engineer to lead migration and deployment...  ...involves managing virtualization platforms, optimizing cloud architecture, and ensuring secure,...  ...critical issues impacting production systems. Perform root cause analysis and implement... 
    Performance
    Work at office
    Remote work

    Compunnel

    Pleasanton, CA
    3 days ago
  •  ...be a plus) Strong Documentation Experience in a Lead/Mentorship role Responsibilities: Design and optimize digital circuits, ensuring high performance in a compact, battery-powered wearable form factor Board-level designs, from initial schematic capture... 
    Performance

    SoloPoint Solutions

    Hayward, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Performance Optimization Engineer. Be the first to apply!