AI Performance Engineer

Full-time

Bright Vision Technologies

Bright Vision Technologies is a forward-thinking software development company dedicated to building innovative solutions that help businesses automate and optimize their operations. We leverage cutting-edge technologies to create scalable, secure, and user-friendly applications.

As we continue to grow, we’re looking for a skilled AI Performance Engineer to join our dynamic team and contribute to our mission of transforming business processes through technology.

This is a fantastic opportunity to join an established and well-respected organization offering tremendous career growth potential.

AI Performance Engineer

Job Title: AI Performance Engineer
Location: 100% Remote (Continental United States)
Position Type: In-house Bright Vision Technologies SOW engagement (no third-party client or vendor)
Experience: 6+ years
Salary: 100k - 150k
Sponsorship: No new H1B sponsorship available. H1B transfers welcomed for qualified candidates.
Employment Type: Full-time, direct W2 with Bright Vision Technologies (no C2C, no 1099, no third-party)
Engagement: Long-term, multi-year, aligned to the Bright Vision SOW delivery roadmap
Compensation: Competitive base salary commensurate with experience, plus benefits.

Employment Terms & Visa Policy

This is a 100% remote, full-time, direct W2 position with Bright Vision Technologies.
This role is part of Bright Vision Technologies’ in-house Statement of Work (SOW) engagement. The client, end customer, and employer for this position is Bright Vision Technologies — there is no third-party client, vendor, or implementation partner involved.
We do not engage in C2C, 1099, or third-party arrangements for this role.

BUT STRICTLY NO C2C/1099/3RD PARTY COMPANIES. ALL OUR ROLES ARE W2 AND NO 3RD PARTY BROKERING PLEASE.
Candidates must be willing to work directly as a full-time W2 employee of Bright Vision Technologies and contribute to our in-house SOW deliverables.
No new H1B sponsorship is available for this role.

However, candidates who are currently on a valid H1B visa and require a transfer are welcome to apply. We will support H1B transfers for qualified candidates.
For every role, a technical coding assessment is mandatory. Please apply only if you are confident in your technical abilities and hands-on experience.

Job Summary
We are seeking an AI Performance Engineer to focus on extracting maximum throughput, minimizing latency, and reducing cost across training and inference workloads for large neural network systems. The role spans the full stack from low-level kernel optimization to distributed system tuning, requiring deep understanding of GPU architecture, model parallelism, memory management, and compiler-level optimization. The ideal candidate has demonstrated impact on production AI workloads, with strong instrumentation and measurement discipline that enables rigorous, data-driven optimization decisions. In this role you will work closely with cross-functional partners — product, design, engineering, operations, and business stakeholders — to translate ambiguous requirements into well-engineered solutions, and will be expected to raise the bar through code review, design review, and mentorship of more junior engineers. The successful candidate brings strong engineering discipline, a clear communication style, and a track record of shipping meaningful work that holds up well in production.

Key Responsibilities

Profile and optimize end-to-end AI training and inference pipelines for throughput, latency, and cost.
Identify and eliminate bottlenecks across data loading, model compute, communication, and memory.
Implement and tune quantization, sparsity, and pruning strategies to reduce model footprint and accelerate inference.
Optimize distributed training using tensor parallelism, pipeline parallelism, FSDP, and ZeRO-style sharding.
Tune attention implementations using FlashAttention, paged attention, and related techniques.
Implement KV cache optimization, continuous batching, and speculative decoding for LLM serving.
Drive compiler-level optimizations using Triton, XLA, TorchInductor, or TVM, working with the broader ML framework community to land improvements that translate into measurable end-to-end performance gains.
Optimize data pipelines, sharding strategies, and storage access patterns for high-throughput training.
Build and maintain rigorous benchmark suites and regression frameworks across workloads.
Collaborate with ML and platform engineering teams to embed best practices in standard pipelines.
Drive cost-efficiency improvements through model architecture, hardware selection, and scheduling strategies.
Evaluate new hardware and software offerings, and advise on adoption.
Document performance tuning playbooks and share findings broadly across engineering teams.
Stay current with AI systems research and translate advances into production improvements.

Required Qualifications

Bachelor’s or Master’s degree in Computer Science, Computer Engineering, or a related field.
Six or more years of experience in performance engineering, ML systems, or HPC.
Strong proficiency in Python and C++.
Hands-on experience optimizing deep learning workloads on modern GPUs.
Deep understanding of distributed training and inference techniques.
Experience with profiling tools across CPU, GPU, and distributed systems.
Familiarity with model compression techniques and their accuracy implications.
Strong grasp of memory hierarchies, communication primitives, and parallelism strategies.
Excellent measurement, debugging, and analytical reasoning skills.
Strong communication and collaboration skills.

Preferred Qualifications

Experience optimizing LLM inference at production scale.
Contributions to vLLM, TensorRT-LLM, DeepSpeed, or similar projects.
Familiarity with custom kernel authoring in Triton or CUTLASS.
Experience with FinOps for AI workloads.
Publications or talks on AI systems performance.

How to Apply
Would you like to know more about this opportunity?
For immediate consideration, please send your resume to View email address on brightvisiontechnologies.applytojob.com or contact us at Show phone number. Learn more about Bright Vision Technologies at
We recognize that our people are our strength, and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company.
We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. We also make reasonable accommodations for applicants’ and employees’ religious practices and beliefs, as well as mental health or physical disability needs.
Bright Vision Technologies is an Equal Opportunity Employer, including Disability/Veterans.
Position offered by “No Fee Agency.”

Equal Employment Opportunity (EEO) Statement

Bright Vision Technologies (BV Teck) is committed to equal employment opportunity (EEO) for all employees and applicants without regard to race, color, religion, sex, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, veteran status, or any other protected status as defined by applicable federal, state, or local laws. This commitment extends to all aspects of employment, including recruitment, hiring, training, compensation, promotion, transfer, leaves of absence, termination, layoffs, and recall.

BV Teck expressly prohibits any form of workplace harassment or discrimination. Any improper interference with employees' ability to perform their job duties may result in disciplinary action up to and including termination of employment.

Apply

Vacancy posted 2 days ago

Similar jobs that could be interesting for youBased on the AI Performance Engineer in Prosper, TX vacancy

Edge AI Engineer
...As we continue to grow, we’re looking for a skilled Edge AI Engineer to join our dynamic team and contribute to our mission of transforming... ...to fit models within edge constraints. Tune model performance for latency, energy efficiency, and memory footprint on target...
Performance
Full time
H1b
Local area
Immediate start
Remote work
Visa sponsorship
Work visa
Bright Vision Technologies
Celina, TX
2 days ago
Lead AI Data Engineer - Frisco
...Join our eCommerce Operational Intelligence team as a Lead AI Data Engineer. You will build enterprise-scale data pipelines and analytics... ...reporting. Design analytics-ready schemas and data models for performance and scale. Troubleshoot pipelines, microservices, and APIs;...
Performance
Flexible hours
Mcafee
Frisco, TX
9 days ago
Senior AI Solutions Engineer - Brazil (bi-lingual English/Portuguese)
Senior AI Solutions Engineer - Brazil (bi-lingual English/Portuguese) Inbenta is a global technology company headquartered in Allen, Texas,... ...onboarding and go‑live. Optimization & Support Monitor solution performance and recommend enhancements. Provide technical support and...
Performance
Remote work
Inbenta Technologies Inc.
Allen, TX
4 days ago
GenAI & Agentic AI Engineer
...Job Description Job Description Role: GenAI & Agentic AI Engineer Location: Whippany, NJ (Hybrid) Hire Type: FTE * Must be... ...Monitor, evaluate, and optimize GenAI models for accuracy, performance, and cost. Expertise You'll Bring: ~5+ years of experience...
Performance
Work from home
Flexible hours
Vytwo
Prosper, TX
14 days ago
AI Infrastructure Engineer
.... As we continue to grow, we’re looking for a skilled AI Infrastructure Engineer to join our dynamic team and contribute to our mission of... ...pipeline health across the AI data estate. Optimize cost and performance through compression, format selection, and caching...
Performance
Full time
H1b
Local area
Immediate start
Remote work
Visa sponsorship
Work visa
Bright Vision Technologies
Celina, TX
2 days ago
Senior Python Developer
...enterprise-grade applications, data-intensive services, and automation platforms. This is a hands-on engineering role focused on delivering robust, secure, and high-performance Python systems that operate reliably within distributed and mission-critical production...
Performance
Full time
H1b
Local area
Immediate start
Remote work
Visa sponsorship
Work visa
Bright Vision Technologies
Prosper, TX
4 days ago
Python Developer
...enterprise‑grade applications, data‑intensive services, and automation platforms. This hands‑on engineering role focuses on delivering robust, secure, and high‑performance Python systems that operate reliably within distributed, mission‑critical production environments...
Performance
Full time
H1b
Remote work
Visa sponsorship
Work visa
Bright Vision Technologies
Prosper, TX
2 days ago
Senior AI Engineer - Agentic Systems and LLM
...Job Description Job Description Role: Senior AI Engineer - Agentic Systems and LLM Client Location: Mason, OH 100% Remote... ...and cloud platforms Establish evaluation, reliability, and performance strategies (accuracy, latency, cost) Job Requirements - ~2...
Performance
Remote work
Vytwo
Prosper, TX
13 days ago
Senior Director, Software Engineering - AI ML Engineering
...security at scale. You’ll lead a focused team of 3-5 senior engineers while remaining deeply involved in the code and... ...architecture. Your core responsibility is building high‑performance, privacy‑preserving AI models that run directly on user devices (Mac, iOS, Android...
Performance
Mcafee
Frisco, TX
4 days ago
Lead Software Engineer AI (Staff Engineer)
$127.4k - $236.6k
...Engagement Manager and Guided Assurance. As a Lead Software Engineer AI (Staff Engineer AI), you will own core backend and AI orchestration... ...(OpenAI, Anthropic, and others). Scale, reliability, and performance Design for high‑throughput, low‑latency AI workloads:...
Performance
Full time
Work at office
Local area
Flexible hours
Shift work
2 days per week
3 days per week
Thomson Reuters
Frisco, TX
3 days ago
Principal Software Engineer
...secure, and user‑friendly applications. Principal Software Engineer Job Title: Principal Software Engineer Location: 100%... ...the organization, requiring deep expertise in system design, performance engineering, reliability architecture, and cross‑organizational...
Performance
Full time
H1b
Local area
Immediate start
Remote work
Visa sponsorship
Work visa
Bright Vision Technologies
Prosper, TX
3 days ago
SAP ABAP Developer (S/4HANA)
...requirements analysis, technical design, hands‑on coding, unit testing, performance tuning, and post‑go‑live support in close collaboration with... ...Qualifications Bachelor’s degree in Computer Science, Engineering, or a related technical discipline. Five or more years of...
Performance
Full time
H1b
Local area
Remote work
Visa sponsorship
Work visa
Bright Vision Technologies
Celina, TX
2 days ago
AI Lead Developer
...Lead Developer in Artificial Intelligence (AI) to join our dynamic team. The ideal... ...applications. Collaborate with data scientists, engineers, and other stakeholders to define... ...and algorithms. Ensure the scalability, performance, and reliability of AI systems. Stay updated...
Performance
TechDigital Group
Frisco, TX
4 days ago
AI ML Developer
AI ML Developer needs 3+ years of experience as AI/ML Developer. Experience with AI/ML large language models, implementing or deploying... ...expenditures. Monitor and analyze new technology product performance and resolving issues regarding potential improvements or modifications...
Performance
Global Channel Management
Frisco, TX
4 days ago
AI engineer
...AI Engineer Location: 6565 Headquarters Drive, Plano, Texas 75024. Duration: 6 months. Rate: $80. Role Descriptions Must Have... ...basis of their ability, competence and their proven capability to perform the functions outlined in the corresponding role. We promote...
Diverse Lynx
Frisco, TX
3 days ago
Remote Senior Data Engineer - Scalable Pipelines & Cloud DW
Vytwo Technologies Inc. is seeking a Senior Data Engineer who will be involved in designing and maintaining scalable data pipelines.... ...Azure. This is a fully remote position offering opportunities for performance recognition and career development. Candidates must possess...
Performance
Remote job
Vytwo Technologies Inc.
Prosper, TX
4 days ago
Remote Senior Financial Analyst - AI Trainer ($50-$60 per hour)
$50 - $60 per hour
...DataAnnotation is committed to creating high-quality AI. Enjoy the flexibility of remote work and the freedom to set your own schedule... ...Evaluate the quality produced by AI models for correctness and performance Qualifications: Fluency in English (native or...
Performance
Hourly pay
Contract work
For contractors
Work experience placement
Remote work
Data Annotation
Prosper, TX
more than 2 months ago
Senior AI Engineer - Legal Tech & Contract AI
Refinitiv is seeking a Senior Software Engineer - AI (Legal) to join their CoCounsel Forward Deployed Engineering team based in Frisco, Texas. You'll collaborate with legal professionals and engineers to develop tailored AI solutions for law firms. The role involves designing...
Contract work
Flexible hours
Refinitiv
Frisco, TX
2 days ago
Senior AI Engineer for Legal Tech - Flexible Remote
Refinitiv is hiring a Senior Software Engineer - AI to design and deploy AI solutions tailored for legal professionals. The role involves building robust systems that enhance legal workflows, such as contract reviews and regulatory analysis, within a collaborative environment...
Remote job
Contract work
Flexible hours
Refinitiv
Frisco, TX
2 days ago
Flexible Remote .NET Tech Lead — High-Throughput & Cloud
...least 7 years of experience in .NET and C#, strong problem-solving skills, and the ability to collaborate across teams. Flexible work-from-home options are available, emphasizing a balanced work environment while delivering high-performance solutions. #J-18808-Ljbffr Vytwo
Performance
Remote job
Work from home
Flexible hours
Vytwo
Prosper, TX
5 days ago
AI-Driven Databricks Data Engineer - Pipelines & MDM
...thinking analytics company is seeking experienced Databricks Data Engineers to architect scalable data pipelines at their Texas-based team.... ...involves designing complex ETL workflows and collaborating with AI/ML engineers. Ideal candidates will have over 4 years of...
Frisco Analytics Inc.
Frisco, TX
1 day ago
Senior Enterprise AI Engineer — Scale AI & LLM Workflows
T-Mobile is seeking a Senior Engineer for Enterprise AI to design and implement AI-powered applications that enhance productivity. You will be central to developing enterprise-grade AI solutions using leading models and frameworks, ensuring smooth integration across teams...
T-Mobile
Frisco, TX
3 days ago
Remote Financial Planning & Analysis Manager - AI Trainer ($50-$60 per hour)
$50 - $60 per hour
...DataAnnotation is committed to creating high-quality AI. Join our team to help train the nextgeneration of AI while enjoying the... ...Evaluate the quality produced by AI models for correctness and performance Qualifications: Fluency in English (native or bilingual level)...
Performance
Hourly pay
Full time
Contract work
Part time
Work experience placement
Remote work
Data Annotation
Prosper, TX
5 days ago
Lead AI Data Engineer: Flexible Hours & Scalable Pipelines
Mcafee is looking for a Lead AI Data Engineer to join their eCommerce Operational Intelligence team in Frisco, Texas. In this role, you will build enterprise-scale data pipelines and analytics foundations that drive operational insights and business impact. Ideal candidates...
Flexible hours
Mcafee
Frisco, TX
9 days ago
AI-Driven Enterprise Data Engineer for eCommerce Analytics
Mcafee is looking for a Lead AI Enterprise Data Engineer in Frisco, TX. This hybrid role focuses on transforming eCommerce operational analytics through scalable data pipelines and AI-powered insights. The ideal candidate will have extensive experience in enterprise development...
Mcafee
Frisco, TX
4 days ago
Senior .NET Lead Engineer - Cloud, High-Volume Apps
...C#, strong object-oriented programming skills, and expertise in SQL/NoSQL databases. The role requires optimizing data pipeline performance and familiarity with Docker, Kubernetes, and distributed event streaming platforms. Applicants must be team players capable of solving...
Performance
Immediate start
Vytwo Technologies Inc
Prosper, TX
1 day ago
Senior Data Engineer INDIA
Senior Data Engineer Remote Work: INDIA *Only Consultants local to INDIA are eligible. *No visa Sponsorship Primary Responsibilities... ...on cloud platforms such as Databricks or Snowflake, ensuring performance, reliability, and cost efficiency Design and implement robust...
Performance
Local area
Remote work
Visa sponsorship
Vytwo
Prosper, TX
2 days ago
Senior AI Solutions Engineer — Remote Enterprise NLP
Inbenta Technologies Inc. seeks a Senior AI Solutions Engineer to design and implement AI-powered solutions for enterprise clients. This role involves collaboration with teams to ensure technical excellence and deliver business value through Inbenta's AI platform. The ideal...
Remote job
Inbenta Technologies Inc.
Allen, TX
4 days ago
Remote Full-Stack AI Engineer (Contract)
Mercor is seeking Full-Stack Engineers to join their Expert Network and connect with leading AI labs. This role involves training AI models, creating deliverables based on real-world scenarios, and providing insights to advance AI research. Qualified candidates will have...
Remote job
Contract work
Flexible hours
Mercor
Frisco, TX
1 day ago
AI Security Engineer
...applications. As we continue to grow, we’re looking for a skilled AI Security Engineer to join our dynamic team and contribute to our mission of... .... Any improper interference with employees' ability to perform their job duties may result in disciplinary action up to and...
Full time
H1b
Local area
Immediate start
Remote work
Visa sponsorship
Work visa
Bright Vision Technologies
Prosper, TX
2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Performance Engineer. Be the first to apply!