AI Performance Engineer
Bright Vision Technologies
AI Performance Engineer
Job Title: AI Performance EngineerLocation: 100% Remote (Continental United States)
Position Type: In-house Bright Vision Technologies SOW engagement (no third-party client or vendor)
Experience: 6+ years
Salary: 100k - 150k
Sponsorship: No new H1B sponsorship available. H1B transfers welcomed for qualified candidates.
Employment Type: Full-time, direct W2 with Bright Vision Technologies (no C2C, no 1099, no third-party)
Engagement: Long-term, multi-year, aligned to the Bright Vision SOW delivery roadmap
Compensation: Competitive base salary commensurate with experience, plus benefits. Employment Terms & Visa Policy This is a 100% remote, full-time, direct W2 position with Bright Vision Technologies.
This role is part of Bright Vision Technologies’ in-house Statement of Work (SOW) engagement. The client, end customer, and employer for this position is Bright Vision Technologies — there is no third-party client, vendor, or implementation partner involved.
We do not engage in C2C, 1099, or third-party arrangements for this role. BUT STRICTLY NO C2C/1099/3RD PARTY COMPANIES. ALL OUR ROLES ARE W2 AND NO 3RD PARTY BROKERING PLEASE.
Candidates must be willing to work directly as a full-time W2 employee of Bright Vision Technologies and contribute to our in-house SOW deliverables.
No new H1B sponsorship is available for this role. However, candidates who are currently on a valid H1B visa and require a transfer are welcome to apply. We will support H1B transfers for qualified candidates.
For every role, a technical coding assessment is mandatory. Please apply only if you are confident in your technical abilities and hands-on experience. Job Summary
We are seeking an AI Performance Engineer to focus on extracting maximum throughput, minimizing latency, and reducing cost across training and inference workloads for large neural network systems. The role spans the full stack from low-level kernel optimization to distributed system tuning, requiring deep understanding of GPU architecture, model parallelism, memory management, and compiler-level optimization. The ideal candidate has demonstrated impact on production AI workloads, with strong instrumentation and measurement discipline that enables rigorous, data-driven optimization decisions. In this role you will work closely with cross-functional partners — product, design, engineering, operations, and business stakeholders — to translate ambiguous requirements into well-engineered solutions, and will be expected to raise the bar through code review, design review, and mentorship of more junior engineers. The successful candidate brings strong engineering discipline, a clear communication style, and a track record of shipping meaningful work that holds up well in production. Key Responsibilities
- Profile and optimize end-to-end AI training and inference pipelines for throughput, latency, and cost.
- Identify and eliminate bottlenecks across data loading, model compute, communication, and memory.
- Implement and tune quantization, sparsity, and pruning strategies to reduce model footprint and accelerate inference.
- Optimize distributed training using tensor parallelism, pipeline parallelism, FSDP, and ZeRO-style sharding.
- Tune attention implementations using FlashAttention, paged attention, and related techniques.
- Implement KV cache optimization, continuous batching, and speculative decoding for LLM serving.
- Drive compiler-level optimizations using Triton, XLA, TorchInductor, or TVM, working with the broader ML framework community to land improvements that translate into measurable end-to-end performance gains.
- Optimize data pipelines, sharding strategies, and storage access patterns for high-throughput training.
- Build and maintain rigorous benchmark suites and regression frameworks across workloads.
- Collaborate with ML and platform engineering teams to embed best practices in standard pipelines.
- Drive cost-efficiency improvements through model architecture, hardware selection, and scheduling strategies.
- Evaluate new hardware and software offerings, and advise on adoption.
- Document performance tuning playbooks and share findings broadly across engineering teams.
- Stay current with AI systems research and translate advances into production improvements.
- Bachelor’s or Master’s degree in Computer Science, Computer Engineering, or a related field.
- Six or more years of experience in performance engineering, ML systems, or HPC.
- Strong proficiency in Python and C++.
- Hands-on experience optimizing deep learning workloads on modern GPUs.
- Deep understanding of distributed training and inference techniques.
- Experience with profiling tools across CPU, GPU, and distributed systems.
- Familiarity with model compression techniques and their accuracy implications.
- Strong grasp of memory hierarchies, communication primitives, and parallelism strategies.
- Excellent measurement, debugging, and analytical reasoning skills.
- Strong communication and collaboration skills.
- Experience optimizing LLM inference at production scale.
- Contributions to vLLM, TensorRT-LLM, DeepSpeed, or similar projects.
- Familiarity with custom kernel authoring in Triton or CUTLASS.
- Experience with FinOps for AI workloads.
- Publications or talks on AI systems performance.
Would you like to know more about this opportunity?
For immediate consideration, please send your resume to View email address on brightvisiontechnologies.applytojob.com or contact us at Show phone number. Learn more about Bright Vision Technologies at
We recognize that our people are our strength, and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company.
We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. We also make reasonable accommodations for applicants’ and employees’ religious practices and beliefs, as well as mental health or physical disability needs.
Bright Vision Technologies is an Equal Opportunity Employer, including Disability/Veterans.
Position offered by “No Fee Agency.”
Equal Employment Opportunity (EEO) Statement
Bright Vision Technologies (BV Teck) is committed to equal employment opportunity (EEO) for all employees and applicants without regard to race, color, religion, sex, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, veteran status, or any other protected status as defined by applicable federal, state, or local laws. This commitment extends to all aspects of employment, including recruitment, hiring, training, compensation, promotion, transfer, leaves of absence, termination, layoffs, and recall.
BV Teck expressly prohibits any form of workplace harassment or discrimination. Any improper interference with employees' ability to perform their job duties may result in disciplinary action up to and including termination of employment.
- ...As we continue to grow, we’re looking for a skilled Edge AI Engineer to join our dynamic team and contribute to our mission of transforming... ...to fit models within edge constraints. Tune model performance for latency, energy efficiency, and memory footprint on target...PerformanceFull timeH1bLocal areaImmediate startRemote workVisa sponsorshipWork visa
- ...Join our eCommerce Operational Intelligence team as a Lead AI Data Engineer. You will build enterprise-scale data pipelines and analytics... ...reporting. Design analytics-ready schemas and data models for performance and scale. Troubleshoot pipelines, microservices, and APIs;...PerformanceFlexible hours
- Senior AI Solutions Engineer - Brazil (bi-lingual English/Portuguese) Inbenta is a global technology company headquartered in Allen, Texas,... ...onboarding and go‑live. Optimization & Support Monitor solution performance and recommend enhancements. Provide technical support and...PerformanceRemote work
- ...Job Description Job Description Role: GenAI & Agentic AI Engineer Location: Whippany, NJ (Hybrid) Hire Type: FTE * Must be... ...Monitor, evaluate, and optimize GenAI models for accuracy, performance, and cost. Expertise You'll Bring: ~5+ years of experience...PerformanceWork from homeFlexible hours
- .... As we continue to grow, we’re looking for a skilled AI Infrastructure Engineer to join our dynamic team and contribute to our mission of... ...pipeline health across the AI data estate. Optimize cost and performance through compression, format selection, and caching...PerformanceFull timeH1bLocal areaImmediate startRemote workVisa sponsorshipWork visa
- ...enterprise-grade applications, data-intensive services, and automation platforms. This is a hands-on engineering role focused on delivering robust, secure, and high-performance Python systems that operate reliably within distributed and mission-critical production...PerformanceFull timeH1bLocal areaImmediate startRemote workVisa sponsorshipWork visa
- ...enterprise‑grade applications, data‑intensive services, and automation platforms. This hands‑on engineering role focuses on delivering robust, secure, and high‑performance Python systems that operate reliably within distributed, mission‑critical production environments...PerformanceFull timeH1bRemote workVisa sponsorshipWork visa
- ...Job Description Job Description Role: Senior AI Engineer - Agentic Systems and LLM Client Location: Mason, OH 100% Remote... ...and cloud platforms Establish evaluation, reliability, and performance strategies (accuracy, latency, cost) Job Requirements - ~2...PerformanceRemote work
- ...security at scale. You’ll lead a focused team of 3-5 senior engineers while remaining deeply involved in the code and... ...architecture. Your core responsibility is building high‑performance, privacy‑preserving AI models that run directly on user devices (Mac, iOS, Android...Performance
$127.4k - $236.6k
...Engagement Manager and Guided Assurance. As a Lead Software Engineer AI (Staff Engineer AI), you will own core backend and AI orchestration... ...(OpenAI, Anthropic, and others). Scale, reliability, and performance Design for high‑throughput, low‑latency AI workloads:...PerformanceFull timeWork at officeLocal areaFlexible hoursShift work2 days per week3 days per week- ...secure, and user‑friendly applications. Principal Software Engineer Job Title: Principal Software Engineer Location: 100%... ...the organization, requiring deep expertise in system design, performance engineering, reliability architecture, and cross‑organizational...PerformanceFull timeH1bLocal areaImmediate startRemote workVisa sponsorshipWork visa
- ...requirements analysis, technical design, hands‑on coding, unit testing, performance tuning, and post‑go‑live support in close collaboration with... ...Qualifications Bachelor’s degree in Computer Science, Engineering, or a related technical discipline. Five or more years of...PerformanceFull timeH1bLocal areaRemote workVisa sponsorshipWork visa
- ...Lead Developer in Artificial Intelligence (AI) to join our dynamic team. The ideal... ...applications. Collaborate with data scientists, engineers, and other stakeholders to define... ...and algorithms. Ensure the scalability, performance, and reliability of AI systems. Stay updated...Performance
- AI ML Developer needs 3+ years of experience as AI/ML Developer. Experience with AI/ML large language models, implementing or deploying... ...expenditures. Monitor and analyze new technology product performance and resolving issues regarding potential improvements or modifications...Performance
- ...AI Engineer Location: 6565 Headquarters Drive, Plano, Texas 75024. Duration: 6 months. Rate: $80. Role Descriptions Must Have... ...basis of their ability, competence and their proven capability to perform the functions outlined in the corresponding role. We promote...
- Vytwo Technologies Inc. is seeking a Senior Data Engineer who will be involved in designing and maintaining scalable data pipelines.... ...Azure. This is a fully remote position offering opportunities for performance recognition and career development. Candidates must possess...PerformanceRemote job
$50 - $60 per hour
...DataAnnotation is committed to creating high-quality AI. Enjoy the flexibility of remote work and the freedom to set your own schedule... ...Evaluate the quality produced by AI models for correctness and performance Qualifications: Fluency in English (native or...PerformanceHourly payContract workFor contractorsWork experience placementRemote work- Refinitiv is seeking a Senior Software Engineer - AI (Legal) to join their CoCounsel Forward Deployed Engineering team based in Frisco, Texas. You'll collaborate with legal professionals and engineers to develop tailored AI solutions for law firms. The role involves designing...Contract workFlexible hours
- Refinitiv is hiring a Senior Software Engineer - AI to design and deploy AI solutions tailored for legal professionals. The role involves building robust systems that enhance legal workflows, such as contract reviews and regulatory analysis, within a collaborative environment...Remote jobContract workFlexible hours
- ...least 7 years of experience in .NET and C#, strong problem-solving skills, and the ability to collaborate across teams. Flexible work-from-home options are available, emphasizing a balanced work environment while delivering high-performance solutions. #J-18808-Ljbffr VytwoPerformanceRemote jobWork from homeFlexible hours
- ...thinking analytics company is seeking experienced Databricks Data Engineers to architect scalable data pipelines at their Texas-based team.... ...involves designing complex ETL workflows and collaborating with AI/ML engineers. Ideal candidates will have over 4 years of...
- T-Mobile is seeking a Senior Engineer for Enterprise AI to design and implement AI-powered applications that enhance productivity. You will be central to developing enterprise-grade AI solutions using leading models and frameworks, ensuring smooth integration across teams...
$50 - $60 per hour
...DataAnnotation is committed to creating high-quality AI. Join our team to help train the nextgeneration of AI while enjoying the... ...Evaluate the quality produced by AI models for correctness and performance Qualifications: Fluency in English (native or bilingual level)...PerformanceHourly payFull timeContract workPart timeWork experience placementRemote work- Mcafee is looking for a Lead AI Data Engineer to join their eCommerce Operational Intelligence team in Frisco, Texas. In this role, you will build enterprise-scale data pipelines and analytics foundations that drive operational insights and business impact. Ideal candidates...Flexible hours
- Mcafee is looking for a Lead AI Enterprise Data Engineer in Frisco, TX. This hybrid role focuses on transforming eCommerce operational analytics through scalable data pipelines and AI-powered insights. The ideal candidate will have extensive experience in enterprise development...
- ...C#, strong object-oriented programming skills, and expertise in SQL/NoSQL databases. The role requires optimizing data pipeline performance and familiarity with Docker, Kubernetes, and distributed event streaming platforms. Applicants must be team players capable of solving...PerformanceImmediate start
- Senior Data Engineer Remote Work: INDIA *Only Consultants local to INDIA are eligible. *No visa Sponsorship Primary Responsibilities... ...on cloud platforms such as Databricks or Snowflake, ensuring performance, reliability, and cost efficiency Design and implement robust...PerformanceLocal areaRemote workVisa sponsorship
- Inbenta Technologies Inc. seeks a Senior AI Solutions Engineer to design and implement AI-powered solutions for enterprise clients. This role involves collaboration with teams to ensure technical excellence and deliver business value through Inbenta's AI platform. The ideal...Remote job
- Mercor is seeking Full-Stack Engineers to join their Expert Network and connect with leading AI labs. This role involves training AI models, creating deliverables based on real-world scenarios, and providing insights to advance AI research. Qualified candidates will have...Remote jobContract workFlexible hours
- ...applications. As we continue to grow, we’re looking for a skilled AI Security Engineer to join our dynamic team and contribute to our mission of... .... Any improper interference with employees' ability to perform their job duties may result in disciplinary action up to and...Full timeH1bLocal areaImmediate startRemote workVisa sponsorshipWork visa
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI Performance Engineer. Be the first to apply!
- senior performance tester Prosper, TX
- senior performance engineer Prosper, TX
- lead performance test engineer Prosper, TX
- performance testing Prosper, TX
- acting performance Prosper, TX
- performance engineer Prosper, TX
- IT performance management Prosper, TX
- ai engineer
- generative ai engineer
- machine learning ai engineer


