AI Performance Engineer
$100k - $150kBright Vision Technologies
AI Performance Engineer
Job Title: AI Performance EngineerSalary Range: 100k$/Annum-150k$/Annum
Location: 100% Remote (Continental United States)
Position Type: In-house Bright Vision Technologies SOW engagement (no third-party client or vendor)
Experience: 6+ years
Sponsorship: No new H1B sponsorship available. H1B transfers welcomed for qualified candidates.
Employment Type: Full-time, direct W2 with Bright Vision Technologies (no C2C, no 1099, no third-party)
Engagement: Long-term, multi-year, aligned to the Bright Vision SOW delivery roadmap
Compensation: Competitive base salary commensurate with experience, plus benefits.
Employment Terms & Visa Policy
This is a 100% remote, full-time, direct W2 position with Bright Vision Technologies.
This role is part of Bright Vision Technologies’ in-house Statement of Work (SOW) engagement. The client, end customer, and employer for this position is Bright Vision Technologies — there is no third-party client, vendor, or implementation partner involved.
We do not engage in C2C, 1099, or third-party arrangements for this role.
BUT STRICTLY NO C2C/1099/3RD PARTY COMPANIES. ALL OUR ROLES ARE W2 AND NO 3RD PARTY BROKERING PLEASE.
Candidates must be willing to work directly as a full-time W2 employee of Bright Vision Technologies and contribute to our in-house SOW deliverables.
No new H1B sponsorship is available for this role.
However, candidates who are currently on a valid H1B visa and require a transfer are welcome to apply. We will support H1B transfers for qualified candidates.
For every role, a technical coding assessment is mandatory. Please apply only if you are confident in your technical abilities and hands-on experience.
Job Summary
We are seeking an AI Performance Optimization Engineer to focus on extracting maximum throughput, minimizing latency, and reducing cost across training and inference workloads for large neural network systems. The role spans the full stack from low-level kernel optimization to distributed system tuning, requiring deep understanding of GPU architecture, model parallelism, memory management, and compiler-level optimization. The ideal candidate has demonstrated an impact on production of AI workloads, with strong instrumentation and measurement discipline that enables rigorous, data-driven optimization decisions. In this role you will work closely with cross-functional partners — product, design, engineering, operations, and business stakeholders — to translate ambiguous requirements into well-engineered solutions, and will be expected to raise the bar through code review, design review, and mentorship of more junior engineers. The successful candidate brings strong engineering discipline, a clear communication style, and a track record of shipping meaningful work that holds up well in production.
Key Responsibilities
- Profile and optimize end-to-end AI training and inference pipelines for throughput, latency, and cost.
- Identify and eliminate bottlenecks across data loading, model compute, communication, and memory.
- Implement and tune quantization, sparsity, and pruning strategies to reduce model footprint and accelerate inference.
- Optimize distributed training using tensor parallelism, pipeline parallelism, FSDP, and ZeRO-style sharding.
- Tune attention implementations using Flash Attention, paged attention, and related techniques.
- Implement KV cache optimization, continuous batching, and speculative decoding for LLM serving.
- Drive compiler-level optimizations using Triton, XLA, Torch Inductor, or TVM, working with the broader ML framework community to land improvements that translate into measurable end-to-end performance gains.
- Optimize data pipelines, sharding strategies, and storage access patterns for high-throughput training.
- Build and maintain rigorous benchmark suites and regression frameworks across workloads.
- Collaborate with ML and platform engineering teams to embed best practices in standard pipelines.
- Drive cost-efficiency improvements through model architecture, hardware selection, and scheduling strategies.
- Evaluate new hardware and software offerings and advise on adoption.
- Document performance tuning playbooks and share findings broadly across engineering teams.
- Stay current with AI systems to research and translate advances into production improvements.
- Bachelor's or master's degree in computer science, Computer Engineering, or related field.
- Six or more years of experience in performance engineering, ML systems, or HPC.
- Strong proficiency in Python and C++.
- Hands-on experience optimizing deep learning workloads on modern GPUs.
- Deep understanding of distributed training and inference techniques.
- Experience with profiling tools across CPU, GPU, and distributed systems.
- Familiarity with model compression techniques and their accuracy implications.
- Strong grasp of memory hierarchies, communication primitives, and parallelism strategies.
- Excellent measurement, debugging, and analytical reasoning skills.
- Strong communication and collaboration skills.
- Experience optimizing LLM inference at production scale.
- Contributions to vLLM, TensorRT-LLM, DeepSpeed, or similar projects.
- Familiarity with custom kernel authoring in Triton or CUTLASS.
- Experience with FinOps for AI workloads.
- Publications or talks on AI systems performance.
Would you like to know more about this opportunity?
For immediate consideration, please send your resume to View email address on brightvisiontechnologies.applytojob.com or contact us at Show phone number. Learn more about Bright Vision Technologies at
We recognize that our people are our strength, and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company.
We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. We also make reasonable accommodations for applicants’ and employees’ religious practices and beliefs, as well as mental health or physical disability needs.
Bright Vision Technologies is an Equal Opportunity Employer, including Disability/Veterans.
Position offered by “No Fee Agency.”
Equal Employment Opportunity (EEO) Statement
Bright Vision Technologies (BV Teck) is committed to equal employment opportunity (EEO) for all employees and applicants without regard to race, color, religion, sex, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, veteran status, or any other protected status as defined by applicable federal, state, or local laws. This commitment extends to all aspects of employment, including recruitment, hiring, training, compensation, promotion, transfer, leaves of absence, termination, layoffs, and recall.
BV Teck expressly prohibits any form of workplace harassment or discrimination. Any improper interference with employees' ability to perform their job duties may result in disciplinary action up to and including termination of employment.
- ...Responsibilities Kforce has a client that is seeking an AI Engineer in Port Washington, NY.Responsibilities: AI Engineer will... ...with existing systems and tools Monitor and optimize model performance and reliability Partner with business and technical teams...PerformanceHourly payContract work
$73.5k - $212.28k
...At PwC, our people in data and analytics engineering focus on leveraging advanced... ...member's unique strengths, and managing performance to deliver on client expectations. With... ...will lead the development of innovative AI solutions that drive remarkable client outcomes...PerformanceFull timeH1b$100k - $150k
...As we continue to grow, we’re looking for a skilled Edge AI Engineer to join our dynamic team and contribute to our mission of transforming... ...to fit models within edge constraints. Tune model performance for latency, energy efficiency, and memory footprint on target...PerformanceFull timeH1bLocal areaImmediate startRemote workVisa sponsorshipWork visa$100k - $150k
...enterprise-grade applications, data-intensive services, and automation platforms. This is a hands-on engineering role focused on delivering robust, secure, and high-performance Python systems that operate reliably within distributed and mission-critical production...PerformanceFull timeH1bLocal areaImmediate startRemote workVisa sponsorshipWork visa$100k - $150k
.... As we continue to grow, we’re looking for a skilled AI Infrastructure Engineer to join our dynamic team and contribute to our mission of... ...clusters, distributed training frameworks, scheduling, storage performance, and developer experience for ML engineers and...PerformanceFull timeH1bLocal areaImmediate startRemote workVisa sponsorshipWork visa$100k - $150k
...applications. As we continue to grow, we’re looking for a skilled AI Data Engineer to join our dynamic team and contribute to our mission of... ...health across the AI data estate. Optimize cost and performance through compression, format selection, and caching...PerformanceFull timeH1bLocal areaImmediate startRemote workVisa sponsorshipWork visa- ...collaboratively to develop an application using the .NET Stack Perform various types of testing which include unit and integrating... ...Qualifications Bachelor’s degree or higher in Computer Science, Engineering, or related field. 10+ years’ experience in Application Development...Performance
$100k - $150k
...requirements analysis, technical design, hands-on coding, unit testing, performance tuning, and post-go-live support in close collaboration with... ...Qualifications Bachelor's degree in Computer Science, Engineering, or a related technical discipline. Five or more years of...PerformanceFull timeH1bLocal areaImmediate startRemote workVisa sponsorshipWork visa$100.25k - $164.69k
...teams to support them. Reporting to the Engineering Manager, the SDE II DevOps Toolchain will... ...with data formats and locality with performance considerations. Demonstrated experience... ...related to technology. Familiarity with AI tools and AI first mindset We are an Equal...PerformanceLocal area$50 - $60 per hour
...DataAnnotation is committed to creating high-quality AI. Enjoy the flexibility of remote work and the freedom to set your own schedule... ...Evaluate the quality produced by AI models for correctness and performance Qualifications: Fluency in English (native or...PerformanceHourly payContract workFor contractorsWork experience placementRemote work$110k
...with relevant experience of at least 5 years in web applications development to join our team. Responsibilities Write hands-on high-performance Java code, using Spring/Spring Boot and Python Build cloud services on top of the modern Infrastructure as a Service (IaaS)...PerformanceFull timeRemote workMonday to Friday$50 - $60 per hour
...DataAnnotation is committed to creating high-quality AI. Enjoy the flexibility of remote work and the freedom to set your own schedule... ...Evaluate the quality produced by AI models for correctness and performance Qualifications: Fluency in English (native or...PerformanceHourly payContract workFor contractorsWork experience placementRemote work- A leading education organization is seeking an Artificial Intelligence Developer Intern to support the development of an AI knowledge system. This hybrid internship involves hands-on work with AI, data structuring, and software development to enhance operational efficiency...Internship
$120k - $140k
...in administration, strong knowledge of Linux and Windows, and expertise in Oracle and PostgreSQL. Your role includes optimizing performance, managing backups, and ensuring the security of systems. The position offers a competitive salary range of $120,000 to $140,000 annually...Performance- A US-based technology company in New York is seeking a Sr Java web applications developer to create high-performance applications. The ideal candidate has over 5 years of relevant experience and skills in Java, Spring, and cloud services. This full-time position offers...PerformanceRemote jobFull time
$115k - $140k
The Data Engineer will play a critical role in designing, building, and scaling enterprise data platform in the cloud. This role is responsible... ...analytics, and ensuring the reliability, security, and performance of cloud-based data solutions. While this is a Microsoft-first...Performance$100k - $150k
...Summary We are seeking an experienced MuleSoft Integration Engineer to design, build, and operate enterprise integration solutions... ...Salesforce Platform Events, or Kafka. Optimize integration performance, including pooling, caching, parallel processing, and message...PerformanceFull timeH1bLocal areaImmediate startRemote workVisa sponsorshipWork visa$100k - $150k
...applications. As we continue to grow, we’re looking for a skilled AI Security Engineer to join our dynamic team and contribute to our mission of... .... Any improper interference with employees' ability to perform their job duties may result in disciplinary action up to and...Full timeH1bLocal areaImmediate startRemote workVisa sponsorshipWork visa$110k - $120k
...programming business. We are seeking an AI Specialist to join our Technology Services... ...AI solutions, assessing feasibility, performance, and scalability. Assess risks and establish... ...with enterprise architects and IT engineering teams to deploy AI solutions into existing...PerformanceWork at officeWork from home1 day per week- ...deliver our clients a competitive edge. Our globally distributed engineering teams focus on adaptable technology and open architecture to... ...for designing, building, and maintaining low latency, high-performance integrations with market data providers and maintaining...PerformanceFlexible hours
- ...Specialist in Bethpage, New York. This role focuses on driving sales performance by designing and executing enablement programs. You'll partner... ...strong skills in facilitation and analytics. Familiarity with AI tools like ChatGPT is vital for improving engagement and...Performance
$50 - $60 per hour
...Data Annotation is committed to creating high-quality AI. Join our team to help train the nextgeneration of AI while enjoying the... ...Evaluate the quality produced by AI models for correctness and performance Qualifications: Fluency in English (native or bilingual level)...PerformanceHourly payFull timeContract workPart timeWork experience placementRemote work$100k - $150k
...applications. As we continue to grow, we’re looking for a skilled AI Research Engineer to join our dynamic team and contribute to our mission of... .... Any improper interference with employees' ability to perform their job duties may result in disciplinary action up to and...Full timeH1bLocal areaImmediate startRemote workVisa sponsorshipWork visa$110k
...Develop software solutions using Java (SE & EE), Salesforce.com API framework, and 3rd party APIs. Improve architecture and optimize performance of very complex software systems. Troubleshoot and resolve problems. Education & Experience Preferred education: Typically a...PerformanceFull timeRemote workMonday to Friday$70k
...Additional Links About Us Social Commitment Diversity & Inclusion Privacy Policy Blog Get In Touch Partners Franchise Opportunity We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI. #J-18808-Ljbffr...Full timeRemote workMonday to Friday$1,600 per week
...the backbone of our success, and we work hard to create a safe, supportive environment where your experience is valued and your performance is rewarded. CDL-A Company Driver Overview Top earners reach $90,000 per year.* Strong earnings averaging $1,400-$1,600, with...PerformanceFull time- ...js. The primary focus of the role is to drive the development of front-end components. You will be responsible for high quality, performant deliverables and for helping to create, evangelize, and enforce the standards necessary to meet team and company goals for the customer...PerformanceSummer workWork at officeRemote work
- ...roles/memberships Understanding of how each policy, role, and membership work together to create the users Collaborating with Engineers to create policy Other Skills / Abilities Customer-service oriented with excellent problem-solving skills Ability to...Casual work
- ...When testing is completed, the Software Architect Expert will be required to migrate the programs to the production environment and perform post production validation of the implementation. If you are interested in this exciting opportunity, please submit...Contract work
- ...he Role: Cognizant seeks to hire an Sr. Golang Developer for our IoT Practice. The IoT Practice provides Product Engineering best practice/standards development, deployment, management, and operational support across the communications, media and technology industry...Temporary work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI Performance Engineer. Be the first to apply!
- senior performance tester Hicksville, NY
- senior performance engineer Hicksville, NY
- performance testing Hicksville, NY
- acting performance Hicksville, NY
- performance engineer Hicksville, NY
- IT performance management Hicksville, NY
- ai engineer
- generative ai engineer
- machine learning ai engineer
- ai research engineer




