Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI Performance Engineer

$100k - $150k
Full-time

Bright Vision Technologies

Bright Vision Technologies is a forward-thinking software development company dedicated to building innovative solutions that help businesses automate and optimize their operations. We leverage cutting-edge technologies to create scalable, secure, and user-friendly applications.

As we continue to grow, we’re looking for a skilled AI Performance Engineer to join our dynamic team and contribute to our mission of transforming business processes through technology.

This is a fantastic opportunity to join an established and well-respected organization offering tremendous career growth potential.

 

AI Performance Engineer

Job Title: AI Performance Engineer
Salary Range: 100k$/Annum-150k$/Annum
Location: 100% Remote (Continental United States)
Position Type: In-house Bright Vision Technologies SOW engagement (no third-party client or vendor)
Experience: 6+ years
Sponsorship: No new H1B sponsorship available. H1B transfers welcomed for qualified candidates.
Employment Type: Full-time, direct W2 with Bright Vision Technologies (no C2C, no 1099, no third-party)
Engagement: Long-term, multi-year, aligned to the Bright Vision SOW delivery roadmap
Compensation: Competitive base salary commensurate with experience, plus benefits.
Employment Terms & Visa Policy
This is a 100% remote, full-time, direct W2 position with Bright Vision Technologies.
This role is part of Bright Vision Technologies’ in-house Statement of Work (SOW) engagement. The client, end customer, and employer for this position is Bright Vision Technologies — there is no third-party client, vendor, or implementation partner involved.
We do not engage in C2C, 1099, or third-party arrangements for this role.
BUT STRICTLY NO C2C/1099/3RD PARTY COMPANIES. ALL OUR ROLES ARE W2 AND NO 3RD PARTY BROKERING PLEASE.
Candidates must be willing to work directly as a full-time W2 employee of Bright Vision Technologies and contribute to our in-house SOW deliverables.
No new H1B sponsorship is available for this role.
However, candidates who are currently on a valid H1B visa and require a transfer are welcome to apply. We will support H1B transfers for qualified candidates.
For every role, a technical coding assessment is mandatory. Please apply only if you are confident in your technical abilities and hands-on experience.
Job Summary
We are seeking an AI Performance Optimization Engineer to focus on extracting maximum throughput, minimizing latency, and reducing cost across training and inference workloads for large neural network systems. The role spans the full stack from low-level kernel optimization to distributed system tuning, requiring deep understanding of GPU architecture, model parallelism, memory management, and compiler-level optimization. The ideal candidate has demonstrated an impact on production of AI workloads, with strong instrumentation and measurement discipline that enables rigorous, data-driven optimization decisions. In this role you will work closely with cross-functional partners — product, design, engineering, operations, and business stakeholders — to translate ambiguous requirements into well-engineered solutions, and will be expected to raise the bar through code review, design review, and mentorship of more junior engineers. The successful candidate brings strong engineering discipline, a clear communication style, and a track record of shipping meaningful work that holds up well in production.
Key Responsibilities
  • Profile and optimize end-to-end AI training and inference pipelines for throughput, latency, and cost.
  • Identify and eliminate bottlenecks across data loading, model compute, communication, and memory.
  • Implement and tune quantization, sparsity, and pruning strategies to reduce model footprint and accelerate inference.
  • Optimize distributed training using tensor parallelism, pipeline parallelism, FSDP, and ZeRO-style sharding.
  • Tune attention implementations using Flash Attention, paged attention, and related techniques.
  • Implement KV cache optimization, continuous batching, and speculative decoding for LLM serving.
  • Drive compiler-level optimizations using Triton, XLA, Torch Inductor, or TVM, working with the broader ML framework community to land improvements that translate into measurable end-to-end performance gains.
  • Optimize data pipelines, sharding strategies, and storage access patterns for high-throughput training.
  • Build and maintain rigorous benchmark suites and regression frameworks across workloads.
  • Collaborate with ML and platform engineering teams to embed best practices in standard pipelines.
  • Drive cost-efficiency improvements through model architecture, hardware selection, and scheduling strategies.
  • Evaluate new hardware and software offerings and advise on adoption.
  • Document performance tuning playbooks and share findings broadly across engineering teams.
  • Stay current with AI systems to research and translate advances into production improvements.
Required Qualifications
  • Bachelor's or master's degree in computer science, Computer Engineering, or related field.
  • Six or more years of experience in performance engineering, ML systems, or HPC.
  • Strong proficiency in Python and C++.
  • Hands-on experience optimizing deep learning workloads on modern GPUs.
  • Deep understanding of distributed training and inference techniques.
  • Experience with profiling tools across CPU, GPU, and distributed systems.
  • Familiarity with model compression techniques and their accuracy implications.
  • Strong grasp of memory hierarchies, communication primitives, and parallelism strategies.
  • Excellent measurement, debugging, and analytical reasoning skills.
  • Strong communication and collaboration skills.
Preferred Qualifications
  • Experience optimizing LLM inference at production scale.
  • Contributions to vLLM, TensorRT-LLM, DeepSpeed, or similar projects.
  • Familiarity with custom kernel authoring in Triton or CUTLASS.
  • Experience with FinOps for AI workloads.
  • Publications or talks on AI systems performance.
How to Apply
Would you like to know more about this opportunity?
For immediate consideration, please send your resume to View email address on brightvisiontechnologies.applytojob.com or contact us at Show phone number. Learn more about Bright Vision Technologies at
We recognize that our people are our strength, and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company.
We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. We also make reasonable accommodations for applicants’ and employees’ religious practices and beliefs, as well as mental health or physical disability needs.
Bright Vision Technologies is an Equal Opportunity Employer, including Disability/Veterans.
Position offered by “No Fee Agency.”

 

Equal Employment Opportunity (EEO) Statement

Bright Vision Technologies (BV Teck) is committed to equal employment opportunity (EEO) for all employees and applicants without regard to race, color, religion, sex, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, veteran status, or any other protected status as defined by applicable federal, state, or local laws. This commitment extends to all aspects of employment, including recruitment, hiring, training, compensation, promotion, transfer, leaves of absence, termination, layoffs, and recall.

BV Teck expressly prohibits any form of workplace harassment or discrimination. Any improper interference with employees' ability to perform their job duties may result in disciplinary action up to and including termination of employment.

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the AI Performance Engineer in Hicksville, NY vacancy
  •  ...Responsibilities Kforce has a client that is seeking an AI Engineer in Port Washington, NY.Responsibilities: AI Engineer will...  ...with existing systems and tools Monitor and optimize model performance and reliability Partner with business and technical teams... 
    Performance
    Hourly pay
    Contract work

    Kforce

    Port Washington, NY
    13 hours ago
  • $73.5k - $212.28k

     ...At PwC, our people in data and analytics engineering focus on leveraging advanced...  ...member's unique strengths, and managing performance to deliver on client expectations. With...  ...will lead the development of innovative AI solutions that drive remarkable client outcomes... 
    Performance
    Full time
    H1b

    PwC

    Melville, NY
    3 days ago
  • $100k - $150k

     ...As we continue to grow, we’re looking for a skilled Edge AI Engineer to join our dynamic team and contribute to our mission of transforming...  ...to fit models within edge constraints. Tune model performance for latency, energy efficiency, and memory footprint on target... 
    Performance
    Full time
    H1b
    Local area
    Immediate start
    Remote work
    Visa sponsorship
    Work visa

    Bright Vision Technologies

    Hicksville, NY
    23 days ago
  • $100k - $150k

     ...enterprise-grade applications, data-intensive services, and automation platforms. This is a hands-on engineering role focused on delivering robust, secure, and high-performance Python systems that operate reliably within distributed and mission-critical production... 
    Performance
    Full time
    H1b
    Local area
    Immediate start
    Remote work
    Visa sponsorship
    Work visa

    Bright Vision Technologies

    Hicksville, NY
    1 day ago
  • $100k - $150k

     .... As we continue to grow, we’re looking for a skilled AI Infrastructure Engineer to join our dynamic team and contribute to our mission of...  ...clusters, distributed training frameworks, scheduling, storage performance, and developer experience for ML engineers and... 
    Performance
    Full time
    H1b
    Local area
    Immediate start
    Remote work
    Visa sponsorship
    Work visa

    Bright Vision Technologies

    Hicksville, NY
    23 days ago
  • $100k - $150k

     ...applications. As we continue to grow, we’re looking for a skilled AI Data Engineer to join our dynamic team and contribute to our mission of...  ...health across the AI data estate. Optimize cost and performance through compression, format selection, and caching... 
    Performance
    Full time
    H1b
    Local area
    Immediate start
    Remote work
    Visa sponsorship
    Work visa

    Bright Vision Technologies

    Hicksville, NY
    3 days ago
  •  ...collaboratively to develop an application using the .NET Stack Perform various types of testing which include unit and integrating...  ...Qualifications Bachelor’s degree or higher in Computer Science, Engineering, or related field. 10+ years’ experience in Application Development... 
    Performance

    Direct Staffing Inc

    Hicksville, NY
    3 days ago
  • $100k - $150k

     ...requirements analysis, technical design, hands-on coding, unit testing, performance tuning, and post-go-live support in close collaboration with...  ...Qualifications Bachelor's degree in Computer Science, Engineering, or a related technical discipline. Five or more years of... 
    Performance
    Full time
    H1b
    Local area
    Immediate start
    Remote work
    Visa sponsorship
    Work visa

    Bright Vision Technologies

    Hicksville, NY
    1 day ago
  • $100.25k - $164.69k

     ...teams to support them. Reporting to the Engineering Manager, the SDE II DevOps Toolchain will...  ...with data formats and locality with performance considerations. Demonstrated experience...  ...related to technology. Familiarity with AI tools and AI first mindset We are an Equal... 
    Performance
    Local area

    Optimum

    Bethpage, NY
    3 days ago
  • $50 - $60 per hour

     ...DataAnnotation is committed to creating high-quality AI. Enjoy the flexibility of remote work and the freedom to set your own schedule...  ...Evaluate the quality produced by AI models for correctness and performance   Qualifications: Fluency in English (native or... 
    Performance
    Hourly pay
    Contract work
    For contractors
    Work experience placement
    Remote work

    Data Annotation

    Wantagh, NY
    more than 2 months ago
  • $110k

     ...with relevant experience of at least 5 years in web applications development to join our team. Responsibilities Write hands-on high-performance Java code, using Spring/Spring Boot and Python Build cloud services on top of the modern Infrastructure as a Service (IaaS)... 
    Performance
    Full time
    Remote work
    Monday to Friday

    Gain America

    Hicksville, NY
    3 days ago
  • $50 - $60 per hour

     ...DataAnnotation is committed to creating high-quality AI. Enjoy the flexibility of remote work and the freedom to set your own schedule...  ...Evaluate the quality produced by AI models for correctness and performance   Qualifications: Fluency in English (native or... 
    Performance
    Hourly pay
    Contract work
    For contractors
    Work experience placement
    Remote work

    Data Annotation

    Hicksville, NY
    more than 2 months ago
  • A leading education organization is seeking an Artificial Intelligence Developer Intern to support the development of an AI knowledge system. This hybrid internship involves hands-on work with AI, data structuring, and software development to enhance operational efficiency... 
    Internship

    Red Door Learning Company

    Hicksville, NY
    2 days ago
  • $120k - $140k

     ...in administration, strong knowledge of Linux and Windows, and expertise in Oracle and PostgreSQL. Your role includes optimizing performance, managing backups, and ensuring the security of systems. The position offers a competitive salary range of $120,000 to $140,000 annually... 
    Performance

    Lightpath

    Bethpage, NY
    4 days ago
  • A US-based technology company in New York is seeking a Sr Java web applications developer to create high-performance applications. The ideal candidate has over 5 years of relevant experience and skills in Java, Spring, and cloud services. This full-time position offers... 
    Performance
    Remote job
    Full time

    Gain America

    Hicksville, NY
    13 hours ago
  • $115k - $140k

    The Data Engineer will play a critical role in designing, building, and scaling enterprise data platform in the cloud. This role is responsible...  ...analytics, and ensuring the reliability, security, and performance of cloud-based data solutions. While this is a Microsoft-first... 
    Performance

    MAP SSG Inc

    Jericho, NY
    2 days ago
  • $100k - $150k

     ...Summary We are seeking an experienced MuleSoft Integration Engineer to design, build, and operate enterprise integration solutions...  ...Salesforce Platform Events, or Kafka. Optimize integration performance, including pooling, caching, parallel processing, and message... 
    Performance
    Full time
    H1b
    Local area
    Immediate start
    Remote work
    Visa sponsorship
    Work visa

    Bright Vision Technologies

    Hicksville, NY
    1 day ago
  • $100k - $150k

     ...applications. As we continue to grow, we’re looking for a skilled AI Security Engineer to join our dynamic team and contribute to our mission of...  .... Any improper interference with employees' ability to perform their job duties may result in disciplinary action up to and... 
    Full time
    H1b
    Local area
    Immediate start
    Remote work
    Visa sponsorship
    Work visa

    Bright Vision Technologies

    Hicksville, NY
    23 days ago
  • $110k - $120k

     ...programming business. We are seeking an AI Specialist to join our Technology Services...  ...AI solutions, assessing feasibility, performance, and scalability. Assess risks and establish...  ...with enterprise architects and IT engineering teams to deploy AI solutions into existing... 
    Performance
    Work at office
    Work from home
    1 day per week

    AMC Networks

    Bethpage, NY
    13 hours ago
  •  ...deliver our clients a competitive edge. Our globally distributed engineering teams focus on adaptable technology and open architecture to...  ...for designing, building, and maintaining low latency, high-performance integrations with market data providers and maintaining... 
    Performance
    Flexible hours

    FlexTrade Systems Inc.

    Great Neck, NY
    3 days ago
  •  ...Specialist in Bethpage, New York. This role focuses on driving sales performance by designing and executing enablement programs. You'll partner...  ...strong skills in facilitation and analytics. Familiarity with AI tools like ChatGPT is vital for improving engagement and... 
    Performance

    Altice USA

    Bethpage, NY
    2 days ago
  • $50 - $60 per hour

     ...Data Annotation is committed to creating high-quality AI. Join our team to help train the nextgeneration of AI while enjoying the...  ...Evaluate the quality produced by AI models for correctness and performance Qualifications: Fluency in English (native or bilingual level)... 
    Performance
    Hourly pay
    Full time
    Contract work
    Part time
    Work experience placement
    Remote work

    Data Annotation

    East Meadow, NY
    1 day ago
  • $100k - $150k

     ...applications. As we continue to grow, we’re looking for a skilled AI Research Engineer to join our dynamic team and contribute to our mission of...  .... Any improper interference with employees' ability to perform their job duties may result in disciplinary action up to and... 
    Full time
    H1b
    Local area
    Immediate start
    Remote work
    Visa sponsorship
    Work visa

    Bright Vision Technologies

    Hicksville, NY
    3 days ago
  • $110k

     ...Develop software solutions using Java (SE & EE), Salesforce.com API framework, and 3rd party APIs. Improve architecture and optimize performance of very complex software systems. Troubleshoot and resolve problems. Education & Experience Preferred education: Typically a... 
    Performance
    Full time
    Remote work
    Monday to Friday

    Gain America

    Hicksville, NY
    13 hours ago
  • $70k

     ...Additional Links About Us Social Commitment Diversity & Inclusion Privacy Policy Blog Get In Touch Partners Franchise Opportunity We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI. #J-18808-Ljbffr... 
    Full time
    Remote work
    Monday to Friday

    Gain America

    Hicksville, NY
    2 days ago
  • $1,600 per week

     ...the backbone of our success, and we work hard to create a safe, supportive environment where your experience is valued and your performance is rewarded. CDL-A Company Driver Overview Top earners reach $90,000 per year.* Strong earnings averaging $1,400-$1,600, with... 
    Performance
    Full time

    Kivi Bros Trucking

    Farmingdale, NY
    3 days ago
  •  ...js. The primary focus of the role is to drive the development of front-end components. You will be responsible for high quality, performant deliverables and for helping to create, evangelize, and enforce the standards necessary to meet team and company goals for the customer... 
    Performance
    Summer work
    Work at office
    Remote work

    Kliger-Weiss Infosystems, Inc.

    Melville, NY
    1 day ago
  •  ...roles/memberships Understanding of how each policy, role, and membership work together to create the users Collaborating with Engineers to create policy Other Skills / Abilities Customer-service oriented with excellent problem-solving skills Ability to... 
    Casual work

    eVero Corporation

    Syosset, NY
    13 days ago
  •  ...When testing is completed, the Software Architect Expert will be required to migrate the programs to the production environment and perform post production validation of the implementation. If you are interested in this exciting opportunity, please submit... 
    Contract work

    InterSources

    Syosset, NY
    3 days ago
  •  ...he Role: Cognizant seeks to hire an Sr. Golang Developer for our IoT Practice. The IoT Practice provides Product Engineering best practice/standards development, deployment, management, and operational support across the communications, media and technology industry... 
    Temporary work

    Omni Inclusive

    Bethpage, NY
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Performance Engineer. Be the first to apply!