Model Serving Engineer
Bright Vision Technologies
Model Serving Engineer
Job Title: Model Serving EngineerLocation: 100% Remote (Continental United States)
Position Type: In-house Bright Vision Technologies SOW engagement (no third-party client or vendor)
Salary : 100 K - 150 K
Experience: 6+ years
Sponsorship: No new H1B sponsorship available. H1B transfers welcomed for qualified candidates.
Employment Type: Full-time, direct W2 with Bright Vision Technologies (no C2C, no 1099, no third-party)
Engagement: Long-term, multi-year, aligned to the Bright Vision SOW delivery roadmap
Compensation: Competitive base salary commensurate with experience, plus benefits. Employment Terms & Visa Policy
This is a 100% remote, full-time, direct W2 position with Bright Vision Technologies.
This role is part of Bright Vision Technologies’ in-house Statement of Work (SOW) engagement. The client, end customer, and employer for this position is Bright Vision Technologies — there is no third-party client, vendor, or implementation partner involved.
We do not engage in C2C, 1099, or third-party arrangements for this role. BUT STRICTLY NO C2C/1099/3RD PARTY COMPANIES. ALL OUR ROLES ARE W2 AND NO 3RD PARTY BROKERING PLEASE.
Candidates must be willing to work directly as a full-time W2 employee of Bright Vision Technologies and contribute to our in-house SOW deliverables.
No new H1B sponsorship is available for this role. However, candidates who are currently on a valid H1B visa and require a transfer are welcome to apply. We will support H1B transfers for qualified candidates.
For every role, a technical coding assessment is mandatory. Please apply only if you are confident in your technical abilities and hands-on experience. Job Summary
We are seeking a Model Serving Engineer to design, build, and operate high-performance, highly reliable inference platforms for serving large machine learning models in production. The role focuses on the systems engineering side of AI deployment, including request routing, batching, caching, autoscaling, GPU utilization, and end-to-end observability across diverse model workloads. The ideal candidate brings strong distributed systems and performance engineering expertise, has shipped serving systems at scale, and understands the trade-offs between latency, throughput, cost, and quality in ML serving. Key Responsibilities
- Design and operate model serving platforms supporting diverse workloads including LLMs, vision models, and recommendation systems.
- Optimize inference performance using continuous batching, paged attention, speculative decoding, and request multiplexing.
- Implement multi-tenant routing, rate limiting, and quality-of-service policies across model endpoints.
- Build autoscaling and capacity management systems that balance latency, throughput, and cost.
- Tune GPU utilization, memory management, and KV cache strategies for LLM serving workloads.
- Integrate model serving with API gateways, identity systems, and observability platforms.
- Implement caching, prompt deduplication, and response reuse strategies where appropriate.
- Drive end-to-end observability including latency histograms, queue dynamics, GPU utilization, and error tracking.
- Develop deployment workflows including canary releases, shadow testing, and automated rollback.
- Operate incident response for high-availability AI services and drive durable reliability improvements.
- Collaborate with ML and product teams to support new model releases and capability rollouts.
- Implement security controls including request signing, content filtering, and abuse detection at the serving layer.
- Document operational procedures, performance characteristics, and tuning guidance for internal teams.
- Stay current with AI serving research and translate advances into production capabilities.
- Bachelor’s or Master’s degree in Computer Science or a related field.
- Six or more years of experience in distributed systems, infrastructure, or ML platform engineering.
- Strong proficiency in Python and a systems language such as Go, Rust, or C++.
- Deep experience operating high-throughput, low-latency services in production.
- Hands-on experience with LLM or large model inference frameworks such as vLLM or TensorRT-LLM.
- Strong understanding of GPU architecture, memory hierarchies, and accelerator utilization.
- Familiarity with Kubernetes, autoscaling, and modern cloud platforms.
- Experience with observability stacks including metrics, tracing, and structured logging.
- Solid grounding in performance engineering and capacity planning.
- Strong communication and incident response skills.
- Open-source contributions to model serving infrastructure.
- Experience with multi-region or globally distributed AI serving.
- Familiarity with model quantization, distillation, and compression techniques.
- Exposure to FinOps for AI workloads and cost-efficient serving design.
- Experience supporting external-facing AI APIs at scale.
Would you like to know more about this opportunity?
For immediate consideration, please send your resume to View email address on click.appcast.io or contact us at View phone number on click.appcast.io. Learn more about Bright Vision Technologies at
We recognize that our people are our strength, and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company.
We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. We also make reasonable accommodations for applicants’ and employees’ religious practices and beliefs, as well as mental health or physical disability needs.
Bright Vision Technologies is an Equal Opportunity Employer, including Disability/Veterans.
Position offered by “No Fee Agency.”
Equal Employment Opportunity (EEO) Statement
Bright Vision Technologies (BV Teck) is committed to equal employment opportunity (EEO) for all employees and applicants without regard to race, color, religion, sex, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, veteran status, or any other protected status as defined by applicable federal, state, or local laws. This commitment extends to all aspects of employment, including recruitment, hiring, training, compensation, promotion, transfer, leaves of absence, termination, layoffs, and recall.
BV Teck expressly prohibits any form of workplace harassment or discrimination. Any improper interference with employees' ability to perform their job duties may result in disciplinary action up to and including termination of employment.
$102.8k
...focus on the health, happiness, and well-being of you and those we serve - we care. What you do at McKesson matters. We foster a culture... ...hear from you. The position as Manager, Pricing MHS Economic Model is an individual contributor, responsible for managing and...SuggestedFull timeContract workWork at officeRemote workWork from home2 days per week- ...Job Title Civil Engineer Company Bio MHES is a well-established engineering firm... ...CAD technicians, and administrative staff serving clients globally. Typical... ...Proficiency with Hydrologic & Hydraulic modeling software such as: HEC-HMS, HEC-RAS, HY-8...SuggestedLocal areaMonday to Friday
$167.9k - $193.1k
...generation and other industrial plants. Summary The Project Engineer will manage the design, technical specifications, equipment... ...and safety standards, on schedule, and within budget. Serve as a key liaison between engineering, contractors, and clients....SuggestedFor contractorsLocal areaMonday to Friday$87.83k - $131.74k
...District/Group: Weeks Marine - Construction Department: Project Engineering Market: Building Employment Type: Full Time Position... ...and diversity of its equipment fleet, enables the company to serve as a one-stop shop for clients in both the private and...SuggestedPermanent employmentFull timeContract workTemporary workWork experience placementRelocationWeekend work$35.62 - $52.99 per hour
...includeseminar course training in cardiac ultrasound Where You'll Work The Woodlands Hospital a primary and secondary care hospital serving North Harris and Montgomery counties. Clinical services include cardiovascular services, diagnostic imaging, women's services (...Suggested- ...MacDonald, we trust our brilliant people to do brilliant things in engineering, management, and development services, supporting multisector... ...positive change for our clients and the communities they serve. You'll be trusted to contribute, encouraged to grow, and supported...Contract workLive inLocal area
- ...Job Title: Senior Project Engineer Location: Tinian Island Position Type: Full-Time Citizenship: U.S. National with a valid... ...site construction challenges. Stakeholder Communication: Serve as the primary technical point of contact for clients,...Full timeFor contractorsFor subcontractorRemote work
- ...Senior Electrical Engineer Pond is seeking a Senior Electrical Engineer to join our growing Energy team in one of the following offices... ...energy and utility projects. The successful candidate will serve as a technical lead within Pond's Energy electrical team, supporting...For contractors
- ...candidate will be placed at title of Project Engineer Sr or Project Engineer Lead based on... ...Project Execution team, the Project Engineer serves as the Owner's Engineer and technical... ...under EPC turnkey and lumpsum contracting models. The position is responsible for providing...Contract workFor contractorsWork at officeLocal areaRelocationVisa sponsorshipWork visa
- ...Infrastructure Systems Engineer Job Description Department: Digital Technology Job Status: Full-time FLSA Status: Salary... ...cloud and on-premises infrastructure solutions. This role serves as a technical expert, ensuring the seamless integration of cloud...Full timeLocal areaMonday to FridayWeekend workAfternoon shiftEarly shift
- ...improvement initiatives that enhance workplace safety while maintaining operational efficiency. Associate Advocacy & Cultural Leadership - Serve as a trusted safety advisor and advocate for associate wellbeing, building strong partnerships across all levels of the organization...Full timeWork at officeRelocation packageFlexible hoursShift workNight shiftWeekend work
- ...one constant remains: our unwavering vision and dedication to serving our customers. We strive to continuously improve and optimize... ...functional teams (Operations, RME (Reliability Maintenance & Engineering), Central Teams, Human Resources, Transportation Operations, and...Full timeSummer workInternshipWork at officeLocal areaRelocationRelocation packageShift workNight shiftWeekend work
- ...resources to impact the communities we collectively serve. Position Summary We are seeking a Mechanical Project Engineer to join our MEP team! This is an opportunity to... ...leadership to refine and implement design and modeling standards and best practices. Provide final...Work at office
- ...building more than structures—we build lasting relationships. Since 1979, our team has been personally invested in the communities we serve, from North Texas to Houston, working on the schools and spaces that shape everyday life. Powered by people, we bring over 45 years...For subcontractorFlexible hours
$2,900 - $5,800 per month
.... At the center of these projects is a talented group of Civil Engineers who help to ensure that each initiative is conceived, planned and... ...may vary depending upon whether you’re currently serving, whether you’ve served before or whether you’ve never served before...Civilian ContractorFull timeContract workPart timeWork at office- ...team! We are hiring an experienced Maintenance Engineer with multi-family experience. In... ...maintenance concerns for repairs on vacants, models, clubhouse, and/or common areas to the Maintenance Supervisor. Serves as the individual responsible for maintenance...Hourly pay16 hoursWork experience placement
- ...The Woodlands, TX is looking for a Senior Automation/Controls Engineer to join their Automation Engineering team. The Senior Automation... ...America and need a strong Management of Change (MOC) background to serve as the technical authority for automationrelated changes across...
- ...Senior Electrical Engineer – Power Systems Studies Pond is seeking an experienced Senior Electrical Engineer – Power Systems Studies... ...system analysis and design. The Senior Electrical Engineer will serve as a technical contributor and task lead, supporting and...
$130.7k - $205.2k
Overview The Value Chain Process Engineer - Source to Pay is responsible for shaping breakthrough... ..., digitally enabled organization. Role Serving as a strategic architect of intelligent... .... Define clear human-AI interaction models (decision rights, escalation paths, exception...Contract workTemporary workFlexible hours- ...Nuclear Quality Assurance Engineer W-Industries is an energy service company that specializes in Automation Solutions, I&E Construction... ..., procurement, manufacturing, and testing activities. Serve as a primary quality interface with customers, regulators, Authorized...Temporary work
- ...Job Description Job Description M&S Engineering is seeking an experienced, licensed Civil Project Engineer (P.E.) to lead and stamp... ...licensure. Responsibilities Design & Technical Leadership Serve as PE of Record — lead design, stamp, and seal engineering...Work at officeLocal area
- ...historical spend analysis and utilization reporting Track and validate billing data accuracy to eliminate errors and discrepancies Serve as a liaison between internal employees and telecom vendors to resolve service, operational, and billing inquiries Collaborate...Full timeContract work
- ...Description Job Description Salary: Civil Engineer: Project Manager Experience:... ...technicians, and administrative staff serving clients globally. Typical Responsibilities... .... ~ Hydrologic & Hydraulic modeling software such as: HEC-HMS, HEC RAS, TR-5...Full timeLocal areaMonday to Friday
$137.3k - $254.9k
...products and more, our 9,000 global employees serve customers in more than 150 countries,... ...have an opportunity for a Senior Sales Engineer, Data Centers to join our Texas team. You... ...experience selling through channel distribution models or equivalent experience selling to end‑...Full time$118.13 per hour
...Permian IC&E teams executing these programs. This position will serve as the primary interface with asset automation and electrical... ...standards Coordinate with operations, automation teams, electricians, engineering contractors, Quest, and asset teams Troubleshoot workflow...Daily paidFor contractors- Overview The Instrument and Controls Engineer is responsible for managing and executing workflows across multiple assets to develop and... ...Drawings, Powerline Mapping, and Power Relay Settings.This role serves as the central coordination point for cross-functional teams,...For contractors
$130.7k - $205.2k
Hire to Retire (H2R) - Value Chain Process Engineer Description - Job Summary The Value Chain... ...-led, digitally enabled enterprise. Serving as a strategic architect of intelligent employee... ...HR to: Define clear human-AI interaction models (decision rights, escalation paths,...Full timeTemporary workLocal areaRelocationFlexible hoursShift work- Graduate Engineers - Conroe Take your career to the next level at a firm known for quality, integrity, and service. Under the supervision... .... Maintain ownership of work products with a desire to serve others through professional engineering. #J-18808-Ljbffr Bleyl...
- ...Houston ExxonMobil's state-of-the-art campus north of Houston serves as home to its Upstream, Product Solutions and Low Carbon... ...Access Management Nearest Major Market: Houston Job Segment: Information Security, Engineer, Research, Technology, EngineeringFull timePart timeWork experience placementImmediate startFlexible hours
- ...Cellipont Bioservices are growing, and we are looking for a MS&T Engineer III who believes in the potential of bridging client's... ...information for implementation of cGMP processes. The individual will serve as a subject matter expert (SME) for Autologous and Allogenic...Flexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Model Serving Engineer. Be the first to apply!



