Senior / Staff Software AI Test Engineer, AI Engineering
TWG AI
At TWG Group Holdings, LLC ("TWG Global"), we drive innovation and business transformation across a range of industries—including financial services, insurance, technology, media, and sports—by leveraging data and AI as core assets. Our AI-first, cloud-native approach delivers real-time intelligence and interactive business applications, empowering informed decision-making for both customers and employees.
We prioritize responsible data and AI practices, ensuring ethical standards and regulatory compliance. Our decentralized structure enables each business unit to operate autonomously, supported by a central AI Solutions Group, while strategic partnerships with leading data and AI vendors fuel game-changing efforts in marketing, operations, and product development.
You will collaborate with management to advance our data and analytics transformation, enhance productivity, and enable agile, data-driven decisions. By leveraging relationships with top tech startups and universities, you will help create competitive advantages and drive enterprise innovation.
At TWG Global, your contributions will support our goal of sustained growth and superior returns, as we deliver rare value and impact across our businesses.
The Role
Senior or Staff AI Software Engineer in Test to join our AI Engineering team building commercial-grade AI products. This is a software engineering role focused on test automation. You won't just write test cases, you'll design and build the frameworks, harnesses, evaluation infrastructure, and tooling that make testing AI agents and LLM-powered applications possible at scale.
Our agents are written in LangGraph and run on Azure on the TWG side, with a parallel Vercel-based stack on the Palantir side. You'll write eval sets against both, and you'll validate the surfaces our users actually touch: iOS apps, plugins, and Chrome extensions, not just the model layer.
You'll work shoulder-to-shoulder with AI engineers and data scientists, contributing production-quality code to shared repositories. The ideal candidate is a strong coder, fluent in Python and Java — who has shipped automated test infrastructure in a production environment and has hands‑on experience evaluating LLM and agentic systems.
Key Responsibilities
Framework and harness engineering
- Design and build scalable, reusable test automation frameworks for AI agents, LLM‑powered applications, and underlying APIs.
- Write clean, maintainable Python for test harnesses, eval pipelines, synthetic data generation utilities, and internal tooling.
- Treat test code as production code: code review, type hints, documentation, library design.
Evaluation infrastructure
- Build evaluation infrastructure for benchmarking agent performance against SOTA LLMs, competitors, and internal baselines.
- Own regression suites, golden datasets, rubric‑based evals, and metric dashboards.
- Build tooling for synthetic test data generation, edge‑case discovery, and adversarial testing.
Resilience and load
- Design and run release, system, performance, and load tests against streaming, stateful, and async systems.
- Build chaos and fault injection tooling for token expiry, connection pool exhaustion, provider failover, and cache pressure scenarios.
- Drive contract testing across LLM providers (Bedrock, Anthropic, OpenAI) to catch parity drift.
CI/CD and observability
- Integrate automated tests into CI/CD so every model, prompt, and code change is validated before it ships.
- Build trace‑based assertions on LangGraph state, tool calls, and agent decisions — debugging an agent failure means replaying graph state, not re‑running a prompt.
- Make observability a first‑class testing surface (LangSmith, audit logs).
Human‑in‑the‑loop and partnership
- Implement HIL review workflows where automation alone cannot validate quality, then push the automation boundary outward.
- Partner with AI engineers and data scientists on model evaluation, training and eval data prep, and root‑cause debugging of complex end‑to‑end failures.
- Champion quality engineering practices across the team: code review, coverage standards, observability, reproducibility.
- Ensure user‑centric validation so AI outputs are accurate, reliable, and meet real‑world application needs.
Requirements
- 3‑7 years of software engineering experience, with a meaningful portion focused on test automation, SDET, or software engineering in test roles.
- Expert‑level Python. You write Python every day, design libraries other engineers use, and apply OOP and clean‑code practices.
- Hands‑on Java experience, enough to read, write, and test Java services, not just touch them.
- Working understanding of the LangGraph or Vercel frameworks: graph state, nodes, edges, tool calls, and how to write evals against agentic flows.
- Demonstrated experience building eval sets for LLM models (this is critical to the role).
- Experience testing across multiple client surfaces: iOS apps, plugins, and Chrome extensions.
- Hands‑on experience building automated test suites with frameworks such as pytest, Selenium, Playwright, Cypress, or similar.
- Proven experience integrating test automation into CI/CD systems (GitHub Actions, Jenkins, CircleCI, GitLab CI, or similar).
- Strong skills in data manipulation, test data preparation, and SQL.
- Bachelor's degree or higher in Computer Science, Engineering, or a related field.
Strongly Preferred
- Experience with Azure (our primary cloud) and containerization (Docker).
- Experience testing RAG pipelines, agentic workflows, or multi‑step tool‑calling systems.
Benefits
Position Location
Santa Monica, CA (on‑site).
Compensation
The base pay for this position is $190,000‑250,000. A bonus will be provided as part of the compensation package, in addition to a full range of medical, financial, and/or other benefits.
TWG is an equal opportunity employer, and all qualified applicants will receive consideration for employment without regard to race, color, religion, gender, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.
#J-18808-Ljbffr- Obsidian is seeking highly skilled Image Generation Experts & Prompt Engineers to support a leading AI research lab's model training and data quality efforts. Ideal candidates will have a strong focus on crafting precise prompts and generating high-fidelity images using...SeniorFull timePart time
- writer.com, based in New York City, is looking for a Software Quality Engineer who will ensure top quality for AI applications. This hybrid role involves defining... ..., with experience in Python/Typescript and modern testing frameworks. Competitive compensation and benefits...SeniorFlexible hours
$190.8k - $267.1k
...Senior Software Engineer, Android Test Engineering 5 days ago Be among the first 25 applicants Reddit is a community of communities. It’s built on... ..., transcribed and summarized by artificial intelligence (AI). You will have the opportunity to opt out of recording,...SeniorFull timeFor contractorsWork experience placementFlexible hours$154.7k - $265.2k
Integral Ad Science, Inc. is seeking a Staff Product Manager for Signal Dashboards and AI, focused on driving marketing spend optimization through compelling insights. The role requires 10+ years of experience in product management, preferably in analytics platforms, balancing...Senior- Integral Ad Science (IAS) is seeking a Staff Product Manager in New York to lead the Signal dashboards roadmap, emphasizing AI capabilities for global agencies and brands. This role involves engaging directly with clients, driving data-driven insights, and collaborating...Senior
- ...Job Description Job Description We are currently looking for a Software Test Engineer who plays key role in supporting all phases of SDLC and also testing trading applications. You will be joining a lean QA team and will have the ability to work across the entire...Senior
$212.27k - $265.46k
...What's exciting about the role The Senior Staff Technical Program Manager is an... ...Verifone's most complex, cross-functional engineering programs. You will own 2-3 large-scale... ...Bachelor's degree in Computer Science, Software Engineering, Chemical Engineering, or related...SeniorContract workWorldwide- A leading technology company is seeking a Senior Staff Software Engineer for its Search Ads Bidding team in New York. The ideal candidate will have extensive experience in C++ programming and software design, along with a strong background in algorithm implementation and...Senior
$172.8k - $297k
...A leading financial technology company in New York is seeking a Senior Staff Software Engineer to lead impactful projects and mentor teams. This role requires 10+ years of programming experience, advanced knowledge of distributed systems, and strong communication skills...Senior- ...FactSet Research Systems Inc. is seeking an experienced software engineer to design and implement scalable solutions that improve operational... ...full stack development using Python, as well as familiarity with AI frameworks. Strong communication and collaboration skills are...Senior
- A leading tech company in New York is seeking a Senior Staff Backend Software Engineer to lead architecture for high-throughput systems. The successful candidate will mentor engineering staff, drive consistency in API guidelines, and ensure scalable architecture. Applicants...Senior
- What’s Exciting About The Role The Senior Staff Technical Program Manager is an executive-facing... ...'s most complex, cross-functional engineering programs. You will own 2-3 large-scale... ...Bachelor's degree in Computer Science, Software Engineering, Chemical Engineering, or related...SeniorContract work
- ...developers and enterprises who are building AI systems to power magical experiences like... .... Cohere is a team of researchers, engineers, designers, and more, who are passionate... ...Engineer" role. As a Member of Technical Staff, Applied ML, you will: # Work directly...SeniorFull timeWork at officeRemote workFlexible hours
- A leading accounting firm in New Jersey is seeking a skilled Tax Senior to manage client engagements and lead a small team of professionals. The successful candidate will have over 4 years of public accounting experience and strong technical and project management skills...Senior
$281k - $356k
...need an experienced, technically hands‑on engineering manager with strong quantitative skills... ...this hybrid role, you will report to a Senior Staff manager. You will: Develop tools for... ...methods including hypothesis testing, experiment design, and sampling Familiarity...SeniorFull timeWork experience placementRemote work$168k - $208k
...Senior Staff Total Rewards Partner Are you a total rewards leader who thrives in building... ...single, real-time data set and embedded AI, Ridgeline helps firms automate complexity... ...Innovation Leader," and by The Software Report as a "Top 100 Software Company."...SeniorLocal area- ...Arya Health is looking for a Senior Software Engineer in AI to enhance its workforce platform using cutting-edge AI tools. In this role, you will design AI-powered workflows, integrate LLMs, and build reliable APIs that improve healthcare interactions. You will collaborate...Senior
$160k - $170k
...You get to focus on your patients - and have work/life balance! Senior Staff Veterinarians possess advanced community medicine skills and a... ...owners and dispensing medications. Document exam findings, test results and treatment recommendations according to ASPCA guidelines...SeniorWork experience placementFlexible hoursShift work$150k - $200k
...Hazel, located in New York City, is looking for a Software Engineer to join the founding team. You will work on building the action layer of Hazel, connecting to systems like Shopify and Klaviyo. The ideal candidate has experience shipping production LLM features and ownership...Senior- ...agenda relating to antitrust, economics and business law. The Senior Staff Associate II will be responsible for independently designing and... ....g. economics, business, public policy, government, technology/AI, media/journalism, or other related field). 10+ years of relevant...Senior
- ...dba ZT Systems is seeking a visionary Sr. Staff Software Developer with 8 years of experience to lead the development of manufacturing test software. The role entails defining... ...candidates will have a Bachelor's degree in engineering or related fields, with strong skills in...Senior
- ...A forward-thinking technology company is seeking a Senior Software Engineer to design and scale intelligent systems for their innovative platform. This hybrid role requires significant experience in AI workflow automation and building production software. Candidates should...Senior
- ...The BayTech Group is seeking experienced software engineers to join their AI Engineering team in New York City. You will be involved in building real-world GenAI products that create measurable business impact across the enterprise. This role requires strong backgrounds...Senior
- ...FanDuel is looking for an AI Software Engineer to join our Tech Transformation team in New York. The role involves building innovative solutions in the fast-paced sports-betting environment, where you will work on multi-disciplinary teams to enhance customer experiences...Senior
- ...A forward-thinking tech startup is looking for a Senior Software Engineer, AI, to design and develop core AI technology. In this remote-friendly position, you will collaborate with cross-functional teams and drive initiatives from concept to completion. The ideal candidate...SeniorRemote work
- ...Job Description MANHATTAN & UPSTATE SENIOR HOUSEKEEPER New York, NY (Manhattan & Dutchess County) Candidates must have their... ...Concierge and collaborates with other traveling household staff. The Heart of the Role This is more than a cleaning position...SeniorFull timeSeasonal workLive out2 days per week
- ...Cloaked is seeking a Senior Software Engineer to lead the development of Call Guard, an AI-driven call screening product that enhances user privacy. This role involves owning the technical roadmap and architecture, optimizing voice AI stacks, and building defenses against...Senior
$16 per hour
A leading senior living provider in New York seeks a compassionate Housekeeper for day shifts. Your role includes maintaining cleanliness in various areas, providing excellent customer service, and managing supplies. Ideal candidates have experience in hospitality or healthcare...SeniorHourly payFlexible hoursDay shift$180k - $264k
...A leading AI solutions firm in New York is looking for a Senior Software Engineer, Forward Deployed Engineer, to develop AI-powered solutions. This hybrid role requires collaboration with clients and teams, focusing on optimizing operations and building scalable systems...Senior- ...General on Sexual Violence in Conflict (OSRSG-SVC) in New York. The Senior Staff Assistant reports directly to the Special Representative of the... ...in different duty stations under conditions established in ST/AI/2023/3 on Mobility, as may be amended or revised. This...SeniorFull timeFixed term contractWork experience placementWork at officeOverseas
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior / Staff Software AI Test Engineer, AI Engineering. Be the first to apply!
- senior software test automation engineer New York, NY
- qa automation engineer New York, NY
- qa engineer New York, NY
- software test engineer New York, NY
- senior automation test engineer New York, NY
- junior software test automation engineer New York, NY
- qa engineering manager New York, NY
- qa test engineer New York, NY
- mobile qa automation engineer New York, NY
- lead software test engineer New York, NY


