Archer Data Scientist
Archer Integrated Risk Management
Archer Data Scientist
Archer is a leading provider of integrated risk management (IRM) solutions that enable customers to improve strategic decision-making and operational resilience with a modern technology platform that supports qualitative and quantitative analysis driven by both business and IT impacts. As true pioneers in GRC software, Archer remains solely dedicated to helping customers manage risk and compliance domains, from traditional operational risk to emerging issues such as ESG. With over 20 years in the risk management industry, the Archer customer base represents one of the largest pure risk management communities globally, with more than 1,200 customers including more than 50% of the Fortune 500.
Data Scientist – LLM & Data Pipeline Engineering (LegalTech / RegTech AI)
Overview:
We are seeking an experienced Data Scientist with a strong background in AI model integration, data pipeline development, and knowledge base (KB) engineering to support our next-generation LegalTech / RegTech AI platform.
This role blends applied machine learning , data engineering , and software development , focusing on building scalable pipelines that connect large language models (LLMs) to structured and unstructured data through retrieval-augmented generation (RAG) and vector database architectures.
The ideal candidate is passionate about operationalizing AI — from training and fine-tuning models to deploying intelligent retrieval systems in AWS cloud environments.
Key Responsibilities
1. AI Model Integration & Development
- Design, train, and evaluate LLM-based pipelines for document understanding, obligation extraction, and regulatory reasoning.
- Implement and optimize RAG architectures , combining LLMs with vector databases for semantic retrieval.
- Develop and maintain model fine-tuning workflows, embedding generation, and knowledge distillation.
- Collaborate with ML Ops teams to integrate AI models into production-ready APIs and services on AWS .
- Measure and improve model precision, recall, latency, and interpretability.
1.5 Agentic and MCP Knowledge Integration:
- Design and maintain agentic multi-component processes (MCPs) that enable context-aware reasoning across multiple data sources and agents.
- Implement AI agents capable of dynamic tool use, autonomous task decomposition, and multi-context knowledge retrieval.
- Develop pipelines that support agent memory , self-reflection , and knowledge synthesis across distributed systems and knowledge bases.
- Collaborate with engineering teams to integrate MCP-driven agents with retrieval, analytics, and workflow orchestration layers, ensuring compliance with regulatory reasoning frameworks.
2. Data Pipeline Engineering
- Build and manage end-to-end data pipelines for ingestion, transformation, embedding, and indexing of legal and compliance data.
- Orchestrate data workflows leveraging AWS services (e.g., S3 , Lambda , Glue , SageMaker , Step Functions , RDS ).
- Develop scalable ETL/ELT processes to feed both relational ( PostgreSQL ) and vector databases (e.g., Pinecone , FAISS , Weaviate , Elastic Vector Search ).
- Ensure data lineage, reproducibility, and version control across AI and analytics pipelines.
- Automate retraining and evaluation pipelines for continuous learning from user feedback.
3. Knowledge Base & Information Retrieval
- Architect and maintain intelligent Knowledge Bases (KBs) to support AI-driven search, summarization, and compliance reasoning.
- Implement advanced retrieval techniques using ElasticSearch / Elastic Vector Search and embedding-based retrieval.
- Align KB structures with business ontologies and regulatory taxonomies to support explainable AI outputs.
- Collaborate with domain experts and PMs to enrich KB metadata and enhance model context relevance.
4. AWS & Deployment
- Deploy and scale AI pipelines using AWS services such as SageMaker , Lambda , ECS/EKS , API Gateway , and CloudFormation/Terraform .
- Implement model and data monitoring solutions for drift detection, latency management, and cost optimization.
- Collaborate with DevOps to maintain secure, reliable, and compliant cloud environments.
5. Cross-Functional Collaboration
- Partner with engineering, product, and compliance teams to align AI models with regulatory and data governance requirements.
- Work closely with QA and Professional Services teams to validate AI outputs and improve client-facing performance.
- Document architectures, experiment results, and data flows to ensure transparency and reproducibility.
Preferred Experience
- Experience building AI products for LegalTech, RegTech, or compliance automation .
- Familiarity with agentic AI frameworks (e.g., OpenAI MCP, CrewAI, LangGraph, or AutoGen).
- Background in document intelligence systems , multi-agent orchestration , or knowledge graph integration .
- Experience with LangChain , LlamaIndex , or similar frameworks for RAG orchestration.
- Hands-on knowledge of MLOps tools and data versioning (DVC, MLflow, Weights & Biases).
- Understanding of governance, interpretability , and ethical AI
Qualifications
- 5+ years of experience in data science, ML engineering, or AI-driven software development .
- Strong programming skills in Python (NumPy, Pandas, PyTorch/TensorFlow, LangChain, or equivalent).
- Experience with vector databases and retrieval systems (Pinecone, FAISS, Weaviate, Qdrant, or Elastic Vector Search).
- Hands-on experience with RAG pipelines , embedding models , and LLM orchestration (OpenAI, Bedrock, Hugging Face, etc.).
- Solid understanding of data pipelines , ETL frameworks , and cloud-native deployment on AWS .
- Familiarity with Elasticsearch , PostgreSQL , and API integration patterns.
- Knowledge of ML lifecycle management , including model training, evaluation, and monitoring.
Soft Skills
- Strong problem-solving and system design capabilities.
- Excellent communication skills for cross-disciplinary collaboration.
- Passion for structured documentation, reproducibility, and experimentation.
- Adaptable mindset with focus on performance, scalability, and reliability.
Success Indicators
- Scalable and well-documented RAG pipelines supporting production of AI workloads.
- High model accuracy, retrievability, and latency efficiency.
- Reliable data flow from ingestion to inference with minimal manual intervention.
- Increased explainability and
- ...Business Engagement Data Analyst/Programmer The primary function of the business engagement data analyst/programmer is to provide... ...performance Participate in safety program including BBS/RSI Guard/Archer tasks, compliance training, etc. Attend safety meetings,...SuggestedWork experience placementWork at officeRemote work
- Archer is an aerospace company based in San Jose, California building an all-electric vertical takeoff and landing aircraft with a mission... .... What You’ll Do: We are looking for an experienced Data Engineer to specialize in aviation data and analytics, focusing...SuggestedPermanent employmentLocal area
- ...Chicago is seeking a skilled developer with robust experience in Archer development and integration of various systems including Qradar... ...strong coding abilities in Java and VB Scripting for effective data integration and modeling. Ideal candidates will have experience...Suggested
- ...Senior Systems Engineer – Data Privacy Location: San Francisco, CA (Onsite) Duration: 6+ Months The Regulatory and Corporate... ...Experience working with one or all: OneTrust, ServiceNow, BigID and RSA Archer ~2+ years of experience in roles related to Data Privacy and/...SuggestedWork experience placementLocal area
- ...Solid proven experience with tooling such as Qualys, Brinqa, Archer, ServiceNOW, Checkmarx, Prisma (and any AWS experience is great... ...Responsible for performing all functions required to support day-to-day data security operations and accountable for security and networking...SuggestedRemote work
- ...GDPR, IATF). ~ Strong communication skills; experience producing executive-level reporting. ~ Experience with GRC tooling (e.g., Archer, ServiceNow GRC, OneTrust, RSA) and security monitoring platforms. PREFERRED QUALIFICATIONS: Master's degree or relevant...
- ...Introduction This role involves managing the full lifecycle of RSA Archer platform deployments across various environments. The position requires deep technical expertise in Archer administration, strong troubleshooting skills, and the ability to collaborate across...Immediate start
- ...organization's enterprise governance, risk and compliance programs (Archer eGRC) as well as support for various credit applications. The... ...corporate risk programs and business line users Monitoring data feeds and STORM Trackers Setting up and executing high visibility...Temporary workWork experience placement
- ...Job Summary The Medpace Data Engineering team is growing rapidly and is focused on building a data driven culture across the enterprise. The DE team uses data and insights to drive increased strategic and operational efficiencies across the organization. As a Business...Contract workLocal areaImmediate startFlexible hours
- ...Position- Business Intelligence Analyst/Data Engineer Duration-Contract Location- Seattle, WA JD Relevant Experience (in Yrs.) 8-10 Year Must Have Technical/Functional Skills Experience in data visualization, data cleaning, data manipulation...Contract workImmediate start
- ...Key Responsibilities: Data Engineering & Pipeline Development: Design, implement, and maintain robust data pipelines (ETL/ELT) to collect, process, and transform large-scale structured and unstructured datasets from diverse automotive sources. Ensure data quality...Immediate start
- ...Archer Architect Sonsoft, Inc. is a USA based corporation duly organized under the laws of the Commonwealth of Georgia. Sonsoft Inc... ...services, RSA Archer product software • Develop RSA Archer eGRC data integration feeds to/from other systems as requested • Provide regular...Permanent employmentFull timeH1b
- ...Archer GRC Platform Architect Location: Cincinnati (Day 1 onsite) Must Have: Application Dev. Archer GRC Configuration... ...Lead Archer platform architecture, application development, data integration, and workflow design. Partner with business and compliance...
$116.5k
...Data Scientist Expedia Group brands power global travel for everyone, everywhere. We design cutting-edge tech to make travel smoother and more memorable, and we create groundbreaking solutions for our partners. Our diverse, vibrant, and welcoming community is essential...Local areaFlexible hours$30 - $35 per hour
...Job Description Insight Global is seeking a 1 Lead Technical Data Analyst sitting remotely in LATAM to join a large financial services client. We are looking for a seasoned subject matter expert responsible for driving analysis, solution design, and delivery across...Remote work- ...leading healthcare organization is seeking a Genomics Research Programmer Analyst - Associate to support the development of a genomic data warehouse. This role focuses on ensuring the integrity and quality of genomic data while developing SQL and R code. The candidate...Remote work
$84.23k - $140.38k
...Kingdom, Australia, Japan, India, and the Philippines.Job DescriptionThe purpose of this role is to bridge the gap between complex data infrastructure and actionable business strategy. You will provide day-to-day support to Revenue Operations (RevOps), including Sales...Remote workHome office- ...Cost Methods, Tools & Data Solutions (CMTD)is seeking a Data Engineer , preferably with a finance or cost engineering background and hands-on exposure to data engineering to drive delivery of modern analytics, automation and AI-enabled solutions. This role will...Immediate start
$94.75k - $135.36k
...Technologies is leading the next evolution of national defense – the data evolution - by accelerating a breadth of national security... ..., machine learning (AI/ML) experts; engineers; technologists; scientists; logistics experts; and business administration professionals....Full timeContract workWork at officeLocal areaWorldwide$105 - $115 per hour
...Work sponsorship is not available for this position. Job Title: Data Scientist Senior (Quantitative Analytics Lead) Locations: Onsite in Mclean, VA (5 days a week) Interview Type: In-Person Job Overview: Must Have Qualifications: E xperience with modelling...Hourly payWork experience placementLocal area$204.85k - $265.1k
...Senior Systems Engineer, Data Management Our field sales professionals rely on proactive technical support during the sales process... ...Architect, Data Engineer, Analytics Consultant, or Data Scientist with strong commercial exposure. Proven experience architecting...Full time- ...Job Title: Senior Data Developer/Business Analyst Duration: 4+ months contract Hours per Week: 37.5 hours per week Location: hybrid onsite at 100 Hancock St, Quincy, MA Candidate must be able to provide proof of residency within New England...Contract workLocal area
- ...foundation! Remote position – You will design and optimize scalable data pipelines and modern data architectures, leading global data... ...code reviews and project leadership. Collaborate with data scientists, analysts, and product teams to understand requirements and...Remote workWorldwideHome officeFlexible hours
- ...Job Title Data Analyst About World Business Lenders ( World Business Lenders (WBL) is proud to offer short-term, real estate-backed commercial loans to a diverse range of small and medium-sized businesses across the United States, especially those who may find it...Temporary workRemote workMonday to Friday
$95k - $154k
...typically includes roles such as entry-level software programmer, Java full stack developer, Python/Java developer, data analyst, data engineer, data scientist, and machine learning/AI engineer. In other words, SynergisticIT focuses on building candidates across Java /...Full timeH1bNight shift- Job Title Intellectsoft is a software development company delivering innovative solutions since 2007. We operate across North America, Latin America, the Nordic region, the UK, and Europe. We specialize in industries like Fintech, Healthcare, EdTech, Construction, Hospitality...Remote work
- ...Senior Data Analyst Join a company that invests in you. Seeking Alpha is the world's leading community of engaged investors. We're the go-to destination for investors looking for actionable stock market opinions, real-time market analysis, and unique financial insights...For contractorsImmediate startRemote workFlexible hours
- ...IoT Engineering Data Analyst Job Category: Tech / Engineering Requisition Number: IOTDA001432 Posted: February 11, 2026 Full-Time On-site TG Kentucky, LLC Lebanon, KY 40033, USA Job Details Description Essential duties and responsibilities include...Full timeMonday to FridayWeekend work
- ...Analyst, Lead Data Engineering - 000GF9 Description Tap into the professional possibilities of the largest publicly traded energy partnership that features one of the most diversified cash flow streams in the midstream segment of the energy industry. With dynamic career...Shift work
- ...Data Engineer Analyst Blend is a premier AI services provider, committed to co-creating meaningful impact for its clients through the power of data science, AI, technology, and people. With a mission to fuel bold visions, Blend tackles significant challenges by seamlessly...Remote workFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Archer Data Scientist. Be the first to apply!
- entry level data scientist United States
- associate data scientist United States
- junior data scientist remote United States
- junior data scientist United States
- data scientist machine learning engineer United States
- entry level data scientist remote United States
- data scientist United States
- ai data scientist United States
- data scientist (hedge fund) United States
- chief data scientist United States




