Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Archer Data Scientist

emergemarket.com

Remote - California 123 Main St Livermore, CA 94551, USA Archer is a leading provider of integrated risk management (IRM) solutions that enable customers to improve strategic decision-making and operational resilience with a modern technology platform that supports qualitative and quantitative analysis driven by both business and IT impacts. As true pioneers in GRC software, Archer remains solely dedicated to helping customers manage risk and compliance domains, from traditional operational risk to emerging issues such as ESG. With over 20 years in the risk management industry, the Archer customer base represents one of the largest pure risk management communities globally, with more than 1,200 customers including more than 50% of the Fortune 500. Learn more at . Data Scientist – LLM & Data Pipeline Engineering (LegalTech / RegTech AI) Overview We are seeking an experienced Data Scientist with a strong background in AI model integration, data pipeline development, and knowledge base (KB) engineering to support our next-generation LegalTech / RegTech AI platform. This role blends applied machine learning , data engineering , and software development , focusing on building scalable pipelines that connect large language models (LLMs) to structured and unstructured data through retrieval-augmented generation (RAG) and vector database architectures. The ideal candidate is passionate about operationalizing AI — from training and fine-tuning models to deploying intelligent retrieval systems in AWS cloud environments. Key Responsibilities Design, train, and evaluate LLM-based pipelines for document understanding, obligation extraction, and regulatory reasoning. Implement and optimize RAG architectures , combining LLMs with vector databases for semantic retrieval. Develop and maintain model fine-tuning workflows, embedding generation, and knowledge distillation. Collaborate with ML Ops teams to integrate AI models into production-ready APIs and services on AWS . Measure and improve model precision, recall, latency, and interpretability. 1.5 Agentic and MCP Knowledge Integration Design and maintain agentic multi-component processes (MCPs) that enable context-aware reasoning across multiple data sources and agents. Implement AI agents capable of dynamic tool use, autonomous task decomposition, and multi-context knowledge retrieval. Develop pipelines that support agent memory , self-reflection , and knowledge synthesis across distributed systems and knowledge bases. Collaborate with engineering teams to integrate MCP-driven agents with retrieval, analytics, and workflow orchestration layers, ensuring compliance with regulatory reasoning frameworks. Build and manage end-to-end data pipelines for ingestion, transformation, embedding, and indexing of legal and compliance data. Orchestrate data workflows leveraging AWS services (e.g., S3 , Lambda , Glue , SageMaker , Step Functions , RDS ). Develop scalable ETL/ELT processes to feed both relational ( PostgreSQL ) and vector databases (e.g., Pinecone , FAISS , Weaviate , Elastic Vector Search ). Ensure data lineage, reproducibility, and version control across AI and analytics pipelines. Automate retraining and evaluation pipelines for continuous learning from user feedback. 3. Knowledge Base & Information Retrieval Architect and maintain intelligent Knowledge Bases (KBs) to support AI-driven search, summarization, and compliance reasoning. Implement advanced retrieval techniques using ElasticSearch / Elastic Vector Search and embedding-based retrieval. Align KB structures with business ontologies and regulatory taxonomies to support explainable AI outputs. Collaborate with domain experts and PMs to enrich KB metadata and enhance model context relevance. 4. AWS & Deployment Deploy and scale AI pipelines using AWS services such as SageMaker , Lambda , ECS/EKS , API Gateway , and CloudFormation/Terraform . Implement model and data monitoring solutions for drift detection, latency management, and cost optimization. Collaborate with DevOps to maintain secure, reliable, and compliant cloud environments. 5. Cross-Functional Collaboration Partner with engineering, product, and compliance teams to align AI models with regulatory and data governance requirements. Work closely with QA and Professional Services teams to validate AI outputs and improve client-facing performance. Document architectures, experiment results, and data flows to ensure transparency and reproducibility. Preferred Experience Experience building AI products for LegalTech, RegTech, or compliance automation . Familiarity with agentic AI frameworks (e.g., OpenAI MCP, CrewAI, LangGraph, or AutoGen). Background in document intelligence systems , multi-agent orchestration , or knowledge graph integration . Experience with LangChain , LlamaIndex , or similar frameworks for RAG orchestration. Hands-on knowledge of MLOps tools and data versioning (DVC, MLflow, Weights & Biases). Understanding of governance, interpretability , and ethical AI Qualifications 5+ years of experience in data science, ML engineering, or AI-driven software development . Strong programming skills in Python (NumPy, Pandas, PyTorch/TensorFlow, LangChain, or equivalent). Experience with vector databases and retrieval systems (Pinecone, FAISS, Weaviate, Qdrant, or Elastic Vector Search). Hands-on experience with RAG pipelines , embedding models , and LLM orchestration (OpenAI, Bedrock, Hugging Face, etc.). Solid understanding of data pipelines , ETL frameworks , and cloud-native deployment on AWS . Familiarity with Elasticsearch , PostgreSQL , and API integration patterns. Knowledge of ML lifecycle management , including model training, evaluation, and monitoring. Soft Skills Strong problem-solving and system design capabilities. Excellent communication skills for cross-disciplinary collaboration. Passion for structured documentation, reproducibility, and experimentation. Adaptable mindset with focus on performance, scalability, and reliability. Success Indicators Scalable and well-documented RAG pipelines supporting production of AI workloads. High model accuracy, retrievability, and latency efficiency. Reliable data flow from ingestion to inference with minimal manual intervention. Increased explainability and compliance assurance across AI outputs. Additional Information About Archer’s Culture and Work Environment: Our people, team collaboration and dynamic leadership is the centerpiece of our great culture and the reason for Archer’s 25 years of success. Over the years, many companies and global organizations have been faced with tough decisions. Layoffs, reorganizations, acquisitions, and mergers. Yet, throughout these challenging times, Archer has exemplified strong innovation and growth and a commitment to our employees. Why is this possible? Collaboration is the key to our success. It inspires great innovation and innovative ideas. It is why Archer's is a household name in the GRC space. Companies, from F500 – F1000, come to Archer first - for our thought leadership and for our ability to meet customers where they are. As we continue to grow and evolve, our focus will remain the same: continue innovating, support our customers and employees and continue driving the risk management industry to new levels. Please note this job description is not designed to cover or contain a comprehensive listing of activities, duties or responsibilities that are required of the employee for this job. Duties, responsibilities and activities may change at any time with or without notice at management discretion based on business need. Archer is committed to the principle of equal employment opportunity for all employees and applicants for employment and to providing employees with a work environment free of discrimination and harassment. All employment decisions at Archer are based on business needs, job requirements and individual qualifications, without regard to race, color, religion, national origin, sex (including pregnancy), age, disability, sexual orientation, gender identity and/or expression, marital, civil union or domestic partnership status, protected veteran status, genetic information, or any other characteristics protected by federal, state or local laws. Archer will not tolerate discrimination or harassment based on any of these characteristics. This policy applies to all terms and conditions of employment, including recruiting, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation, and training. All Archer employees are expected to support this policy and contribute to an environment of equal opportunity. If you need a reasonable accommodation during the application process, please contact View email address on click.appcast.io. All employees must be legally authorized to work in Country they are applying for. Archer and its approved consultants will never ask you for a fee to process or consider your application for a career with Archer. Archer reserves the right to amend or withdraw any job posting at any time, including prior to the advertised closing date. Pay Transparency Notice: We’re committed to fair and transparent pay practices. In line with state pay transparency laws, the salary range for this role is available upon request. Please contact our Talent Acquisition team at View email address on click.appcast.io for the range and related compensation details. Actual pay may vary based on location, experience, skills, and internal equity. Equal Opportunity Employer This employer is required to notify all applicants of their rights pursuant to federal employment laws.For further information, please review the Know Your Rights notice from the Department of Labor. #J-18808-Ljbffr emergemarket.com

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Archer Data Scientist in Livermore, CA vacancy
  •  ...Archer is a leading provider of integrated risk management (IRM) solutions that enable customers to improve strategic decision-making...  ...including more than 50% of the Fortune 500. Learn more at Data Scientist - LLM & Data Pipeline Engineering (LegalTech / RegTech AI)... 
    Suggested
    Local area

    Archer Technologies

    Livermore, CA
    1 day ago
  •  ...Title: Senior Data Engineer Location: Pleasanton, California (hybrid work) Role Overview: As a Senior/Lead Data Engineer, you will lead the design, development, and ownership of core data infrastructure-from pipelines to storage to data products. You... 
    Suggested

    SnapCode, Inc.

    Pleasanton, CA
    1 day ago
  •  ...computer science, applied math, physics, engineering, statistics, economics or related field. 3+ years of industry experience in Data Engineering 3+ years of work experience including hands-on technical experience with SQL, Python, PySpark, Jupyter Notebook,... 
    Suggested
    Work experience placement

    Apex Informatics

    Pleasanton, CA
    17 hours ago
  • $115k - $175k

     ...sciences industry , committed to making a positive impact on its customers, employees, and communities. The Role A Senior Data Engineer who can lead the design and implementation of our next-generation Data Lakehouse. In this role, you will be designing and... 
    Suggested
    Work at office
    Local area
    Remote work
    Work from home
    Flexible hours
    3 days per week

    Veeva Systems

    Pleasanton, CA
    17 hours ago
  •  ...Job Title : Data Engineer Location : Pleasanton CA Type : Contract Long Term Hybrid (3days onsite) Roles & Responsibilities ~ Bachelor's degree ~5+ years of data manipulation, analysis, etc. ~ Expert level MS Excel ~ Database experience... 
    Suggested
    Contract work

    Perfict Global, Inc.

    Pleasanton, CA
    1 day ago
  •  ...Data Scientist VentureSoft Global, Inc. is a technology services company specializing in software development and IT consulting. We offer a range of services including custom software development, mobile app development, cloud solutions, Data, AI/ML Engineering and... 
    Remote work
    Flexible hours

    Venturesoft

    Pleasanton, CA
    2 days ago
  •  ...Job Details: Data Scientist Location: Pleasanton, CA Top Skill: Qualifications for Data Scientist Strong problem solving skills with an emphasis on product development. Experience using statistical computer languages (R, Python, SQL, etc... 

    Apex Informatics

    Pleasanton, CA
    17 hours ago
  • $146.34k - $222.56k

     ...real impact. Job Description Wehave multiple openings for a Data Science Engineer with a background in applied machine learning...  ...cybersecurity experts, power systems engineers, and computer scientists. Support building research prototypes and capabilities for critical... 
    Minimum wage
    For contractors
    Local area
    Work from home
    Relocation package
    Flexible hours
    1 day per week

    Lawrence Livermore National Laboratory

    Livermore, CA
    4 days ago
  • $146.34k - $222.56k

     ...impact. Job Description We have multiple openings for a Data Science Engineer with a background in applied machine learning...  ...cybersecurity experts, power systems engineers, and computer scientists. Support building research prototypes and capabilities for critical... 
    Minimum wage
    For contractors
    Local area
    Work from home
    Relocation package
    Flexible hours
    1 day per week

    LLNL

    Livermore, CA
    3 days ago
  • $120k - $135k

     ...teamwork, and a commitment to excellence. Plus, we know how to have fun while getting the job done! About the Role   We are seeking a Data Acquisition Engineer to design, build, and scale systems that collect, process, and maintain high-quality external data from the web... 
    Monday to Friday

    Vagaro Inc

    Pleasanton, CA
    4 days ago
  •  ...Title: Data Engineering Power BI Architect Location: Pleasanton, CA 94588(Remote) Duration: 12 Months Responsibilities: Resources should have 10+ years of experience in Power BI with health care knowledge. Having Experience in developing Power... 
    Remote work

    VDart

    Pleasanton, CA
    3 days ago
  •  ...Data Analytics Engineer VentureSoft Global, Inc. is a technology services company specializing in software development and IT consulting. We offer a range of services including custom software development, mobile app development, cloud solutions, Data, AI/ML Engineering... 
    Remote work
    Flexible hours

    Venturesoft

    Pleasanton, CA
    2 days ago
  • Oracle is seeking a Senior Software Engineer to assist in developing software applications tied to AI in Supply Chain Management. In this role, you will define specifications for new projects, develop software solutions, and collaborate with cross-functional teams. The ...

    Oracle

    Pleasanton, CA
    3 days ago
  • $135k - $150k

     ...stakeholders, developing dashboards with Power BI, and managing ETL pipelines. Ideal candidates will have over 7 years of experience in data analysis and strong SQL skills. The position offers a competitive salary from $135,000 to $150,000 plus annual bonuses and... 

    Vagaro Inc

    Pleasanton, CA
    2 days ago
  • A fast-growing organization in life sciences is seeking a Senior Data Engineer to lead the design and implementation of a scalable Data Lakehouse. You will build real-time and batch data ingestion pipelines and develop complex ETL workflows to optimize large-scale data... 

    Veeva Systems

    Pleasanton, CA
    2 days ago
  • Vagaro in Pleasanton, CA is seeking a Data Acquisition Engineer to design and build scalable systems for collecting and processing high-quality external data. Candidates should have 3+ years of experience in web crawling and scraping, along with strong programming skills... 

    Vagaro

    Pleasanton, CA
    17 hours ago
  • $65 - $70 per hour

     ...Data Scientist / Databricks / Onsite in Pleasanton, CA Data Scientist / Databricks / Onsite in Pleasanton, CA This range is provided by Motion Recruitment. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base... 
    16 hours
    Full time
    Contract work
    Temporary work
    Flexible hours

    Motion Recruitment

    Pleasanton, CA
    3 days ago
  • $146.34k - $222.56k

     ...LLNL is a place where your expertise can make a real impact. Job Description We have an opening for an Energy Resilience Data Scientist .This role sits at the intersection of data science, energy systems, and critical infrastructure resilience, helping transform... 
    Minimum wage
    For contractors
    Local area
    Relocation package
    Flexible hours

    Lawrence Livermore National Laboratory

    Livermore, CA
    4 days ago
  •  ...Role: Senior Data Scientist - Marketing & NLP Location: Pleasanton CA We are seeking a Senior Data Scientist with strong expertise in developing marketing analytics models and Natural Language Processing (NLP) solutions. The ideal candidate will leverage... 

    Lorven Technologies

    Pleasanton, CA
    1 day ago
  • $126k - $179.3k

     ...Reliability Analytics and is responsible for developing advanced data science models and industry-leading anomaly detection...  ...part of cross-functional teams, including data engineers, data scientists, technologists, and subject matter experts – this individual will... 
    Work at office
    Remote work

    PG&E Corporation

    Livermore, CA
    1 hour ago
  • $144k

     ...for the delivery of greater than $7B in annual Capital work, encompassing Distribution, Substation & Transmission work. Workplan and Data Management is Portfolio Operations’ centralized data insights and analytics team focused on in-depth assessments of Portfolio and... 
    Work at office
    Remote work
    2 days per week

    PG&E Corporation

    Pleasanton, CA
    4 days ago
  • A rapidly growing biotech organization in Pleasanton, CA, is looking for a Senior Software Engineer to lead the design and implementation of core software for complex systems. The role requires proficiency in C and Python, with at least 3 years of experience in developing...

    N6.com

    Pleasanton, CA
    3 days ago
  • Pacific Gas and Electric Company is seeking a Senior Manager, Workplan Data Platforms & Engineering, responsible for leading the strategy and execution of data management across their electric operations. This role focuses on the architecture and reliability of Workplan... 

    Pacific Gas and Electric Company

    Pleasanton, CA
    3 days ago
  • A technology services company is seeking a Data Analytics Engineer to develop and maintain data pipelines and visualizations to drive organizational insights. The role requires a Master's degree and 5+ years of experience in data analytics. The candidate must be proficient... 
    Flexible hours

    Premier Inn Hotels LLC (UAE)

    Pleasanton, CA
    1 day ago
  • $100k - $150k

     ...technologies to create scalable, secure, and user-friendly applications. As we continue to grow, we’re looking for a skilled SAP Data Migration Engineer (LTMC / SLT / LVM) to join our dynamic team and contribute to our mission of transforming business processes through... 
    Full time
    H1b
    Local area
    Immediate start
    Remote work
    Visa sponsorship
    Work visa

    Bright Vision Technologies

    Pleasanton, CA
    4 days ago
  • A leading technology firm is seeking an experienced Data Scientist to support their next-generation AI platform. The role involves AI model integration, data pipeline development, and knowledge base engineering with a focus on LegalTech and RegTech. The ideal candidate... 
    Remote job

    emergemarket.com

    Livermore, CA
    4 days ago
  • Workday in Pleasanton, CA, is seeking a Full Stack Engineer to enhance our data engineering team. In this role, you'll develop user-facing solutions and integrate AI capabilities into our HR and Finance workflows. This is an opportunity to make a real impact as Workday... 
    Remote job
    Flexible hours

    HR Tech Job

    Pleasanton, CA
    4 days ago
  • $198.5k - $268.5k

     ...network which means forecasting and territory analytics here are genuinely hard and genuinely consequential. We're hiring a Staff Data Scientist to own the sales analytics function within our Business Insights & Analytics team. You'll report to the Head of Business... 
    Full time

    10X Genomics

    Pleasanton, CA
    8 days ago
  •  ...consulting services. We are in search of a highly motivated candidate to join our talented Team. Job Title: Simulation Developer - Data Scientist (Simulation & Modeling) Location(s): Pleasanton, CA. About the Role: We are seeking a highly skilled and motivated Data... 

    Ampcus, Inc

    Pleasanton, CA
    1 day ago
  • $88.1k - $141k

     ...We're looking for a Data Engineer - United States This role is Hybrid, Dublin Office Data Engineer - Hybrid(Dublin,...  ...technical and non-technical stakeholders. • Partner with data scientists and functional leaders in sales, marketing, and product to deploy... 
    Full time
    Work at office
    Local area

    Cornerstone OnDemand

    Dublin, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Archer Data Scientist. Be the first to apply!