Archer Data Scientist
emergemarket.com
Remote - California 123 Main St Livermore, CA 94551, USA Archer is a leading provider of integrated risk management (IRM) solutions that enable customers to improve strategic decision-making and operational resilience with a modern technology platform that supports qualitative and quantitative analysis driven by both business and IT impacts. As true pioneers in GRC software, Archer remains solely dedicated to helping customers manage risk and compliance domains, from traditional operational risk to emerging issues such as ESG. With over 20 years in the risk management industry, the Archer customer base represents one of the largest pure risk management communities globally, with more than 1,200 customers including more than 50% of the Fortune 500. Learn more at . Data Scientist – LLM & Data Pipeline Engineering (LegalTech / RegTech AI) Overview We are seeking an experienced Data Scientist with a strong background in AI model integration, data pipeline development, and knowledge base (KB) engineering to support our next-generation LegalTech / RegTech AI platform. This role blends applied machine learning , data engineering , and software development , focusing on building scalable pipelines that connect large language models (LLMs) to structured and unstructured data through retrieval-augmented generation (RAG) and vector database architectures. The ideal candidate is passionate about operationalizing AI — from training and fine-tuning models to deploying intelligent retrieval systems in AWS cloud environments. Key Responsibilities Design, train, and evaluate LLM-based pipelines for document understanding, obligation extraction, and regulatory reasoning. Implement and optimize RAG architectures , combining LLMs with vector databases for semantic retrieval. Develop and maintain model fine-tuning workflows, embedding generation, and knowledge distillation. Collaborate with ML Ops teams to integrate AI models into production-ready APIs and services on AWS . Measure and improve model precision, recall, latency, and interpretability. 1.5 Agentic and MCP Knowledge Integration Design and maintain agentic multi-component processes (MCPs) that enable context-aware reasoning across multiple data sources and agents. Implement AI agents capable of dynamic tool use, autonomous task decomposition, and multi-context knowledge retrieval. Develop pipelines that support agent memory , self-reflection , and knowledge synthesis across distributed systems and knowledge bases. Collaborate with engineering teams to integrate MCP-driven agents with retrieval, analytics, and workflow orchestration layers, ensuring compliance with regulatory reasoning frameworks. Build and manage end-to-end data pipelines for ingestion, transformation, embedding, and indexing of legal and compliance data. Orchestrate data workflows leveraging AWS services (e.g., S3 , Lambda , Glue , SageMaker , Step Functions , RDS ). Develop scalable ETL/ELT processes to feed both relational ( PostgreSQL ) and vector databases (e.g., Pinecone , FAISS , Weaviate , Elastic Vector Search ). Ensure data lineage, reproducibility, and version control across AI and analytics pipelines. Automate retraining and evaluation pipelines for continuous learning from user feedback. 3. Knowledge Base & Information Retrieval Architect and maintain intelligent Knowledge Bases (KBs) to support AI-driven search, summarization, and compliance reasoning. Implement advanced retrieval techniques using ElasticSearch / Elastic Vector Search and embedding-based retrieval. Align KB structures with business ontologies and regulatory taxonomies to support explainable AI outputs. Collaborate with domain experts and PMs to enrich KB metadata and enhance model context relevance. 4. AWS & Deployment Deploy and scale AI pipelines using AWS services such as SageMaker , Lambda , ECS/EKS , API Gateway , and CloudFormation/Terraform . Implement model and data monitoring solutions for drift detection, latency management, and cost optimization. Collaborate with DevOps to maintain secure, reliable, and compliant cloud environments. 5. Cross-Functional Collaboration Partner with engineering, product, and compliance teams to align AI models with regulatory and data governance requirements. Work closely with QA and Professional Services teams to validate AI outputs and improve client-facing performance. Document architectures, experiment results, and data flows to ensure transparency and reproducibility. Preferred Experience Experience building AI products for LegalTech, RegTech, or compliance automation . Familiarity with agentic AI frameworks (e.g., OpenAI MCP, CrewAI, LangGraph, or AutoGen). Background in document intelligence systems , multi-agent orchestration , or knowledge graph integration . Experience with LangChain , LlamaIndex , or similar frameworks for RAG orchestration. Hands-on knowledge of MLOps tools and data versioning (DVC, MLflow, Weights & Biases). Understanding of governance, interpretability , and ethical AI Qualifications 5+ years of experience in data science, ML engineering, or AI-driven software development . Strong programming skills in Python (NumPy, Pandas, PyTorch/TensorFlow, LangChain, or equivalent). Experience with vector databases and retrieval systems (Pinecone, FAISS, Weaviate, Qdrant, or Elastic Vector Search). Hands-on experience with RAG pipelines , embedding models , and LLM orchestration (OpenAI, Bedrock, Hugging Face, etc.). Solid understanding of data pipelines , ETL frameworks , and cloud-native deployment on AWS . Familiarity with Elasticsearch , PostgreSQL , and API integration patterns. Knowledge of ML lifecycle management , including model training, evaluation, and monitoring. Soft Skills Strong problem-solving and system design capabilities. Excellent communication skills for cross-disciplinary collaboration. Passion for structured documentation, reproducibility, and experimentation. Adaptable mindset with focus on performance, scalability, and reliability. Success Indicators Scalable and well-documented RAG pipelines supporting production of AI workloads. High model accuracy, retrievability, and latency efficiency. Reliable data flow from ingestion to inference with minimal manual intervention. Increased explainability and compliance assurance across AI outputs. Additional Information About Archer’s Culture and Work Environment: Our people, team collaboration and dynamic leadership is the centerpiece of our great culture and the reason for Archer’s 25 years of success. Over the years, many companies and global organizations have been faced with tough decisions. Layoffs, reorganizations, acquisitions, and mergers. Yet, throughout these challenging times, Archer has exemplified strong innovation and growth and a commitment to our employees. Why is this possible? Collaboration is the key to our success. It inspires great innovation and innovative ideas. It is why Archer's is a household name in the GRC space. Companies, from F500 – F1000, come to Archer first - for our thought leadership and for our ability to meet customers where they are. As we continue to grow and evolve, our focus will remain the same: continue innovating, support our customers and employees and continue driving the risk management industry to new levels. Please note this job description is not designed to cover or contain a comprehensive listing of activities, duties or responsibilities that are required of the employee for this job. Duties, responsibilities and activities may change at any time with or without notice at management discretion based on business need. Archer is committed to the principle of equal employment opportunity for all employees and applicants for employment and to providing employees with a work environment free of discrimination and harassment. All employment decisions at Archer are based on business needs, job requirements and individual qualifications, without regard to race, color, religion, national origin, sex (including pregnancy), age, disability, sexual orientation, gender identity and/or expression, marital, civil union or domestic partnership status, protected veteran status, genetic information, or any other characteristics protected by federal, state or local laws. Archer will not tolerate discrimination or harassment based on any of these characteristics. This policy applies to all terms and conditions of employment, including recruiting, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation, and training. All Archer employees are expected to support this policy and contribute to an environment of equal opportunity. If you need a reasonable accommodation during the application process, please contact View email address on click.appcast.io. All employees must be legally authorized to work in Country they are applying for. Archer and its approved consultants will never ask you for a fee to process or consider your application for a career with Archer. Archer reserves the right to amend or withdraw any job posting at any time, including prior to the advertised closing date. Pay Transparency Notice: We’re committed to fair and transparent pay practices. In line with state pay transparency laws, the salary range for this role is available upon request. Please contact our Talent Acquisition team at View email address on click.appcast.io for the range and related compensation details. Actual pay may vary based on location, experience, skills, and internal equity. Equal Opportunity Employer This employer is required to notify all applicants of their rights pursuant to federal employment laws.For further information, please review the Know Your Rights notice from the Department of Labor. #J-18808-Ljbffr emergemarket.com
- ...this role, you will own the technical roadmap for an AI-native Data Science organization that builds the predictive, prescriptive,... ...and AI architecture and technical strategy for a team of data scientists that will build data science and AI capabilities to optimize operations...SuggestedWeekly payMinimum wageLocal areaFlexible hours
- ...Overview: Job Summary: The Senior Data Engineer will be responsible for designing, building, and maintaining robust data... ...organization and accessibility. Collaborate with data analysts, scientists, and business teams to understand data requirements and deliver...Suggested
- ...computer science, applied math, physics, engineering, statistics, economics or related field. 3+ years of industry experience in Data Engineering 3+ years of work experience including hands-on technical experience with SQL, Python, PySpark, Jupyter Notebook,...SuggestedWork experience placement
$215k
...Scion Staffing has been engaged to conduct a search for a Senior Data Engineer for an innovative and rapidly growing transportation and logistics organization focused on modernizing enterprise data and analytics capabilities across multiple business units. This is a hybrid...SuggestedTemporary workInterim roleImmediate start- ...highly skilled Senior Azure Databricks (ADB) Developer to join our Data Engineering team. This role involves developing large-scale... ...Stakeholder Communication: Interface with product owners, data scientists, and business analysts to translate data requirements into production...Suggested
- ...Senior Data Engineer Location: Pleasanton, California (hybrid work) Role Overview As a Senior/Lead Data Engineer, you will lead the design, development, and ownership of core data infrastructure—from pipelines to storage to data products. You'll be a strategic...
$125k - $145k
...Job Title: Data Scientist Location: Pleasanton,CA Duration: Fulltime Skills: Python Salary: $125K - $145K/Year Must Have Technical/Functional Skills: Hands-on experience on: Programming Languages Strong Python familiarity...Full time- ...Overview: We are seeking a Senior Data Scientist with strong expertise in developing marketing models and Natural Language Processing (NLP) solutions. This role will focus on leveraging data-driven insights to optimize marketing strategies and enhance decision...
$115k - $175k
...sciences industry , committed to making a positive impact on its customers, employees, and communities. The Role A Senior Data Engineer who can lead the design and implementation of our next-generation Data Lakehouse. In this role, you will be designing and...Work at officeLocal areaRemote workWork from homeFlexible hours3 days per week- ...Job Summary: This individual contributor is primarily responsible for designing and developing data pipelines and automation for data acquisition and ingestion of raw data from multiple data sources and data formats by transforming, cleansing, and storing data...
$149k - $187k
...skillset that will accelerate their careers. Work, Play and Grow at BlackLine! Make Your Mark: We are looking for a Data Scientist to join our Product & Technology organization and play a pivotal role in transforming data into actionable intelligence and...Temporary workWork experience placementWork at officeShift work3 days per week- ...Business consulting services. We are in search of a highly motivated candidate to join our talented Team. Job Title: Data Scientist Location(s): Pleasanton, CA. About the Role: We are hiring a Data Scientist with strong data engineering and...Remote work
- ...Data Scientist VentureSoft Global, Inc. is a technology services company specializing in software development and IT consulting. We offer a range of services including custom software development, mobile app development, cloud solutions, Data, AI/ML Engineering and...Remote workFlexible hours
- ...Job Details: Data Scientist Location: Pleasanton, CA Top Skill: Qualifications for Data Scientist Strong problem solving skills with an emphasis on product development. Experience using statistical computer languages (R, Python, SQL, etc...
- ...on the World! Job Description We have multiple openings for a Data Science Engineer with a background in applied machine learning... ...including cybersecurity experts, power systems engineers, and computer scientists. Support building research prototypes and capabilities for...Minimum wageLocal areaWork from homeRelocation packageFlexible hours1 day per week
$146.34k - $222.56k
...impact. Job Description We have multiple openings for a Data Science Engineer with a background in applied machine learning... ...cybersecurity experts, power systems engineers, and computer scientists. Support building research prototypes and capabilities for critical...Minimum wageFor contractorsLocal areaWork from homeRelocation packageFlexible hours1 day per week- ...Overview: Role Summary: Looking for a Senior Data Scientist with strong experience in marketing models and NLP to support customer analytics and marketing optimization initiatives. Key Responsibilities: Build and optimize marketing models (segmentation...
- A technology services company is seeking a Data Analytics Engineer to develop and maintain data pipelines and visualizations to drive organizational insights. The role requires a Master's degree and 5+ years of experience in data analytics. The candidate must be proficient...Flexible hours
- ...development and IT consulting. We offer a range of services including custom software development, mobile app development, cloud solutions, Data, AI/ML Engineering and digital transformation services. The company prides itself on delivering innovative and tailored solutions to...Remote workFlexible hours
- Apex Systems is seeking a Senior Data Scientist to support Principal Investigators in Pleasanton, California. This remote role involves translating complex research into machine learning workflows and developing prototypes using tools like Kubernetes and Docker. The ideal...Remote work
$175.5k - $263.3k
...Workday to make better people decisions by pioneering innovative people analytics and insights. About the Role As a People Analytics Data Scientist, you will deliver analytics and insights related to our people, culture, and leadership, working closely with HR Partners,...Work at officeRemote workFlexible hours- Data Piper in Pleasanton, California is seeking a full-time developer experienced in Palantir Foundry. The role involves architecting solutions, collaborating across global teams, and developing frameworks for client-specific offerings. The ideal candidate will have hands...Full time
$87.8k - $131.6k
...rotation will be required to address critical infrastructure events outside business hours. Occasional travel to support other OpenText data centers. Compensation At OpenText, we offer a thoughtfully designed benefits package that supports your physical, emotional, and...Local areaRemote work- A fast-growing organization in life sciences is seeking a Senior Data Engineer to lead the design and implementation of a scalable Data Lakehouse. You will build real-time and batch data ingestion pipelines and develop complex ETL workflows to optimize large-scale data...
- Veeva Systems, Inc. is looking for a Senior Data Engineer to oversee the design and implementation of our next-generation Data Lakehouse. You will be responsible for creating efficient data ingestion and storage solutions for large datasets, optimizing for performance...Work at officeWork from homeFlexible hours
- The American Physical Society in Livermore, California, is seeking a Data Science Engineer to develop and apply advanced data science algorithms for cybersecurity and power systems. The ideal candidate will have expertise in machine learning and experience in designing...Flexible hours
- Scion Staffing is seeking a Senior Data Engineer / AI Data Platform Engineer for a dynamic organization in Pleasanton, California. This hybrid opportunity invites candidates with strong expertise in Databricks, SQL, Python, and PySpark to support enterprise cloud data...
$190.1k - $285.1k
...Sr Software Development Engineer in Pleasanton, CA to architect and deliver platform capabilities for their Apache Iceberg zero-copy data lake solution. The role requires deep experience in software engineering, specifically with Java, distributed systems, and large-...$20 - $50 per unit
...candidate will have 2-6 years of experience in mechanical engineering and the ability to collaborate with multidisciplinary teams. A competitive rate of $20-$50 per task is offered, reflecting the role’s impact in high-quality AI training data. #J-18808-Ljbffr Micro1Remote job- VentureSoft in Pleasanton, United States, is looking for a Data Analytics Engineer to build and maintain data solutions that enable organizational insights. The ideal candidate will develop KPI reports, partner with engineers on data architecture, and foster data-driven...Remote job
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Archer Data Scientist. Be the first to apply!
- entry level data scientist remote Livermore, CA
- data cabling installation Livermore, CA
- data recovery Livermore, CA
- data capturer Livermore, CA
- sap master data Livermore, CA
- data loss prevention engineer Livermore, CA
- data technician Livermore, CA
- data analysis part time Livermore, CA
- data operator Livermore, CA
- data cabling Livermore, CA

