Archer Data Scientist
emergemarket.com
Remote - California 123 Main St Livermore, CA 94551, USA Archer is a leading provider of integrated risk management (IRM) solutions that enable customers to improve strategic decision-making and operational resilience with a modern technology platform that supports qualitative and quantitative analysis driven by both business and IT impacts. As true pioneers in GRC software, Archer remains solely dedicated to helping customers manage risk and compliance domains, from traditional operational risk to emerging issues such as ESG. With over 20 years in the risk management industry, the Archer customer base represents one of the largest pure risk management communities globally, with more than 1,200 customers including more than 50% of the Fortune 500. Learn more at . Data Scientist – LLM & Data Pipeline Engineering (LegalTech / RegTech AI) Overview We are seeking an experienced Data Scientist with a strong background in AI model integration, data pipeline development, and knowledge base (KB) engineering to support our next-generation LegalTech / RegTech AI platform. This role blends applied machine learning , data engineering , and software development , focusing on building scalable pipelines that connect large language models (LLMs) to structured and unstructured data through retrieval-augmented generation (RAG) and vector database architectures. The ideal candidate is passionate about operationalizing AI — from training and fine-tuning models to deploying intelligent retrieval systems in AWS cloud environments. Key Responsibilities Design, train, and evaluate LLM-based pipelines for document understanding, obligation extraction, and regulatory reasoning. Implement and optimize RAG architectures , combining LLMs with vector databases for semantic retrieval. Develop and maintain model fine-tuning workflows, embedding generation, and knowledge distillation. Collaborate with ML Ops teams to integrate AI models into production-ready APIs and services on AWS . Measure and improve model precision, recall, latency, and interpretability. 1.5 Agentic and MCP Knowledge Integration Design and maintain agentic multi-component processes (MCPs) that enable context-aware reasoning across multiple data sources and agents. Implement AI agents capable of dynamic tool use, autonomous task decomposition, and multi-context knowledge retrieval. Develop pipelines that support agent memory , self-reflection , and knowledge synthesis across distributed systems and knowledge bases. Collaborate with engineering teams to integrate MCP-driven agents with retrieval, analytics, and workflow orchestration layers, ensuring compliance with regulatory reasoning frameworks. Build and manage end-to-end data pipelines for ingestion, transformation, embedding, and indexing of legal and compliance data. Orchestrate data workflows leveraging AWS services (e.g., S3 , Lambda , Glue , SageMaker , Step Functions , RDS ). Develop scalable ETL/ELT processes to feed both relational ( PostgreSQL ) and vector databases (e.g., Pinecone , FAISS , Weaviate , Elastic Vector Search ). Ensure data lineage, reproducibility, and version control across AI and analytics pipelines. Automate retraining and evaluation pipelines for continuous learning from user feedback. 3. Knowledge Base & Information Retrieval Architect and maintain intelligent Knowledge Bases (KBs) to support AI-driven search, summarization, and compliance reasoning. Implement advanced retrieval techniques using ElasticSearch / Elastic Vector Search and embedding-based retrieval. Align KB structures with business ontologies and regulatory taxonomies to support explainable AI outputs. Collaborate with domain experts and PMs to enrich KB metadata and enhance model context relevance. 4. AWS & Deployment Deploy and scale AI pipelines using AWS services such as SageMaker , Lambda , ECS/EKS , API Gateway , and CloudFormation/Terraform . Implement model and data monitoring solutions for drift detection, latency management, and cost optimization. Collaborate with DevOps to maintain secure, reliable, and compliant cloud environments. 5. Cross-Functional Collaboration Partner with engineering, product, and compliance teams to align AI models with regulatory and data governance requirements. Work closely with QA and Professional Services teams to validate AI outputs and improve client-facing performance. Document architectures, experiment results, and data flows to ensure transparency and reproducibility. Preferred Experience Experience building AI products for LegalTech, RegTech, or compliance automation . Familiarity with agentic AI frameworks (e.g., OpenAI MCP, CrewAI, LangGraph, or AutoGen). Background in document intelligence systems , multi-agent orchestration , or knowledge graph integration . Experience with LangChain , LlamaIndex , or similar frameworks for RAG orchestration. Hands-on knowledge of MLOps tools and data versioning (DVC, MLflow, Weights & Biases). Understanding of governance, interpretability , and ethical AI Qualifications 5+ years of experience in data science, ML engineering, or AI-driven software development . Strong programming skills in Python (NumPy, Pandas, PyTorch/TensorFlow, LangChain, or equivalent). Experience with vector databases and retrieval systems (Pinecone, FAISS, Weaviate, Qdrant, or Elastic Vector Search). Hands-on experience with RAG pipelines , embedding models , and LLM orchestration (OpenAI, Bedrock, Hugging Face, etc.). Solid understanding of data pipelines , ETL frameworks , and cloud-native deployment on AWS . Familiarity with Elasticsearch , PostgreSQL , and API integration patterns. Knowledge of ML lifecycle management , including model training, evaluation, and monitoring. Soft Skills Strong problem-solving and system design capabilities. Excellent communication skills for cross-disciplinary collaboration. Passion for structured documentation, reproducibility, and experimentation. Adaptable mindset with focus on performance, scalability, and reliability. Success Indicators Scalable and well-documented RAG pipelines supporting production of AI workloads. High model accuracy, retrievability, and latency efficiency. Reliable data flow from ingestion to inference with minimal manual intervention. Increased explainability and compliance assurance across AI outputs. Additional Information About Archer’s Culture and Work Environment: Our people, team collaboration and dynamic leadership is the centerpiece of our great culture and the reason for Archer’s 25 years of success. Over the years, many companies and global organizations have been faced with tough decisions. Layoffs, reorganizations, acquisitions, and mergers. Yet, throughout these challenging times, Archer has exemplified strong innovation and growth and a commitment to our employees. Why is this possible? Collaboration is the key to our success. It inspires great innovation and innovative ideas. It is why Archer's is a household name in the GRC space. Companies, from F500 – F1000, come to Archer first - for our thought leadership and for our ability to meet customers where they are. As we continue to grow and evolve, our focus will remain the same: continue innovating, support our customers and employees and continue driving the risk management industry to new levels. Please note this job description is not designed to cover or contain a comprehensive listing of activities, duties or responsibilities that are required of the employee for this job. Duties, responsibilities and activities may change at any time with or without notice at management discretion based on business need. Archer is committed to the principle of equal employment opportunity for all employees and applicants for employment and to providing employees with a work environment free of discrimination and harassment. All employment decisions at Archer are based on business needs, job requirements and individual qualifications, without regard to race, color, religion, national origin, sex (including pregnancy), age, disability, sexual orientation, gender identity and/or expression, marital, civil union or domestic partnership status, protected veteran status, genetic information, or any other characteristics protected by federal, state or local laws. Archer will not tolerate discrimination or harassment based on any of these characteristics. This policy applies to all terms and conditions of employment, including recruiting, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation, and training. All Archer employees are expected to support this policy and contribute to an environment of equal opportunity. If you need a reasonable accommodation during the application process, please contact View email address on click.appcast.io. All employees must be legally authorized to work in Country they are applying for. Archer and its approved consultants will never ask you for a fee to process or consider your application for a career with Archer. Archer reserves the right to amend or withdraw any job posting at any time, including prior to the advertised closing date. Pay Transparency Notice: We’re committed to fair and transparent pay practices. In line with state pay transparency laws, the salary range for this role is available upon request. Please contact our Talent Acquisition team at View email address on click.appcast.io for the range and related compensation details. Actual pay may vary based on location, experience, skills, and internal equity. Equal Opportunity Employer This employer is required to notify all applicants of their rights pursuant to federal employment laws.For further information, please review the Know Your Rights notice from the Department of Labor. #J-18808-Ljbffr emergemarket.com
- ...Archer is a leading provider of integrated risk management (IRM) solutions that enable customers to improve strategic decision-making... ...including more than 50% of the Fortune 500. Learn more at Data Scientist - LLM & Data Pipeline Engineering (LegalTech / RegTech AI)...SuggestedLocal area
- ...Title: Senior Data Engineer Location: Pleasanton, California (hybrid work) Role Overview: As a Senior/Lead Data Engineer, you will lead the design, development, and ownership of core data infrastructure-from pipelines to storage to data products. You...Suggested
- ...computer science, applied math, physics, engineering, statistics, economics or related field. 3+ years of industry experience in Data Engineering 3+ years of work experience including hands-on technical experience with SQL, Python, PySpark, Jupyter Notebook,...SuggestedWork experience placement
$115k - $175k
...sciences industry , committed to making a positive impact on its customers, employees, and communities. The Role A Senior Data Engineer who can lead the design and implementation of our next-generation Data Lakehouse. In this role, you will be designing and...SuggestedWork at officeLocal areaRemote workWork from homeFlexible hours3 days per week- ...Job Title : Data Engineer Location : Pleasanton CA Type : Contract Long Term Hybrid (3days onsite) Roles & Responsibilities ~ Bachelor's degree ~5+ years of data manipulation, analysis, etc. ~ Expert level MS Excel ~ Database experience...SuggestedContract work
- ...Data Scientist VentureSoft Global, Inc. is a technology services company specializing in software development and IT consulting. We offer a range of services including custom software development, mobile app development, cloud solutions, Data, AI/ML Engineering and...Remote workFlexible hours
- ...Job Details: Data Scientist Location: Pleasanton, CA Top Skill: Qualifications for Data Scientist Strong problem solving skills with an emphasis on product development. Experience using statistical computer languages (R, Python, SQL, etc...
$146.34k - $222.56k
...real impact. Job Description Wehave multiple openings for a Data Science Engineer with a background in applied machine learning... ...cybersecurity experts, power systems engineers, and computer scientists. Support building research prototypes and capabilities for critical...Minimum wageFor contractorsLocal areaWork from homeRelocation packageFlexible hours1 day per week$146.34k - $222.56k
...impact. Job Description We have multiple openings for a Data Science Engineer with a background in applied machine learning... ...cybersecurity experts, power systems engineers, and computer scientists. Support building research prototypes and capabilities for critical...Minimum wageFor contractorsLocal areaWork from homeRelocation packageFlexible hours1 day per week$120k - $135k
...teamwork, and a commitment to excellence. Plus, we know how to have fun while getting the job done! About the Role We are seeking a Data Acquisition Engineer to design, build, and scale systems that collect, process, and maintain high-quality external data from the web...Monday to Friday- ...Title: Data Engineering Power BI Architect Location: Pleasanton, CA 94588(Remote) Duration: 12 Months Responsibilities: Resources should have 10+ years of experience in Power BI with health care knowledge. Having Experience in developing Power...Remote work
- ...Data Analytics Engineer VentureSoft Global, Inc. is a technology services company specializing in software development and IT consulting. We offer a range of services including custom software development, mobile app development, cloud solutions, Data, AI/ML Engineering...Remote workFlexible hours
- Oracle is seeking a Senior Software Engineer to assist in developing software applications tied to AI in Supply Chain Management. In this role, you will define specifications for new projects, develop software solutions, and collaborate with cross-functional teams. The ...
$135k - $150k
...stakeholders, developing dashboards with Power BI, and managing ETL pipelines. Ideal candidates will have over 7 years of experience in data analysis and strong SQL skills. The position offers a competitive salary from $135,000 to $150,000 plus annual bonuses and...- A fast-growing organization in life sciences is seeking a Senior Data Engineer to lead the design and implementation of a scalable Data Lakehouse. You will build real-time and batch data ingestion pipelines and develop complex ETL workflows to optimize large-scale data...
- Vagaro in Pleasanton, CA is seeking a Data Acquisition Engineer to design and build scalable systems for collecting and processing high-quality external data. Candidates should have 3+ years of experience in web crawling and scraping, along with strong programming skills...
$65 - $70 per hour
...Data Scientist / Databricks / Onsite in Pleasanton, CA Data Scientist / Databricks / Onsite in Pleasanton, CA This range is provided by Motion Recruitment. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base...16 hoursFull timeContract workTemporary workFlexible hours$146.34k - $222.56k
...LLNL is a place where your expertise can make a real impact. Job Description We have an opening for an Energy Resilience Data Scientist .This role sits at the intersection of data science, energy systems, and critical infrastructure resilience, helping transform...Minimum wageFor contractorsLocal areaRelocation packageFlexible hours- ...Role: Senior Data Scientist - Marketing & NLP Location: Pleasanton CA We are seeking a Senior Data Scientist with strong expertise in developing marketing analytics models and Natural Language Processing (NLP) solutions. The ideal candidate will leverage...
$126k - $179.3k
...Reliability Analytics and is responsible for developing advanced data science models and industry-leading anomaly detection... ...part of cross-functional teams, including data engineers, data scientists, technologists, and subject matter experts – this individual will...Work at officeRemote work$144k
...for the delivery of greater than $7B in annual Capital work, encompassing Distribution, Substation & Transmission work. Workplan and Data Management is Portfolio Operations’ centralized data insights and analytics team focused on in-depth assessments of Portfolio and...Work at officeRemote work2 days per week- A rapidly growing biotech organization in Pleasanton, CA, is looking for a Senior Software Engineer to lead the design and implementation of core software for complex systems. The role requires proficiency in C and Python, with at least 3 years of experience in developing...
- Pacific Gas and Electric Company is seeking a Senior Manager, Workplan Data Platforms & Engineering, responsible for leading the strategy and execution of data management across their electric operations. This role focuses on the architecture and reliability of Workplan...
- A technology services company is seeking a Data Analytics Engineer to develop and maintain data pipelines and visualizations to drive organizational insights. The role requires a Master's degree and 5+ years of experience in data analytics. The candidate must be proficient...Flexible hours
$100k - $150k
...technologies to create scalable, secure, and user-friendly applications. As we continue to grow, we’re looking for a skilled SAP Data Migration Engineer (LTMC / SLT / LVM) to join our dynamic team and contribute to our mission of transforming business processes through...Full timeH1bLocal areaImmediate startRemote workVisa sponsorshipWork visa- A leading technology firm is seeking an experienced Data Scientist to support their next-generation AI platform. The role involves AI model integration, data pipeline development, and knowledge base engineering with a focus on LegalTech and RegTech. The ideal candidate...Remote job
- Workday in Pleasanton, CA, is seeking a Full Stack Engineer to enhance our data engineering team. In this role, you'll develop user-facing solutions and integrate AI capabilities into our HR and Finance workflows. This is an opportunity to make a real impact as Workday...Remote jobFlexible hours
$198.5k - $268.5k
...network which means forecasting and territory analytics here are genuinely hard and genuinely consequential. We're hiring a Staff Data Scientist to own the sales analytics function within our Business Insights & Analytics team. You'll report to the Head of Business...Full time- ...consulting services. We are in search of a highly motivated candidate to join our talented Team. Job Title: Simulation Developer - Data Scientist (Simulation & Modeling) Location(s): Pleasanton, CA. About the Role: We are seeking a highly skilled and motivated Data...
$88.1k - $141k
...We're looking for a Data Engineer - United States This role is Hybrid, Dublin Office Data Engineer - Hybrid(Dublin,... ...technical and non-technical stakeholders. • Partner with data scientists and functional leaders in sales, marketing, and product to deploy...Full timeWork at officeLocal area
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Archer Data Scientist. Be the first to apply!
- entry level data scientist remote Livermore, CA
- data officer Livermore, CA
- data network cabling Livermore, CA
- test data management Livermore, CA
- data capturer Livermore, CA
- data tech Livermore, CA
- provider data management Livermore, CA
- clinical data Livermore, CA
- data Livermore, CA
- data technician Livermore, CA

