Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Data Scientist II - Big Data R&D, Identity Graph & KYC

$140k - $170k

Socure

About the Role The Big Data R&D team is responsible for building the core identity graph and entity-resolution capabilities that power Socure’s KYC and compliance products. In this role, you will help develop graph-based algorithms and data pipelines on massive PII datasets, support modelers with high-quality features, and evaluate new data sources that feed our identity and fraud products. You will work closely with senior data scientists and engineers while developing your skills in large‑scale ML, distributed systems, and graph analytics. What You’ll Do Contribute to the design and implementation of machine learning, data mining, statistical, and graph‑based algorithms to analyze very large datasets for identity verification and anomaly detection. Analyze large datasets to help develop and refine entity‑resolution and identity‑matching algorithms that drive Socure’s KYC and compliance solutions. Build and maintain components of data‑processing pipelines (ETL, feature generation, normalization) using tools such as Spark/PySpark and AWS (e.g., EMR, S3). Support senior data scientists with feature engineering, data exploration, error analysis, and A/B test setup for new models and signals. Help evaluate new third‑party and internal data sources: profile data quality, design offline experiments, and summarize impact on coverage and model performance. Implement and maintain SQL and Python/R code for data extraction, transformation, and validation; contribute to code reviews and basic testing. Provide analytical support to compliance and regulatory product teams, including ad hoc investigations, simple dashboards, and data deep dives. Communicate findings in a clear, structured way to peers and cross‑functional partners (Product, Engineering, Client Analysis), focusing on key insights and trade‑offs. Work effectively in a fast‑paced, cross‑functional environment; demonstrate ownership of well‑scoped tasks and follow through to completion. What You Bring Master’s degree with 2+ years of experience, or Ph.D. with 1+ years of experience in a data science or analytics role, or equivalent practical experience. Proficiency in at least one general-purpose programming language used in data science (Python, or Scala). Solid experience writing and optimizing SQL for large datasets; comfort working in data lake / warehouse environments. Hands‑on experience with Spark or PySpark and common ML libraries (e.g., scikit‑learn, XGBoost, TensorFlow/PyTorch a plus). Familiarity with UNIX environments and the AWS ecosystem (e.g., EMR, S3); Databricks experience is a plus. Working knowledge of supervised/unsupervised ML and basic statistics (similarity measures, clustering, evaluation metrics). Exposure to graph techniques or graph databases (Neo4j, AWS Neptune, GraphFrames) is a strong plus. Bonus: experience with Elasticsearch or DynamoDB; workflow tools such as Airflow for automating data pipelines. Ability to break down loosely defined problems, ask good clarifying questions, and iterate quickly with feedback. Please note sponsorship is not available at this time; and that you must be located within 45 miles of a talent hub to be considered. Socure is an equal opportunity employer that values diversity in all its forms within our company. We do not discriminate based on race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. If you need an accommodation during any stage of the application or hiring process—including interview or onboarding support—please reach out to your Socure recruiting partner directly. Compensation Range: $140K – $170K #J-18808-Ljbffr Socure

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Data Scientist II - Big Data R&D, Identity Graph & KYC in San Francisco, CA vacancy
  • $130.05k - $175.95k

    The Onyx Research Data Platform organization represents...  ...investment by GSK R&D and Digital & Tech, designed...  ...experience for GSK’s scientists, engineers, and...  ...requirements. A Data Engineer II is a technical...  ...sexual orientation, gender identity/expression, age, disability... 
    Suggested
    Local area

    GlaxoSmithKline

    San Francisco, CA
    5 days ago
  •  ...about — in real time, at scale, across every screen. Our data exists with the consent of over a billion people,...  ...because it’s the most culturally relevant.  As a Data Scientist on Samba's Knowledge Graph & Identity team in Warsaw, you will own end-to-end delivery of significant... 
    Suggested

    Samba

    San Francisco, CA
    a month ago
  • $162k - $258.75k

     ...About the Principal Data Scientist at Headspace We are looking for an innovative, strategic...  ...preferred). ~7+ years of expertise in big data technologies (e.g., Redshift, S3, Databricks...  ...race, color, religion, gender, gender identity, gender expression, sexual orientation,... 
    Big data
    Full time
    Currently hiring
    Work at office
    Local area
    Remote work
    Shift work
    3 days per week

    Human Ventures, LLC.

    San Francisco, CA
    3 days ago
  • $148.5k - $260.1k

     ...the next-generation Enterprise Knowledge Graph platform to power AI-driven experiences,...  ...applications, semantic search, enterprise data discovery, and intelligent decision-making...  ...sexual orientation, gender expression or identity, transgender status, age, disability,... 
    Suggested
    Contract work

    SwiftCruit

    San Francisco, CA
    5 days ago
  • Data Engineer II - QuantumBlack, AI by McKinsey Mar 09, 2026 Your Impact As a Data Engineer...  ...cross‑functional Agile teams with Data Scientists, Machine Learning Engineers, Designers...  ...employment without regard to sex, gender identity, sexual orientation, race, color,... 
    Suggested
    Apprenticeship

    McKinsey & Company

    San Francisco, CA
    2 days ago
  • $112.5k - $150k

    Analytics Engineer As the Analytics Engineer II, you will: Build and improve data models that enable faster, more reliable decision‑making across Product...  ...sex, national origin, age, sexual orientation, gender identity, veteran status, disability or genetics. Qualified... 
    Work from home
    Home office

    Earnest

    San Francisco, CA
    3 days ago
  • $130k - $196.5k

    Senior Software Engineer - Big Data page is loaded## Senior Software Engineer - Big Datalocations...  ...privacy, data ethics, and foundational identity, LiveRamp is setting the new standard for...  ...Identity Engineering team maintain the graph processing engine that connects the... 
    Big data
    Work at office
    Work from home
    Worldwide
    Flexible hours
    Night shift

    LiveRamp

    San Francisco, CA
    5 days ago
  • A leading data collaboration company in San Francisco is seeking a Senior Software Engineer to join their Identity Engineering team. This role involves leading complex projects focused on big data technologies and collaborative software development. Candidates should have... 
    Big data
    Remote work

    LiveRamp

    San Francisco, CA
    5 days ago
  • $103.52k - $143.1k

     ...Services LLC Position: Business Analyst II - AMZ26516.2 Location: San Francisco,...  ...metrics reporting and performing data mining and big data analysis to provide strategic advice...  ...Female / Disability / Veteran / Gender Identity / Sexual Orientation Basic Qualifications... 
    Big data
    Local area
    Relocation package

    Amazon

    San Francisco, CA
    4 days ago
  • $144.77k - $209.11k

     ...Sr. Data Engineer Employment Type: Full-Time, Mid-level Department: Business Intelligence...  ...Python, R, SQL, SAS). -Strong knowledge of big data analysis and storage tools and...  ...religion, sex, sexual orientation, gender identity, national origin, disability, or status as... 
    Big data
    Full time
    Flexible hours

    Contact Government Services, LLC

    San Francisco, CA
    16 hours ago
  • $106.9k - $176.5k

     ...seeking a highly skilled Senior Consultant Data Engineer with expertise in cloud data...  ...Databricks and experience with Spark for big data processing. Proven experience in at least...  ...religion, age, sex, sexual orientation, gender identity/expression, pregnancy, genetic information... 
    Big data
    Summer holiday
    Flexible hours

    EY

    San Francisco, CA
    1 day ago
  •  ...Socure is building the identity trust...  ...starts. The mission is big, the problems are complex...  ...role will lead the Data Science function across...  ..., including KYC, International KYC,...  ...Prefill, and Identity Graph . In addition to advancing...  ...team of data scientists and applied... 

    Socure Inc.

    San Francisco, CA
    4 days ago
  • $205k - $235k

     ...EY-Parthenon – EY Growth Platforms - Data Scientist – Director The opportunity EY-Parthenon...  ...top of trends in the Data Science and Big Data industry. If you have a genuine passion...  ..., age, sex, sexual orientation, gender identity/expression, pregnancy, genetic... 
    Big data
    Work experience placement
    Summer holiday
    Flexible hours

    Ernst & Young Oman

    San Francisco, CA
    4 days ago
  • $130k - $196.5k

    You will Design, build, and optimize scalable data processing pipelines using Spark, Airflow, and related big data technologies to support both batch and real‑time...  ...veteran, disability, sexual orientation, gender identity, genetics or other protected status. Qualified... 
    Big data
    Work from home
    Flexible hours
    Night shift

    LiveRamp

    San Francisco, CA
    5 days ago
  • $171.12k - $213.9k

     ...Twilio's next Staff Software Engineer on our Data & Analytics Platform Who we are & why...  ...distributed systems. ~ Expertise in big data technologies such as Hadoop, Spark,...  ...medical conditions), sexual orientation, gender identity, gender expression, age, status as a... 
    Big data
    Local area
    Remote work
    Worldwide

    Twilio

    San Francisco, CA
    2 days ago
  •  ...for a Staff Software Engineer to join the Data Infrastructure team within the broader Data...  ...scale. Strong technical background with big data and infrastructure technologies such...  ...national origin, sex, sexual orientation, gender identity or expression, transgender status, age,... 
    Big data

    100 Salesforce, Inc.

    San Francisco, CA
    2 days ago
  • $192k - $264k

     ...we’re using the power of tech, data, and machine learning to...  ...everywhere can compete with these big box and e-commerce giants. By...  ...team to work closely with Data Scientists, Product Analysts and Software...  ...genetics, sexual orientation, gender identity or gender expression. Faire is... 
    Big data
    Work experience placement
    Work at office
    Local area
    Remote work
    Monday to Friday
    Flexible hours
    3 days per week

    Faire Inc

    San Francisco, CA
    3 days ago
  • $110k - $135k

     ...platform for providers. We are venture backed and have partnered with multiple US-based health systems and data providers. What you’ll do As a Data Engineer II, you’ll be a foundational member of a small, high‑impact team building the data backbone of our clinical AI... 
    Full time
    Internship

    Knit Health

    San Francisco, CA
    3 days ago
  • $99k - $149k

    Day to Day This role’s primary responsibility is to integrate data from a variety of sources into common data domain models, supporting...  ...without regard to race, color, religion, gender, gender identity or expression, family status, marital status, sexual orientation... 

    Indeed

    San Francisco, CA
    3 days ago
  • $140k - $200k

     ...About the Role Help us advance our robotics moonshot by scaling our data engineering efforts. Drive design and development of data...  ...condition, genetic information, marital status, sex, gender, gender identity, gender expression, sexual orientation, age, military or veteran... 
    Full time
    Local area
    Flexible hours
    Weekend work

    Nimble Robotics

    San Francisco, CA
    3 days ago
  • Mithrl is seeking a Data Engineer, Knowledge Graphs to build the infrastructure for their biological knowledge layer. In this role, you will partner closely with data scientists to create scalable ETL pipelines and efficient APIs for data access. Your work will have significant... 
    Work at office

    Mithrl

    San Francisco, CA
    3 days ago
  • $132.26k - $155.6k

     ...you excel at—all from Day One. Job Description Responsible for big data/analytics projects that gather and integrate large volumes of data...  ..., color, sex, national origin, age, sexual orientation, gender identity, disability or veteran status, and other factors protected... 
    Big data
    Full time
    Temporary work
    Work experience placement
    Local area

    U.S. Bank

    San Francisco, CA
    3 days ago
  • $169k - $232k

    Adyen provides payments, data, and financial products in a single solution for customers like Meta, Uber, H&M, and Microsoft - making...  ...ethnicity, ancestry, national origin, religion, sex, gender, gender identity, gender expression, sexual orientation, age, disability, veteran... 
    Work at office
    Local area

    Adyen

    San Francisco, CA
    2 days ago
  •  ...commercially available AI Co-Scientist. It is a discovery engine that transforms messy biological data into insights in minutes. Scientists...  ...by leading biotechs and big pharma across three continents...  ...hiring a Data Engineer, Knowledge Graphs to build the infrastructure... 
    Work at office

    Mithrl

    San Francisco, CA
    3 days ago
  • $110k - $135k

    Knit Health in San Francisco is seeking a Data Engineer II to be a key member of a small team focused on developing data backbone for clinical...  ...data quality and efficiency while working closely with data scientists to support AI model development. The ideal candidate should... 

    Knit Health

    San Francisco, CA
    2 days ago
  • $181.1k - $272.1k

    Senior Software Data Engineer, App Store San Francisco, California, United States Software...  ...-solving skills Hands‑on experience in big data technologies such as Hadoop, Spark/Flink...  ...religion, sex, sexual orientation, gender identity, national origin, disability, Veteran... 
    Big data
    Relocation package

    Apple Inc.

    San Francisco, CA
    6 days ago
  • $125.5k - $230.2k

     ...build a better working world. Technology - Data and Decision Science - Data Engineering -...  ...Databricks and experience with Spark for big data processing. Strong background in data...  ..., age, sex, sexual orientation, gender identity/expression, pregnancy, genetic information... 
    Big data
    Summer holiday
    Flexible hours

    Ernst & Young Oman

    San Francisco, CA
    5 days ago
  • $123.7k - $254.67k

     ...platform purpose-built for performance marketers. We leverage massive data and cutting‑edge science to automate and optimize TV advertising...  ...related medical conditions), sexual orientation, gender, gender identity, gender expression, age, marital status, status as a protected... 
    Big data
    Work at office
    Local area
    Relocation
    Relocation package

    Pinterest

    San Francisco, CA
    5 days ago
  • $98k - $164k

     ...BrazeAI, we’re expanding our team! Join our Forward-Deployed Data Scientist group of creative technical experts who partner with...  ...inclusive experience - regardless of age, color, disability, gender identity, marital status, maternity, national origin, pregnancy, race... 
    Full time
    Work at office
    Local area
    Flexible hours

    Braze

    San Francisco, CA
    1 day ago
  • $114.3k - $235.32k

     ...decisions for both Pinners and the business. We’re looking for a Data Scientist to join our Infrastructure Data Science team. In this role,...  ...medical conditions), sexual orientation, gender, gender identity, gender expression, age, marital status, status as a protected... 
    Full time
    Work at office
    Local area
    Remote work
    Relocation
    Relocation package

    Pinterest

    San Francisco, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Data Scientist II - Big Data R&D, Identity Graph & KYC. Be the first to apply!