Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Data Scientist - Big Data R&D, Identity Graph & KYC

Socure Inc

Why Socure?

Socure is building the identity trust infrastructure for the digital economy — verifying 100% of good identities in real time and stopping fraud before it starts. The mission is big, the problems are complex, and the impact is felt by businesses, governments, and millions of people every day.

We hire people who want that level of responsibility. People who move fast, think critically, act like owners, and care deeply about solving customer problems with precision. If you want predictability or narrow scope, this won't be your place. If you want to help build the future of identity with a team that holds a high bar for itself — keep reading.

About the Role

The Big Data R&D team develops cutting-edge big data and graph-based solutions for entity search, entity resolution, and identity matching that power Socure's KYC and compliance products.

As a Senior Data Scientist I, you will lead the design and deployment of advanced ML and graph algorithms on large-scale PII datasets, own end-to-end projects from problem definition through production validation, and serve as a key technical partner to Product, Engineering, and Client-facing teams. You will help define standards for feature engineering, experimentation, and data quality across our identity graph stack, with substantial impact on coverage, accuracy, and fairness.

What You'll Do
  • Own the design, development, and evaluation of machine learning, statistical, and graph-based algorithms for entity-resolution, identity trust scoring, and anomaly detection on massive datasets.

  • Architect and optimize graph-based identity representations (identity graph structure, linkage rules, clustering) to improve match rates, reduce false positives/negatives, and support downstream fraud and KYC models.

  • Build and maintain scalable data pipelines and feature stores in Spark/PySpark (or Scala), including data normalization, deduplication, and feature computation across large PII datasets in AWS/Databricks environments.

  • Lead A/B tests and offline/online experimentation for new models, features, and data sources; define success metrics, design experiments, and ensure rigorous validation before rollout.

  • Evaluate new internal and external data sources: explore signal quality, design backtests, quantify incremental value, and provide clear recommendations on vendor selection and integration.

  • Partner closely with product managers and engineers to translate ambiguous business and regulatory requirements (e.g., KYC coverage, watchlist matching) into concrete modeling and data roadmaps.

  • Provide deep analytical support to Socure's compliance and regulatory product suite, including investigative analyses, root-cause analysis for anomalies, and clear narratives for internal and external stakeholders.

  • Contribute to model governance and documentation: clearly explain model logic, data dependencies, limitations, and monitoring plans to internal risk/compliance stakeholders.

  • Mentor junior data scientists and engineers on best practices in data exploration, feature engineering, experimentation, and code quality.

  • Communicate complex technical concepts and trade-offs in a concise, structured way to both technical and non-technical audiences (e.g., product reviews, customer meetings, internal briefings).

What You Bring
  • Master's degree with 3+ years of relevant industry experience, or Ph.D. with 1+ years of experience in applied ML / data science roles; background in Computer Science, Statistics, Mathematics, or related quantitative fields preferred.

  • Strong proficiency in Python (preferred) or Scala, including experience with ML libraries such as scikit-learn, XGBoost, TensorFlow or PyTorch.

  • Extensive experience with Spark or PySpark and distributed data systems (e.g., AWS EMR, Databricks) working on very large, messy datasets.

  • Deep understanding of supervised and unsupervised learning, feature engineering, model evaluation, and experiment design (A/B testing, holdout strategies, stratification).

  • Experience developing production-quality data pipelines and automated workflows using Airflow or similar orchestration tools.

  • Practical familiarity with graph databases and/or graph frameworks (Neo4j, AWS Neptune, GraphFrames, DGL, PyTorch Geometric) and graph algorithms for clustering, link prediction, and community detection is strongly preferred.

  • Solid SQL skills and experience working with large-scale analytical data stores.

  • Experience in at least one of: identity verification, fraud detection, credit risk, or adjacent high-stakes domains is a plus.

  • Demonstrated ability to lead medium-to-large projects end-to-end, make sound trade-off decisions under ambiguity, and influence cross-functional stakeholders with data and clear reasoning.

Please note that sponsorship is not available at this time; and that you must be located within 45 miles of a talent hub to be considered.

Socure is an equal opportunity employer that values diversity in all its forms within our company. We do not discriminate based on race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. If you need an accommodation during any stage of the application or hiring process—including interview or onboarding support—please reach out to your Socure recruiting partner directly.

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Senior Data Scientist - Big Data R&D, Identity Graph & KYC in United States vacancy
  •  ...is building the identity trust infrastructure...  .... The mission is big, the problems are...  ...The Big Data R&D team is responsible...  ...the core identity graph and entity-resolution...  ...that power Socure's KYC and compliance...  ...work closely with senior data scientists and engineers... 
    Big data

    Socure Inc

    New York, NY
    3 days ago
  •  ...Socure is building the identity trust infrastructure...  .... The mission is big, the problems are complex...  ...Role The Big Data R&D team builds the core...  ...Socure's Verify and KYC products. As a Senior Data Scientist focused on...  ...design and deploy ML and graph-based systems tailored... 
    Senior
    Big data
    Work at office
    Local area

    Socure Inc

    New York, NY
    2 days ago
  • Socure is seeking a Data Scientist to join their Big Data R&D team in Miami, Florida. In this role, you will design algorithms and develop data pipelines for their identity verification solutions. The ideal candidate will have a Master’s degree or Ph.D. with relevant experience... 
    Big data

    Socure

    Miami, FL
    1 day ago
  • $183.1k - $274.7k

    The Trade Desk, Inc. in Bellevue is seeking a Senior Identity Engineer to design and operate large-scale distributed systems and data products. The ideal candidate has over 7...  ...experience in software engineering, especially with big data processing, distributed systems, and... 
    Senior
    Big data

    The Trade Desk

    Bellevue, WA
    3 days ago
  •  ...Socure is seeking a Senior Data Scientist I in Carson City, Nevada, to lead the design and deployment of advanced machine learning and graph algorithms on large datasets. This role involves significant collaboration with Product and Engineering teams to define standards... 
    Senior

    Socure Inc

    Seattle, WA
    4 days ago
  •  ...Have: We are looking for a Senior Data Scientist, with experience processing...  ...and analysis Apply big data analytic tools to large...  ...Analytics, Network Analytics, Graphing Data that assist the analytical...  ...sexual orientation, gender identity, national origin, disability... 
    Senior
    Big data
    Immediate start

    GRVTY

    Springfield, VA
    15 days ago
  •  ...scientific workflows, accelerating R&D across biology and chemistry...  ...partnered to hire multiple data engineers (mid and senior) to join a small team...  ...across ETL and solving complex big data problems....  ...various database technologies (graph, MongoDB, SQL, etc.). Hands-... 
    Senior
    Big data
    Remote work

    Talener

    Boston, MA
    2 days ago
  • $173k - $197.4k

     ...America, McLean, Virginia Principal Data Scientist, Consumer Identity Machine Learning Data is at the center...  ...billions of customer records to unlock the big opportunities that help everyday...  ...NLP, Data-centric AI, and Knowledge Graphs. Track record of driving innovation,... 
    Big data
    Full time
    Part time
    Local area

    Capital One National Association

    San Jose, CA
    3 days ago
  •  ...scientific workflows, accelerating R&D across Biology and Chemistry...  ...them to hire multiple data engineers (Mid and Senior level) to join a small team...  ...across ETL and solving complex big data problems. Job Title...  ...various database technologies (Graph, Mongo, SQL, etc) and... 
    Senior
    Big data
    Immediate start
    Remote work

    Talener

    Boston, MA
    3 days ago
  •  ...NTT DATA strives to hire exceptional, innovative and...  ...seeking am AWS Data Engineer Senior Specialist to join our...  ...Data Engineering and Big Data processing environments...  ....6 billion each year in R&D to help organizations...  ...sexual orientation, gender identity, national origin,... 
    Senior
    Big data
    Work experience placement

    NTT DATA Americas, Inc.

    Charlotte, NC
    5 days ago
  • $65.5k - $134k

     ...a better working world. Senior Data Scientist EY is the only professional...  ...MLLib, SparkNLP). Utilize big data frameworks (Hive, Hue;...  ...Good to have: Exposure to graph neural networks and graph databases...  ...sexual orientation, gender identity/expression, pregnancy,... 
    Senior
    Big data
    Summer holiday
    Flexible hours

    EY

    Tampa, FL
    3 days ago
  •  ...Job Title Senior Data Scientist Location McLean, VA 22102 US (Primary...  ...experience with databases design for big data, such as management of...  ...data mining, statistics, or graph algorithms to support...  ...sexual orientation, gender identity, national origin, disability... 
    Senior
    Big data
    Full time
    Work at office
    Remote work

    Prescient Edge

    McLean, VA
    2 days ago
  • Responsibilities Collaborate with Data Scientists and Platform Engineers to...  ...or AWS CDK Experience with graph technologies and knowledge...  ...semantic modeling Background in big data environments and real-...  ..., sexual orientation, gender identity, age, disability, national origin... 
    Senior
    Big data
    Permanent employment
    Contract work
    Local area

    Cloud Hybrid Technologies, LLC

    Houston, TX
    11 hours ago
  • $200k - $220k

     ...Senior Staff Data Infrastructure Engineer Armis, the cyber exposure management...  ...join our Data Infrastructure R&D Group as a dedicated expert...  .... Familiarity with Big Data frameworks or NoSQL databases...  ..., sexual orientation, gender identity, age, disability, veteran... 
    Senior
    Big data
    Local area
    Remote work

    Armis Security

    United States
    3 days ago
  •  ...D.C. Responsibilities As a Senior Data Engineer / Data Architect, you...  ..., distributed computing, and big data processing systems. Proficiency...  .... Experience with NoSQL and graph databases. Experience...  ..., sexual orientation, gender identity or any other legally... 
    Senior
    Big data

    ECCO Select

    Washington DC
    3 days ago
  •  ...Senior IT Data Engineer The Senior IT Data Engineers are experts in data...  ...projects, using advanced big data technologies, and ensuring...  ...pipelines, and RAG/knowledge graph platforms. Establish AI governance...  ..., sexual orientation, gender identity, disability or veteran status... 
    Senior
    Big data
    Work experience placement
    Relocation package
    Shift work
    Day shift

    Tyson Foods Inc.

    Springdale, AR
    2 days ago
  •  ...Johnson & Johnson Innovative Medicine is looking for a Principal Data Scientist – Knowledge Graph Engineer (Immunology) to join their Immunology R&D Data Science & Digital Health team. This role involves designing scalable knowledge graph infrastructures and collaborating... 
    Senior

    Johnson & Johnson Innovative Medicine

    Nacogdoches, TX
    3 days ago
  •  ...at scale, across every screen. Our data exists with the consent of over a billion...  ...culturally relevant.  As a Data Scientist on Samba's Knowledge Graph & Identity team in Warsaw, you will own end-to...  ...and deployment — with support from senior team members Develop and test... 

    Samba

    San Francisco, CA
    7 days ago
  • $100k - $300k

     ...seeking a forward-thinking AI Data Engineer to bridge the gap between...  ..., BigQuery, ClickHouse) and Big Data frameworks (Spark, Flink)...  .... Knowledge of Knowledge Graphs (Neo4j, NebulaGraph) and how to...  ...status, disability, gender identity or Veteran status. We also consider... 
    Senior
    Big data
    Full time

    OPPO US Research Center

    Palo Alto, CA
    1 day ago
  • $170k - $215k

    Senior Data Scientist Company: Norstella Location: Remote, United States Date...  ...fine tuning, and knowledge graphs Experience with...  ...retrieval) Experience with Big Data tools like Apache Spark...  ...veteran status, gender, gender identity or expression, sexual orientation... 
    Senior
    Big data
    Full time
    Contract work
    Temporary work
    Work experience placement
    Local area
    Remote work
    Flexible hours

    Norstella

    Annapolis, MD
    4 days ago
  •  ...makes play happen. The Senior Data Engineer is...  ...works closely with data scientists, analysts, AI/ML engineers...  ...embeddings, knowledge graphs, semantic layers, APIs...  ...platforms Experience with big data technologies such...  ..., sex, gender, gender identity or expression, sexual... 
    Senior
    Big data
    Local area

    Electronic Arts

    Austin, TX
    1 day ago
  •  ...Anywhere in Country AI and Data - Data Scientist - Senior Manager EY delivers...  ...in artificial intelligence, big data and cloud engineering....  ...memory and grounding, knowledge graphs, foundation models, optimisation...  ...sexual orientation, gender identity/expression, pregnancy,... 
    Senior
    Big data

    Ernst & Young Oman

    Kansas City, MO
    21 hours ago
  • $65.5k - $134k

     ...better working world. Senior Machine Learning...  ...! The opportunity Data has yet to be...  ...closely with data scientists, product managers,...  ...pipelines. Proficiency in big data frameworks (...  ...have: Experience with graph neural networks,...  ...orientation, gender identity/expression, pregnancy... 
    Senior
    Big data
    Summer holiday
    Flexible hours

    Ernst & Young Oman

    Tampa, FL
    2 days ago
  • MFour Mobile Research, Inc. seeks a Senior Data Engineer in Kansas City to own the data pipeline essential for AI-driven insights. This role involves ingesting and cleaning data across multiple sources, ensuring quality and compliance while collaborating with various teams... 
    Senior

    MFour Mobile Research, Inc.

    Kansas City, MO
    21 hours ago
  • $120k - $202.5k

     ...motivated and technically skilled Data Scientist to join our AI/ML science...  ...and maintain knowledge graphs and embeddings for semantic search...  ...Prior work on agentic AI is a big plus Salary Range $120...  ..., sexual orientation, gender identity or expression, citizenship,... 
    Senior
    Temporary work
    Flexible hours

    State Street Corporation

    Quincy, MA
    3 days ago
  • $100.9k - $187.3k

     ...What You’ll Do: As a Senior Software Engineer, you will be responsible...  ...suite to deliver global data using advanced search, big data ingestion and...  ...desired Neo4J or other graph technology Kafka Grails...  ...Assistance Program; Group Legal Identity Theft Protection benefit... 
    Senior
    Big data
    Contract work
    Work experience placement
    Local area
    Flexible hours

    Thomson Reuters

    McLean, VA
    3 days ago
  • $135k - $220k

     ...intelligent business identity platform that...  ...& identity graph network leverage...  ...proprietary data sources to prevent...  ...like KYC/KYB or financial...  ...closely with data scientists, ML engineers,...  ...Innovation & R&D: Stay on the cutting...  ....05% – 0.25% Seniority level Mid‑Senior... 
    Senior
    Full time
    Work at office
    Flexible hours
    3 days per week

    BASELAYER

    San Francisco, CA
    3 days ago
  •  ...support our NATO customer, SimIS seeks a Senior Data Scientist to apply business and data manipulation...  ...designing, developing, and maintaining big data platforms/fabrics. ~ Experience...  ...Provide cloud-based solutioning for internal R&D efforts to develop software prototypes... 
    Senior
    Big data
    Contract work
    Temporary work

    SimIS Inc.

    Norfolk, VA
    28 days ago
  • $130k - $196.5k

    Senior Software Engineer - Big Data page is loaded## Senior Software Engineer - Big Datalocations: San Franciscotime...  ..., data ethics, and foundational identity, LiveRamp is setting the new...  ...Identity Engineering team maintain the graph processing engine that connects the... 
    Senior
    Big data
    Work at office
    Work from home
    Worldwide
    Flexible hours
    Night shift

    LiveRamp

    San Francisco, CA
    2 days ago
  •  ...National Capitol Contracting is seeking Data Scientist II to support the Naval Sea Systems...  ...prescriptive analytics. Knowledge of big data technology infrastructure and environments...  ...sexual orientation, gender, gender identity, and gender expression, familial status,... 
    Senior
    Big data
    For contractors
    Work at office

    National Capitol Contracting LLC

    Norfolk, VA
    22 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Data Scientist - Big Data R&D, Identity Graph & KYC. Be the first to apply!