Senior Data Scientist - Big Data R&D, Identity Graph & KYC
Socure Inc
Why Socure?
Socure is building the identity trust infrastructure for the digital economy — verifying 100% of good identities in real time and stopping fraud before it starts. The mission is big, the problems are complex, and the impact is felt by businesses, governments, and millions of people every day.
We hire people who want that level of responsibility. People who move fast, think critically, act like owners, and care deeply about solving customer problems with precision. If you want predictability or narrow scope, this won't be your place. If you want to help build the future of identity with a team that holds a high bar for itself — keep reading.
About the Role
The Big Data R&D team develops cutting-edge big data and graph-based solutions for entity search, entity resolution, and identity matching that power Socure's KYC and compliance products.
As a Senior Data Scientist I, you will lead the design and deployment of advanced ML and graph algorithms on large-scale PII datasets, own end-to-end projects from problem definition through production validation, and serve as a key technical partner to Product, Engineering, and Client-facing teams. You will help define standards for feature engineering, experimentation, and data quality across our identity graph stack, with substantial impact on coverage, accuracy, and fairness.
What You'll Do
Own the design, development, and evaluation of machine learning, statistical, and graph-based algorithms for entity-resolution, identity trust scoring, and anomaly detection on massive datasets.
Architect and optimize graph-based identity representations (identity graph structure, linkage rules, clustering) to improve match rates, reduce false positives/negatives, and support downstream fraud and KYC models.
Build and maintain scalable data pipelines and feature stores in Spark/PySpark (or Scala), including data normalization, deduplication, and feature computation across large PII datasets in AWS/Databricks environments.
Lead A/B tests and offline/online experimentation for new models, features, and data sources; define success metrics, design experiments, and ensure rigorous validation before rollout.
Evaluate new internal and external data sources: explore signal quality, design backtests, quantify incremental value, and provide clear recommendations on vendor selection and integration.
Partner closely with product managers and engineers to translate ambiguous business and regulatory requirements (e.g., KYC coverage, watchlist matching) into concrete modeling and data roadmaps.
Provide deep analytical support to Socure's compliance and regulatory product suite, including investigative analyses, root-cause analysis for anomalies, and clear narratives for internal and external stakeholders.
Contribute to model governance and documentation: clearly explain model logic, data dependencies, limitations, and monitoring plans to internal risk/compliance stakeholders.
Mentor junior data scientists and engineers on best practices in data exploration, feature engineering, experimentation, and code quality.
Communicate complex technical concepts and trade-offs in a concise, structured way to both technical and non-technical audiences (e.g., product reviews, customer meetings, internal briefings).
What You Bring
Master's degree with 3+ years of relevant industry experience, or Ph.D. with 1+ years of experience in applied ML / data science roles; background in Computer Science, Statistics, Mathematics, or related quantitative fields preferred.
Strong proficiency in Python (preferred) or Scala, including experience with ML libraries such as scikit-learn, XGBoost, TensorFlow or PyTorch.
Extensive experience with Spark or PySpark and distributed data systems (e.g., AWS EMR, Databricks) working on very large, messy datasets.
Deep understanding of supervised and unsupervised learning, feature engineering, model evaluation, and experiment design (A/B testing, holdout strategies, stratification).
Experience developing production-quality data pipelines and automated workflows using Airflow or similar orchestration tools.
Practical familiarity with graph databases and/or graph frameworks (Neo4j, AWS Neptune, GraphFrames, DGL, PyTorch Geometric) and graph algorithms for clustering, link prediction, and community detection is strongly preferred.
Solid SQL skills and experience working with large-scale analytical data stores.
Experience in at least one of: identity verification, fraud detection, credit risk, or adjacent high-stakes domains is a plus.
Demonstrated ability to lead medium-to-large projects end-to-end, make sound trade-off decisions under ambiguity, and influence cross-functional stakeholders with data and clear reasoning.
Please note that sponsorship is not available at this time; and that you must be located within 45 miles of a talent hub to be considered.
Socure is an equal opportunity employer that values diversity in all its forms within our company. We do not discriminate based on race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. If you need an accommodation during any stage of the application or hiring process—including interview or onboarding support—please reach out to your Socure recruiting partner directly.
- ...is building the identity trust infrastructure... .... The mission is big, the problems are... ...The Big Data R&D team is responsible... ...the core identity graph and entity-resolution... ...that power Socure's KYC and compliance... ...work closely with senior data scientists and engineers...Big data
- ...Socure is building the identity trust infrastructure... .... The mission is big, the problems are complex... ...Role The Big Data R&D team builds the core... ...Socure's Verify and KYC products. As a Senior Data Scientist focused on... ...design and deploy ML and graph-based systems tailored...SeniorBig dataWork at officeLocal area
- Socure is seeking a Data Scientist to join their Big Data R&D team in Miami, Florida. In this role, you will design algorithms and develop data pipelines for their identity verification solutions. The ideal candidate will have a Master’s degree or Ph.D. with relevant experience...Big data
$183.1k - $274.7k
The Trade Desk, Inc. in Bellevue is seeking a Senior Identity Engineer to design and operate large-scale distributed systems and data products. The ideal candidate has over 7... ...experience in software engineering, especially with big data processing, distributed systems, and...SeniorBig data- ...Socure is seeking a Senior Data Scientist I in Carson City, Nevada, to lead the design and deployment of advanced machine learning and graph algorithms on large datasets. This role involves significant collaboration with Product and Engineering teams to define standards...Senior
- ...Have: We are looking for a Senior Data Scientist, with experience processing... ...and analysis Apply big data analytic tools to large... ...Analytics, Network Analytics, Graphing Data that assist the analytical... ...sexual orientation, gender identity, national origin, disability...SeniorBig dataImmediate start
- ...scientific workflows, accelerating R&D across biology and chemistry... ...partnered to hire multiple data engineers (mid and senior) to join a small team... ...across ETL and solving complex big data problems.... ...various database technologies (graph, MongoDB, SQL, etc.). Hands-...SeniorBig dataRemote work
$173k - $197.4k
...America, McLean, Virginia Principal Data Scientist, Consumer Identity Machine Learning Data is at the center... ...billions of customer records to unlock the big opportunities that help everyday... ...NLP, Data-centric AI, and Knowledge Graphs. Track record of driving innovation,...Big dataFull timePart timeLocal area- ...scientific workflows, accelerating R&D across Biology and Chemistry... ...them to hire multiple data engineers (Mid and Senior level) to join a small team... ...across ETL and solving complex big data problems. Job Title... ...various database technologies (Graph, Mongo, SQL, etc) and...SeniorBig dataImmediate startRemote work
- ...NTT DATA strives to hire exceptional, innovative and... ...seeking am AWS Data Engineer Senior Specialist to join our... ...Data Engineering and Big Data processing environments... ....6 billion each year in R&D to help organizations... ...sexual orientation, gender identity, national origin,...SeniorBig dataWork experience placement
$65.5k - $134k
...a better working world. Senior Data Scientist EY is the only professional... ...MLLib, SparkNLP). Utilize big data frameworks (Hive, Hue;... ...Good to have: Exposure to graph neural networks and graph databases... ...sexual orientation, gender identity/expression, pregnancy,...SeniorBig dataSummer holidayFlexible hours- ...Job Title Senior Data Scientist Location McLean, VA 22102 US (Primary... ...experience with databases design for big data, such as management of... ...data mining, statistics, or graph algorithms to support... ...sexual orientation, gender identity, national origin, disability...SeniorBig dataFull timeWork at officeRemote work
- Responsibilities Collaborate with Data Scientists and Platform Engineers to... ...or AWS CDK Experience with graph technologies and knowledge... ...semantic modeling Background in big data environments and real-... ..., sexual orientation, gender identity, age, disability, national origin...SeniorBig dataPermanent employmentContract workLocal area
$200k - $220k
...Senior Staff Data Infrastructure Engineer Armis, the cyber exposure management... ...join our Data Infrastructure R&D Group as a dedicated expert... .... Familiarity with Big Data frameworks or NoSQL databases... ..., sexual orientation, gender identity, age, disability, veteran...SeniorBig dataLocal areaRemote work- ...D.C. Responsibilities As a Senior Data Engineer / Data Architect, you... ..., distributed computing, and big data processing systems. Proficiency... .... Experience with NoSQL and graph databases. Experience... ..., sexual orientation, gender identity or any other legally...SeniorBig data
- ...Senior IT Data Engineer The Senior IT Data Engineers are experts in data... ...projects, using advanced big data technologies, and ensuring... ...pipelines, and RAG/knowledge graph platforms. Establish AI governance... ..., sexual orientation, gender identity, disability or veteran status...SeniorBig dataWork experience placementRelocation packageShift workDay shift
- ...Johnson & Johnson Innovative Medicine is looking for a Principal Data Scientist – Knowledge Graph Engineer (Immunology) to join their Immunology R&D Data Science & Digital Health team. This role involves designing scalable knowledge graph infrastructures and collaborating...Senior
- ...at scale, across every screen. Our data exists with the consent of over a billion... ...culturally relevant. As a Data Scientist on Samba's Knowledge Graph & Identity team in Warsaw, you will own end-to... ...and deployment — with support from senior team members Develop and test...
$100k - $300k
...seeking a forward-thinking AI Data Engineer to bridge the gap between... ..., BigQuery, ClickHouse) and Big Data frameworks (Spark, Flink)... .... Knowledge of Knowledge Graphs (Neo4j, NebulaGraph) and how to... ...status, disability, gender identity or Veteran status. We also consider...SeniorBig dataFull time$170k - $215k
Senior Data Scientist Company: Norstella Location: Remote, United States Date... ...fine tuning, and knowledge graphs Experience with... ...retrieval) Experience with Big Data tools like Apache Spark... ...veteran status, gender, gender identity or expression, sexual orientation...SeniorBig dataFull timeContract workTemporary workWork experience placementLocal areaRemote workFlexible hours- ...makes play happen. The Senior Data Engineer is... ...works closely with data scientists, analysts, AI/ML engineers... ...embeddings, knowledge graphs, semantic layers, APIs... ...platforms Experience with big data technologies such... ..., sex, gender, gender identity or expression, sexual...SeniorBig dataLocal area
- ...Anywhere in Country AI and Data - Data Scientist - Senior Manager EY delivers... ...in artificial intelligence, big data and cloud engineering.... ...memory and grounding, knowledge graphs, foundation models, optimisation... ...sexual orientation, gender identity/expression, pregnancy,...SeniorBig data
$65.5k - $134k
...better working world. Senior Machine Learning... ...! The opportunity Data has yet to be... ...closely with data scientists, product managers,... ...pipelines. Proficiency in big data frameworks (... ...have: Experience with graph neural networks,... ...orientation, gender identity/expression, pregnancy...SeniorBig dataSummer holidayFlexible hours- MFour Mobile Research, Inc. seeks a Senior Data Engineer in Kansas City to own the data pipeline essential for AI-driven insights. This role involves ingesting and cleaning data across multiple sources, ensuring quality and compliance while collaborating with various teams...Senior
$120k - $202.5k
...motivated and technically skilled Data Scientist to join our AI/ML science... ...and maintain knowledge graphs and embeddings for semantic search... ...Prior work on agentic AI is a big plus Salary Range $120... ..., sexual orientation, gender identity or expression, citizenship,...SeniorTemporary workFlexible hours$100.9k - $187.3k
...What You’ll Do: As a Senior Software Engineer, you will be responsible... ...suite to deliver global data using advanced search, big data ingestion and... ...desired Neo4J or other graph technology Kafka Grails... ...Assistance Program; Group Legal Identity Theft Protection benefit...SeniorBig dataContract workWork experience placementLocal areaFlexible hours$135k - $220k
...intelligent business identity platform that... ...& identity graph network leverage... ...proprietary data sources to prevent... ...like KYC/KYB or financial... ...closely with data scientists, ML engineers,... ...Innovation & R&D: Stay on the cutting... ....05% – 0.25% Seniority level Mid‑Senior...SeniorFull timeWork at officeFlexible hours3 days per week- ...support our NATO customer, SimIS seeks a Senior Data Scientist to apply business and data manipulation... ...designing, developing, and maintaining big data platforms/fabrics. ~ Experience... ...Provide cloud-based solutioning for internal R&D efforts to develop software prototypes...SeniorBig dataContract workTemporary work
$130k - $196.5k
Senior Software Engineer - Big Data page is loaded## Senior Software Engineer - Big Datalocations: San Franciscotime... ..., data ethics, and foundational identity, LiveRamp is setting the new... ...Identity Engineering team maintain the graph processing engine that connects the...SeniorBig dataWork at officeWork from homeWorldwideFlexible hoursNight shift- ...National Capitol Contracting is seeking Data Scientist II to support the Naval Sea Systems... ...prescriptive analytics. Knowledge of big data technology infrastructure and environments... ...sexual orientation, gender, gender identity, and gender expression, familial status,...SeniorBig dataFor contractorsWork at office
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Data Scientist - Big Data R&D, Identity Graph & KYC. Be the first to apply!
- entry level data scientist remote United States
- senior data scientist United States
- data scientist no experience United States
- entry level data scientist United States
- work from home data scientist United States
- healthcare data scientist United States
- python data scientist United States
- associate data scientist United States
- data scientist (hedge fund) United States
- energy data scientist United States


