Data Scientist II - Big Data R&D, Identity Graph & KYC
$140k - $170kSocure
About the Role The Big Data R&D team is responsible for building the core identity graph and entity-resolution capabilities that power Socure’s KYC and compliance products. In this role, you will help develop graph-based algorithms and data pipelines on massive PII datasets, support modelers with high-quality features, and evaluate new data sources that feed our identity and fraud products. You will work closely with senior data scientists and engineers while developing your skills in large‑scale ML, distributed systems, and graph analytics. What You’ll Do Contribute to the design and implementation of machine learning, data mining, statistical, and graph‑based algorithms to analyze very large datasets for identity verification and anomaly detection. Analyze large datasets to help develop and refine entity‑resolution and identity‑matching algorithms that drive Socure’s KYC and compliance solutions. Build and maintain components of data‑processing pipelines (ETL, feature generation, normalization) using tools such as Spark/PySpark and AWS (e.g., EMR, S3). Support senior data scientists with feature engineering, data exploration, error analysis, and A/B test setup for new models and signals. Help evaluate new third‑party and internal data sources: profile data quality, design offline experiments, and summarize impact on coverage and model performance. Implement and maintain SQL and Python/R code for data extraction, transformation, and validation; contribute to code reviews and basic testing. Provide analytical support to compliance and regulatory product teams, including ad hoc investigations, simple dashboards, and data deep dives. Communicate findings in a clear, structured way to peers and cross‑functional partners (Product, Engineering, Client Analysis), focusing on key insights and trade‑offs. Work effectively in a fast‑paced, cross‑functional environment; demonstrate ownership of well‑scoped tasks and follow through to completion. What You Bring Master’s degree with 2+ years of experience, or Ph.D. with 1+ years of experience in a data science or analytics role, or equivalent practical experience. Proficiency in at least one general-purpose programming language used in data science (Python, or Scala). Solid experience writing and optimizing SQL for large datasets; comfort working in data lake / warehouse environments. Hands‑on experience with Spark or PySpark and common ML libraries (e.g., scikit‑learn, XGBoost, TensorFlow/PyTorch a plus). Familiarity with UNIX environments and the AWS ecosystem (e.g., EMR, S3); Databricks experience is a plus. Working knowledge of supervised/unsupervised ML and basic statistics (similarity measures, clustering, evaluation metrics). Exposure to graph techniques or graph databases (Neo4j, AWS Neptune, GraphFrames) is a strong plus. Bonus: experience with Elasticsearch or DynamoDB; workflow tools such as Airflow for automating data pipelines. Ability to break down loosely defined problems, ask good clarifying questions, and iterate quickly with feedback. Please note sponsorship is not available at this time; and that you must be located within 45 miles of a talent hub to be considered. Socure is an equal opportunity employer that values diversity in all its forms within our company. We do not discriminate based on race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. If you need an accommodation during any stage of the application or hiring process—including interview or onboarding support—please reach out to your Socure recruiting partner directly. Compensation Range: $140K – $170K #J-18808-Ljbffr Socure
$130.05k - $175.95k
The Onyx Research Data Platform organization represents... ...investment by GSK R&D and Digital & Tech, designed... ...experience for GSK’s scientists, engineers, and... ...requirements. A Data Engineer II is a technical... ...sexual orientation, gender identity/expression, age, disability...SuggestedLocal area- ...about — in real time, at scale, across every screen. Our data exists with the consent of over a billion people,... ...because it’s the most culturally relevant. As a Data Scientist on Samba's Knowledge Graph & Identity team in Warsaw, you will own end-to-end delivery of significant...Suggested
$162k - $258.75k
...About the Principal Data Scientist at Headspace We are looking for an innovative, strategic... ...preferred). ~7+ years of expertise in big data technologies (e.g., Redshift, S3, Databricks... ...race, color, religion, gender, gender identity, gender expression, sexual orientation,...Big dataFull timeCurrently hiringWork at officeLocal areaRemote workShift work3 days per week$148.5k - $260.1k
...the next-generation Enterprise Knowledge Graph platform to power AI-driven experiences,... ...applications, semantic search, enterprise data discovery, and intelligent decision-making... ...sexual orientation, gender expression or identity, transgender status, age, disability,...SuggestedContract work- Data Engineer II - QuantumBlack, AI by McKinsey Mar 09, 2026 Your Impact As a Data Engineer... ...cross‑functional Agile teams with Data Scientists, Machine Learning Engineers, Designers... ...employment without regard to sex, gender identity, sexual orientation, race, color,...SuggestedApprenticeship
$112.5k - $150k
Analytics Engineer As the Analytics Engineer II, you will: Build and improve data models that enable faster, more reliable decision‑making across Product... ...sex, national origin, age, sexual orientation, gender identity, veteran status, disability or genetics. Qualified...Work from homeHome office$130k - $196.5k
Senior Software Engineer - Big Data page is loaded## Senior Software Engineer - Big Datalocations... ...privacy, data ethics, and foundational identity, LiveRamp is setting the new standard for... ...Identity Engineering team maintain the graph processing engine that connects the...Big dataWork at officeWork from homeWorldwideFlexible hoursNight shift- A leading data collaboration company in San Francisco is seeking a Senior Software Engineer to join their Identity Engineering team. This role involves leading complex projects focused on big data technologies and collaborative software development. Candidates should have...Big dataRemote work
$103.52k - $143.1k
...Services LLC Position: Business Analyst II - AMZ26516.2 Location: San Francisco,... ...metrics reporting and performing data mining and big data analysis to provide strategic advice... ...Female / Disability / Veteran / Gender Identity / Sexual Orientation Basic Qualifications...Big dataLocal areaRelocation package$144.77k - $209.11k
...Sr. Data Engineer Employment Type: Full-Time, Mid-level Department: Business Intelligence... ...Python, R, SQL, SAS). -Strong knowledge of big data analysis and storage tools and... ...religion, sex, sexual orientation, gender identity, national origin, disability, or status as...Big dataFull timeFlexible hours$106.9k - $176.5k
...seeking a highly skilled Senior Consultant Data Engineer with expertise in cloud data... ...Databricks and experience with Spark for big data processing. Proven experience in at least... ...religion, age, sex, sexual orientation, gender identity/expression, pregnancy, genetic information...Big dataSummer holidayFlexible hours- ...Socure is building the identity trust... ...starts. The mission is big, the problems are complex... ...role will lead the Data Science function across... ..., including KYC, International KYC,... ...Prefill, and Identity Graph . In addition to advancing... ...team of data scientists and applied...
$205k - $235k
...EY-Parthenon – EY Growth Platforms - Data Scientist – Director The opportunity EY-Parthenon... ...top of trends in the Data Science and Big Data industry. If you have a genuine passion... ..., age, sex, sexual orientation, gender identity/expression, pregnancy, genetic...Big dataWork experience placementSummer holidayFlexible hours$130k - $196.5k
You will Design, build, and optimize scalable data processing pipelines using Spark, Airflow, and related big data technologies to support both batch and real‑time... ...veteran, disability, sexual orientation, gender identity, genetics or other protected status. Qualified...Big dataWork from homeFlexible hoursNight shift$171.12k - $213.9k
...Twilio's next Staff Software Engineer on our Data & Analytics Platform Who we are & why... ...distributed systems. ~ Expertise in big data technologies such as Hadoop, Spark,... ...medical conditions), sexual orientation, gender identity, gender expression, age, status as a...Big dataLocal areaRemote workWorldwide- ...for a Staff Software Engineer to join the Data Infrastructure team within the broader Data... ...scale. Strong technical background with big data and infrastructure technologies such... ...national origin, sex, sexual orientation, gender identity or expression, transgender status, age,...Big data
$192k - $264k
...we’re using the power of tech, data, and machine learning to... ...everywhere can compete with these big box and e-commerce giants. By... ...team to work closely with Data Scientists, Product Analysts and Software... ...genetics, sexual orientation, gender identity or gender expression. Faire is...Big dataWork experience placementWork at officeLocal areaRemote workMonday to FridayFlexible hours3 days per week$110k - $135k
...platform for providers. We are venture backed and have partnered with multiple US-based health systems and data providers. What you’ll do As a Data Engineer II, you’ll be a foundational member of a small, high‑impact team building the data backbone of our clinical AI...Full timeInternship$99k - $149k
Day to Day This role’s primary responsibility is to integrate data from a variety of sources into common data domain models, supporting... ...without regard to race, color, religion, gender, gender identity or expression, family status, marital status, sexual orientation...$140k - $200k
...About the Role Help us advance our robotics moonshot by scaling our data engineering efforts. Drive design and development of data... ...condition, genetic information, marital status, sex, gender, gender identity, gender expression, sexual orientation, age, military or veteran...Full timeLocal areaFlexible hoursWeekend work- Mithrl is seeking a Data Engineer, Knowledge Graphs to build the infrastructure for their biological knowledge layer. In this role, you will partner closely with data scientists to create scalable ETL pipelines and efficient APIs for data access. Your work will have significant...Work at office
$132.26k - $155.6k
...you excel at—all from Day One. Job Description Responsible for big data/analytics projects that gather and integrate large volumes of data... ..., color, sex, national origin, age, sexual orientation, gender identity, disability or veteran status, and other factors protected...Big dataFull timeTemporary workWork experience placementLocal area$169k - $232k
Adyen provides payments, data, and financial products in a single solution for customers like Meta, Uber, H&M, and Microsoft - making... ...ethnicity, ancestry, national origin, religion, sex, gender, gender identity, gender expression, sexual orientation, age, disability, veteran...Work at officeLocal area- ...commercially available AI Co-Scientist. It is a discovery engine that transforms messy biological data into insights in minutes. Scientists... ...by leading biotechs and big pharma across three continents... ...hiring a Data Engineer, Knowledge Graphs to build the infrastructure...Work at office
$110k - $135k
Knit Health in San Francisco is seeking a Data Engineer II to be a key member of a small team focused on developing data backbone for clinical... ...data quality and efficiency while working closely with data scientists to support AI model development. The ideal candidate should...$181.1k - $272.1k
Senior Software Data Engineer, App Store San Francisco, California, United States Software... ...-solving skills Hands‑on experience in big data technologies such as Hadoop, Spark/Flink... ...religion, sex, sexual orientation, gender identity, national origin, disability, Veteran...Big dataRelocation package$125.5k - $230.2k
...build a better working world. Technology - Data and Decision Science - Data Engineering -... ...Databricks and experience with Spark for big data processing. Strong background in data... ..., age, sex, sexual orientation, gender identity/expression, pregnancy, genetic information...Big dataSummer holidayFlexible hours$123.7k - $254.67k
...platform purpose-built for performance marketers. We leverage massive data and cutting‑edge science to automate and optimize TV advertising... ...related medical conditions), sexual orientation, gender, gender identity, gender expression, age, marital status, status as a protected...Big dataWork at officeLocal areaRelocationRelocation package$98k - $164k
...BrazeAI, we’re expanding our team! Join our Forward-Deployed Data Scientist group of creative technical experts who partner with... ...inclusive experience - regardless of age, color, disability, gender identity, marital status, maternity, national origin, pregnancy, race...Full timeWork at officeLocal areaFlexible hours$114.3k - $235.32k
...decisions for both Pinners and the business. We’re looking for a Data Scientist to join our Infrastructure Data Science team. In this role,... ...medical conditions), sexual orientation, gender, gender identity, gender expression, age, marital status, status as a protected...Full timeWork at officeLocal areaRemote workRelocationRelocation package
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Data Scientist II - Big Data R&D, Identity Graph & KYC. Be the first to apply!
- principal data scientist San Francisco, CA
- python data scientist San Francisco, CA
- healthcare data scientist San Francisco, CA
- part time data scientist San Francisco, CA
- work from home data scientist San Francisco, CA
- data scientist machine learning engineer San Francisco, CA
- data scientist (hedge fund) San Francisco, CA
- energy data scientist San Francisco, CA
- data scientist San Francisco, CA
- entry level data scientist remote San Francisco, CA


