Member of Technical Staff, Data Researcher
Inception
Inception creates the world’s fastest, most efficient AI models. Our Mercury model is the world’s fastest reasoning LLM and first commercially available diffusion LLM, delivering 5x greater speed and efficiency than today’s LLMs, with best‑in‑class quality. We are the AI researchers and engineers behind such breakthrough AI technologies as diffusion models, flash attention, and DPO. The Role We seek experienced engineers and scientists to shape how we collect, process, and curate the datasets that power our models. You'll combine engineering expertise with research insight to build scalable data pipelines, develop synthetic data generation techniques, and ensure our models are trained on high‑quality, diverse data. Key Responsibilities Develop data mixes for training LLMs, including by leveraging open-source datasets, synthetically generated data, and curated human feedback. Design and implement data pipelines for processing petabyte‑scale datasets. Build systems for web crawling, data ingestion, and real‑time data processing to support model training. Develop tools and frameworks for efficient data storage, retrieval, and versioning across distributed systems. Create evaluation frameworks to measure data diversity, quality, and representativeness. Ensure data collection adheres to privacy regulations. Qualifications BS/MS/PhD in Computer Science, Machine Learning, or a related field (or equivalent experience). 3+ years of experience building data processing pipelines at scale, particularly with AI/ML applications. Strong proficiency in Python and experience with data processing frameworks (Apache Spark, Beam, Airflow). Familiarity with synthetic data generation techniques and data augmentation strategies. Familiarity with web scraping, crawling technologies, and Common Crawl datasets. Solid understanding of machine learning fundamentals and experience with ML frameworks (PyTorch, TensorFlow). Experience with SQL and NoSQL databases for managing structured and unstructured data. Preferred Skills Experience with large language models and understanding of tokenization, embeddings, and model architectures. Experience managing human annotation workflows and quality control processes. Experience with vector databases and embedding‑based retrieval systems. Knowledge of data privacy regulations and ethical AI practices. Experience with distributed computing and large‑scale data storage systems (HDFS, S3, BigQuery). Why Join Inception Work with World‑Class Talent : Collaborate with the inventors of diffusion models and leading AI researchers Shape Foundational Technology : Your decisions will influence how the next generation of AI products are built and used Immediate Impact : Join at the ground floor where your contributions directly shape product direction and company trajectory Competitive salary and equity in a rapidly growing startup Flexible vacation and paid time off (PTO) Health, dental, and vision insurance Catered meals (breakfast, lunch, & dinner) A collaborative and inclusive culture About Us Inception creates the world’s fastest, most efficient AI models. Today’s autoregressive LLMs generate tokens sequentially, which makes them painfully slow and expensive. Inception’s diffusion‑based LLMs (dLLMs) generate answers in parallel. They are 5x faster and more efficient, while delivering best‑in‑class quality. Inception was co‑founded by Stanford professor Stefano Ermon, who co‑invented such breakthrough AI technologies as diffusion models, flash attention, and DPO, UCLA professor Aditya Grover, who co‑invented node2vec, decision transformers, and d1 reasoning, and Cornell professor and Afresh co‑founder Volodymyr Kuleshov, who co‑invented MDLM and Block Diffusion. We pioneered the application of diffusion to language, with world’s first (and only) commercially available dLLM, Mercury. We are currently deploying our large‑scale diffusion LLMs at Fortune 500 companies. Diffusion is the technology behind today’s image and video AI, and we’re making it the standard for LLMs as well. Our team includes engineers from AWS, Google DeepMind, Meta AI, Microsoft, HashiCorp, and OpenAI. Based in Palo Alto, CA, we are backed by top‑tier venture capitalists, including Menlo Ventures, Mayfield, M12 (Microsoft’s venture fund), Snowflake Ventures, Databricks, and Innovation Endeavors, and by tech luminaries such as Andrew Ng, Andrej Karpathy, and Eric Schmidt. If you are talented, innovative, and ambitious, come help us invent the future of AI. We are an equal opportunity employer and encourage candidates of all backgrounds to apply. #J-18808-Ljbffr Inception
- ...AI. About the role Gimlet Labs is seeking an Member of Technical Staff focused on AI research. As an AI Researcher, you will be evaluating... ...comparable area of study Experience with AI/ML or applied data science. Strong candidates may also have...Data
- ...paying users' lives, even saving a few of them. We have the data, we have the revenue, we have the funding, now we're hiring... ...game feel equally strong obligations to both 1) choose good and 2) to win think that this role should be renamed "member of tomo staff"...DataImmediate start
- ...Member Of Technical Staff We're looking for a member of technical staff to build and deploy production... ...for training, inference, and data processing Improve latency, throughput... ...tools and APIs Partner with product, research, and design to ship end-to-end features...Data
$180k
...Member Of Technical Staff - Pre-Training Palo Alto, CA About XAI XAI's mission is to create AI systems that can accurately understand... ...and perks. xAI is an equal opportunity employer. For details on data processing, view our Recruitment Privacy Notice....DataTemporary work$180k
As molecular data generation and frontier model intelligence grows, new approaches to data analysis are needed across the biotech... ...handle data from instrument-to-insights. We're seeking a Member of Technical Staff for Genomics to lead our genomics bench, pushing its...DataFull timeWork at office$180k
...Member Of Technical Staff - Inference Palo Alto, CA About Xai Xai's mission is to create AI... ...inference engine updates. Accelerate research on scaling test-time compute, RL rollout... ...opportunity employer. For details on data processing, view our Recruitment...DataTemporary work$200k - $300k
...Member of Technical Staff (Platform) Title of Role: Member of Technical Staff (Platform) Location... ...that is redefining how autonomous research is conducted. This company leverages... ...rapid deployment. Build and optimize data pipelines and infrastructure to...DataWork at office$200k
...promising path to safe AGI lies in automating research and code generation to improve models... ...supports pre-training, post-training, data, inference, and product, and sits on... ...'s most important decisions. As a Member of Technical Staff on Evals, you will build both the...DataVisa sponsorshipRelocation package- ...Member Of Technical Staff - Image / Video Generation Freiburg (Germany) About Black Forest Labs... ...staying true to what makes us different: research excellence, open science, and building... ...models for image and video data, working at the scale where intuitions...DataRemote workWorldwide2 days per week
$180k
...Member Of Technical Staff - RL Infrastructure Palo Alto, CA About XAI XAI's mission is to... ...software engineers to create robust data pipelines, comprehensive evaluations... ...frameworks to increase the productivity of researchers and engineers. Typical problems...DataTemporary work- ...Product Hunt), Charlie Songhurst (Board Member, Meta), and Michael Jones (Former Chair,... ...Nations, UChicago, and Oxford engineers and researchers. Our omnichannel agents are supporting... ...user-facing features end-to-end (from data model to API to UI) Collaborate closely...DataFull timeWork experience placementInternshipWorldwide
- ...the globe underwrite them with proprietary data, negotiate terms, and execute... ...systems — not maintain someone else's. Member of Technical Staff For exceptional builders who don't fit in a box Former founder? Research engineer? Generalist who's been the secret...Data
- ...Member Of Technical Staff, Platform Engineer You'll design, build, and own distributed systems and core platform infrastructure end-to-end across... ...product experiences, user interactions, and large-scale data flows behind Design Arena and similar systems...Data
- ...We are responsible for designing, building, and scaling core infrastructure that powers a high-volume data platform for AI applications. We are looking for team members who love building enabling systems that empower our engineers and power our rapidly growing product....DataWork at office
$256k - $276k
...vision at Postman. The Opportunity As a Member of Technical Staff on AI Infrastructure, you will build and... ...power AI model post training, inference, and data pipelines. You will collaborate with engineering and research teams to ensure performance, scalability,...DataWork at officeFlexible hours3 days per week- Pixeltable, Inc. is seeking a Member of Technical Staff based in San Francisco, CA. As a founding member of our engineering team, you will directly... ...influence the design and development of a revolutionary AI data platform. With over 5 years of experience in systems...DataFlexible hours
- ...Member Of Technical Staff @ Lotus AI Lotus AI is a groundbreaking primary care app that integrates... ...'ll help build and operate the AI + data systems behind AI-driven primary care... ...group of engineers, clinicians, and AI researchers to build something with lasting...Data
- ...operations into coordinated, profit-aware systems - unifying internal data and putting the highest-leverage actions directly in operators’... ...factory operators, iterate and validate fast. Meritocracy: Any problem can be solved by any team member. #J-18808-Ljbffr ComplementData
$150k - $280k
...Member of Technical Staff (Backend) San Francisco, CA Compensation: $150,000 – $280,000 + Competitive... ...the web, interpret unstructured data, detect global financial risk, and make... ...Self-healing data pipelines 4. Deep Research Without Hallucinations - Develop deep...DataFull timeTemporary workH1bWork at officeVisa sponsorshipRelocation package- ...default Design and enforce data retention policies - minimal... ...enterprise customers Own the technical relationship with enterprise... ...As a founding member, you'll help define the technical... ...NeoSigma is a product-driven research lab building the intelligence...DataVisa sponsorshipFlexible hours
$140k - $200k
...Member of Technical Staff Harper is an AI-native commercial insurance company in San Francisco. We're not bolting AI onto insurance - we're... ...backend powering their entire journey (application, business data, history, ongoing service), and the AI layer that makes the...DataWork at officeRelocation- ...core product functionality and data workflows Design and... ...impact on company direction and technical decisions High ownership and... ...As a founding member, you'll help define the technical... ...NeoSigma is a product-driven research lab building the intelligence...DataVisa sponsorshipFlexible hours
- ...Member Of Technical Staff – Applied AI, Frontend Stuut is transforming accounts receivable for B2B companies—making collections smarter and... ...Engineering, Product, and our customers to deliver responsive, data-heavy applications that power Stuut's platform. This is...DataFull timeFlexible hours
- Member of Technical Staff, ML Systems Mirendil Mirendil is a tech-first company focused on solving core... .... We are building a frontier AI research company and training our own models end... ...latency, throughput, cost) Developing data pipelines and evaluation tooling Deploying...Data
- ...what cutting edge means. We're hiring Members of Technical Staff to design the evaluations that set the... ...This is a unique combination of product, research, technical, and client-facing work,... ...Analysis of AI: Drive developing reports and data visualizations to communicate complex...Data
- Job Description As a Member of Technical Staff (Research) at Trajectory, you will design and build the post‑training stack that lets our customers’ models... ...workflows. You will own end‑to‑end experiments across data, training, and evaluation: shaping telemetry into...Data
- ...Member Of Technical Staff – Applied AI, Fullstack Stuut is transforming accounts receivable for B2B companies—making collections smarter and... ...database schema to production-ready UI — ensuring seamless data flow, scalability, and performance. Design and implement...DataFull timeFlexible hours
- ...facing products or SDKs Experience with data pipelines, telemetry systems, or... ...Apple, Mosaic, Adept, and Windsurf RL researchers, raised $15M led by Conviction, backed... ...We’re building our team of founding Members of Technical Staff to design the frontier of continually...Data
$150k - $300k
...async RL trainer. We enable researchers, startups, and enterprises to... ...runs the jobs. Core Technical Responsibilities Hosted Training... ...training fundamentals (data/tensor/pipeline/expert parallelism... ...and encourage team members to contribute to the broader...DataWork at officeLocal areaRemote workVisa sponsorshipRelocation packageFlexible hours$160k - $250k
Member of Technical Staff - Computational Biology About Edison Scientific focuses on building and commercializing... ...an AI Scientist - scaling autonomous research, productizing it, and applying it to... ...LLM agents to execute long, coherent data-driven discovery tasks Building...DataRemote work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Member of Technical Staff, Data Researcher. Be the first to apply!
- IT assistant San Francisco, CA
- desktop support analyst San Francisco, CA
- senior IT support technician San Francisco, CA
- personal computer support technician San Francisco, CA
- technical analyst San Francisco, CA
- customer support technician San Francisco, CA
- tech assistant San Francisco, CA
- technical support assistant San Francisco, CA
- customer support analyst San Francisco, CA
- remote (work from home) technical support representative San Francisco, CA

