Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Member of Technical Staff, Web Data

Cohere

Who are we?

Our mission is to scale intelligence to serve humanity. We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI.

We obsess over what we build. Each one of us is responsible for contributing to increasing the capabilities of our models and the value they drive for our customers. We like to work hard and move fast to do what's best for our customers.

Cohere is a team of researchers, engineers, designers, and more, who are passionate about their craft. Each person is one of the best in the world at what they do. We believe that a diverse range of perspectives is a requirement for building great products.

Join us on our mission and shape the future!

As a Senior Member of Technical Staff specializing in web data for pre-training, you will play a pivotal role in developing the large scale web data pipeline that underpins Cohere's advanced language models. In this role, you will work extensively with Common Crawl and other large-scale web corpora, transforming raw, noisy internet data into high-quality training data for pretraining. You will own key components of the data pipeline, including extraction, parsing, deduplication, and filtering. You will also analyze the composition and quality of web data, study its impact on downstream model performance, and collaborate closely with the broader data and evaluation teams to iterate on the training corpus.

Your work will be essential to Cohere's mission of delivering efficient and reliable language understanding and generation capabilities, driving innovation in natural language processing. If you are passionate about transforming data into the foundation of AI systems, this role offers a unique opportunity to make a meaningful impact.

Please Note: We have offices in London, Paris, Toronto, San Francisco and New York but also embrace being remote-friendly! There are no restrictions on where you can be located for this role. (EST/EU)

As a Senior Member of Technical Staff, Web Data, you will:
  • Maintain large-scale pipelines for processing web corpora.
  • Work on filtering and quality-scoring systems to identify high-value web documents.
  • Analyze web data composition across domains, languages and time periods.
  • Develop and maintain highly-performant deduplication pipelines.
  • Collaborate with cross-functional teams, including researchers and engineers, to ensure data pipelines meet the demands of cutting-edge language models.
You may be a good fit if you have:
  • Strong software engineering skills, with proficiency in Python and experience building data pipelines.
  • Familiarity with data processing frameworks such as Apache Spark, Apache Beam, Pandas, or similar tools.
  • Experience working with large-scale web datasets.
  • Knowledge of data quality assessment techniques and experimentation with data mixtures.
  • A passion for bridging research and engineering to solve complex data-related challenges in AI model training.

Bonus: paper at top-tier venues (such as NeurIPS, ICML, ICLR, AIStats, MLSys, JMLR, AAAI, Nature, COLING, ACL, EMNLP).

If some of the above doesn't line up perfectly with your experience, we still encourage you to apply!

We value and celebrate diversity and strive to create an inclusive work environment for all. We welcome applicants from all backgrounds and are committed to providing equal opportunities. Should you require any accommodations during the recruitment process, please submit an Accommodations Request Form, and we will work together to meet your needs.

If some of the above doesn't line up perfectly with your experience, we still encourage you to apply!


We value and celebrate diversity and strive to create an inclusive work environment for all. We welcome applicants from all backgrounds and are committed to providing equal opportunities. Should you require any accommodations during the recruitment process, please submit an Accommodations Request Form, and we will work together to meet your needs.

Full-Time Employees at Cohere enjoy these Perks:

An open and inclusive culture and work environment

Work closely with a team on the cutting edge of AI research

Weekly lunch stipend, in-office lunches & snacks

Full health and dental benefits, including a separate budget to take care of your mental health

100% Parental Leave top-up for up to 6 months

Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement

Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend

6 weeks of vacation (30 working days!)
Vacancy posted 17 hours ago
Similar jobs that could be interesting for youBased on the Senior Member of Technical Staff, Web Data in New York, NY vacancy
  •  ...mobile AI experiences. As a key member of our team, you'll push the...  ...of iOS development, combining technical excellence with design finesse...  ...syncing, on-device ML, efficient data flow). Iterate quickly on...  ...(iOS + Android or iOS + Web). Led iOS architecture: modular... 
    Data
    Web
    Shift work

    ATG intelligence

    New York, NY
    2 days ago
  •  ...help create something truly transformative. The Role As a Member of Technical Staff, you'll be a core technical contributor building high‑impact...  ...engine: agentic workflows, intelligent routing, and real‑time data orchestration. AI Optimization: Craft precise prompting... 
    Data
    Senior
    Local area

    Atomic

    New York, NY
    4 days ago
  •  ...Page One. The Role We're hiring a Member of Technical Staff - Internal AI Harness to build the...  ...Generation Systems Build and maintain data pipelines that support go-to-market...  ...building backend services and modern web applications. Have experience designing... 
    Data
    Web
    Full time
    Flexible hours

    Stuut

    New York, NY
    17 hours ago
  •  ...infrastructure. You may be the person building APIs, integrations, data pipelines, and distributed systems — or you may be the...  ..., mention it in your application. About the Role Members of Technical Staff at Anterior own problems end-to-end — from system design through... 
    Data
    Senior
    Apprenticeship
    Remote work
    Flexible hours

    Anterior, Inc.

    New York, NY
    2 days ago
  • $50k - $120k

     ...Member of the Technical Staff, Full-Stack Web Developer (UI/UX) Unitary Foundation is a non-profit research group helping build a quantum technology industry...  ...of this role will be on the Quantum Benchmarking data visualization platform. We are building an open-source... 
    Data
    Web
    Local area
    Remote work
    Flexible hours

    Second Renaissance

    New York, NY
    4 days ago
  • $170k - $220k

     ...precision-based care for Atria members and their families. All...  ...a highly experienced Senior Product Manager to help...  ...mobile (iOS/Android), web, onboarding/...  ...Partner with engineering and data teams to build AI-assisted...  ...user impact, risk, and technical feasibility. Cross-... 
    Data
    Senior
    Web
    Flexible hours

    Atria Physician Practice New York PC

    New York, NY
    2 days ago
  •  ...make it accessible to all. About The Role Data plays a crucial role at the frontier of AI...  ...architectures, but from better data. As a member of the Data Team, your mission is to build...  ...the ingestion systems that turn the open web and other large‑scale data sources into reliable... 
    Data
    Web
    Relocation package

    Reflection

    New York, NY
    3 days ago
  • $139.9k - $274.8k

     ...benefits.???? Microsoft AI (MS AI) is seeking a experienced Member of Technical Staff - Data Engineer - Microsoft AI - Copilot to help build mission...  ...trends, best practices, and emerging technologies in web development and AI.? ~ Ability to work in a fast-paced environment... 
    Data
    Web
    Ongoing contract
    Work at office
    Local area

    Microsoft Corporation

    New York, NY
    2 days ago
  • Member of Technical Staff - Software Engineer Valthos | Posted Mar 3 Full-time Negotiable Advanced (5...  ...high-scale, opinionated APIs that are data/GPU intensive. Prototype and develop interactive...  ...of the full application stack (Web UX frameworks, API design and... 
    Data
    Web
    Full time
    Work at office

    Valthos

    New York, NY
    1 day ago
  •  ...to function as CurRent Senior Application Specialist,...  ...collaboration tools to assist team members in resolving incidents...  ..., systems programming, data communications,...  ..., mobile development, web development and design;...  ...from an accredited technical school (post high school... 
    Data
    Senior
    Web
    Permanent employment
    Full time
    Work at office
    Immediate start
    Remote work
    Shift work

    CITY OF NEW YORK INC

    Brooklyn, NY
    5 days ago
  •  ...Requirements: Experience building and shipping modern web applications end-to-end. We care more about what you've built than...  ...the frontend, Python services on the backend, and ClickHouse for data and analytics. Deep knowledge of observability tools and patterns... 
    Data
    Web
    Work at office

    Modal

    New York, NY
    4 days ago
  •  ...driver of front-end architecture, technical direction, and engineering...  ...professional experience building web applications and UIs, with...  ...concurrency, algorithms, and data structures (formal CS degree NOT...  ...diverse team of more than 600 members, we are united in one common goal... 
    Data
    Web
    Full time

    Anchorage Digital

    New York, NY
    18 hours ago
  •  ...is uniquely designed to make it easy for both engineers and non-technical teams to build agents on a single platform. Our customers include...  ...with TypeScript, with a deep understanding of end-to-end web fundamentals System design skills: you can architect APIs and... 
    Senior
    Web

    Inkeep

    New York, NY
    4 days ago
  •  ...: We're looking for a Growth Engineer to own the technical foundation of Modal's marketing and developer-facing web surfaces: the marketing site, docs site, growth landing...  ...You'll partner with Product Engineering, Design, Data, and Growth to ship polished, measurable web... 
    Data
    Web
    Work at office

    Modal

    New York, NY
    4 days ago
  • $159.1k - $196.48k

     ...Senior Software Engineer I, Member Growth, Care Guide Experience New York (Hybrid)...  ...Expand and maintain existing web applications and infrastructure...  ...and efficiently Provide technical leadership in architectural...  ...with product, design, data, or other teams, contributing... 
    Data
    Senior
    Web
    Work at office
    Relocation
    Sleeping nights
    2 days per week
    3 days per week

    Spring Health

    New York, NY
    3 days ago
  •  ...Design and build interfaces that make complex data, analysis results, workflows, alerts,...  ...and building production-quality web applications. Strong frontend engineering...  ...where product direction, customer needs, and technical constraints are still evolving.... 
    Data
    Web
    Work at office

    Valthos

    New York, NY
    2 days ago
  • $180k

     ...intersection of AI and finance. RESPONSIBILITIES: * Develop backend services, APIs, and data models to support high-volume, multi-user environments. * Work with iOS, Android & Web client engineers to ship products. * Design robust infrastructure and microservices for... 
    Data
    Web
    Full time
    Temporary work

    xAI

    New York, NY
    5 days ago
  •  ...Pace Technical Staff Role Pace is an AI-native business process outsourcer for insurers. We...  ...companies in the world. We're looking for a Member of Technical Staff who will partner with...  ..., 2) Full-stack engineer to own RPA/web automation features, 3) Enterprise... 
    Web

    Pace

    New York, NY
    1 day ago
  • $81k - $115.5k

     ...- 3 days per week Role Overview: As a Senior Business/Technical Analyst within the Tech Data AI Ventures team at New York Life, you will serve as...  ...systems with a multi-tier architectures consisting of web and legacy applications. • Experience working in... 
    Data
    Senior
    Web
    Local area
    Shift work
    3 days per week

    New York Life

    New York, NY
    3 days ago
  •  .... We are looking for a Senior Salesforce Developer who...  ...comprehensive technical solutions, establish and...  ...addition to pairing with team members on functional and nonfunctional...  ...platform principles, data model, integration...  ...Services, SOQL, Apex Web Service Callouts, Scheduled... 
    Data
    Senior
    Web
    Remote work

    BXGI Consulting

    New York, NY
    4 days ago
  •  ...quarterly in‑person collaboration days to work together and further deepen our Village. A successful member of this team would help bridge business and technical uses of data, creating an easy to use platform for Data and Data Tooling for Engineering, Reporting, Analytics,... 
    Data
    Work at office
    Remote work

    Crypto Pro Network

    New York, NY
    4 days ago
  •  ...US-based 501(c)(3). Job Description: We are looking for a Member of Technical Staff, Research to investigate, design, test and develop state of...  ...independent and internal thought experiments. Design and automate data ingestion pipelines in collaboration with our Data... 
    Data
    Remote work

    Firstprinciples

    New York, NY
    4 days ago
  •  ...Senior React Developer Location: Weehawken, NJ -...  ...mainframe system to a new technical, highly-scalable...  ...allowing to query underlying data from third-party applications...  ...front and front-end web development Maintain...  ...and lead other team members Design and code cloud... 
    Data
    Senior
    Web
    Contract work
    Work at office

    Futran Tech Solutions Pvt. Ltd.

    Weehawken, NJ
    3 days ago
  • $200k - $270k

     ...drive long-term success for both clients and candidates. Member of Technical Staff Location: New York City Company Stage of Funding:...  ...Do Design and build backend services, APIs, and data pipelines that power AI-driven healthcare workflows Work... 
    Data
    Work at office
    Visa sponsorship

    Recruiting from Scratch

    New York, NY
    2 days ago
  • $80 per hour

     ...Title - Senior Salesforce Developer Location - New...  ...Development, Integration and technical execution for...  ...delivering complex Lightning Web Components (LWC) user...  ...understanding of Salesforce data modeling, sharing rules...  ...and mentor team members Good to have... 
    Data
    Senior
    Web
    Work at office

    SysMind Tech

    Jersey City, NJ
    3 days ago
  •  ...role We're looking for a Senior Product Designer with a...  ...finance - translating technical constraints and...  ...experiences across our web and mobile products - from...  ...Translate DeFi mechanics and data-heavy flows into...  ...views each and every team member as a separate individual... 
    Data
    Senior
    Web

    Aave Labs

    New York, NY
    4 days ago
  • $189.59k

     ...anchorage.com, on X @Anchorage, and on LinkedIn. Job title: Member of Technical Staff, Banking SolutionsCompany name: Anchor LabsJob site address...  ...humans. If you would like more information about how your data is processed, please contact us.Listed in: , , , , , , , ,... 
    Data
    Bank staff
    Remote work

    Crypto Pro Network

    New York, NY
    1 day ago
  •  ...Activant, 1984 Ventures and Page One. The Role We’re hiring a Member of Technical Staff - AI/ML to design, build, and deploy AI-powered systems...  ...financial use cases, including fine-tuning on proprietary data where it moves the needle Build robust AI pipelines from ingestion... 
    Data
    Full time
    Flexible hours

    Stuut

    New York, NY
    2 days ago
  •  ...Purpose/Role We are seeking a Senior Synon Developer to develop, enhance, and support our Member Benefits System (MBS). The...  ...of Synon functionality and the data exchange integrations we employ...  ...PageDNA, LegaSuite, Java, Apex, Query Tools, or web design is a plus.
    Data
    Senior
    Web

    RIT Solutions, Inc.

    New York, NY
    3 days ago
  • $175k - $220k

     ...Member Of Technical Staff, Cloud Infrastructure New York, NY; San Mateo, CA At Fireworks, we're building the future of generative AI infrastructure...  ...to support distributed training, inference, and data processing pipelines. Lead technical design discussions,... 
    Data

    Fireworks AI

    New York, NY
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Member of Technical Staff, Web Data. Be the first to apply!