Senior Member of Technical Staff, Web Data
Cohere
Who are we? Our mission is to scale intelligence to serve humanity. We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI. We obsess over what we build. Each one of us is responsible for contributing to increasing the capabilities of our models and the value they drive for our customers. We like to work hard and move fast to do what's best for our customers. Cohere is a team of researchers, engineers, designers, and more, who are passionate about their craft. Each person is one of the best in the world at what they do. We believe that a diverse range of perspectives is a requirement for building great products. Join us on our mission and shape the future! As a Senior Member of Technical Staff specializing in web data for pre-training, you will play a pivotal role in developing the large scale web data pipeline that underpins Cohere's advanced language models. In this role, you will work extensively with large-scale web corpora, transforming raw, noisy internet data into high-quality training data for pretraining. You will own key components of the data pipeline, including extraction, parsing, deduplication, and filtering. You will also analyze the composition and quality of web data, study its impact on downstream model performance, and collaborate closely with the broader data and evaluation teams to iterate on the training corpus. Your work will be essential to Cohere's mission of delivering efficient and reliable language understanding and generation capabilities, driving innovation in natural language processing. If you are passionate about transforming data into the foundation of AI systems, this role offers a unique opportunity to make a meaningful impact. Please Note: We have offices in London, Paris, Toronto, San Francisco and New York but also embrace being remote-friendly! There are no restrictions on where you can be located for this role. (EST/EU) As a Senior Member of Technical Staff, Web Data, you will:
Bonus: paper at top-tier venues (such as NeurIPS, ICML, ICLR, AIStats, MLSys, JMLR, AAAI, Nature, COLING, ACL, EMNLP). If some of the above doesn't line up perfectly with your experience, we still encourage you to apply! We value and celebrate diversity and strive to create an inclusive work environment for all. We welcome applicants from all backgrounds and are committed to providing equal opportunities. Should you require any accommodations during the recruitment process, please submit an Accommodations Request Form, and we will work together to meet your needs. If some of the above doesn't line up perfectly with your experience, we still encourage you to apply!
We value and celebrate diversity and strive to create an inclusive work environment for all. We welcome applicants from all backgrounds and are committed to providing equal opportunities. Should you require any accommodations during the recruitment process, please submit an Accommodations Request Form, and we will work together to meet your needs.
We may use AI-enabled tools to screen and assess applicants against the criteria for this position. This helps our recruiters identify potentially qualified candidates, but it doesn't limit the applications our recruiters may review or consider. Full-Time Employees at Cohere enjoy these Perks: An open and inclusive culture and work environment Work closely with a team on the cutting edge of AI research Weekly lunch stipend, in-office lunches & snacks Full health and dental benefits, including a separate budget to take care of your mental health 100% Parental Leave top-up for up to 6 months Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend 6 weeks of vacation (30 working days!)
- Maintain large-scale pipelines for processing web corpora.
- Work on filtering and quality-scoring systems to identify high-value web documents.
- Analyze web data composition across domains, languages and time periods.
- Develop and maintain highly-performant deduplication pipelines.
- Collaborate with cross-functional teams, including researchers and engineers, to ensure data pipelines meet the demands of cutting-edge language models.
- Strong software engineering skills, with proficiency in Python and experience building data pipelines.
- Familiarity with data processing frameworks such as Apache Spark, Apache Beam, Pandas, or similar tools.
- Experience working with large-scale web datasets.
- Knowledge of data quality assessment techniques and experimentation with data mixtures.
- A passion for bridging research and engineering to solve complex data-related challenges in AI model training.
Bonus: paper at top-tier venues (such as NeurIPS, ICML, ICLR, AIStats, MLSys, JMLR, AAAI, Nature, COLING, ACL, EMNLP). If some of the above doesn't line up perfectly with your experience, we still encourage you to apply! We value and celebrate diversity and strive to create an inclusive work environment for all. We welcome applicants from all backgrounds and are committed to providing equal opportunities. Should you require any accommodations during the recruitment process, please submit an Accommodations Request Form, and we will work together to meet your needs. If some of the above doesn't line up perfectly with your experience, we still encourage you to apply!
We value and celebrate diversity and strive to create an inclusive work environment for all. We welcome applicants from all backgrounds and are committed to providing equal opportunities. Should you require any accommodations during the recruitment process, please submit an Accommodations Request Form, and we will work together to meet your needs.
We may use AI-enabled tools to screen and assess applicants against the criteria for this position. This helps our recruiters identify potentially qualified candidates, but it doesn't limit the applications our recruiters may review or consider. Full-Time Employees at Cohere enjoy these Perks: An open and inclusive culture and work environment Work closely with a team on the cutting edge of AI research Weekly lunch stipend, in-office lunches & snacks Full health and dental benefits, including a separate budget to take care of your mental health 100% Parental Leave top-up for up to 6 months Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend 6 weeks of vacation (30 working days!)
Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Senior Member of Technical Staff, Web Data in New York, NY vacancy
- ...mobile AI experiences. As a key member of our team, you'll push the... ...of iOS development, combining technical excellence with design finesse... ...syncing, on-device ML, efficient data flow). Iterate quickly on... ...(iOS + Android or iOS + Web). Led iOS architecture: modular...DataWebShift work
$170k - $270k
...to some of your work and let's talk anyway. About the Role Members of Technical Staff at Anterior own problems end-to-end — from system design through... ...Designing and building backend services, APIs, and data pipelines that power our AI platform Helping us migrate off...DataSeniorFull timeApprenticeshipFlexible hours- ...help create something truly transformative. The Role As a Member of Technical Staff, you'll be a core technical contributor building high‑impact... ...: agentic workflows, intelligent routing, and real‑time data orchestration. AI Optimization: Craft precise prompting systems...DataSeniorLocal area
$170k - $220k
...Atria Health Institute Senior Product Manager The... ...precision-based care for Atria members and their families. All... ...mobile (iOS/Android), web, onboarding/... ...Partner with engineering and data teams to build AI-... ...user impact, risk, and technical feasibility. Serve...DataSeniorWeb- ...make it accessible to all. About The Role Data plays a crucial role at the frontier of AI... ...architectures, but from better data. As a member of the Data Team, your mission is to build... ...the ingestion systems that turn the open web and other large‑scale data sources into reliable...DataWebRelocation package
- ...lead the delivery of complex technical projects , ~ Experience... ...Associate Director of Engineering, Member Growth, this senior engineer will develop full-... ...and maintain existing web applications and infrastructure... ...with product, design, data, or other teams, contributing...DataSeniorWeb
- Member of Technical Staff - Software Engineer Valthos | Posted Mar 3 Full-time Negotiable Advanced (5... ...high-scale, opinionated APIs that are data/GPU intensive. Prototype and develop interactive... ...of the full application stack (Web UX frameworks, API design and...DataWebFull timeWork at office
$167k - $230k
As a member of the Banking Solutions team, you will be responsible... ...Anchorage’s NeoBank functionality. Technical Skills Participate in task... ...simplify complex financial data into digestible, actionable insights... ...distributed systems and web applications from scratch. Experience...DataWebFull timeBank staff- ...Requirements: Experience building and shipping modern web applications end-to-end. We care more about what you've built than... ...the frontend, Python services on the backend, and ClickHouse for data and analytics. Deep knowledge of observability tools and patterns...DataWebWork at office
$160k - $300k
...applications and rapidly iterating technical approaches at the frontiers of... ...on real biopharma marketing data. Wrangle proprietary... ...Strong proficiency in full-stack web development technologies such... ...calibrated to reflect experience, seniority, and the level of ownership you...DataWebH1bVisa sponsorshipRelocation packageFlexible hoursNight shift- ...: We're looking for a Growth Engineer to own the technical foundation of Modal's marketing and developer-facing web surfaces: the marketing site, docs site, growth landing... ...You'll partner with Product Engineering, Design, Data, and Growth to ship polished, measurable web...DataWebWork at office
- ...is uniquely designed to make it easy for both engineers and non-technical teams to build agents on a single platform. Our customers include... ...with TypeScript, with a deep understanding of end-to-end web fundamentals System design skills: you can architect APIs and...SeniorWeb
$180k
...intersection of AI and finance. RESPONSIBILITIES: Develop backend services, APIs, and data models to support high‑volume, multi‑user environments. Work with iOS, Android & Web client engineers to ship products. Design robust infrastructure and microservices for payments...DataWebTemporary work$159.1k - $213.57k
...Senior Software Engineer I, Member Growth, Care Guide Experience New York (Hybrid)... ...Expand and maintain existing web applications and infrastructure... ...and efficiently Provide technical leadership in architectural... ...with product, design, data, or other teams, contributing...DataSeniorWebWork at officeSleeping nights2 days per week3 days per week$159.1k - $196.48k
...Senior Software Engineer I, Member Growth, Care Guide Experience New York (Hybrid)... ...Expand and maintain existing web applications and infrastructure... ...and efficiently Provide technical leadership in architectural... ...with product, design, data, or other teams, contributing...DataSeniorWebWork at officeRelocationSleeping nights2 days per week3 days per week- ...trustworthy answers grounded in the live web and backed by clear citations. It combines... ...across structured and unstructured data, while Perplexity Computer extends this... ...working cross-functionally and engaging both technical teams and executive stakeholders Nice...DataWeb
- ...Design and build interfaces that make complex data, analysis results, workflows, alerts,... ...and building production-quality web applications. Strong frontend engineering... ...where product direction, customer needs, and technical constraints are still evolving....DataWebWork at office
- ...'s curiosity with fast, trustworthy answers grounded in the live web and backed by clear citations. It combines multiple leading models... ...that powers how Perplexity persists, retrieves, and manages data across all systems, ensuring high availability, performance, and...DataWeb
- ...answers grounded in the live web and backed by clear citations... .... About the Role The Data Platform team owns the end-to... ..., and ClickHouse. In this senior/staff role, you will shape architecture... ..., and drive the long-term technical direction of Perplexity's...DataWeb
$81k - $115.5k
...- 3 days per week Role Overview: As a Senior Business/Technical Analyst within the Tech Data AI Ventures team at New York Life, you will serve as... ...systems with a multi-tier architectures consisting of web and legacy applications. • Experience working in...DataSeniorWebLocal areaShift work3 days per week- ...Pace Technical Staff Role Pace is an AI-native business process outsourcer for insurers. We... ...companies in the world. We're looking for a Member of Technical Staff who will partner with... ..., 2) Full-stack engineer to own RPA/web automation features, 3) Enterprise...Web
$80 per hour
...Title - Senior Salesforce Developer Location - New... ...Development, Integration and technical execution for... ...delivering complex Lightning Web Components (LWC) user... ...understanding of Salesforce data modeling, sharing rules... ...and mentor team members Good to have...DataSeniorWebWork at office- ...We are looking for a Senior Salesforce Developer who... ...architect comprehensive technical solutions, establish and... ...to pairing with team members on functional and nonfunctional... ...platform principles, data model, integration... ...Email Services, SOQL, Apex Web Service Callouts,...DataSeniorWebRemote work
- Splice is searching for a Senior Frontend Engineer to join their Core... ...work culture with global team members. -Collaborative and supportive... ...managers, designers, and data scientists to build user-centric... ...Stay up-to-date on the latest web technologies and best practices...DataSeniorWebRemote job
- ...build, and maintain backend services and web applications Work on real-time web... ...to have: Experience with distributed data or batch processing frameworks (e.g.,... ...who we/you are looking for in a new team member/team Technical assessments: 2-3 (a mix of interview...DataSeniorWebRemote work
$100k - $200k
...sophisticated ML models Advance data architecture for accounting-... ...our tech and product Lead technical decision making on accounting... ...on frontend infrastructure, web server design, abstractions, etc... ...-term alongside founding team members, and committed to mentoring...DataSeniorWebWork at office- ...Technical Intern Opportunity Adaptive ML is a frontier... ...soon. Our Technical Staff develops the... ...systematic way; Build data pipelines to support reinforcement... ...Nearly all members of our Technical Staff... ...close collaboration with senior engineers and researchers...DataInternshipLive inWork at office
- Member of Technical Staff (Full-Stack) - Series A, NYC I’m working with a Series A NYC-based AI company building an AI platform for life sciences... ...(e.g., Veeva, Adobe, Figma) Develop document + data pipelines (PDFs, OCR, embeddings, metadata) What they’re looking...Data
$250k
...We’re searching for Senior Software Engineering tale... ...Engineering on architecture and technical direction Build... ...systems and modern web applications Help shape... ...and mentor early team members Contribute hands-on... ...Python, especially in data or AI-related workflows...DataSeniorWeb$100k - $125k
...learners. About The Role As a Senior Experience Designer (UX/UI) we... ...with clients and team members to find the strongest solution... ...Expertise in motion design and web or mobile development is a bonus... ...abilities and alignment with market data. While we sincerely...DataSeniorWebWork from homeFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Member of Technical Staff, Web Data. Be the first to apply!
Related searches
- IT assistant New York, NY
- desktop support analyst New York, NY
- senior IT support technician New York, NY
- personal computer support technician New York, NY
- technical analyst New York, NY
- customer support technician New York, NY
- tech assistant New York, NY
- technical support assistant New York, NY
- product support analyst New York, NY
- customer support analyst New York, NY


