Research Crawling Engineer
$80k - $175kMLabs
Location: Remote - Must have a 6 hour overlap with EST Remote | Full-time Compensation: $80K - $175K We are hiring on behalf of our client who is a technical infrastructure firm specializing in the delivery of massive-scale web data to organizations developing advanced artificial intelligence models. The organization supports high-capacity bandwidth-sharing networks and operates a distributed crawler capable of accessing high-quality public web data at a global scale. Additionally, the team has engineered sophisticated pipelines for the ingestion, segmentation, and annotation of billions of multimedia files, facilitating dataset creation for frontier research labs. The organization operates as a lean, technical team that prioritizes speed and direct execution. As a Research Crawling Engineer , the successful candidate will design and operate large-scale web data acquisition systems. This role encompasses distributed systems, scraping infrastructure, and data pipelines, focusing on providing high-quality inputs for research and model development. Key Responsibilities Construct and maintain large-scale web crawlers across diverse domains. Design high-throughput, fault-tolerant systems for data collection, managing volumes ranging from millions to billions of URLs per day. Navigate anti-bot systems, rate limits, and dynamic, JavaScript-heavy websites. Develop robust pipelines for data cleaning, deduplication, filtering, and normalization. Build and maintain datasets specifically structured for research and machine learning model training. Monitor and optimize crawl performance, coverage, and data quality through rapid iteration. Collaborate with research teams to ensure data collection efforts align with modeling requirements. Optimize infrastructure to ensure cost-efficiency, low latency, and reliability. Core Qualifications Extensive programming experience in one or more of the following: Go, Rust, Python, Java, or C++. Proven experience in building web crawlers or large-scale data pipelines. Solid understanding of networking protocols, and browser behavior. Familiarity with distributed systems and parallel processing techniques. Experience handling large datasets, ideally at the terabyte to petabyte scale. Demonstrated ability to debug and maintain systems within unstable or adversarial environments. Preferred Qualifications Experience with NLP pipelines or dataset curation for machine learning. Familiarity with LLM pre-training data or retrieval systems. Practical experience with headless browsers (e.g., Playwright, Puppeteer, or Chrome DevTools Protocol). Knowledge of proxy systems, IP rotation, and large-scale request orchestration. Background in data quality evaluation or benchmarking. Experience running workloads on cloud or bare-metal infrastructure. Benefits Impactful Opportunity: Contribute to the development of a web-scale crawler and knowledge graph at the forefront of AI data accessibility. High-Performance Culture: Join a lean, low-ego team that prioritizes high output and professional growth. Remote Work: This position is part of a fully remote team, offering flexibility and autonomy. Competitive Compensation: A package including a competitive salary, comprehensive benefits, and equity, commensurate with experience and the ability to operate at scale. Interview Process Recruiter Coordination Call Hiring Manager Interview Founder / CEO Interview Secondary Executive Interview Final Interview Due to the high volume of applications we anticipate, we regret that we are unable to provide individual feedback to all candidates. If you do not hear back from us within 4 weeks of your application, please assume that you have not been successful on this occasion. We genuinely appreciate your interest and wish you the best in your job search. Commitment to Equality and Accessibility At MLabs, we are committed to offer equal opportunities to all candidates. We ensure no discrimination, accessible job adverts, and providing information in accessible formats. Our goal is to foster a diverse, inclusive workplace with equal opportunities for all. If you need any reasonable adjustments during any part of the hiring process or you would like to see the job-advert in an accessible format please let us know at the earliest opportunity by emailing View email address on click.appcast.io. MLabs Ltd collects and processes the personal information you provide such as your contact details, work history, resume, and other relevant data for recruitment purposes only. This information is managed securely in accordance with MLabs Ltd’s Privacy Policy and Information Security Policy, and in compliance with applicable data protection laws. Your data may be shared only with clients and trusted partners where necessary for recruitment purposes. You may request the deletion of your data or withdraw your consent at any time by contacting View email address on click.appcast.io. #J-18808-Ljbffr
- ...Research Engineering Interns page is loaded## Research Engineering Internslocations: Penn State University Parktime type: Part timeposted on: Posted Todayjob requisition id: REQ\_0000076869# **APPLICATION INSTRUCTIONS:*** ## CURRENT PENN STATE EMPLOYEE (faculty, staff...SuggestedPart timeInternshipRemote workRelocation
- ...Research Areas Infrastructure Intelligence: Models that help us understand what’s happening in infrastructure and why. Detecting anomalies... ...that investigates, resolves, and prevents issues—giving engineers their time back to focus on what they do best: building great...SuggestedWork at officeNight shift
$190.58k
...celebrate each other’s wins. About the Position You're an experienced engineer who combines deep technical skill with the judgment to make... ...decisions. You drive complex projects across Murmuration's research and data science enablement systems: designing, building, and operating...SuggestedFull timeRemote workHome officeFlexible hours- ...–it's how we do it. DRW is a place of high expectations, integrity, innovation and a willingness to challenge consensus. As a Research Engineer , you will be an integral member of a systematic research team comprised of experienced technologists, quantitative researchers...SuggestedImmediate start
$190k - $250k
...The Research Engineering team is dedicated to accelerating the velocity of machine learning research and expanding the exploration space for innovations at PDT. We partner with PDT’s quantitative researchers to design and build a state-of-the-art environment for testing...SuggestedWork at officeWork visa3 days per week- About Basis Basis is a nonprofit applied AI research organization with two mutually reinforcing goals. The first is to understand and... ...that puts human values first. About the Role Research Engineers in Operations at Basis build the internal tooling, automation,...Full timeContract workWork at office
- A leading Voice AI company in the United States is seeking a highly skilled Machine Learning Engineer to join their Research team. This role focuses on scalable model training for speech technologies and developing robust data strategies. The ideal candidate has strong...Remote work
$350k
A leading AI research company is seeking a Pre-training Research Engineer to advance large language models. You will be engaged in research, implementing solutions, and enhancing training infrastructure while collaborating with other experts. Candidates must possess an...Work at office- A leading technology firm in the United States is seeking an MRI Technical Program Engineer to support the development of advanced MRI imaging methods. The successful candidate will manage timelines, coordinate across various teams, and assist in pulse sequence development...Flexible hours
$165k - $260k
Senior NLP Research Engineer - Artificial Intelligence Location New York Business Area Engineering and CTO Ref # 10049310 Description & Requirements Bloomberg’s Engineering AI department has 400+ AI practitioners building highly sought after products and features...Temporary workFor contractorsWork experience placement- Cohere is looking for world-class research scientists and research engineers to advance multilingual AI technologies. You will lead the design of scalable solutions that enhance language model performance and contribute to groundbreaking research in natural language processing...Remote job
$180k - $200k
About the role We are seeking exceptional Senior Research Engineers to join our mission-critical team building the world's best oncology foundational models. As an AI-driven drug development company, these models are the engine that powers everything we do, from predicting...Work at office3 days per week$350k
Anthropic seeks a Research Engineer/Research Scientist to join our Pre-training team in a remote-friendly position. You will develop next-generation large language models and work on projects at the intersection of research and engineering. The role requires an advanced...Remote workFlexible hours$27 - $43 per hour
...who are seeking summer internship opportunities to learn, grow, and work alongside other talents. The intern will work in the Research Engineering (RE) group, inside the larger Informatics and Predictive Sciences (IPS) group, under the guidance of an experienced...Hourly payFull timeTemporary workWork experience placementInternshipSummer internship$27 - $43 per hour
...or graduate students in Bioinformatics or Computer Science. Interns will apply generative AI and large language models to enhance research in drug development, gaining hands-on experience with a focus on problem-solving and collaboration. The position offers a starting...Hourly payInternshipSummer internship- Aimling is hiring a Research Engineer for the Code RL team in New York. You will enhance our AI models to effectively write, test, and debug software across real codebases. The role requires deep expertise in Python, software engineering, and experience with reinforcement...
$165k - $260k
A leading financial technology firm in New York is seeking a Senior NLP Research Engineer. The successful candidate will work with a talented team to develop AI-driven products using advanced Natural Language Processing techniques. Responsibilities include designing NLP...- A nonprofit organization in advanced AI research is seeking a Member of Technical Staff, Research. This role involves designing and developing state-of-the-art methods for scientific exploration, collaborating with cross-functional teams to innovate AI approaches in research...Remote job
- ...strategic investors—industry leaders in construction, building products manufacturing, and distribution. Higharc is hiring a Research Engineer to join our Special Projects team. In this role, you'll conduct foundational research in spatial AI for residential construction...Full timeTemporary workRemote workHome officeFlexible hours
- Framework Ventures is seeking a research-minded engineer to design and develop decentralized RL systems that incentivize miners while pushing the boundaries of reinforcement learning, program synthesis, and alignment techniques. The ideal candidate has expertise in RL...
$190.58k
A nonprofit organization dedicated to civic engagement is looking for a Senior Research Engineer who can design and build systems to enhance civic participation. Ideal candidates will have 6+ years of experience in software engineering, strong skills in Python, and a passion...Remote jobFull time$165k - $260k
A leading financial technology company in New York seeks a Senior NLP Engineer to work on innovative AI-driven solutions. This role involves designing, training, and evaluating NLP models, collaborating across teams, and publishing findings. Ideal candidates will have...$160k - $200k
...Layer Health was founded in 2023 by leading machine learning researchers from MIT and Harvard Medical School. We are building an AI layer... ...the better. Job Description We’re hiring an exceptional ML Engineer to join our team (Boston or NYC office). In this role, you will...Work at office- ...revenue growth, years of runway, and backing from world‑class investors . I’ll share more once we meet. About the Role As an ML Research Engineer at Maple, you'll be a part of our core product team transforming cutting‑edge research into production‑ready voice agents,...Work at officeLocal area
- ...DESCRIPTION AND POSITION REQUIREMENTS**The Applied Signals & System Integration Department of the Applied Research Laboratory (ARL) is seeking undergraduates to assist ARL engineers in the areas of signal processing, data analysis, and test & evaluation.ARL is an authorized DoD...Summer workInternshipRemote workRelocation
$200k
...Optiver is a seeking a Machine Learning Research Engineer to join our team, focusing on a pivotal AI initiative. This role would offer the opportunity to have significant impact across Machine Learning infrastructure, training, and inference challenges to advance our futures...Work at office$170k - $210k
...experts from OpenAI, Meta, Mandiant, Palantir, Cruise, Trail of Bits, and Aptiv. About the Role We are looking for a Security Research Engineer who ships. You will own the research and development of offensive capabilities that power Sybil, working end‑to‑end from...Full timeRemote work$180k - $212k
...remote within the United States or Canada. Role Summary Security researchers have a strong technical background and excel at creative... ...generating your own research ideas. This role will help provide the engineering teams with the information needed to provide better coverage,...Remote workWorldwide$200k - $400k
...execution (supporting more complex reasoning and workflows). Our research shows that an agent’s task execution reliability increasingly... ...wraps around it. This is highly experimental, frontier‑style engineering. The team continuously analyzes real‑world failures, builds...Full timeWork at officeLocal area- Twilio is seeking a Senior/Staff Applied Research Software Engineer, offering remote opportunities across the United States. This role focuses on developing innovative solutions and collaborating with engineering teams to enhance Twilio's offerings in a fast-paced environment...Remote job
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Research Crawling Engineer. Be the first to apply!
- research assistant engineering New York, NY
- ai research engineer New York, NY
- engineering business analyst New York, NY
- junior machine learning research engineer New York, NY
- engineering change analyst New York, NY
- research engineer New York, NY
- research programmer New York, NY
- engineering analyst New York, NY
- cyber research engineer New York, NY
- deep learning research engineer New York, NY

