Research Engineer (Agentic Behavior - Kotlin AI Value Stream)
JetBrains
Research Engineer (Agentic Behavior – Kotlin AI Value Stream)
At JetBrains, code is our passion. Ever since we started, back in 2000, we've been striving to make the strongest, most effective developer tools on earth. Today, AI-powered coding agents are becoming a core part of how developers write Kotlin – and we want to make sure they write it well.
The Kotlin AI Value Stream team is responsible for how AI agents understand, generate, and improve Kotlin code across all platforms: Android, Kotlin Multiplatform, server-side, web, desktop, and others. We build the evaluation infrastructure, error analysis tools, and post-training pipelines that measure and improve agent behavior on real Kotlin developer tasks.
As a Research Engineer on this team, you'll own the end-to-end loop: Analyze how agents fail on Kotlin → build evals that capture those failures → research and implement methods to fix them → measure the improvement. Your work will directly shape how millions of developers experience Kotlin through AI coding agents.
As part of our team, you will:
Build tools for agentic error analysis
- Design and implement tooling to systematically capture, classify, and analyze errors that AI coding agents make when generating Kotlin code.
- Build observability pipelines over agentic traces – mining patterns from agent sessions in JetBrains IDEs, Junie, Claude Code, Cursor, and other coding agents.
Build evaluation pipelines
- Design, implement, and maintain evaluation pipelines that measure Kotlin code generation quality across dimensions, including correctness, idiomaticity, build success, framework usage, and test coverage.
- Build simulation environments where coding agents can be measured on realistic Kotlin developer tasks – from greenfield KMP projects and Gradle dependency management to migrating Spring applications from Java to Kotlin.
- Own evaluation infrastructure: metrics, experiment tracking, automated regression checks, and reproducible benchmarking.
Research methods for improving agent and model behavior on Kotlin
- Experiment with post-training techniques (SFT, DPO, GRPO) to improve how models handle Kotlin-specific patterns, idioms, and frameworks.
- Investigate context engineering approaches: CLAUDE.md/AGENTS.md files, compiler-as-verifier feedback loops, Kotlin LSP integration, and MCP-based tooling.
- Run experiments to measure impact: A/B comparisons, benchmark suites, and before/after analyses on real codebases.
- Collaborate with model providers (Anthropic, OpenAI, and Google) to translate Kotlin-specific findings into model improvements.
Build public Kotlin benchmarks
- Design and build open-source benchmarks that measure AI coding agent performance on Kotlin tasks and eventually become the standard reference for the ecosystem.
- Create task datasets covering the breadth of Kotlin usage: the server side (Spring, Ktor), multiplatform projects (KMP), build systems (Gradle), Android, library development, and others.
- Include both mined real-world tasks and carefully designed synthetic tasks that test specific Kotlin capabilities.
- Maintain and evolve benchmarks as models improve, ensuring they remain challenging, relevant, and contamination-resistant.
We'll be happy to have you on board if you have:
- Hands-on experience building evaluation or analysis pipelines for LLMs or AI coding agents in a research or production setting.
- Strong Python engineering skills (at least three years), with the ability to write clean, maintainable code in data-heavy and ML-adjacent codebases.
- Experience with data analysis at scale: querying large datasets (SQL/Athena), building data pipelines, and performing statistical analysis of experimental results.
- The ability to own projects end to end – from identifying a problem in agent traces to designing an eval, running experiments, and shipping a fix.
- A product-aware mindset: You care about how agents are actually used by developers and can translate real failure modes into evaluation and training work.
- Familiarity with Kotlin or a strong willingness to develop deep Kotlin expertise (you'll be living in Kotlin codebases daily).
Our ideal candidate would also have experience with:
- Post-training LLMs: SFT, RLHF, DPO, GRPO – either hands-on training or designing the data and reward pipelines that feed into training.
- Modern deep learning frameworks (PyTorch) and LLM training stacks (TRL, verl, Megatron, or similar).
- AI agent development: tool-using agents, multi-step coding workflows, agentic frameworks.
- Evaluation frameworks and tools: Inspect AI, Promptfoo, LM-evaluation-harness, or custom eval pipelines.
- Experiment tracking and observability: Weights & Biases, MLflow, Langfuse, or similar.
- The Kotlin ecosystem: Android, Gradle, KMP, Spring, Ktor – with an understanding of the developer workflows that agents need to support.
- Contributing to or maintaining open-source projects, especially benchmarks or evaluation tools.
Don't check every box? That's okay – if you're excited about this work and bring strong fundamentals, we'd love to hear from you. We're happy to talk and provide the training you need to grow into the role.
Why join JetBrains?
- Strong base salary. We offer competitive pay that reflects your skills and experience.
- Flexible work location. Enjoy the freedom to work from home or from the office.
- Remote work. Spend up to 30 days per year working remotely from abroad.
- Extra time off. More days to relax, recharge, and do the things you love.
- Medical insurance allowance. Enjoy peace of mind for you and your family
- Learning and development opportunities. Access to conferences, courses, and language classes.
- Relocation support. We help make your move as smooth and stress-free as possible.
- Language classes. Pick up the local language or sharpen your English skills.
- Fuel your day. Enjoy a hot meal or receive a lunch allowance on workdays.
- Mental health support. To help you feel your best, we provide easy access to professional mental health services.
- Sports benefit. Enjoy an on-site gym or sports club stipend.
- Internal events. Join company-wide celebrations and team gatherings.
*Some benefits may vary depending on location.
We are an equal opportunity employer We know great ideas can come from anyone, anywhere. That's why we do our best to create an open and inclusive workplace – one that welcomes everyone regardless of their background, identity, religion, age, accessibility needs, or orientation.
We process the data provided in your job application in accordance with the Recruitment Privacy Policy.
- ...post‐training techniques for agentic AI systems. You will... ...function calling. The work spans research and engineering, with direct impact on production... ...reasoning, and autonomous behavior in real‐world tasks.... ...GitHub, Hugging Face) is highly valued. Strong analytical thinking...SuggestedRemote work
$161k - $194k
...) Software Engineering New York, NY... ...complexity meets AI's potential. We... ...building Tapestry's Agentic Data Platform... ...low-latency streaming infrastructure... ...engineers, PMs, AI researchers, and power... ...language (Java/Kotlin or Golang preferred... .... Our values Take charge...KotlinFull timeFlexible hours$75k - $85k
...We are the only cloud-based, AI-enabled translation platform... ...and passionate Junior Research Enablement Engineer based remotely in the United... ...Python Experience with Java or Kotlin Fundamental or better understanding... ...and advance your career Value-Driven. An energetic, driven...KotlinWork at officeRemote workHome officeFlexible hours- ...ML/AI Research Engineer — Agentic AI Lab (Founding Team) Location: San Francisco Bay Area Type: Full-Time Compensation: Competitive salary +... ...Create reinforcement learning pipelines to optimize agent behaviors (e.g. RLHF, DPO, PPO) Establish scalable evaluation harnesses...SuggestedFull time
- ...s lives, with our core values guiding us every step of... ...Senior Staff Software Engineer to join our Builder Tools... ...architecture of our AI-powered software testing... ...operations for AI powered agentic testing (autonomous... ...coding skills (e.g., Java, Kotlin, Python) delivering...KotlinRemote work
- ...applicants whose values align with our institutional... ..., and ethical behavior and stewardship.... ...top-ranked public research university... ...executive programs in engineering, computing, science... ...to see beyond the AI hype while still recognizing... ...Experience using agentic coding tools. A...Full timeContract workTemporary workPart timeFor contractorsCasual workWork at officeLocal area
$147.4k - $272.1k
...Machine Learning Research Engineer, Siri Comprehension & Planning... ...The future of AI is on-device. On the Siri... ...specifically for assistant behavior. Performance &... ...We're a team that values velocity and pragmatism... ...Models Experience with Agentic AI Pay & Benefits...Work from homeRelocation- ...we're looking for a skilled AI Research Engineer (Applied AI) to join our dynamic... ...experts to ensure model behavior aligns with user needs and policy... ...-augmented generation, agentic systems, or multimodal architectures... ...employer and place a high value on diversity and inclusion...Full timeH1bLocal areaImmediate startRemote workVisa sponsorshipWork visa
- ...is standing up a new Agentic Engineering team within Technology... ...ensuring the team delivers value incrementally while... ..., and enterprise-wide AI enablement.... ...management, and value stream alignment. ~ Experience... ...handle non-deterministic behavior and require guardrail...Work experience placementRemote work
$165k - $180k
...Company Description The Bosch Research and Technology Center North... ...research organization, our AI research in Silicon Valley... ...Autonomous Systems, AI Systems Engineering, and Industry AI. We develop... ...Design, build, and evaluate agentic AI systems that can plan, reason...Full timeTemporary workWork experience placementWorldwide- ...Research Engineer, Foundation Models About the Opportunity We are seeking... ...generation of large-scale AI systems. This role sits at... ...scale data systems What We Value Ownership and... ...Optimization, Production ML, AI Agents, Agentic AI, Autonomous Systems,...Visa sponsorshipRelocation packageFlexible hours
- ...California. The Role: As a Research Engineer - Agency and Reasoning , you... ...ability to understand model behaviors and correct them through... ...engineering excellence are equally valued We strongly value new and... ...we do and love discussing AI Benefits and Perks: Comprehensive...Work at officeRelocation package
- ...Job Title Research Engineer IV Agency Texas A&M Engineering... ...of professional and ethical behavior, and prepared to meet the complex... ...workforce that embraces our core values of Respect, Excellence,... ...devices for next-generation AI hardware applications. The candidate...Flexible hours
- ...building next-generation Embodied AI systems and intelligent... ...experience in frontier AI research, robotics engineering, product development, and... ...interested in imitation learning, behavior cloning, learning from... ...encouraged to apply. We value strong problem-solving ability...Full timeInternship
$89.3k
...specific area of scientific research or other function, with its... ...BS&DG is seeking a Research Engineer II - AI for Building Energy Systems... ...large language models, and agentic systems for building research... ...workforce committed to the values of Integrity, Creativity, Collaboration...For contractorsWork at officeLocal areaRelocation packageFlexible hours- ...Framework Ventures is seeking a research-minded engineer to design and develop decentralized RL systems that incentivize miners while pushing the... ...position is an excellent opportunity to contribute to innovative AI research and implementation in a decentralized environment....
$200k - $260k
...combining frontier agentic AI, an enterprise-... ...We live by three values: Decisiveness, Simplicity... ...As an Android Engineer, you'll own and lead... ...with our AI research, product, and backend... ...applications using Kotlin and Jetpack Compose... ...interactions and streaming capabilities. Build...Kotlin$160k - $180k
...human relationships, behavioral science, network economics, AI and ML, online and... ..., and more. Our Values Take the... ...speed. As an Android engineer with an Applied AI... ...prompts, templates, and agentic workflows aligned... ...experience in Kotlin with a deep understanding...KotlinFull timeWork experience placementWork at officeImmediate startFlexible hours- ...growing data storage needs of the AI era, Western Digital is... ...intersection of supply chain, engineering, and operations, driving innovation... ..., should-cost analysis, or value engineering Salary Ranges:... ...gender-related appearance and behavior, whether or not stereotypically...Temporary workImmediate startRemote workFlexible hoursShift work
$180k - $195k
...Senior React Native Engineer Pelago is... ..., to cognitive behavioral therapy (CBT)... ...and scale new AI-native products... ...hypothesis to research preview to live... ...within a team that values thoughtful... ...writing Swift and Kotlin and exposing that... ...with real-time/streaming data, and...KotlinFull timeWork at office- ...Senior Manager to lead engineering delivery for the... ...platform — from AI-assisted... ...Development (TDD), Behavior-Driven Development... ...identifying high-value opportunities to apply... ...Programming Languages Kotlin Primary language... ...-throughput event streaming and messaging...KotlinFlexible hours
$170k - $205k
...Senior Software Engineer, Tag Engineering... ...ecommerce and user behavior events from customer... ...heavily leaning into AI to improve the... ..., and compute for agentic systems and data analysis... ...services in Java, Kotlin, or a similar... ...Attentive Company Values Default to Action...KotlinFull timeCasual workRemote work- ...Applied AI Engineer Our mission is to scale intelligence... ...of our models and the value they drive for our... ...Cohere is a team of researchers, engineers, designers,... ...problems into well-framed agentic problems with clear... ...Debug real-world agent behavior and systematically improve...Full timeWork at officeRemote workFlexible hours
$190.58k - $200k
...GPU Cluster Lead Engineer Stanford Research Computing seeks an exceptional GPU Cluster... ...groundbreaking research in AI/ML, computational biology,... ...responsibility and value for safety; communicates safety... ...concerns; uses and promotes safe behaviors based on training and...Hourly payFlexible hoursWeekend workAfternoon shift- ...around the globe. Valued at US$8 billion and... ...end-to-end. You use AI to work smarter and... ...scalable vector indexing, agentic workflows, and real-time data streaming. Beyond the... .... As a champion of engineering rigor, you will... ...proficient in Java, Kotlin, or Go. You should...KotlinRemote workWorldwide
- ...enterprises who are building AI systems to power... ...of our models and the value they drive for our... ...Cohere is a team of researchers, engineers, designers, and more,... ...build, and deployment of agentic workflows powered by Large... ...real-world agent behavior and systematically improve...Full timeWork at officeRemote workFlexible hours
- ...Senior Software Engineer, Automation Experience... ...for advanced UI behaviors. Integrate and... ...Strategically Apply AI Tools... ...generative AI or agentic workflows to deliver... ...measurable customer value. You Have:... ...language such as Java, Kotlin, Go, or Python with...KotlinRemote work
$161.25k - $191.25k
...possible. We invest heavily in advanced AI capabilities-specifically our Process Intelligence... ...and the planet. The Role: As a Value Engineer, you are spearheading our mission of... ...finance processes and deploying an agentic business. In partnership with the Celonis...Full timeWork experience placementWork at officeLocal areaImmediate startRemote workWorldwideFlexible hours- ...Mobile Infrastructure Engineer At Commure, our... ...growing suite of AI solutions spans... ...recording, real-time streaming, offline... ...clinical workflows, agentic actions, and offline... ...native Swift/iOS and Kotlin/Android integrations... ...codebase. We value deep native iOS and...KotlinLocal area
$176k - $253k
...At Toyota Research Institute (TRI), we're on a mission... ...the state of the art in AI, robotics, driving, and... ...Diffusion Policy and Large Behavior Models. Within AD2,... ...senior researchers and engineers to develop methods that... ..., diagnostic value, and failure modes....Local areaShift work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Research Engineer (Agentic Behavior - Kotlin AI Value Stream). Be the first to apply!
- junior research engineer United States
- engineering change analyst United States
- senior research engineer United States
- research engineer United States
- junior machine learning research engineer United States
- research programmer United States
- engineering analyst United States
- engineering business analyst United States
- deep learning research engineer United States
- cyber research engineer United States


