Reinforcement Learning Tutorial Python

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

DR Tulu-8B is the first open Deep Research (DR) model trained for long-form DR tasks. DR Tulu-8B matches OpenAI DR on long-form DR benchmarks. Feburary 9, 2026: 🔥 We released a free interactive demo ...

IEEE

Reinforcement Learning-powered Effectiveness and Efficiency Few-shot Jailbreaking Attack LLMs

Abstract: The widespread use of large language models (LLMs) has brought about security risks, including biases, discrimination, and ethical concerns. Reinforcement Learning from Human Feedback (RLHF) ...

IEEE

Semantically-Driven Deep Reinforcement Learning for Inspection Path Planning

Abstract: This letter introduces a novel semantics-aware inspection planning policy derived through deep reinforcement learning. Reflecting the fact that within autonomous informative path planning ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Reinforcement Learning-powered Effectiveness and Efficiency Few-shot Jailbreaking Attack LLMs

Semantically-Driven Deep Reinforcement Learning for Inspection Path Planning

Trending now