Reinforcement Learning Environment Design

News

CoSN 2025: Universal Design for Learning Applies to Tech, Too

A school technology leader from Indiana improved accessibility and inclusion for his district by including UDL principles in ...

GitHub2d

verl: Volcano Engine Reinforcement Learning for LLMs

verl is a flexible, efficient and production-ready RL training library for large language models (LLMs). verl is the open-source version of HybridFlow: A Flexible and Efficient RLHF Framework paper.

Le Lézard6d

Z.ai Unveils New GLM Open-Source Models with World-Class Reasoning Performance

Z.ai (formerly Zhipu) announces the open-sourcing of its 32B and 9B GLM model series, including base, reasoning, and rumination models, all under the MIT license. These models are now available for ...

eLife6d

Neural mechanisms of credit assignment for delayed outcomes during contingent learning

Orbitofrontal cortex and hippocampus reinstate representations of causal choices to associate with delayed outcomes, and the frontal pole supports this credit assignment process by maintaining pending ...

OfficeChai6d

We Built An AI System That Designed Its Own Reinforcement Learning System: Google Deepmind’s David Silver

There has been much talk about how AI could recursively self-improve in the coming years, but it appears that Google ...

eLife6d

Neural signatures of model-based and model-free reinforcement learning across prefrontal cortex and striatum

This important study presents single-unit activity collected during model-based (MB) and model-free (MF) reinforcement learning in non-human primates. The dataset was carefully collected, and the ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results