News

The folks over at buzzy Chinese AI lab, DeepSeek, are working on a new series of AI models called DeepSeek-GRM that employ a ...
Chinese startup DeepSeek, led by Liang Wenfeng, is developing generative reward modeling (GRM) to enhance AI efficiency and ...
Chinese AI startup DeepSeek is collaborating with Tsinghua University to reduce the training required for its AI models, ...
DeepSeek is working with Tsinghua University on reducing the training its AI models need in an effort to lower operational ...
DeepSeek AI, in collaboration with Tsinghua University, unveiled a new research study to improve reward modelling in large ...
to make AI models smarter and more efficient in a self-improving way. The Chinese startup is calling these new models DeepSeek-GRM and plans to release them on an open source basis, just like its ...
DeepSeek is calling these new models DeepSeek-GRM — short for ... are also pushing into a new frontier of improving reasoning and self-refining capabilities while an AI model is performing ...
Reward models holding back AI? DeepSeek's SPCT creates self-guiding critiques, promising more scalable intelligence for enterprise LLMs.
In collaboration with Tsinghua University, DeepSeek developed a technique combining reasoning methods to guide AI models ...