News
verl is a flexible, efficient and production-ready RL training library for large language models (LLMs). verl is the open-source version of HybridFlow: A Flexible and Efficient RLHF Framework paper.
AI models are numerous and confusing to navigate, but the benchmarks used to measure their performance are also challenging.
Meta faces challenges in AI as Chinese models like DeepSeek's R1 outperform with cost-effective innovation. Read an analysis ...
Comparing AI reasoning abilities reveals OpenAI's o1 model surpasses DeepSeek's R1 in generating accurate, sentence-level ...
Can they do it? Or not? AI companies claim (and very enthusiastically so) that their models vary between good and amazing, at ...
Meta has launched Llama 4, its latest open-weight AI models, including Scout, Maverick, and Behemoth, offering advanced multimodal capabilities. These models excel in text, image, and video processing ...
Grok-3 represents scale without compromise — 2,00,000 NVIDIA H100s chasing frontier gains, while DeepSeek-R1 delivers similar performance using a fraction of the compute, signalling that innovative ...
Released in January, DeepSeek claims R1 performs as well as OpenAI’s o1 model on key benchmarks. Being a reasoning model, R1 effectively fact-checks itself, which helps it to avoid some of the ...
Benchmarks Find ‘DeepSeek-V3-0324 Is More Vulnerable Than Qwen2.5-Max’ Your email has been sent While the latest iteration of Qwen2.5-Max outperforms DeepSeek-V3 on security, the AI model lags ...
The company claims GLM-Z1-Air matches rival DeepSeek's R1 in performance while running up to eight times ... claims its latest large language model GLM4 outperforms OpenAI's GPT-4 on several ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results