LLM Model Size Diagram

News

Tech Xplore on MSN2d

Over-training large language models may make them harder to fine-tune

A small team of AI researchers from Carnegie Mellon University, Stanford University, Harvard University and Princeton ...

Computer Weekly3d

The role of small language models in enterprise AI

Small language models do not require vast amounts of expensive computational resources and can be trained on business data ...

Nvidia’s new Llama-3.1 Nemotron Ultra outperforms DeepSeek R1 at half the size

Compared to DeepSeek R1, Llama-3.1-Nemotron-Ultra-253B shows competitive results despite having less than half the parameters.

Tech Xplore9d

Small model approach could be more effective than LLMs

Small language models are more reliable and secure than their large counterparts, primarily because they draw information ...

the-decoder16d

OpenAI plans to release open-weight reasoning LLM without usage restrictions

OpenAI announces the release of a new open-weight language model. It will have reasoning capabilities and will be available without usage restrictions. OpenAI plans to release a language model as open ...

MIT Technology Review21d

Anthropic can now track the bizarre inner workings of a large language model

Anthropic used circuit tracing to watch its LLM Claude 3.5 Haiku carry out various tasks. The second (titled “On the Biology of a Large Language Model”) details what the team discovered when ...

IEEE22d

LLM-Based Automation of COSMIC Functional Size Measurement from Use Cases

Abstract: COmmon Software Measurement International Consortium (COSMIC) Functional Size Measurement is a method widely used in the software industry to quantify user functionality and measure software ...

Wired22d

Databricks Has a Trick That Lets AI Models Improve Themselves

The company previously revealed to WIRED how it developed DBX, a cutting-edge open source large language model (LLM) from scratch. Without well-labeled, carefully curated data, it is challenging ...

cryptopolitan22d

Chinese AI startups change business models after DeepSeek’s boom

He explained that WanZhi Enterprise’s LLM model offered integrated software-hardware solutions at lower prices than its competitors. Kai-Fu noted that the enthusiasm around DeepSeek in China mirrored ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results