[Paper] The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

rufus@discuss.tchncs.de · edit-2 8 months ago

[Paper] The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

ffhein@lemmy.world · 8 months ago

Ah, I thought you meant why the researchers themselves hadn’t produced any larger models. AFAIK neither MS or OAI has released even a 7b model, they might have larger BitNet models which they only use internally.

[Paper] The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

[Paper] The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper page - The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits