-
OPTML-Group/SimNPO-TOFU-forget05-Llama-2-7b-chat
Text Generation • 7B • Updated • 26 -
OPTML-Group/SimNPO-TOFU-forget10-Llama-2-7b-chat
Text Generation • 7B • Updated • 17 -
OPTML-Group/SimNPO-MUSE-Books-iclm-7b
Text Generation • 7B • Updated • 48 -
OPTML-Group/SimNPO-MUSE-News-Llama-2-7b
Text Generation • 7B • Updated • 12
Collections
Discover the best community collections!
Collections trending this week
-
mixedbread-ai/mxbai-embed-large-v1
Feature Extraction • 0.3B • Updated • 1.74M • • 773 -
nomic-ai/nomic-embed-text-v1.5-GGUF
Sentence Similarity • 0.1B • Updated • 186k • 93 -
bartowski/reader-lm-1.5b-GGUF
Text Generation • 2B • Updated • 413 • 15 -
bartowski/reader-lm-0.5b-GGUF
Text Generation • 0.5B • Updated • 192 • 2
-
RefalMachine/RuadaptQwen2.5-32B-Pro-Beta
Text Generation • 33B • Updated • 1.56k • 12 -
RefalMachine/RuadaptQwen2.5-7B-Lite-Beta
Text Generation • 8B • Updated • 82 • 10 -
RefalMachine/RuadaptQwen2.5-14B-Instruct
Text Generation • 15B • Updated • 57 • 5 -
msu-rcc-lair/RuadaptQwen2.5-32B-Instruct
Text Generation • 33B • Updated • 80 • 48
-
Dissociating language and thought in large language models: a cognitive perspective
Paper • 2301.06627 • Published • 1 -
A Latent Space Theory for Emergent Abilities in Large Language Models
Paper • 2304.09960 • Published • 3 -
Are Emergent Abilities of Large Language Models a Mirage?
Paper • 2304.15004 • Published • 8 -
Do LLMs Really Adapt to Domains? An Ontology Learning Perspective
Paper • 2407.19998 • Published • 1
-
OPTML-Group/SimNPO-TOFU-forget05-Llama-2-7b-chat
Text Generation • 7B • Updated • 26 -
OPTML-Group/SimNPO-TOFU-forget10-Llama-2-7b-chat
Text Generation • 7B • Updated • 17 -
OPTML-Group/SimNPO-MUSE-Books-iclm-7b
Text Generation • 7B • Updated • 48 -
OPTML-Group/SimNPO-MUSE-News-Llama-2-7b
Text Generation • 7B • Updated • 12
-
mixedbread-ai/mxbai-embed-large-v1
Feature Extraction • 0.3B • Updated • 1.74M • • 773 -
nomic-ai/nomic-embed-text-v1.5-GGUF
Sentence Similarity • 0.1B • Updated • 186k • 93 -
bartowski/reader-lm-1.5b-GGUF
Text Generation • 2B • Updated • 413 • 15 -
bartowski/reader-lm-0.5b-GGUF
Text Generation • 0.5B • Updated • 192 • 2
-
RefalMachine/RuadaptQwen2.5-32B-Pro-Beta
Text Generation • 33B • Updated • 1.56k • 12 -
RefalMachine/RuadaptQwen2.5-7B-Lite-Beta
Text Generation • 8B • Updated • 82 • 10 -
RefalMachine/RuadaptQwen2.5-14B-Instruct
Text Generation • 15B • Updated • 57 • 5 -
msu-rcc-lair/RuadaptQwen2.5-32B-Instruct
Text Generation • 33B • Updated • 80 • 48
-
Dissociating language and thought in large language models: a cognitive perspective
Paper • 2301.06627 • Published • 1 -
A Latent Space Theory for Emergent Abilities in Large Language Models
Paper • 2304.09960 • Published • 3 -
Are Emergent Abilities of Large Language Models a Mirage?
Paper • 2304.15004 • Published • 8 -
Do LLMs Really Adapt to Domains? An Ontology Learning Perspective
Paper • 2407.19998 • Published • 1