LLM Collection by yzqtdu Nov 15, 2023 - BitNet: Scaling 1-bit Transformers for Large Language Models Paper • 2310.11453 • Published Oct 17, 2023 • 106
BitNet: Scaling 1-bit Transformers for Large Language Models Paper • 2310.11453 • Published Oct 17, 2023 • 106
RL Collection by matthh Mar 19, 2024 - Direct Preference Optimization: Your Language Model is Secretly a Reward Model Paper • 2305.18290 • Published May 29, 2023 • 64 PERL: Parameter Efficient Reinforcement Learning from Human Feedback Paper • 2403.10704 • Published Mar 15, 2024 • 60
Direct Preference Optimization: Your Language Model is Secretly a Reward Model Paper • 2305.18290 • Published May 29, 2023 • 64
PERL: Parameter Efficient Reinforcement Learning from Human Feedback Paper • 2403.10704 • Published Mar 15, 2024 • 60
LLMs Collection by CoreMega Nov 15, 2023 - meta-llama/Llama-2-7b-hf Text Generation • 7B • Updated Apr 17, 2024 • 904k • 2.28k
model Collection by Steven1991 Nov 15, 2023 - meta-llama/Llama-2-7b-chat-hf Text Generation • Updated Apr 17, 2024 • 400k • 4.72k
LLM Collection by qidianlinjin Nov 15, 2023 - Running Featured 229 Distil Whisper Web 👀 229 Transcribe audio files to text instantly Runtime error Featured 1.2k Explore Llamav2 With TGI 💻 1.2k
Llama-2-13b-hf Collection by mostafaamiri Nov 15, 2023 - meta-llama/Llama-2-13b-hf Text Generation • Updated Apr 17, 2024 • 33.1k • 621
LLM Collection by yzqtdu Nov 15, 2023 - BitNet: Scaling 1-bit Transformers for Large Language Models Paper • 2310.11453 • Published Oct 17, 2023 • 106
BitNet: Scaling 1-bit Transformers for Large Language Models Paper • 2310.11453 • Published Oct 17, 2023 • 106
LLM Collection by qidianlinjin Nov 15, 2023 - Running Featured 229 Distil Whisper Web 👀 229 Transcribe audio files to text instantly Runtime error Featured 1.2k Explore Llamav2 With TGI 💻 1.2k
RL Collection by matthh Mar 19, 2024 - Direct Preference Optimization: Your Language Model is Secretly a Reward Model Paper • 2305.18290 • Published May 29, 2023 • 64 PERL: Parameter Efficient Reinforcement Learning from Human Feedback Paper • 2403.10704 • Published Mar 15, 2024 • 60
Direct Preference Optimization: Your Language Model is Secretly a Reward Model Paper • 2305.18290 • Published May 29, 2023 • 64
PERL: Parameter Efficient Reinforcement Learning from Human Feedback Paper • 2403.10704 • Published Mar 15, 2024 • 60
LLMs Collection by CoreMega Nov 15, 2023 - meta-llama/Llama-2-7b-hf Text Generation • 7B • Updated Apr 17, 2024 • 904k • 2.28k
model Collection by Steven1991 Nov 15, 2023 - meta-llama/Llama-2-7b-chat-hf Text Generation • Updated Apr 17, 2024 • 400k • 4.72k
Llama-2-13b-hf Collection by mostafaamiri Nov 15, 2023 - meta-llama/Llama-2-13b-hf Text Generation • Updated Apr 17, 2024 • 33.1k • 621