roshniramesh
's Collections
fp8 llm
updated
nvidia/Llama-3.1-8B-Instruct-FP8
Text Generation
•
8B
•
Updated
•
46.5k
•
•
32
amd/Llama-3.1-8B-Instruct-FP8-KV
8B
•
Updated
•
24.9k
•
6
amd/Mixtral-8x7B-Instruct-v0.1-FP8-KV
3B
•
Updated
•
2.99k
•
3
amd/Meta-Llama-3-8B_fp8_quark
Text Generation
•
8B
•
Updated
•
5
ibm-ai-platform/Bamba-9B-2T-fp8
Text Generation
•
10B
•
Updated
•
1
•
2
ibm-ai-platform/Bamba-9B-fp8
Text Generation
•
10B
•
Updated
•
37
•
2
ibm-ai-platform/Bamba-9B-1.8T-fp8
Text Generation
•
10B
•
Updated
•
1
•
2
RedHatAI/Meta-Llama-3-8B-Instruct-FP8
Text Generation
•
8B
•
Updated
•
2.12k
•
•
24
RedHatAI/Meta-Llama-3-8B-Instruct-FP8-KV
Text Generation
•
8B
•
Updated
•
9.89k
•
•
8
RedHatAI/Qwen2-7B-Instruct-FP8
Text Generation
•
8B
•
Updated
•
4.56k
•
•
2
RedHatAI/Qwen2-1.5B-Instruct-FP8
Text Generation
•
2B
•
Updated
•
17.1k
RedHatAI/Mistral-7B-Instruct-v0.3-FP8
Text Generation
•
7B
•
Updated
•
1.79k
•
3
RedHatAI/Llama-2-7b-chat-hf-FP8
Text Generation
•
7B
•
Updated
•
191
RedHatAI/gemma-2-9b-it-FP8
Text Generation
•
9B
•
Updated
•
269
•
5
RedHatAI/DeepSeek-Coder-V2-Lite-Instruct-FP8
Text Generation
•
16B
•
Updated
•
57.2k
•
9
FriendliAI/Llama-2-13b-chat-hf-fp8
Text Generation
•
Updated
•
2
•
8
FriendliAI/Meta-Llama-3-8B-Instruct-fp8
Text Generation
•
8B
•
Updated
•
21
•
2
FriendliAI/Meta-Llama-3-8B-fp8
Text Generation
•
Updated
•
7
•
3
FriendliAI/Meta-Llama-3.1-8B-Instruct-fp8
Text Generation
•
8B
•
Updated
•
2.15k
amd/Llama-3.2-3B-Instruct-FP8-KV
3B
•
Updated
•
24
amd/Llama-3.2-1B-Instruct-FP8-KV
1B
•
Updated
•
754
3B
•
Updated
•
1
1B
•
Updated
•
1
amd/Meta-Llama-3.1-8B-Instruct-fp8-quark-vllm