vikarti-anatra 's Collections Interesting ones
updated
LoRA Fine-tuning Efficiently Undoes Safety Training in Llama 2-Chat 70B
Paper
• 2310.20624
• Published
• 13
Unleashing the Power of Pre-trained Language Models for Offline
Reinforcement Learning
Paper
• 2310.20587
• Published
• 18
BadLlama: cheaply removing safety fine-tuning from Llama 2-Chat 13B
Paper
• 2311.00117
• Published
VideoFusion: Decomposed Diffusion Models for High-Quality Video
Generation
Paper
• 2303.08320
• Published
• 3
Vikhrmodels/Vikhr-7B-instruct_0.4
Text Generation
• 8B • Updated
• 143
• 35
IlyaGusev/saiga_llama3_8b
Text Generation
• 8B • Updated
• 399k
• • 137
QuixiAI/wizard_vicuna_70k_unfiltered
Viewer
• Updated
• 34.6k • 301
• 174
failspy/llama-3-70B-Instruct-abliterated
Text Generation
• Updated
• 9.03k
• • 127
Zoyd/Sao10K_L3-8B-Stheno-v3.1-8_0bpw_exl2
Text Generation
• Updated
• 1
• 3
Zoyd/Sao10K_L3-8B-Stheno-v3.1-6_5bpw_exl2
Text Generation
• Updated
• 2
• 1
sophosympatheia/Aurora-Nights-70B-v1.0
Text Generation
• 69B • Updated
• 876
• 22
PygmalionAI/mythalion-13b
Text Generation
• 13B • Updated
• 1.13k
• • 161
Nitral-AI/Poppy_Porpoise-1.0-L3-8B
Text Generation
• 8B • Updated
• 27
• 26
NeverSleep/Noromaid-v0.4-Mixtral-Instruct-8x7b-Zloss
Text Generation
• 47B • Updated
• 24
• 38
microsoft/Phi-3-medium-128k-instruct
Text Generation
• Updated
• 139k
• 387
Text Generation
• 8B • Updated
• 2
• 3
Lewdiculous/Poppy_Porpoise-1.0-L3-8B-GGUF-IQ-Imatrix
8B • Updated
• 285
• 15
ACECODER: Acing Coder RL via Automated Test-Case Synthesis
Paper
• 2502.01718
• Published
• 28
OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training
Tokens
Paper
• 2504.07096
• Published
• 77