Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Asilarknes
/
70m-adamwgrok
like
1
Text Generation
PyTorch
Russian
russian
language-model
from-scratch
grokadamw
causal-lm
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
70m-adamwgrok
4.08 GB
Ctrl+K
Ctrl+K
1 contributor
History:
35 commits
Asilarknes
Upload README.md with huggingface_hub
acc2a1c
verified
6 days ago
.gitattributes
1.57 kB
Upload distill.jsonl with huggingface_hub
6 days ago
README.md
29.1 kB
Upload README.md with huggingface_hub
6 days ago
RU_LM_REPORT.md
Safe
7.52 kB
Add 70M Russian LM (GrokAdamW): checkpoint, tokenizer, code, report
6 days ago
chat_gen.py
3.1 kB
Upload chat_gen.py with huggingface_hub
6 days ago
chat_rag_ckpt.pt
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
284 MB
xet
Upload chat_rag_ckpt.pt with huggingface_hub
6 days ago
distill.jsonl
26.3 MB
xet
Upload distill.jsonl with huggingface_hub
6 days ago
distill_data.py
4.34 kB
Upload distill_data.py with huggingface_hub
6 days ago
embed_kb.py
1.27 kB
Upload embed_kb.py with huggingface_hub
6 days ago
eval_factual.py
7.72 kB
Upload eval_factual.py with huggingface_hub
6 days ago
eval_verify.py
11.2 kB
Upload eval_verify.py with huggingface_hub
6 days ago
gen.py
Safe
1.9 kB
Add 70M Russian LM (GrokAdamW): checkpoint, tokenizer, code, report
6 days ago
kb.tar.gz
Safe
218 MB
xet
Upload kb.tar.gz with huggingface_hub
6 days ago
kb2.tar.gz
2.41 GB
xet
Upload kb2.tar.gz with huggingface_hub
6 days ago
kb_build.py
Safe
1.81 kB
Upload kb_build.py with huggingface_hub
6 days ago
kb_build2.py
2.13 kB
Upload kb_build2.py with huggingface_hub
6 days ago
kb_reindex.py
Safe
740 Bytes
Upload kb_reindex.py with huggingface_hub
6 days ago
lm_ckpt.pt
Safe
pickle
Detected Pickle imports (3)
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
,
"torch.FloatStorage"
What is a pickle import?
284 MB
xet
Add 70M Russian LM (GrokAdamW): checkpoint, tokenizer, code, report
6 days ago
lm_model.py
Safe
5.09 kB
Add 70M Russian LM (GrokAdamW): checkpoint, tokenizer, code, report
6 days ago
lm_state.py
3.04 kB
Upload lm_state.py with huggingface_hub
6 days ago
rag2_gen.py
4.78 kB
Upload rag2_gen.py with huggingface_hub
6 days ago
rag_ckpt.pt
Safe
pickle
Detected Pickle imports (3)
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
,
"torch.FloatStorage"
What is a pickle import?
284 MB
xet
Upload rag_ckpt.pt with huggingface_hub
6 days ago
rag_gen.py
Safe
4.18 kB
Upload rag_gen.py with huggingface_hub
6 days ago
rag_state.pt
Safe
pickle
Detected Pickle imports (3)
"torch.FloatStorage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
286 MB
xet
Upload rag_state.pt with huggingface_hub
6 days ago
rag_state_gen.py
4.8 kB
Upload rag_state_gen.py with huggingface_hub
6 days ago
rag_verify_gen.py
5.27 kB
Upload rag_verify_gen.py with huggingface_hub
6 days ago
ru_lm_curves.png
Safe
70.5 kB
Add 70M Russian LM (GrokAdamW): checkpoint, tokenizer, code, report
6 days ago
ru_tok.json
Safe
2.4 MB
Add 70M Russian LM (GrokAdamW): checkpoint, tokenizer, code, report
6 days ago
sft_ckpt.pt
Safe
pickle
Detected Pickle imports (3)
"torch.FloatStorage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
284 MB
xet
Upload sft_ckpt.pt with huggingface_hub
6 days ago
sft_data.py
Safe
2.08 kB
Upload sft_data.py with huggingface_hub
6 days ago
sft_gen.py
Safe
2.07 kB
Upload sft_gen.py with huggingface_hub
6 days ago
sft_rag_data.py
Safe
2.81 kB
Upload sft_rag_data.py with huggingface_hub
6 days ago
sft_train.py
Safe
6.11 kB
Upload sft_train.py with huggingface_hub
6 days ago
train_state.py
6.97 kB
Upload train_state.py with huggingface_hub
6 days ago