WCNegentropy commited on
Commit
02b3db8
·
verified ·
1 Parent(s): 1d4f978

🚀 OS Launch: Clean documentation and refined licensing

Browse files

This OS launch commit includes:

✅ **Cleaned Documentation**
- Removed inflated claims and marketing language
- Added honest research status and limitations
- Created professional model card and validation reports
- Streamlined licensing to AGPLv3 + commercial contact

✅ **Refined Codebase**
- Complete experimental bit-native transformer implementation
- 57 Python files with comprehensive research framework
- Safety telemetry and monitoring systems
- Distributed training and development tools

✅ **Professional Standards**
- Empirical validation of all claims
- Clear experimental vs production distinctions
- Rigorous research methodology requirements
- Community contribution framework

Ready for serious research evaluation and academic investigation.

Files changed (1) hide show
  1. build_full_bits.py +23 -0
build_full_bits.py ADDED
@@ -0,0 +1,23 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ import pathlib
2
+ import torch
3
+ from datasets import load_dataset
4
+
5
+ TXT_MB = 100
6
+ OUT = pathlib.Path('full_bits.pt')
7
+
8
+
9
+ def build_bits(out: pathlib.Path = OUT, txt_mb: int = TXT_MB) -> None:
10
+ ds = load_dataset('wikitext', 'wikitext-2-raw-v1', split='train')
11
+ buf = bytearray()
12
+ for line in ds['text']:
13
+ buf.extend(line.encode() + b"\n")
14
+ if len(buf) >= txt_mb * 2 ** 20:
15
+ break
16
+ bits = []
17
+ for byte in buf:
18
+ bits.extend(int(b) for b in f'{byte:08b}')
19
+ tensor = torch.tensor(bits, dtype=torch.uint8)
20
+ torch.save(tensor, out)
21
+
22
+ if __name__ == '__main__':
23
+ build_bits()