Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

rtferraz
/
domainTokenizer

Model card Files Files and versions
xet
Community
domainTokenizer / docs
Ctrl+K
Ctrl+K
  • 1 contributor
History: 8 commits
rtferraz's picture
rtferraz
Add e-commerce pre-training report β€” successful demo, behavioral clusters found, future improvements noted
2b3e3af verified 1 day ago
  • adr
    Add ADR-002: Dataset selection for Phase 3 demos β€” research findings, rationale, phased plan 7 days ago
  • reports
    Add e-commerce pre-training report β€” successful demo, behavioral clusters found, future improvements noted 1 day ago
  • nubank_nuformer_analysis.md
    29.9 kB
    Add Nubank nuFormer reverse-engineering analysis β€” full pipeline reconstruction 8 days ago
  • phase2_implementation_report.md
    19.2 kB
    Update implementation report: add Phase 2D, update header to v0.4.0 / 139 tests, update cumulative summary and API 7 days ago
  • research_report.md
    52.8 kB
    Add comprehensive research report on domain-specific tokenization 8 days ago