Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

jshuadvd
/
LongRoPE

Text Generation
large-language-models
context-extension
transformer-models
fine-tuning
long-contexts
natural-language-processing
context-window
context-length
nlp
llm
llm-context-window
llm-context-length
Model card Files Files and versions
xet

You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

Gated model
You can list files but not access them

Preview of files found in this repository
  • .github
    correct pipline errors over 1 year ago
  • images
    update README.md almost 2 years ago
  • notebooks
    Update the training notebook with the latest training updates over 1 year ago
  • src
    Update comments over 1 year ago
  • tests
    Fix the short_context_recovery function call over 1 year ago
  • .gitignore
    3.13 kB
    Remove notebooks from .gitignore over 1 year ago
  • .pre-commit-config.yaml
    376 Bytes
    Initial LongRoPE model implementation almost 2 years ago
  • Dockerfile
    659 Bytes
    Initial LongRoPE model implementation almost 2 years ago
  • README.md
    11.3 kB
    Update README.md to be more detailed over 1 year ago
  • evaluation.py
    2.53 kB
    Forgot wandb import over 1 year ago
  • poetry.lock
    300 kB
    Add compute_complexity method over 1 year ago
  • pyproject.toml
    604 Bytes
    Add compute_complexity method over 1 year ago
  • train.py
    18.1 kB
    Update the training notebook with the latest training updates over 1 year ago