Clue-instruct Clue-instruct dataset and different models fine-tuned on it. Clue-Instruct: Text-Based Clue Generation for Educational Crossword Puzzles Paper • 2404.06186 • Published Apr 9, 2024 • 3 azugarini/clue-instruct Viewer • Updated Jul 11, 2024 • 44.1k • 12 azugarini/crossword-clues-QA Viewer • Updated Mar 21, 2025 • 1.17k • 24 azugarini/clue-instruct-llama-13b Text Generation • 13B • Updated Jul 11, 2024 • 1
Clue-Instruct: Text-Based Clue Generation for Educational Crossword Puzzles Paper • 2404.06186 • Published Apr 9, 2024 • 3
Tokenizer Adaptation Collection of research on tokenizers' adaptation to specific domains and/or languages. Special focus on sequence compression directions Fast Vocabulary Transfer for Language Model Compression Paper • 2402.09977 • Published Feb 15, 2024 • 2 Multi-Word Tokenization for Sequence Compression Paper • 2402.09949 • Published Feb 15, 2024 Zero-Shot Tokenizer Transfer Paper • 2405.07883 • Published May 13, 2024 • 5 Language Model Tokenizers Introduce Unfairness Between Languages Paper • 2305.15425 • Published May 17, 2023 • 1
Fast Vocabulary Transfer for Language Model Compression Paper • 2402.09977 • Published Feb 15, 2024 • 2
Language Model Tokenizers Introduce Unfairness Between Languages Paper • 2305.15425 • Published May 17, 2023 • 1
Clue-instruct Clue-instruct dataset and different models fine-tuned on it. Clue-Instruct: Text-Based Clue Generation for Educational Crossword Puzzles Paper • 2404.06186 • Published Apr 9, 2024 • 3 azugarini/clue-instruct Viewer • Updated Jul 11, 2024 • 44.1k • 12 azugarini/crossword-clues-QA Viewer • Updated Mar 21, 2025 • 1.17k • 24 azugarini/clue-instruct-llama-13b Text Generation • 13B • Updated Jul 11, 2024 • 1
Clue-Instruct: Text-Based Clue Generation for Educational Crossword Puzzles Paper • 2404.06186 • Published Apr 9, 2024 • 3
Tokenizer Adaptation Collection of research on tokenizers' adaptation to specific domains and/or languages. Special focus on sequence compression directions Fast Vocabulary Transfer for Language Model Compression Paper • 2402.09977 • Published Feb 15, 2024 • 2 Multi-Word Tokenization for Sequence Compression Paper • 2402.09949 • Published Feb 15, 2024 Zero-Shot Tokenizer Transfer Paper • 2405.07883 • Published May 13, 2024 • 5 Language Model Tokenizers Introduce Unfairness Between Languages Paper • 2305.15425 • Published May 17, 2023 • 1
Fast Vocabulary Transfer for Language Model Compression Paper • 2402.09977 • Published Feb 15, 2024 • 2
Language Model Tokenizers Introduce Unfairness Between Languages Paper • 2305.15425 • Published May 17, 2023 • 1