-
Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping
Paper • 2402.14083 • Published • 47 -
Linear Transformers are Versatile In-Context Learners
Paper • 2402.14180 • Published • 7 -
Training-Free Long-Context Scaling of Large Language Models
Paper • 2402.17463 • Published • 23 -
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper • 2402.17764 • Published • 627
Yang Lee
innovation64
AI & ML interests
AGI
Organizations
RAG
RAG research
-
Beyond Chain-of-Thought: A Survey of Chain-of-X Paradigms for LLMs
Paper • 2404.15676 • Published -
How faithful are RAG models? Quantifying the tug-of-war between RAG and LLMs' internal prior
Paper • 2404.10198 • Published • 8 -
RAFT: Adapting Language Model to Domain Specific RAG
Paper • 2403.10131 • Published • 72 -
FaaF: Facts as a Function for the evaluation of RAG systems
Paper • 2403.03888 • Published
papaer selecting
-
Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping
Paper • 2402.14083 • Published • 47 -
Linear Transformers are Versatile In-Context Learners
Paper • 2402.14180 • Published • 7 -
Training-Free Long-Context Scaling of Large Language Models
Paper • 2402.17463 • Published • 23 -
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper • 2402.17764 • Published • 627
RAG
RAG research
-
Beyond Chain-of-Thought: A Survey of Chain-of-X Paradigms for LLMs
Paper • 2404.15676 • Published -
How faithful are RAG models? Quantifying the tug-of-war between RAG and LLMs' internal prior
Paper • 2404.10198 • Published • 8 -
RAFT: Adapting Language Model to Domain Specific RAG
Paper • 2403.10131 • Published • 72 -
FaaF: Facts as a Function for the evaluation of RAG systems
Paper • 2403.03888 • Published
models
24
innovation64/gemma-2-2B-it-thinking-function_calling-V0
Updated
innovation64/llama3.1-nli
Updated
innovation64/llama3.1-8B-instruct-4bit-ruozhiba-4bit
Text Generation
•
8B
•
Updated
•
1
innovation64/llama3.1-8B-instruct-4bit-ruozhiba-GGUF
8B
•
Updated
•
38
innovation64/llama3.1-8B-instruct-4bit-ruozhiba-lora
Updated
innovation64/llama3.1-8B-instruct-4bit-ruozhiba-16
Text Generation
•
8B
•
Updated
•
1
innovation64/speecht5_finetuned_voxpopuli_sl
Text-to-Speech
•
Updated
•
4
innovation64/whisper-tiny-dv
Automatic Speech Recognition
•
Updated
•
3
innovation64/distilhubert-finetuned-gtzan
Audio Classification
•
Updated
•
1
innovation64/poca-aSoccerTwos
Reinforcement Learning
•
Updated
•
4
datasets
0
None public yet