AccRL

community

AI & ML interests

None defined yet.

Recent Activity

CharyZeng authored a paper 11 days ago

HierSVA: A Data Synthesis Pipeline, Dataset, and Benchmark for LLM-Driven Hierarchical Hardware Formal Verification

CharyZeng authored a paper 26 days ago

Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regression

CharyZeng authored a paper 26 days ago

Parallax: Parameterized Local Linear Attention for Language Modeling

View all activity

authored a paper 11 days ago

HierSVA: A Data Synthesis Pipeline, Dataset, and Benchmark for LLM-Driven Hierarchical Hardware Formal Verification

Paper • 2606.13706 • Published 18 days ago

authored 2 papers 26 days ago

Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regression

Paper • 2510.01450 • Published Oct 1, 2025 • 2

Parallax: Parameterized Local Linear Attention for Language Modeling

Paper • 2605.29157 • Published May 27 • 11

authored 2 papers 2 months ago

CATS: Contextually-Aware Thresholding for Sparsity in Large Language Models

Paper • 2404.08763 • Published Apr 12, 2024 • 2

AccelOpt: A Self-Improving LLM Agentic System for AI Accelerator Kernel Optimization

Paper • 2511.15915 • Published Apr 15 • 4

submitted a paper to Daily Papers 2 months ago

AccelOpt: A Self-Improving LLM Agentic System for AI Accelerator Kernel Optimization

Paper • 2511.15915 • Published Apr 15 • 4

authored a paper 4 months ago

ReMix: Reinforcement routing for mixtures of LoRAs in LLM finetuning

Paper • 2603.10160 • Published Mar 10 • 26

authored a paper over 1 year ago

Exploring the Performance Improvement of Tensor Processing Engines through Transformation in the Bit-weight Dimension of MACs

Paper • 2503.06342 • Published Mar 8, 2025 • 1

authored a paper over 1 year ago

Mixture-of-Mamba: Enhancing Multi-Modal State-Space Models with Modality-Aware Sparsity

Paper • 2501.16295 • Published Jan 27, 2025 • 9

authored 3 papers over 1 year ago

EN-T: Optimizing Tensor Computing Engines Performance via Encoder-Based Methodology

Paper • 2404.11887 • Published Apr 18, 2024

LUT Tensor Core: Lookup Table Enables Efficient Low-Bit LLM Inference Acceleration

Paper • 2408.06003 • Published Aug 12, 2024

SeerAttention: Learning Intrinsic Sparse Attention in Your LLMs

Paper • 2410.13276 • Published Oct 17, 2024 • 29

authored 2 papers almost 2 years ago

Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Paper • 2407.04620 • Published Jul 5, 2024 • 34

MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression

Paper • 2406.14909 • Published Jun 21, 2024 • 16