Georgia Institute of Technology

university

Verified

https://gatech.edu

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

fei-yang-wu updated a dataset about 1 month ago

GeorgiaTech/g1_lafan1_50hz

fei-yang-wu published a dataset about 1 month ago

GeorgiaTech/g1_lafan1_50hz

hyungjoochae submitted a paper about 2 months ago

Safe and Scalable Web Agent Learning via Recreated Websites

View all activity

Papers

Safe and Scalable Web Agent Learning via Recreated Websites

Do Vision-Language Models Respect Contextual Integrity in Location Disclosure?

View all Papers

updated a dataset about 1 month ago

GeorgiaTech/g1_lafan1_50hz

Updated Mar 23 • 3

published a dataset about 1 month ago

GeorgiaTech/g1_lafan1_50hz

Updated Mar 23 • 3

submitted a paper to Daily Papers about 2 months ago

Safe and Scalable Web Agent Learning via Recreated Websites

Paper • 2603.10505 • Published Mar 11 • 27

updated a model 3 months ago

GeorgiaTech/t5-small-finetuned

Updated Feb 17 • 11 • 1

RayY

submitted a paper to Daily Papers 3 months ago

Do Vision-Language Models Respect Contextual Integrity in Location Disclosure?

Paper • 2602.05023 • Published Feb 4 • 2

submitted a paper to Daily Papers 3 months ago

Behavior Knowledge Merge in Reinforced Agentic Models

Paper • 2601.13572 • Published Jan 20 • 27

authored a paper 7 months ago

Learning to Reason as Action Abstractions with Scalable Mid-Training RL

Paper • 2509.25810 • Published Sep 30, 2025 • 6

authored a paper 11 months ago

Beyond Markovian: Reflective Exploration via Bayes-Adaptive RL for LLM Reasoning

Paper • 2505.20561 • Published May 26, 2025 • 7

updated a Space about 1 year ago

CS6460 EdTech

CS6460 Ed Tech Presentation Dashbaord

published a Space about 1 year ago

CS6460 EdTech

CS6460 Ed Tech Presentation Dashbaord

updated a model about 1 year ago

GeorgiaTech/sonic

Updated Feb 24, 2025

published a model about 1 year ago

GeorgiaTech/sonic

Updated Feb 24, 2025

updated a Space over 1 year ago

Arxiv Summarizer

summarize arixv papers and chat with your data

published a Space over 1 year ago

Arxiv Summarizer

summarize arixv papers and chat with your data

authored 4 papers over 1 year ago

Provably Mitigating Overoptimization in RLHF: Your SFT Loss is Implicitly an Adversarial Regularizer

Paper • 2405.16436 • Published May 26, 2024 • 1

Reward-Augmented Data Enhances Direct Preference Alignment of LLMs

Paper • 2410.08067 • Published Oct 10, 2024 • 2

DSTC: Direct Preference Learning with Only Self-Generated Tests and Code to Improve Code LMs

Paper • 2411.13611 • Published Nov 20, 2024

Offline Reinforcement Learning for LLM Multi-Step Reasoning

Paper • 2412.16145 • Published Dec 20, 2024 • 38

authored a paper almost 2 years ago

Self-Exploring Language Models: Active Preference Elicitation for Online Alignment

Paper • 2405.19332 • Published May 29, 2024 • 22

updated a model almost 2 years ago

GeorgiaTech/0.0005_llama_nodpo_3iters_bs128_531lr_oldtrl_iter_3

Text Generation • 8B • Updated May 13, 2024 • 4