smart glasses reading list
updated
Human-inspired Perspectives: A Survey on AI Long-term Memory
Paper
•
2411.00489
•
Published
•
1
Multimodal Fusion with LLMs for Engagement Prediction in Natural
Conversation
Paper
•
2409.09135
•
Published
•
2
Reading Recognition in the Wild
Paper
•
2505.24848
•
Published
•
1
EgoLife: Towards Egocentric Life Assistant
Paper
•
2503.03803
•
Published
•
46
AIMI: Leveraging Future Knowledge and Personalization in Sparse Event
Forecasting for Treatment Adherence
Paper
•
2503.16091
•
Published
•
1
LiveVLM: Efficient Online Video Understanding via Streaming-Oriented KV
Cache and Retrieval
Paper
•
2505.15269
•
Published
•
1
LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale
Paper
•
2504.16030
•
Published
•
36
Cooperative Face Liveness Detection from Optical Flow
Paper
•
2508.10786
•
Published
CLIPC8: Face liveness detection algorithm based on image-text pairs and
contrastive learning
Paper
•
2311.17583
•
Published
Camera-Driven Representation Learning for Unsupervised Domain Adaptive
Person Re-identification
Paper
•
2308.11901
•
Published
LiveStar: Live Streaming Assistant for Real-World Online Video Understanding
Paper
•
2511.05299
•
Published
•
2
YOLO-World: Real-Time Open-Vocabulary Object Detection
Paper
•
2401.17270
•
Published
•
42
YOLO-TS: Real-Time Traffic Sign Detection with Enhanced Accuracy Using
Optimized Receptive Fields and Anchor-Free Fusion
Paper
•
2410.17144
•
Published
YOLOE: Real-Time Seeing Anything
Paper
•
2503.07465
•
Published
•
16
MediaPipe Hands: On-device Real-time Hand Tracking
Paper
•
2006.10214
•
Published
MobilePortrait: Real-Time One-Shot Neural Head Avatars on Mobile Devices
Paper
•
2407.05712
•
Published
Sharing emotions at scale: The Vent dataset
Paper
•
1901.04856
•
Published
Natural Language Processing for Cognitive Analysis of Emotions
Paper
•
2210.05296
•
Published
•
1
How you feelin'? Learning Emotions and Mental States in Movie Scenes
Paper
•
2304.05634
•
Published
A Brain Wave Encodes a Thousand Tokens: Modeling Inter-Cortical Neural Interactions for Effective EEG-based Emotion Recognition
Paper
•
2511.13954
•
Published
•
3
EmoNet-Face: An Expert-Annotated Benchmark for Synthetic Emotion
Recognition
Paper
•
2505.20033
•
Published
•
4
Beyond Emotion Recognition: A Multi-Turn Multimodal Emotion
Understanding and Reasoning Benchmark
Paper
•
2508.16859
•
Published
"Only ChatGPT gets me": An Empirical Analysis of GPT versus other Large
Language Models for Emotion Detection in Text
Paper
•
2503.04831
•
Published
•
1
OV-MER: Towards Open-Vocabulary Multimodal Emotion Recognition
Paper
•
2410.01495
•
Published
Don't Judge Before You CLIP: A Unified Approach for Perceptual Tasks
Paper
•
2503.13260
•
Published
•
2
Gaze into the Heart: A Multi-View Video Dataset for rPPG and Health
Biomarkers Estimation
Paper
•
2508.17924
•
Published
•
14
R2I-rPPG: A Robust Region of Interest Selection Method for Remote
Photoplethysmography to Extract Heart Rate
Paper
•
2410.15851
•
Published
rPPG-Toolbox: Deep Remote PPG Toolbox
Paper
•
2210.00716
•
Published
RPGBENCH: Evaluating Large Language Models as Role-Playing Game Engines
Paper
•
2502.00595
•
Published
Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in
Large Language Models
Paper
•
2505.02847
•
Published
•
28
CPED: A Large-Scale Chinese Personalized and Emotional Dialogue Dataset
for Conversational AI
Paper
•
2205.14727
•
Published
MetaMind: Modeling Human Social Thoughts with Metacognitive Multi-Agent
Systems
Paper
•
2505.18943
•
Published
•
24
VIBE: Can a VLM Read the Room?
Paper
•
2506.11162
•
Published
BlazePose: On-device Real-time Body Pose tracking
Paper
•
2006.10204
•
Published
QBitOpt: Fast and Accurate Bitwidth Reallocation during Training
Paper
•
2307.04535
•
Published
Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary
Detection
Paper
•
2409.08513
•
Published
•
14
FER-YOLO-Mamba: Facial Expression Detection and Classification Based on
Selective State Space
Paper
•
2405.01828
•
Published
•
1
QuickSRNet: Plain Single-Image Super-Resolution Architecture for Faster
Inference on Mobile Platforms
Paper
•
2303.04336
•
Published
Real-Time Neural Light Field on Mobile Devices
Paper
•
2212.08057
•
Published
ContextAgent: Context-Aware Proactive LLM Agents with Open-World Sensory
Perceptions
Paper
•
2505.14668
•
Published
Computational Life: How Well-formed, Self-replicating Programs Emerge
from Simple Interaction
Paper
•
2406.19108
•
Published
Synheart Emotion: Privacy-Preserving On-Device Emotion Recognition from Biosignals
Paper
•
2511.06231
•
Published
•
1
EgoPet: Egomotion and Interaction Data from an Animal's Perspective
Paper
•
2404.09991
•
Published
LLMs Learn to Deceive Unintentionally: Emergent Misalignment in
Dishonesty from Misaligned Samples to Biased Human-AI Interactions
Paper
•
2510.08211
•
Published
•
22
Put Myself in Your Shoes: Lifting the Egocentric Perspective from
Exocentric Videos
Paper
•
2403.06351
•
Published
SELF-PERCEPT: Introspection Improves Large Language Models' Detection of
Multi-Person Mental Manipulation in Conversations
Paper
•
2505.20679
•
Published
LALM: Long-Term Action Anticipation with Language Models
Paper
•
2311.17944
•
Published
HumanAgencyBench: Scalable Evaluation of Human Agency Support in AI
Assistants
Paper
•
2509.08494
•
Published
•
1
EgoSpeak: Learning When to Speak for Egocentric Conversational Agents in
the Wild
Paper
•
2502.14892
•
Published
•
6
Decision-Oriented Dialogue for Human-AI Collaboration
Paper
•
2305.20076
•
Published
AI for Service: Proactive Assistance with AI Glasses
Paper
•
2510.14359
•
Published
•
74
COPILOT: Human-Environment Collision Prediction and Localization from
Egocentric Videos
Paper
•
2210.01781
•
Published
TeleEgo: Benchmarking Egocentric AI Assistants in the Wild
Paper
•
2510.23981
•
Published
Ego-EXTRA: video-language Egocentric Dataset for EXpert-TRAinee assistance
Paper
•
2512.13238
•
Published
•
1
In the Eye of MLLM: Benchmarking Egocentric Video Intent Understanding
with Gaze-Guided Prompting
Paper
•
2509.07447
•
Published
•
1
Proactive Hearing Assistants that Isolate Egocentric Conversations
Paper
•
2511.11473
•
Published
•
6
AlanaVLM: A Multimodal Embodied AI Foundation Model for Egocentric Video
Understanding
Paper
•
2406.13807
•
Published
LifelongMemory: Leveraging LLMs for Answering Queries in Egocentric
Videos
Paper
•
2312.05269
•
Published
Proactive Assistant Dialogue Generation from Streaming Egocentric Videos
Paper
•
2506.05904
•
Published
•
2
Semantic MapNet: Building Allocentric Semantic Maps and Representations
from Egocentric Views
Paper
•
2010.01191
•
Published
EgoM2P: Egocentric Multimodal Multitask Pretraining
Paper
•
2506.07886
•
Published
•
1
EgoThinker: Unveiling Egocentric Reasoning with Spatio-Temporal CoT
Paper
•
2510.23569
•
Published
•
3
Vinci: A Real-time Embodied Smart Assistant based on Egocentric
Vision-Language Model
Paper
•
2412.21080
•
Published
MM-Ego: Towards Building Egocentric Multimodal LLMs
Paper
•
2410.07177
•
Published
•
22
Listen to Look into the Future: Audio-Visual Egocentric Gaze
Anticipation
Paper
•
2305.03907
•
Published
•
1
Project Aria: A New Tool for Egocentric Multi-Modal AI Research
Paper
•
2308.13561
•
Published
EgoMe: Follow Me via Egocentric View in Real World
Paper
•
2501.19061
•
Published
Entering Real Social World! Benchmarking the Theory of Mind and
Socialization Capabilities of LLMs from a First-person Perspective
Paper
•
2410.06195
•
Published
State Your Intention to Steer Your Attention: An AI Assistant for
Intentional Digital Living
Paper
•
2510.14513
•
Published
•
1
Mixed-Session Conversation with Egocentric Memory
Paper
•
2410.02503
•
Published
•
8
Multi-Advisor Reinforcement Learning
Paper
•
1704.00756
•
Published
•
1
EgoPrivacy: What Your First-Person Camera Says About You?
Paper
•
2506.12258
•
Published
•
3
Can Vision-Language Models Think from a First-Person Perspective?
Paper
•
2311.15596
•
Published
•
3
Multimodal Distillation for Egocentric Action Recognition
Paper
•
2307.07483
•
Published
EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering
Paper
•
2502.07411
•
Published
AssistantX: An LLM-Powered Proactive Assistant in Collaborative
Human-Populated Environment
Paper
•
2409.17655
•
Published
EgoVLM: Policy Optimization for Egocentric Video Understanding
Paper
•
2506.03097
•
Published
Embodied VideoAgent: Persistent Memory from Egocentric Videos and
Embodied Sensors Enables Dynamic Scene Understanding
Paper
•
2501.00358
•
Published
Aligning VLM Assistants with Personalized Situated Cognition
Paper
•
2506.00930
•
Published
•
2
ProPerSim: Developing Proactive and Personalized AI Assistants through
User-Assistant Simulation
Paper
•
2509.21730
•
Published
HAPRec: Hybrid Activity and Plan Recognizer
Paper
•
2004.13482
•
Published
SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in
Cyber World
Paper
•
2412.07472
•
Published
Game-theoretic LLM: Agent Workflow for Negotiation Games
Paper
•
2411.05990
•
Published
•
8
GRIM: GRaph-based Interactive narrative visualization for gaMes
Paper
•
2311.09213
•
Published
•
13
GTAlign: Game-Theoretic Alignment of LLM Assistants for Mutual Welfare
Paper
•
2510.08872
•
Published
•
3
Game-TARS: Pretrained Foundation Models for Scalable Generalist
Multimodal Game Agents
Paper
•
2510.23691
•
Published
•
52
Game Theory with Simulation in the Presence of Unpredictable
Randomisation
Paper
•
2410.14311
•
Published
A Survey on Large Language Model-Based Game Agents
Paper
•
2404.02039
•
Published
Persuasion for Good: Towards a Personalized Persuasive Dialogue System
for Social Good
Paper
•
1906.06725
•
Published
•
1
Make an Offer They Can't Refuse: Grounding Bayesian Persuasion in Real-World Dialogues without Pre-Commitment
Paper
•
2510.13387
•
Published
Persuasion at Play: Understanding Misinformation Dynamics in
Demographic-Aware Human-LLM Interactions
Paper
•
2503.02038
•
Published
Monopoly Deal: A Benchmark Environment for Bounded One-Sided Response
Games
Paper
•
2510.25080
•
Published
•
1
Context versus Prior Knowledge in Language Models
Paper
•
2404.04633
•
Published
•
5
Sotopia-RL: Reward Design for Social Intelligence
Paper
•
2508.03905
•
Published
•
23
Prompt-Based Monte-Carlo Tree Search for Goal-Oriented Dialogue Policy
Planning
Paper
•
2305.13660
•
Published
The Persuasive Power of Large Language Models
Paper
•
2312.15523
•
Published
PRINCIPLES: Synthetic Strategy Memory for Proactive Dialogue Agents
Paper
•
2509.17459
•
Published
ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind
Paper
•
2505.22961
•
Published
•
8
Communication is All You Need: Persuasion Dataset Construction via
Multi-LLM Communication
Paper
•
2502.08896
•
Published
Human Choice Prediction in Language-based Persuasion Games:
Simulation-based Off-Policy Evaluation
Paper
•
2305.10361
•
Published
•
1
Language of Persuasion and Misrepresentation in Business Communication:
A Textual Detection Approach
Paper
•
2508.09935
•
Published
Persuasion Should be Double-Blind: A Multi-Domain Dialogue Dataset With
Faithfulness Based on Causal Theory of Mind
Paper
•
2502.21297
•
Published