EgoLife: Towards Egocentric Life Assistant

Yang, Jingkang; Liu, Shuai; Guo, Hongming; Dong, Yuhao; Zhang, Xiamengwei; Zhang, Sicheng; Wang, Pengyun; Zhou, Zitang; Xie, Binzhu; Wang, Ziyue; Ouyang, Bei; Lin, Zhengyu; Cominelli, Marco; Cai, Zhongang; Li, Bo; Zhang, Yuanhan; Zhang, Peiyuan; Hong, Fangzhou; Widmer, Joerg; Gringoli, Francesco; Yang, Lei; Liu, Ziwei

dc.contributor.author	Yang, Jingkang
dc.contributor.author	Liu, Shuai
dc.contributor.author	Guo, Hongming
dc.contributor.author	Dong, Yuhao
dc.contributor.author	Zhang, Xiamengwei
dc.contributor.author	Zhang, Sicheng
dc.contributor.author	Wang, Pengyun
dc.contributor.author	Zhou, Zitang
dc.contributor.author	Xie, Binzhu
dc.contributor.author	Wang, Ziyue
dc.contributor.author	Ouyang, Bei
dc.contributor.author	Lin, Zhengyu
dc.contributor.author	Cominelli, Marco
dc.contributor.author	Cai, Zhongang
dc.contributor.author	Li, Bo
dc.contributor.author	Zhang, Yuanhan
dc.contributor.author	Zhang, Peiyuan
dc.contributor.author	Hong, Fangzhou
dc.contributor.author	Widmer, Joerg
dc.contributor.author	Gringoli, Francesco
dc.contributor.author	Yang, Lei
dc.contributor.author	Liu, Ziwei
dc.date.accessioned	2026-06-15T10:36:22Z
dc.date.available	2026-06-15T10:36:22Z
dc.date.issued	2025-06-15
dc.identifier.uri	https://hdl.handle.net/20.500.12761/2041
dc.description.abstract	We introduce EgoLife, a project to develop an egocentric life assistant that accompanies and enhances personal efficiency through AI-powered wearable glasses. To lay the foundation for this assistant, we conducted a comprehensive data collection study where six participants lived together for one week, continuously recording their daily activities - including discussions, shopping, cooking, socializing, and entertainment - using AI glasses for multimodal egocentric video capture, along with synchronized third-person-view video references. This effort resulted in the EgoLife Dataset, a comprehensive 300-hour egocentric, interpersonal, multiview, and multimodal daily life dataset with intensive annotation. Leveraging this dataset, we introduce EgoLifeQA, a suite of long-context, life-oriented question-answering tasks designed to provide meaningful assistance in daily life by addressing practical questions such as recalling past relevant events, monitoring health habits, and offering personalized recommendations. To address the key technical challenges of (1) developing robust visual-audio models for egocentric data, (2) enabling identity recognition, and (3) facilitating long-context question answering over extensive temporal information, we introduce EgoButler, an integrated system comprising EgoGPT and EgoRAG. EgoGPT is an omni-modal model trained on egocentric datasets, achieving state-of-the-art performance on egocentric video understanding. EgoRAG is a retrieval-based component that supports answering ultra-long-context questions. Our experimental studies verify their working mechanisms and reveal critical factors and bottlenecks, guiding future improvements. By releasing our datasets, models, and benchmarks, we aim to stimulate further research in egocentric AI assistants.	es
dc.language.iso	eng	es
dc.title	EgoLife: Towards Egocentric Life Assistant	es
dc.type	conference object	es
dc.conference.date	11-15 June 2025	es
dc.conference.place	Music City Center in Nashville, Tennessee, USA	es
dc.conference.title	IEEE/CVF Conference on Computer Vision and Pattern Recognition	*
dc.event.type	conference	es
dc.pres.type	poster	es
dc.type.hasVersion	VoR	es
dc.rights.accessRights	open access	es
dc.description.refereed	TRUE	es
dc.description.status	pub	es

Files in this item

Name:: Yang_EgoLife_Towards_Egocentri ...
Size:: 3.888Mb
Format:: PDF
Description:: Main article

This item appears in the following Collection(s)

IMDEA Networks

Show simple item record