Demonstrating Distributed Inference in the User Plane with DUNE

Bütün, Beyza; de Andrés Hernández, David; Aguilar, Jose; Gucciardo, Michele; Fiore, Marco

dc.contributor.author	Bütün, Beyza
dc.contributor.author	de Andrés Hernández, David
dc.contributor.author	Aguilar, Jose
dc.contributor.author	Gucciardo, Michele
dc.contributor.author	Fiore, Marco
dc.date.accessioned	2025-02-21T16:01:30Z
dc.date.available	2025-02-21T16:01:30Z
dc.date.issued	2025-05
dc.identifier.uri	https://hdl.handle.net/20.500.12761/1905
dc.description.abstract	Deploying Machine Learning (ML) models in the user plane enables low-latency and scalable in-network inference, but integrating them into programmable devices faces stringent constraints in terms of memory resources and computing capabilities. In this demo, we show how the newly proposed DUNE, a novel framework for distributed user-plane inference across multiple programmable network devices by automating the decomposition of large ML models into smaller sub-models, mitigates the limitations of traditional monolithic ML designs. We run experiments on a testbed with Intel Tofino switches using measurement data and show how DUNE not only improves the accuracy that the traditional single-device monolithic approach gets but also maintains a comparable per-switch latency.	es
dc.language.iso	eng	es
dc.title	Demonstrating Distributed Inference in the User Plane with DUNE	es
dc.type	conference object	es
dc.conference.date	19-22 May 2025	es
dc.conference.place	London, United Kingdom	es
dc.conference.title	IEEE International Conference on Computer Communications	*
dc.event.type	conference	es
dc.pres.type	demo	es
dc.rights.accessRights	open access	es
dc.acronym	INFOCOM	*
dc.rank	A*	*
dc.description.refereed	TRUE	es
dc.description.status	inpress	es

Ficheros en el ítem

Nombre:: INFOCOM2025_DEMO_DSpace.pdf
Tamaño:: 878.8Kb
Formato:: PDF

Este ítem aparece en la(s) siguiente(s) colección(ones)

IMDEA Networks

Mostrar el registro sencillo del ítem