Mostrar el registro sencillo del ítem
Demonstrating Distributed Inference in the User Plane with DUNE
dc.contributor.author | Bütün, Beyza | |
dc.contributor.author | de Andrés Hernández, David | |
dc.contributor.author | Aguilar, Jose | |
dc.contributor.author | Gucciardo, Michele | |
dc.contributor.author | Fiore, Marco | |
dc.date.accessioned | 2025-02-21T16:01:30Z | |
dc.date.available | 2025-02-21T16:01:30Z | |
dc.date.issued | 2025-05 | |
dc.identifier.uri | https://hdl.handle.net/20.500.12761/1905 | |
dc.description.abstract | Deploying Machine Learning (ML) models in the user plane enables low-latency and scalable in-network inference, but integrating them into programmable devices faces stringent constraints in terms of memory resources and computing capabilities. In this demo, we show how the newly proposed DUNE, a novel framework for distributed user-plane inference across multiple programmable network devices by automating the decomposition of large ML models into smaller sub-models, mitigates the limitations of traditional monolithic ML designs. We run experiments on a testbed with Intel Tofino switches using measurement data and show how DUNE not only improves the accuracy that the traditional single-device monolithic approach gets but also maintains a comparable per-switch latency. | es |
dc.language.iso | eng | es |
dc.title | Demonstrating Distributed Inference in the User Plane with DUNE | es |
dc.type | conference object | es |
dc.conference.date | 19-22 May 2025 | es |
dc.conference.place | London, United Kingdom | es |
dc.conference.title | IEEE International Conference on Computer Communications | * |
dc.event.type | conference | es |
dc.pres.type | demo | es |
dc.rights.accessRights | open access | es |
dc.acronym | INFOCOM | * |
dc.rank | A* | * |
dc.description.refereed | TRUE | es |
dc.description.status | inpress | es |