dc.contributor.author | Akem, Aristide Tanyi-Jong | |
dc.contributor.author | Bütün, Beyza | |
dc.contributor.author | Gucciardo, Michele | |
dc.contributor.author | Fiore, Marco | |
dc.date.accessioned | 2023-01-09T09:52:36Z | |
dc.date.available | 2023-01-09T09:52:36Z | |
dc.date.issued | 2022-12-09 | |
dc.identifier.uri | https://hdl.handle.net/20.500.12761/1648 | |
dc.description.abstract | The recent proliferation of programmable network equipment has opened up new possibilities for embedding intelligence into the data plane. Deploying models directly in the data plane promises to achieve high throughput and low latency inference capabilities that cannot be attained with traditional closed loops involving control-plane operations. Recent efforts have paved the way for the integration of trained machine learning models in resource-constrained programmable switches, yet current solutions have significant limitations that translate into performance barriers when coping with complex inference tasks. In this paper, we present Henna, a first in-switch implementation of a hierarchical classification system. The concept underpinning our solution is that of splitting a difficult classification task into easier cascaded decisions, which can then be addressed with separated and resource-efficient tree-based classifiers. We propose a design of Henna that aligns with the internal organization of the Protocol Independent Switch Architecture (PISA), and integrates state-of-the-art strategies for mapping decision trees to switch hardware. We then implement Henna into a real testbed with off-the-shelf Intel Tofino programmable switches using the P4 language. Experiments with a complex 21-category classification task based on measurement data demonstrate how Henna improves the F1 score of an advanced single-stage model by 21%, while keeping usage of switch resources at 8% on average. | es |
dc.description.sponsorship | European Union Horizon 2020 research and innovation program under Marie Skłodowska-Curie grant agreement no. 860239 “BANYAN” | es |
dc.description.sponsorship | CHIST-ERA grant no. CHIST-ERA-20-SICT- 001 “ECOMOME”, via grant PCI2022-133013 of Agencia Estatal de Investigación | es |
dc.description.sponsorship | European Union Horizon 2020 research and innovation program under grant agreement no. 101017109 “DAEMON” | es |
dc.language.iso | eng | es |
dc.title | Henna: hierarchical machine learning inference in programmable switches | es |
dc.type | conference object | es |
dc.conference.date | 9 December 2022 | es |
dc.conference.place | Rome, Italy | es |
dc.conference.title | International Workshop on Native Network Intelligence | * |
dc.event.type | workshop | es |
dc.pres.type | paper | es |
dc.type.hasVersion | AM | es |
dc.rights.accessRights | open access | es |
dc.page.final | 7 | es |
dc.page.initial | 1 | es |
dc.relation.projectID | info:eu-repo/grantAgreement/EC/H2020/860239/EU/Big dAta aNalYtics for radio Access Networks/BANYAN | es |
dc.relation.projectID | info:eu-repo/grantAgreement/EC/H2020/101017109/EU/Network intelligence for aDAptive and sElf-Learning MObile Networks/DAEMON | es |
dc.relation.projectName | BANYAN (Big dAta aNalYtics for radio Access Networks) | es |
dc.relation.projectName | DAEMON (Network intelligence for aDAptive and sElf-Learning MObile Networks) | es |
dc.relation.projectName | ECOMOME (Energy COnsumption Measurements and Optimization in Mobile nEtworks) | es |
dc.subject.keyword | Programmable switch | es |
dc.subject.keyword | machine learning | es |
dc.subject.keyword | in-switch inference | es |
dc.subject.keyword | P4 | es |
dc.description.refereed | TRUE | es |
dc.description.status | pub | es |