Show simple item record

dc.contributor.authorLo Schiavo, Leonardo 
dc.contributor.authorAyala-Romero, Jose A.
dc.contributor.authorGarcia-Saavedra, Andres
dc.contributor.authorFiore, Marco 
dc.contributor.authorCosta-Perez, Xavier
dc.date.accessioned2024-01-22T18:05:43Z
dc.date.available2024-01-22T18:05:43Z
dc.date.issued2024-05-20
dc.identifier.urihttps://hdl.handle.net/20.500.12761/1784
dc.description.abstractRAN virtualization is revolutionizing the telco industry, enabling 5G Distributed Units to run using general-purpose platforms equipped with Hardware Accelerators (HAs). Recently, GPUs have been proposed as HAs, hinging on their unique capability to execute 5G PHY operations efficiently while also processing Machine Learning (ML) workloads. While this ambivalence makes GPUs attractive for cost-effective deployments, we experimentally demonstrate that multiplexing 5G and ML workloads in GPUs is in fact challenging, and that using conventional GPU-sharing methods can severely disrupt 5G operations. We then introduce YinYangRAN, an innovative O-RAN-compliant solution that supervises GPU-based HAs so as to ensure reliability in the 5G processing pipeline while maximizing the throughput of concurrent ML services. YinYangRAN performs GPU resource allocation decisions via a computationally-efficient approximate dynamic programming technique, which is informed by a neural network trained on real-world measurements. Using workloads collected in real RANs, we demonstrate that YinYangRAN can achieve over 50\% higher 5G processing reliability than conventional GPU sharing models with minimal impact on co-located ML workloads. To our knowledge, this is the first work identifying and addressing the complex problem of HA management in emerging GPU-accelerated vRANs, and represents a promising step towards multiplexing PHY and ML workloads in mobile networks.es
dc.description.sponsorshipSpanish Ministry of Economic Affairs and Digital Transformation and the European Union-NextGenerationEU through the UNICO 5G I+D project no.TSI-063000-2021-52 “AEON-ZERO"es
dc.description.sponsorshipEuropean Commission through Grant No. SNS-JU-101097083 (BeGREEN), 101139270 (ORIGAMI), and 101017109 (DAEMON)es
dc.language.isoenges
dc.titleYinYangRAN: Resource Multiplexing in GPU-Accelerated Virtualized RANses
dc.typeconference objectes
dc.conference.date20-23 May 2024es
dc.conference.placeVancouver, Canadaes
dc.conference.titleIEEE International Conference on Computer Communications *
dc.event.typeconferencees
dc.pres.typepaperes
dc.type.hasVersionAMes
dc.rights.accessRightsopen accesses
dc.acronymINFOCOM*
dc.rankA**
dc.relation.projectNameBeGREEN (Beyond 5G Artificial Intelligence Assisted Energy Efficient Open Radio Access Network)es
dc.relation.projectNameORIGAMI (Optimized resource integration and global architecture for mobile infrastructure for 6G)es
dc.relation.projectNameDAEMON (Network intelligence for aDAptive and sElf-Learning MObile Networks)es
dc.relation.projectNameAEON-ZERO (Network Intelligence for zero-touch orchestration and anomaly detection)es
dc.subject.keywordvRANes
dc.subject.keywordO-RANes
dc.subject.keywordMobile Networkses
dc.subject.keywordGPU Sharinges
dc.description.refereedTRUEes
dc.description.statusinpresses


Files in this item

This item appears in the following Collection(s)

Show simple item record