dc.contributor.author | Lo Schiavo, Leonardo | |
dc.contributor.author | Ayala-Romero, Jose A. | |
dc.contributor.author | Garcia-Saavedra, Andres | |
dc.contributor.author | Fiore, Marco | |
dc.contributor.author | Costa-Perez, Xavier | |
dc.date.accessioned | 2024-01-22T18:05:43Z | |
dc.date.available | 2024-01-22T18:05:43Z | |
dc.date.issued | 2024-05-20 | |
dc.identifier.uri | https://hdl.handle.net/20.500.12761/1784 | |
dc.description.abstract | RAN virtualization is revolutionizing the telco industry, enabling 5G Distributed Units to run using general-purpose platforms equipped with Hardware Accelerators (HAs). Recently, GPUs have been proposed as HAs, hinging on their unique capability to execute 5G PHY operations efficiently while also processing Machine Learning (ML) workloads. While this ambivalence makes GPUs attractive for cost-effective deployments, we experimentally demonstrate that multiplexing 5G and ML workloads in GPUs is in fact challenging, and that using conventional GPU-sharing methods can severely disrupt 5G operations. We then introduce YinYangRAN, an innovative O-RAN-compliant solution that supervises GPU-based HAs so as to ensure reliability in the 5G processing pipeline while maximizing the throughput of concurrent ML services. YinYangRAN performs GPU resource allocation decisions via a computationally-efficient approximate dynamic programming technique, which is informed by a neural network trained on real-world measurements. Using workloads collected in real RANs, we demonstrate that YinYangRAN can achieve over 50\% higher 5G processing reliability than conventional GPU sharing models with minimal impact on co-located ML workloads. To our knowledge, this is the first work identifying and addressing the complex problem of HA management in emerging GPU-accelerated vRANs, and represents a promising step towards multiplexing PHY and ML workloads in mobile networks. | es |
dc.description.sponsorship | Spanish Ministry of Economic Affairs and Digital Transformation and the European Union-NextGenerationEU through the UNICO 5G I+D project no.TSI-063000-2021-52 “AEON-ZERO" | es |
dc.description.sponsorship | European Commission through Grant No. SNS-JU-101097083 (BeGREEN), 101139270 (ORIGAMI), and 101017109 (DAEMON) | es |
dc.language.iso | eng | es |
dc.title | YinYangRAN: Resource Multiplexing in GPU-Accelerated Virtualized RANs | es |
dc.type | conference object | es |
dc.conference.date | 20-23 May 2024 | es |
dc.conference.place | Vancouver, Canada | es |
dc.conference.title | IEEE International Conference on Computer Communications | * |
dc.event.type | conference | es |
dc.pres.type | paper | es |
dc.type.hasVersion | AM | es |
dc.rights.accessRights | open access | es |
dc.acronym | INFOCOM | * |
dc.rank | A* | * |
dc.relation.projectName | BeGREEN (Beyond 5G Artificial Intelligence Assisted Energy Efficient Open Radio Access Network) | es |
dc.relation.projectName | ORIGAMI (Optimized resource integration and global architecture for mobile infrastructure for 6G) | es |
dc.relation.projectName | DAEMON (Network intelligence for aDAptive and sElf-Learning MObile Networks) | es |
dc.relation.projectName | AEON-ZERO (Network Intelligence for zero-touch orchestration and anomaly detection) | es |
dc.subject.keyword | vRAN | es |
dc.subject.keyword | O-RAN | es |
dc.subject.keyword | Mobile Networks | es |
dc.subject.keyword | GPU Sharing | es |
dc.description.refereed | TRUE | es |
dc.description.status | inpress | es |