Show simple item record

dc.contributor.authorCulhane, William
dc.contributor.authorKogan, Kirill 
dc.contributor.authorJayalath, Chamikara
dc.contributor.authorEugster, Patrick
dc.date.accessioned2021-07-13T10:19:35Z
dc.date.available2021-07-13T10:19:35Z
dc.date.issued2015-04-26
dc.identifier.urihttp://hdl.handle.net/20.500.12761/1460
dc.description.abstractAggregation of computed sets of results fundamentally underlies the distillation of information in many of today’s big data applications. To this end there are many systems which have been introduced which allow users to obtain aggregate results by aggregating along communication structures such as trees, but they do not focus on optimizing performance by optimizing the underlying structure to perform the aggregation. We consider two cases of the problem – aggregation of (1) single blocks of data, and of (2) streaming input. For each case we determine which metric of “fast” completion is the most relevant and mathematically model resulting systems based on aggregation trees to optimize that metric. Our assumptions and model are laid out in depth. From our model we determine how to create a provably ideal aggregation tree (i.e., with optimal fanin) using only limited information about the aggregation function being applied. Experiments in the Amazon Elastic Compute Cloud (EC2) confirm the validity of our models in practice.
dc.language.isoeng
dc.titleOptimal Communication Structures for Big Data Aggregationen
dc.typeconference object
dc.conference.date26 April - 1 May 2015
dc.conference.placeHong Kong, China
dc.conference.titleThe 34th IEEE International Conference on Computer Communications (IEEE INFOCOM 2015)*
dc.event.typeconference
dc.pres.typepaper
dc.type.hasVersionVoR
dc.rights.accessRightsopen access
dc.page.final9
dc.page.initial1
dc.description.refereedTRUE
dc.description.statuspub
dc.eprint.idhttp://eprints.networks.imdea.org/id/eprint/948


Files in this item

This item appears in the following Collection(s)

Show simple item record