A Simple Message-Optimal Algorithm for Random Sampling from a Distributed Stream

dc.contributor.author Chung, Yung-Yu
dc.contributor.author Tirthapura, Srikanta
dc.contributor.department Department of Electrical and Computer Engineering
dc.date 2018-02-19T07:26:15.000
dc.date.accessioned 2020-06-30T02:02:20Z
dc.date.available 2020-06-30T02:02:20Z
dc.date.copyright Fri Jan 01 00:00:00 UTC 2016
dc.date.issued 2016-06-01
dc.description.abstract <p>We present a simple, message-optimal algorithm for maintaining a random sample from a large data stream whose input elements are distributed across multiple sites that communicate via a central coordinator. At any point in time, the set of elements held by the coordinator represent a uniform random sample from the set of all the elements observed so far. When compared with prior work, our algorithms asymptotically improve the total number of messages sent in the system. We present a matching lower bound, showing that our protocol sends the optimal number of messages up to a constant factor with large probability. We also consider the important case when the distribution of elements across different sites is non-uniform, and show that for such inputs, our algorithm significantly outperforms prior solutions.</p>
dc.description.comments <p>This is a manuscript of an article published as Chung, Yung-Yu, Srikanta Tirthapura, and David P. Woodruff. "A Simple Message-Optimal Algorithm for Random Sampling from a Distributed Stream." IEEE Transactions on Knowledge and Data Engineering 28, no. 6 (2016): 1356-1368. <a href="http://dx.doi.org/10.1109" target="_blank">10.1109/TKDE.2016.2518679</a>. Posted with permission.</p>
dc.format.mimetype application/pdf
dc.identifier archive/lib.dr.iastate.edu/ece_pubs/149/
dc.identifier.articleid 1150
dc.identifier.contextkey 11358052
dc.identifier.s3bucket isulib-bepress-aws-west
dc.identifier.submissionpath ece_pubs/149
dc.identifier.uri https://dr.lib.iastate.edu/handle/20.500.12876/20971
dc.language.iso en
dc.source.bitstream archive/lib.dr.iastate.edu/ece_pubs/149/2016_Tirthapura_SimpleMessage.pdf|||Fri Jan 14 20:28:27 UTC 2022
dc.source.uri 10.1109/TKDE.2016.2518679
dc.subject.disciplines Electrical and Computer Engineering
dc.subject.keywords Complexity theory
dc.subject.keywords Algorithm design and analysis
dc.subject.keywords Protocols
dc.subject.keywords reservoirs
dc.subject.keywords Silicon
dc.subject.keywords Distributed databases
dc.subject.keywords Electronic mail
dc.title A Simple Message-Optimal Algorithm for Random Sampling from a Distributed Stream
dc.type article
dc.type.genre article
dspace.entity.type Publication
relation.isAuthorOfPublication b0235db2-0a72-4dd1-8d5f-08e5e2e2bf7d
relation.isOrgUnitOfPublication a75a044c-d11e-44cd-af4f-dab1d83339ff
File
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
2016_Tirthapura_SimpleMessage.pdf
Size:
751.31 KB
Format:
Adobe Portable Document Format
Description:
Collections