A Simple Message-Optimal Algorithm for Random Sampling from a Distributed Stream
dc.contributor.author | Chung, Yung-Yu | |
dc.contributor.author | Tirthapura, Srikanta | |
dc.contributor.department | Department of Electrical and Computer Engineering | |
dc.date | 2018-02-19T07:26:15.000 | |
dc.date.accessioned | 2020-06-30T02:02:20Z | |
dc.date.available | 2020-06-30T02:02:20Z | |
dc.date.copyright | Fri Jan 01 00:00:00 UTC 2016 | |
dc.date.issued | 2016-06-01 | |
dc.description.abstract | <p>We present a simple, message-optimal algorithm for maintaining a random sample from a large data stream whose input elements are distributed across multiple sites that communicate via a central coordinator. At any point in time, the set of elements held by the coordinator represent a uniform random sample from the set of all the elements observed so far. When compared with prior work, our algorithms asymptotically improve the total number of messages sent in the system. We present a matching lower bound, showing that our protocol sends the optimal number of messages up to a constant factor with large probability. We also consider the important case when the distribution of elements across different sites is non-uniform, and show that for such inputs, our algorithm significantly outperforms prior solutions.</p> | |
dc.description.comments | <p>This is a manuscript of an article published as Chung, Yung-Yu, Srikanta Tirthapura, and David P. Woodruff. "A Simple Message-Optimal Algorithm for Random Sampling from a Distributed Stream." IEEE Transactions on Knowledge and Data Engineering 28, no. 6 (2016): 1356-1368. <a href="http://dx.doi.org/10.1109" target="_blank">10.1109/TKDE.2016.2518679</a>. Posted with permission.</p> | |
dc.format.mimetype | application/pdf | |
dc.identifier | archive/lib.dr.iastate.edu/ece_pubs/149/ | |
dc.identifier.articleid | 1150 | |
dc.identifier.contextkey | 11358052 | |
dc.identifier.s3bucket | isulib-bepress-aws-west | |
dc.identifier.submissionpath | ece_pubs/149 | |
dc.identifier.uri | https://dr.lib.iastate.edu/handle/20.500.12876/20971 | |
dc.language.iso | en | |
dc.source.bitstream | archive/lib.dr.iastate.edu/ece_pubs/149/2016_Tirthapura_SimpleMessage.pdf|||Fri Jan 14 20:28:27 UTC 2022 | |
dc.source.uri | 10.1109/TKDE.2016.2518679 | |
dc.subject.disciplines | Electrical and Computer Engineering | |
dc.subject.keywords | Complexity theory | |
dc.subject.keywords | Algorithm design and analysis | |
dc.subject.keywords | Protocols | |
dc.subject.keywords | reservoirs | |
dc.subject.keywords | Silicon | |
dc.subject.keywords | Distributed databases | |
dc.subject.keywords | Electronic mail | |
dc.title | A Simple Message-Optimal Algorithm for Random Sampling from a Distributed Stream | |
dc.type | article | |
dc.type.genre | article | |
dspace.entity.type | Publication | |
relation.isAuthorOfPublication | b0235db2-0a72-4dd1-8d5f-08e5e2e2bf7d | |
relation.isOrgUnitOfPublication | a75a044c-d11e-44cd-af4f-dab1d83339ff |
File
Original bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- 2016_Tirthapura_SimpleMessage.pdf
- Size:
- 751.31 KB
- Format:
- Adobe Portable Document Format
- Description: