Free delivery for purchases over 1 299 Kč
PPL Parcel Shop 54 Czech Post 74 Balíkovna 49 GLS point 54 GLS courier 64 Zásilkovna 44 PPL courier 99

Synopses for Massive Data

Language EnglishEnglish
Book Paperback
Book Synopses for Massive Data Graham Cormode
Libristo code: 04834883
Publishers Now Publishers Inc, December 2011
Synopses for Massive Data: Samples, Histograms, Wavelets, Sketches describes basic principles and re... Full description
? points 242 b
2 416 včetně DPH
In stock at our supplier Shipping in 15-20 days

30-day return policy


You might also be interested in


Synopses for Massive Data: Samples, Histograms, Wavelets, Sketches describes basic principles and recent developments in building approximate synopses (i.e., lossy, compressed representations) of massive data. Such synopses enable approximate query processing, in which the user's query is executed against the synopsis instead of the original data. The monograph focuses on the four main families of synopses: random samples, histograms, wavelets, and sketches. A random sample comprises a "representative" subset of the data values of interest, obtained via a stochastic mechanism. Samples can be quick to obtain, and can be used to approximately answer a wide range of queries. A histogram summarizes a data set by grouping the data values into subsets, or "buckets," and then, for each bucket, computing a small set of summary statistics that can be used to approximately reconstruct the data in the bucket. Histograms have been extensively studied and have been incorporated into the query optimizers of virtually all commercial relational DBMSs. Wavelet-based synopses were originally developed in the context of image and signal processing. The data set is viewed as a set of M elements in a vector - i.e., as a function defined on the set {0, 1, 2, ..., M-1} - and the wavelet transform of this function is found as a weighted sum of wavelet "basis functions." The weights, or coefficients, can then be "thresholded", for example, by eliminating coefficients that are close to zero in magnitude. The remaining small set of coefficients serves as the synopsis. Wavelets are good at capturing features of the data set at various scales. Sketch summaries are particularly well suited to streaming data. Linear sketches, for example, view a numerical data set as a vector or matrix, and multiply the data by a fixed matrix. Such sketches are massively parallelizable. They can accommodate streams of transactions in which data is both inserted and removed. Sketches have also been used successfully to estimate the answer to COUNT DISTINCT queries, a notoriously hard problem. Synopses for Massive Data describes and compares the different synopsis methods. It also discusses the use of AQP within research systems, and discusses challenges and future directions. It is essential reading for anyone working with, or doing research on massive data.

About the book

Full name Synopses for Massive Data
Language English
Binding Book - Paperback
Date of issue 2011
Number of pages 308
EAN 9781601985163
ISBN 1601985169
Libristo code 04834883
Publishers Now Publishers Inc
Weight 452
Dimensions 156 x 234 x 17
Give this book today
It's easy
1 Add to cart and choose Deliver as present at the checkout 2 We'll send you a voucher 3 The book will arrive at the recipient's address

Login

Log in to your account. Don't have a Libristo account? Create one now!

 
mandatory
mandatory

Don’t have an account? Discover the benefits of having a Libristo account!

With a Libristo account, you'll have everything under control.

Create a Libristo account