My Account Log in

1 option

Statistical analysis of massive data streams : proceedings of a workshop / Committee on Applied and Theoretical Statistics.

Ebook Central Academic Complete Available online

View online
Format:
Book
Language:
English
Subjects (All):
Mathematical statistics.
Statistics--Data processing.
Statistics.
Physical Description:
1 online resource (395 p.)
Edition:
1st ed.
Place of Publication:
Washington, D.C. : National Academy Press, 2004.
Language Note:
English
Summary:
Massive data streams--large quantities of data that arrive continuously--are becoming increasingly commonplace in many areas of science and technology. Consequently development of analytical methods for such streams is of growing importance. To address this issue, the National Security Agency asked the NRC to hold a workshop to explore methods for analysis of streams of data so as to stimulate progress in the field. This report presents the results of that workshop. It provides presentations that focused on five different research areas where massive data streams are present: atmospheric and meteorological data; high-energy physics; integrated data systems; network traffic; and mining commercial data streams. The goals of the report are to improve communication among researchers in the field and to increase relevant statistical science activity.
Contents:
STATISTICAL ANALYSIS OF MASSIVE DATA STREAMS
Copyright
ACKNOWLEDGEMENT OF REVIEWERS
Preface and Workshop Rationale
Sallie Keller-McNulty Welcome and Overview of Sessions
TRANSCRIPT OF PRESENTATION
James Schatz Welcome and Overview of Sessions
Douglas Nychka, Chair of Session on Atmospheric and Meteorological Data Introduction by Session Chair
John Bates Exploratory Climate Analysis Tools for Environmental Satellite and Weather Radar Data
ABSTRACT OF PRESENTATION
1. Introduction
2. Philosophy of the use of remote sensing data for climate monitoring
Amy Braverman Statistical Challenges in the Production and Analysis of Remote Sensing Earth Science Data at the Jet…
Ralph Milliff Global and Regional Surface Wind Field Inferences from Spaceborne Scatterometer Data
Global and Regional Surface Wind Field Inferences Given Spaceborne Scatterometer Data Ralph F.Milliff
GLOBAL AND REGIONAL SURFACE WIND FIELD INFERENCES FROM SPACE-BORNE SCATTEROMETER DATA
Blending QSCAT and Weather-Center Analysis Winds
Bayesian Inference for Surface Winds in the Labrador Sea
Bayesian Hierarchical Model for Surface Winds in the Tropics
A Bayesian Hierarchical Air-Sea Interaction Model
References
Figure Captions
Summary
Report from Breakout Group
David Scott, Chair of Session on High-Energy Physics Introduction by Session Chair
Robert Jacobsen Statistical Analysis of High Energy Physics Data
Paul Padley Some Challenges in Experimental Particle Physics Data Streams
TRANSCRIPT OF PRESENTATION.
Miron Livny Data Grids (or, A Distributed Computing View of High Energy Physics)
Daryl Pregibon Keynote Address: Graph Mining-Discovery in Large Networks
Sallie Keller-McNulty, Chair of Session on Integrated Data Systems Introduction by Session Chair
J.Douglas Beason Global Situational Awareness
Kevin Vixie Incorporating Invariants in Mahalanobis Distance-Based Classifiers: Applications to Face Recognition
INCORPORATING INVARIANTS IN MAHALANOBIS DISTANCE BASED CLASSIFIERS: APPLICATION TO FACE RECOGNITION
I. INTRODUCTION
II. COMBINING WITHIN CLASS COVARIANCES AND LINEAR APPROXIMATIONS TO INVARIANCES
III. FACE RECOGNITION RESULTS
IV. CONCLUSIONS
V. ACKNOWLEDGMENT
REFERENCES
John Elder Ensembles of Models: Simplicity (of Function) Through Complexity (of Form)
Mark Hansen Untitled Presentation
Wendy Martinez, Chair of Session on Network Traffic Introduction by Session Chair
William Cleveland FSD Models for Open-Loop Generation of Internet Packet Traffic
Johannes Gehrke Processing Aggregate Queries over Continuous Data Streams
Edward Wegman Visualization of Internet Packet Headers
Paul Whitney Toward the Routine Analysis of Moderate to Large-Size Data
Leland Wilkinson, Chair of Session on Mining Commercial Streams of Data Introduction by Session Chair
Lee Rhodes A Stream Processor for Extracting Usage Intelligence from High-Momentum Internet Data
A STREAM PROCESSOR FOR EXTRACTING USAGE INTELLIGENCE FROM HIGH-MOMENTUM INTERNET DATA
1. INTRODUCTION
2. BUSINESS CHALLENGES FOR THE NSPs
3. SOURCES AND TYPES OF DATA
3.1 USAGE MEs
3.2 SESSION MEs
3.3 REFERENCE DATA
4. DATA STREAMS AND RIVERS
5. IUM HIGH-LEVEL ARCHITECTURE
6. STREAM COLLECTION AND NORMALIZATION
7. STREAM RULE PROCESSING
8. RULE CHAINS AND ASSOCIATED DATA STRUCTURES
9. STATISTICS FROM STREAMS
9.1 CAPTURE MODELS
9.2 CAPTURE MODEL AGGREGATION
9.3 DRILL FORWARD
9.4 USER INTERACTION WITH STREAMING MODELS
10. SUMMARY
ACKNOWLEDGMENTS
Pedro Domingos A General Framework for Mining Massive Data Streams
A GENERAL FRAMEWORK FOR MINING MASSIVE DATA STREAMS
Abstract
1 The Problem
2 The Framework
3 Time-Changing Data
4 Conclusion
Reference
Andrew Moore kd- R- Ball- and Ad- Trees: Scalable Massive Science Data Analysis
Concluding Comments.
Notes:
Bibliographic Level Mode of Issuance: Monograph
ISBN:
1-280-20873-2
9786610208739
0-309-59302-6
OCLC:
567850849

The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.

My Account

Shelf Request an item Bookmarks Fines and fees Settings

Guides

Using the Library Catalog Using Articles+ Library Account