My Account Log in

1 option

Core Concepts in Data Analysis: Summarization, Correlation and Visualization / by Boris Mirkin.

SpringerLink Books Computer Science (2011-2024) Available online

View online
Format:
Book
Author/Creator:
Mirkin, Boris, 1937- author.
Contributor:
SpringerLink (Online service)
Series:
Computer Science (Springer-11645)
Undergraduate topics in computer science 1863-7310
Undergraduate Topics in Computer Science, 1863-7310
Language:
English
Subjects (All):
Computer science--Mathematics.
Computer science.
Mathematical statistics.
Artificial intelligence.
Pattern perception.
Discrete Mathematics in Computer Science.
Probability and Statistics in Computer Science.
Math Applications in Computer Science.
Artificial Intelligence.
Pattern Recognition.
Local Subjects:
Discrete Mathematics in Computer Science.
Probability and Statistics in Computer Science.
Math Applications in Computer Science.
Artificial Intelligence.
Pattern Recognition.
Physical Description:
1 online resource (XX, 390 pages) : 129 illustrations.
Edition:
First edition 2011.
Contained In:
Springer eBooks
Place of Publication:
London : Springer London : Imprint: Springer, 2011.
System Details:
text file PDF
Summary:
Core Concepts in Data Analysis: Summarization, Correlation and Visualization provides in-depth descriptions of those data analysis approaches that either summarize data (principal component analysis and clustering, including hierarchical and network clustering) or correlate different aspects of data (decision trees, linear rules, neuron networks, and Bayes rule). Boris Mirkin takes an unconventional approach and introduces the concept of multivariate data summarization as a counterpart to conventional machine learning prediction schemes, utilizing techniques from statistics, data analysis, data mining, machine learning, computational intelligence, and information retrieval. Innovations following from his in-depth analysis of the models underlying summarization techniques are introduced, and applied to challenging issues such as the number of clusters, mixed scale data standardization, interpretation of the solutions, as well as relations between seemingly unrelated concepts: goodness-of-fit functions for classification trees and data standardization, spectral clustering and additive clustering, correlation and visualization of contingency data. The mathematical detail is encapsulated in the so-called "formulation" parts, whereas most material is delivered through "presentation" parts that explain the methods by applying them to small real-world data sets; concise "computation" parts inform of the algorithmic and coding issues. Four layers of active learning and self-study exercises are provided: worked examples, case studies, projects and questions. .
Other Format:
Printed edition:
ISBN:
978-0-85729-287-2
9780857292872
Access Restriction:
Restricted for use by site license.

The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.

Find

Home Release notes

My Account

Shelf Request an item Bookmarks Fines and fees Settings

Guides

Using the Find catalog Using Articles+ Using your account