1 option
Apache Mahout clustering designs : explore the clustering algorithms used with Apache Mahout / Ashish Gupta.
- Format:
- Book
- Author/Creator:
- Gupta, Ashish, author.
- Series:
- Community experience distilled.
- Community experience distilled
- Language:
- English
- Subjects (All):
- Apache (Computer file : Apache Group).
- Machine learning.
- Web site development.
- Java (Computer program language).
- Distributed algorithms.
- Data mining.
- Physical Description:
- 1 online resource (131 p.)
- Place of Publication:
- Birmingham : Packt Publishing, 2015.
- Language Note:
- English
- Summary:
- About This BookUse Mahout for clustering datasets and gain useful insightsExplore the different clustering algorithms used in day-to-day workA practical guide to create and evaluate your own clustering models using real world data setsWho This Book Is For This book is for developers who want to try out clustering on large datasets using Mahout. It will also be useful for those users who don't have a background in Mahout, but have knowledge of basic programming and are familiar with the basics of machine learning and clustering. It will be helpful if you know about clustering techniques for some other tool.What You Will LearnExplore clustering algorithms and cluster evaluation techniquesLearn different types of clustering and distance measuring techniquesPerform clustering on your data using K-means clusteringDiscover how Canopy clustering is used as a preprocess step for K-meansUse the Fuzzy K-means algorithm in Apache MahoutImplement Streaming K-means clustering in MahoutLearn the Spectral K-means clustering implementation of MahoutIn Detail As more and more organizations are discovering the use of big data analytics, interest in platforms that provide storage, computational, and analytical capabilities has increased. Apache Mahout caters to this need and paves the way for the implementation of complex algorithms in the field of machine learning, in order to better analyze your data and gain useful insight into it. Starting with the introduction of clustering algorithms, this book provides an insight into Apache Mahout and different algorithms it uses for clustering data. It provides a general introduction to algorithms such as K-means, Fuzzy K-means, Streaming K-means, and how to use Mahout to cluster your data using a particular algorithm. You will study the different types of clustering and learn how to use Apache Mahout with real-world datasets to implement and evaluate your clusters.
- Notes:
- Includes index.
- Description based on online resource; title from PDF title page (ebrary, viewed January 4, 2016).
- ISBN:
- 9781783284443
- 1783284447
The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.