1 option

Knowledge Discovery from Multi-Sourced Data / by Chen Ye, Hongzhi Wang, Guojun Dai.

SpringerLink Books Computer Science (2011-2024) Available online

Format:: Book
Author/Creator:: Ye, Chen, Author.; Wang, Hongzhi, Author.; Dai, Guojun., Author.
Contributor:: SpringerLink (Online service)
Series:: Computer Science (SpringerNature-11645); SpringerBriefs in computer science 2191-5776; SpringerBriefs in Computer Science, 2191-5776
Language:: English
Subjects (All):: Data mining.; Database management.; Artificial intelligence-Data processing.; Data Mining and Knowledge Discovery.; Database Management.; Data Science.
Local Subjects:: Data Mining and Knowledge Discovery.; Database Management.; Data Science.
Physical Description:: 1 online resource (XII, 83 pages) : 14 illustrations, 9 illustrations in color.
Edition:: 1st ed. 2022.
Contained In:: Springer Nature eBook
Place of Publication:: Singapore : Springer Nature Singapore : Imprint: Springer, 2022.
System Details:: text file PDF
Summary:: This book addresses several knowledge discovery problems on multi-sourced data where the theories, techniques, and methods in data cleaning, data mining, and natural language processing are synthetically used. This book mainly focuses on three data models: the multi-sourced isomorphic data, the multi-sourced heterogeneous data, and the text data. On the basis of three data models, this book studies the knowledge discovery problems including truth discovery and fact discovery on multi-sourced data from four important properties: relevance, inconsistency, sparseness, and heterogeneity, which is useful for specialists as well as graduate students. Data, even describing the same object or event, can come from a variety of sources such as crowd workers and social media users. However, noisy pieces of data or information are unavoidable. Facing the daunting scale of data, it is unrealistic to expect humans to "label" or tell which data source is more reliable. Hence, it is crucial to identify trustworthy information from multiple noisy information sources, referring to the task of knowledge discovery. At present, the knowledge discovery research for multi-sourced data mainly faces two challenges. On the structural level, it is essential to consider the different characteristics of data composition and application scenarios and define the knowledge discovery problem on different occasions. On the algorithm level, the knowledge discovery task needs to consider different levels of information conflicts and design efficient algorithms to mine more valuable information using multiple clues. Existing knowledge discovery methods have defects on both the structural level and the algorithm level, making the knowledge discovery problem far from totally solved.
Contents:: 1. Introduction; 2. Functional-dependency-based truth discovery for isomorphic data; 3. Denial-constraint-based truth discovery for isomorphic data; 4. Pattern discovery for heterogeneous data; 5. Deep fact discovery for text data.
Other Format:: Printed edition:
ISBN:: 978-981-19-1879-7; 9789811918797
Access Restriction:: Restricted for use by site license.

The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.

1 option

Knowledge Discovery from Multi-Sourced Data / by Chen Ye, Hongzhi Wang, Guojun Dai.

Find

My Account

Guides