Automatic Disambiguation of Author Names in Bibliographic Repositories

Automatic Disambiguation of Author Names in Bibliographic Repositories
Author :
Publisher : Morgan & Claypool Publishers
Total Pages : 148
Release :
ISBN-10 : 9781681738581
ISBN-13 : 1681738589
Rating : 4/5 (589 Downloads)

Book Synopsis Automatic Disambiguation of Author Names in Bibliographic Repositories by : Anderson A. Ferreira

Download or read book Automatic Disambiguation of Author Names in Bibliographic Repositories written by Anderson A. Ferreira and published by Morgan & Claypool Publishers. This book was released on 2020-06-01 with total page 148 pages. Available in PDF, EPUB and Kindle. Book excerpt: This book deals with a hard problem that is inherent to human language: ambiguity. In particular, we focus on author name ambiguity, a type of ambiguity that exists in digital bibliographic repositories, which occurs when an author publishes works under distinct names or distinct authors publish works under similar names. This problem may be caused by a number of reasons, including the lack of standards and common practices, and the decentralized generation of bibliographic content. As a consequence, the quality of the main services of digital bibliographic repositories such as search, browsing, and recommendation may be severely affected by author name ambiguity. The focal point of the book is on automatic methods, since manual solutions do not scale to the size of the current repositories or the speed in which they are updated. Accordingly, we provide an ample view on the problem of automatic disambiguation of author names, summarizing the results of more than a decade of research on this topic conducted by our group, which were reported in more than a dozen publications that received over 900 citations so far, according to Google Scholar. We start by discussing its motivational issues (Chapter 1). Next, we formally define the author name disambiguation task (Chapter 2) and use this formalization to provide a brief, taxonomically organized, overview of the literature on the topic (Chapter 3). We then organize, summarize and integrate the efforts of our own group on developing solutions for the problem that have historically produced state-of-the-art (by the time of their proposals) results in terms of the quality of the disambiguation results. Thus, Chapter 4 covers HHC - Heuristic-based Clustering, an author name disambiguation method that is based on two specific real-world assumptions regarding scientific authorship. Then, Chapter 5 describes SAND - Self-training Author Name Disambiguator and Chapter 6 presents two incremental author name disambiguation methods, namely INDi - Incremental Unsupervised Name Disambiguation and INC- Incremental Nearest Cluster. Finally, Chapter 7 provides an overview of recent author name disambiguation methods that address new specific approaches such as graph-based representations, alternative predefined similarity functions, visualization facilities and approaches based on artificial neural networks. The chapters are followed by three appendices that cover, respectively: (i) a pattern matching function for comparing proper names and used by some of the methods addressed in this book; (ii) a tool for generating synthetic collections of citation records for distinct experimental tasks; and (iii) a number of datasets commonly used to evaluate author name disambiguation methods. In summary, the book organizes a large body of knowledge and work in the area of author name disambiguation in the last decade, hoping to consolidate a solid basis for future developments in the field.


Automatic Disambiguation of Author Names in Bibliographic Repositories Related Books

Automatic Disambiguation of Author Names in Bibliographic Repositories
Language: en
Pages: 148
Authors: Anderson A. Ferreira
Categories: Computers
Type: BOOK - Published: 2020-06-01 - Publisher: Morgan & Claypool Publishers

GET EBOOK

This book deals with a hard problem that is inherent to human language: ambiguity. In particular, we focus on author name ambiguity, a type of ambiguity that ex
Automatic Disambiguation of Author Names in Bibliographic Repositories
Language: en
Pages: 126
Authors: Anderson A. Ferreira
Categories: Computers
Type: BOOK - Published: 2022-06-01 - Publisher: Springer Nature

GET EBOOK

This book deals with a hard problem that is inherent to human language: ambiguity. In particular, we focus on author name ambiguity, a type of ambiguity that ex
Knowledge Graphs and Semantic Web
Language: en
Pages: 355
Authors: Boris Villazón-Terrazas
Categories: Computers
Type: BOOK - Published: 2022-11-12 - Publisher: Springer Nature

GET EBOOK

This book constitutes the proceedings of the 4th Iberoamerican Conference and third Indo-American Conference on Knowledge Graphs and Semantic Web, KGSWC 2022, w
Information Management and Big Data
Language: en
Pages: 563
Authors: Juan Antonio Lossio-Ventura
Categories: Computers
Type: BOOK - Published: 2021-05-11 - Publisher: Springer Nature

GET EBOOK

This book constitutes the refereed proceedings of the 7th International Conference on Information Management and Big Data, SIMBig 2020, held in Lima, Peru, in O
International Conference on Digital Libraries (ICDL) 2013
Language: en
Pages: 1230
Authors: Shantanu Ganguly
Categories: Language Arts & Disciplines
Type: BOOK - Published: 2013-11-29 - Publisher: The Energy and Resources Institute (TERI)

GET EBOOK

ICDL conferences are recognized on of the most important platform in the world where noted expert share their experiences. Many DL experts have contributed thou