dblp.zip This database is used for tasks related to disambiguating author names. It contains 69,574,243 records and 10 columns and was obtained from the DBLP repository and has been preprocessed to extract all possible combinations of pairs of authors (2,665,634) unique authors) from 5,299,929 papers in the database. There are some are in the database where a single author is duplicated. Attributes Record ID Publication ID Target Author Target Author's First Name Target Author's Last Name Co-author's First Name Co-author's Last Name Publication Title Year of Publication Source (Venue) Note that Target Author = Target Author's First Name + Target Author's Last Name + Suffix. The suffix is added to the target author's name to ensure that it refers to a specific, unique person in the real world. Example: Given the following reference string: Boukhers, Zeyd, and Asundi, Nagaraj Bahubali. "Deep Author Name Disambiguation Using Bibliographic Data." International Conference on Theory and Practice of Digital Libraries. Springer, Cham, 2022. The following records are extracted: Record ID Publication ID Target Author Target Author's First Name Target Author's Last Name Co-author's First Name Co-author's Last Name Publication Title Year of Publication Source (Venue) 1 1 Zeyd Boukhers Zeyd Boukhers Zeyd Boukhers Deep Author Name Disambiguation Using Bibliographic Data 2022 International Conference on Theory and Practice of Digital Libraries 2 1 Zeyd Boukhers Zeyd Boukhers Nagaraj Bahubali Asundi Deep Author Name Disambiguation Using Bibliographic Data 2022 International Conference on Theory and Practice of Digital Libraries 3 1 Nagaraj Bahubali Asundi001 Nagaraj Bahubali Asundi Zeyd Boukhers Deep Author Name Disambiguation Using Bibliographic Data 2022 International Conference on Theory and Practice of Digital Libraries 4 1 Nagaraj Bahubali Asundi001 Nagaraj Bahubali Asundi Nagaraj Bahubali Asundi Deep Author Name Disambiguation Using Bibliographic Data 2022 International Conference on Theory and Practice of Digital Libraries data.zip It contains pickle files in the format <n>_<first_name_acronym> <full_last_name>.pickle, each of which refers to an atomic name (i.e. the acronym of the first name and the full last name), where <n> denotes the number of real-world authors sharing the atomic name. The pickle file contains the indices of these real-wo...