The incompleteness of author names is a well-known issue in the MEDLINE database. It was since 2002, the full author name has been systematically indexed in MEDLINE. Although many full author names have been added to MEDLINE, we still found a significant number of abbreviated names in papers published after 2002.
Here we built an enhanced author name dataset for MEDLINE, called EAN, achieved by linking the whole PubMed to other large literature databases and conducting a large-scale name comparison and restoration with obtained multi-sources author names. Our evaluation shows that more than 90% of author names in EAN are complete as compared to the ratio of ~60% in MEDLINE.