Detecting anomalous referencing patterns in PubMed papers suggestive of author-centric reference list manipulation.

Scientometrics

Genes and Human Disease Research Program, Oklahoma Medical Research Foundation, 825 N.E. 13th Street, Oklahoma City, OK 73104-5005, USA.

Published: October 2022


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Although citations are used as a quantifiable, objective metric of academic influence, references could be added to a paper solely to inflate the perceived influence of a body of research. This reference list manipulation (RLM) could take place during the peer-review process, or prior to it. Surveys have estimated how many people may have been affected by coercive RLM at one time or another, but it is not known how many authors engage in RLM, nor to what degree. By examining a subset of active, highly published authors (n = 20,803) in PubMed, we find the frequency of non-self-citations (NSC) to one author coming from a single paper approximates Zipf's law. Author-centric deviations from it are approximately normally distributed, permitting deviations to be quantified statistically. Framed as an anomaly detection problem, statistical confidence increases when an author is an outlier by multiple metrics. Anomalies are not proof of RLM, but authors engaged in RLM will almost unavoidably create anomalies. We find the NSC Gini Index correlates highly with anomalous patterns across multiple "red flags", each suggestive of RLM. Between 81 (0.4%, FDR < 0.05) and 231 (1.1%, FDR < 0.10) authors are outliers on the curve, suggestive of chronic, repeated RLM. Approximately 16% of all authors may have engaged in RLM to some degree. Authors who use 18% or more of their references for self-citation are significantly more likely to have NSC Gini distortions, suggesting a potential willingness to coerce others to cite them.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10836843PMC
http://dx.doi.org/10.1007/s11192-022-04503-6DOI Listing

Publication Analysis

Top Keywords

reference list
8
list manipulation
8
rlm
8
rlm degree
8
authors engaged
8
engaged rlm
8
nsc gini
8
authors
6
detecting anomalous
4
anomalous referencing
4

Similar Publications

Background: Visceral adipose tissue (VAT) is associated with several cardiometabolic risk factors, particularly metabolic syndrome and insulin resistance. Reference values for VAT vary across populations, genders, and ages. Data on visceral fat in the Algerian population are lacking.

View Article and Find Full Text PDF

Motivation: The advent of next-generation sequencing-based spatially resolved transcriptomics (SRT) techniques has reshaped genomic studies by enabling high-throughput gene expression profiling while preserving spatial and morphological context. Understanding gene functions and interactions in different spatial domains is crucial, as it can enhance our comprehension of biological mechanisms, such as cancer-immune interactions and cell differentiation in various regions. It is necessary to cluster tissue regions into distinct spatial domains and identify discriminating genes that elucidate the clustering result, referred to as spatial domain-specific discriminating genes (DGs).

View Article and Find Full Text PDF

Fruit and fruit-based products are a valuable source of essential nutrients, critical for food security, and drive economic productivity with minimal inputs. The significant rise in global demand for high-quality imported fruit and fruit-based products reflects a shift in consumer awareness and interest in the products origin and potential health-promoting bioactive compounds. Analytical techniques such as liquid chromatography, gas chromatography, inductively coupled plasma techniques, isotope-ratio mass spectrometry (IRMS), near infrared (NIR) spectroscopy, visible near infrared (VIS-NIR) spectroscopy, hyperspectral imaging (HSI), mid-infrared (MIR) spectroscopy, Raman spectroscopy, nuclear magnetic resonance (NMR) spectroscopy, fluorescence spectroscopy, terahertz spectroscopy, dielectric spectroscopy, electronic nose (e-nose), and electronic tongue (e-tongue) coupled with supervised and unsupervised chemometrics can be employed for traceability, authentication, and bioactive profiling of fruit and fruit-based products.

View Article and Find Full Text PDF