Research Article |
Corresponding author: R. Henrik Nilsson ( henrik.nilsson@bioenv.gu.se ) Academic editor: Thorsten Lumbsch
© 2024 R. Henrik Nilsson, Arnold Tobias Jansson, Christian Wurzbacher, Sten Anslan, Pauline Belford, Natàlia Corcoll, Alexandra Dombrowski, Masoomeh Ghobad-Nejhad, Mikael Gustavsson, Daniela Gómez-Martínez, Faheema Kalsoom Khan, Maryia Khomich, Charlotte Lennartsdotter, David Lund, Breyten Van Der Merwe, Vladimir Mikryukov, Marko Peterson, Teresita M. Porter, Sergei Põlme, Alice Retter, Marisol Sanchez-Garcia, Sten Svantesson, Patrik Svedberg, Duong Vu, Martin Ryberg, Kessy Abarenkov, Erik Kristiansson.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Nilsson RH, Jansson AT, Wurzbacher C, Anslan S, Belford P, Corcoll N, Dombrowski A, Ghobad-Nejhad M, Gustavsson M, Gómez-Martínez D, Kalsoom Khan F, Khomich M, Lennartsdotter C, Lund D, Van Der Merwe B, Mikryukov V, Peterson M, Porter TM, Põlme S, Retter A, Sanchez-Garcia M, Svantesson S, Svedberg P, Vu D, Ryberg M, Abarenkov K, Kristiansson E (2024) 20 years of bibliometric data illustrates a lack of concordance between journal impact factor and fungal species discovery in systematic mycology. MycoKeys 110: 273-285. https://doi.org/10.3897/mycokeys.110.136048
|
Journal impact factors were devised to qualify and compare university library holdings but are frequently repurposed for use in ranking applications, research papers, and even individual applicants in mycology and beyond. The widely held assumption that mycological studies published in journals with high impact factors add more to systematic mycology than studies published in journals without high impact factors nevertheless lacks evidential underpinning. The present study uses the species hypothesis system of the UNITE database for molecular identification of fungi and other eukaryotes to trace the publication history and impact factor of sequences uncovering new fungal species hypotheses. The data show that journal impact factors are poor predictors of discovery potential in systematic mycology. There is no clear relationship between journal impact factor and the discovery of new species hypotheses for the years 2000–2021. On the contrary, we found journals with low, and even no, impact factor to account for substantial parts of the species hypothesis landscape, often discovering new fungal taxa that are only later picked up by journals with high impact factors. Funding agencies and hiring committees that insist on upholding journal impact factors as a central funding and recruitment criterion in systematic mycology should consider using indicators such as research quality, productivity, outreach activities, review services for scientific journals, and teaching ability directly rather than using publication in high impact factor journals as a proxy for these indicators.
Bibliometrics, impact factor, mycology, systematics, taxonomy
The concept of journal impact factors (IFs) was introduced in the 1970s as a means to qualify and compare university library holdings (
Over time, parts of the scientific community warmed to the idea that IFs may be an unwarrantably simplistic predictor of past and future research performance (
In the context of systematic mycology, down-prioritizing researchers and research groups without a strong track record of high-IF publications would make sense if, indeed, low-IF publications and no-IF publications do not contribute much, or anything, to systematic mycology. Conversely, if it is the case that also (or even primarily) low- and no-IF publications make substantial contributions to this field, then the usefulness of IFs as a decisive indicator in systematic mycology would be illusory and, in fact, directly counterproductive. Is there data to form some sort of evidential underpinning for the contribution of IFs to systematic mycology? We argue that there is. The UNITE database for molecular identification of eukaryotes clusters all public, full-length fungal barcode (nuclear ribosomal internal transcribed spacer, or ITS) sequences in the International Nucleotide Sequence Database Collaboration (
The full flow of operation behind the UNITE database is described elsewhere (
The 1,258,182 Sanger sequencing-derived sequences of UNITE eukaryotic release 10 were found to be distributed across 182,847 SHs at the default 1.5% sequence dissimilarity level. We targeted sequences submitted to the INSDC in the interval 2000–2021. All 43,057 non-singleton SHs whose first (earliest date of INSDC deposition) sequence was annotated (by default or by subsequent third-party sequence annotation) as fungal were targeted. These SHs comprised a total of 506,103 sequences. All other SHs were considered to represent non-fungal eukaryotes and are not treated any further in this study. We examined all sequences computationally for information on publication of origin. A total of 23,710 (55.1%) fungal SHs were first discovered through a sequence for which a published study of origin was specified in INSDC, leaving 19,347 (44.9%) of the initial-SH-discovery sequences with a publication status of the “Unpublished” or “Direct submission” kind. Some proportion of these seemingly unpublished sequences can be expected to be published but not updated with publication information in INSDC (
Official journal impact factors were compiled from ISI Web of Science for the period 2000–2021 for all journals sporting sequences in all SHs deemed to be fungal. The annual median impact factor for mycology was inferred from the 33 mycological journals in ISI’s journal category “Mycology”. Each sequence was assigned the impact factor of its outlet and the year of publication in the IF window 2000–2021. Two alternative approaches were adopted for sequences published in an outlet without a formal IF for the year of publication. In the “strict median” approach, they were not assigned any IF value and were excluded from estimates of median IFs. In the “relaxed median” approach, they were assigned an IF of 0.0 and were included in estimates of median IFs. IFs were considered down to the three decimal digits supported by ISI Web of Science. As a baseline, we also analyzed all formal fungal species descriptions 2000–2021 for impact factor using GBIF (https://www.gbif.org/) and MycoBank (
We found 43,057 non-singleton UNITE SHs to be fungal. The first (oldest) sequence in each such SH was examined for publication information. More than half (23,710; 55.1%) of these were found to be annotated to publication of origin, leaving 19,347 (44.9%) of the “Unpublished” and “Direct submission” kind. We were able to track down 10,203 (52.7%) of these to a published study of origin, giving us a final dataset of 33,913 published sequences, each representing a first, initial discovery of an SH. These 33,913 sequences were found to come from 6,878 studies. Sequences only released through B.Sc./M.Sc./Ph.D. theses were scored as unpublished. The unpublished sequences are not considered any further in this study. The SH-derived sequences of the study were found to have been published in well over 1,500 different journals and outlets, ranging from top-tier international journals with an IF of over 30 to what seemed to be regional or even local journals without an online presence.
In total, 28,662 (84.5%) of the sequences that were the first to evince a new SH discovery were published in a scientific journal with a formal IF that year, leaving 5,251 (15.5%) of the sequences published in an outlet without. Out of those 5,251 sequences, 2,223 (42.3%) were published in a journal that did not have a formal IF at the time of publication, but that eventually obtained one after an average of 4.7 years, leaving 3,028 (57.7%) of the without-IF-sequences published in an outlet that never had a formal IF (2000–2021). The strict and relaxed median IFs of sequences discovering new SHs over time are displayed in Fig.
a the median impact factor of initial discoveries of UNITE species hypotheses (SHs). For the red curve, only sequences published in a journal with a formal impact factor from the year of publication were included in the calculation. For the blue curve, also sequences published in journals without a formal impact factor from the year of publication were included with their impact factor set to 0.0. The green curve shows the average impact factor of the journals in ISI’s category “Mycology” over time b the median impact factor of initial discoveries of SHs visualized as the difference in IF from the median mycological IF (dashed line) over time. The post-2015 drop in relative impact factor is presumably explained by the trend of increasing IFs in ISI’s category “Mycology” over time and mycologists’ apparent struggle to take advantage of this trend when publishing.
The median IFs of sequences inside SHs over time. SHs were discovered at year 0. The SHs were then inspected for subsequent recoveries in journals with a formal IF, and the value plotted is the median of all such recoveries. To account for the trend of increasing IFs in mycology, the data was normalized by subtracting the median impact factor of the journals in ISI’s category “Mycology”. The unit of the y axis is difference in IF. The error bars show 95% confidence intervals.
11,386 (33.6%) SHs contained sequences that were released through two or more distinct studies, all of which either lacked a formal IF or had an IF below the median mycological IF the year of publication. Similarly, 2,048 (6.0%) SHs were found to be known from two or more distinct studies, all of which had an IF above the median mycological IF the year of publication. 313 (0.9%) SHs were found to be known from two or more distinct studies, none of which were published in an outlet with a formal IF. In analogy, 12,477 (36.8%) SHs were found to be known from two or more distinct studies, all of which were published in an outlet with a formal IF at the year of publication. 1,260 (3.7%) non-singleton SHs were recovered both from journals with and without formal IFs. The results of the analysis of the impact factors of formal fungal species descriptions 2000–2021 are given in Suppl. material
The present study examined the relation of journal IFs to discovery potential in systematic mycology. Our results are largely dispiriting – there seems to be no meaningful correlation between IFs and mycosystematical discovery potential as measured as the discovery of new SHs in UNITE. On the contrary, at least in systematic mycology, journal IFs come across as a concept divested of meaning, or at least the meaning ascribed to it in the committee meetings that many of the present authors regularly attend. For instance, for the last 10 years, the majority of new SHs were first reported from journals with an IF below the median mycological IF in a trend that is accentuated over time (Fig.
In an IF-centred world, important mycological findings would be announced in high-IF journals, and those results would only later trickle down and be subsumed into studies published in journals of lesser, or no, IFs. Our results take umbrage with such a contention (Fig.
Our results do not necessarily suggest that the mycological community should prioritize low/no-IF researchers and research teams, but rather that IFs are a superficially deep, but deeply superficial, measure of mycosystematical discovery potential. If it, indeed, is mycosystematical discovery potential that we wish to promote, then time’s provision of further and better particulars seems to call for abandoning oversimplified shortcuts in the assessment of a researcher’s previous production. Maybe, in fact, there are no shortcuts (
Ranking candidates based on IFs furthermore perpetuates the ‘Matthew effect’ whereby candidates who happen to publish in high impact journals early in their career accrue more recognition and cumulative advantage relative to other candidates (
This study makes the simplification to define “systematic mycology” as the field that discovers and describes new species and groups of fungi – which is what the present study quantifies. We are well aware that systematic mycology covers more than just that, and that the discovery of new SHs in UNITE and formal description of species do not do full justice to the discipline. At the same time, it would seem like a stretch to argue that the discovery and formal description of new species and groups of fungi, unlike all other aspects of systematic mycology, scale poorly to IFs. Instead, we hypothesize that our data speak reasonably well for all of systematic mycology in arguing against the use of IFs as a decisive indicator in systematic mycology. Our study made heavy use of the UNITE SH system, which is based on the formal fungal barcode, the nuclear ribosomal ITS region (
Our approach was to some extent haunted by missing data – at the onset of the project, a full 19,347 (44.9%) of the sequences representing initial discoveries of species hypotheses were not annotated with a study of origin. We spent more than three months trying to restore this information, but we often found ourselves struggling with journals without a digital presence, journals in other languages than the present set of co-authors had access to, special characters, conflicting information, and the sheer magnitude of the task at hand. In the end, we were able to restore the publication information for 10,203 sequences, reducing the share of “unpublished” sequences from 49.7% to 21.2%. We find it remarkable that upwards of half of the public fungal barcode sequences older than two years were un-annotated to begin with in this regard.
Mycologists regularly report feeling compelled to publish in high-IF journals by virtue of professionalism. Our data suggest that if we by professionalism mean keeping the best interest of systematic mycology in mind, then journal IFs are at a particular risk of misinterpretation – and are regularly ascribed a weight that endangers progress in the field. We eagerly anticipate a future where applications and candidates are assessed in a more integrative way than simple summary metrics obtained from journal IFs, and where mycological contributions are quantified in a way agnostic of the very journal in which they happened to be published. Our non-trivial experience of serving in various evaluation committees is dispiriting in this regard, painting a bleak picture for the future of systematic mycology in a time when the understanding of fungal diversity is more important than ever.
GBIF and MycoBank are acknowledged for assistance with bibliographic data for species descriptions.
The authors have declared that no competing interests exist.
No ethical statement was reported.
This work was supported by Vetenskapsrådet (2023-03456) and Artdatabanken (2023.4.3-30).
All authors have contributed equally.
R. Henrik Nilsson https://orcid.org/0000-0002-8052-0107
Arnold Tobias Jansson https://orcid.org/0000-0002-9100-9814
Christian Wurzbacher https://orcid.org/0000-0001-7418-0831
Sten Anslan https://orcid.org/0000-0002-2299-454X
Pauline Belford https://orcid.org/0009-0005-5297-483X
Natàlia Corcoll https://orcid.org/0000-0002-1660-9422
Masoomeh Ghobad-Nejhad https://orcid.org/0000-0002-7807-4187
Mikael Gustavsson https://orcid.org/0000-0003-0635-8321
Daniela Gómez-Martínez https://orcid.org/0000-0001-8834-8930
Faheema Kalsoom Khan https://orcid.org/0000-0002-4891-953X
Maryia Khomich https://orcid.org/0000-0002-6840-5739
Charlotte Lennartsdotter https://orcid.org/0009-0008-2780-4881
David Lund https://orcid.org/0000-0003-4075-7529
Breyten Van Der Merwe https://orcid.org/0000-0003-0546-5619
Vladimir Mikryukov https://orcid.org/0000-0003-2786-2690
Teresita M. Porter https://orcid.org/0000-0002-0227-6874
Sergei Põlme https://orcid.org/0000-0002-9658-1166
Alice Retter https://orcid.org/0009-0004-3758-4123
Marisol Sanchez-Garcia https://orcid.org/0000-0002-0635-6281
Sten Svantesson https://orcid.org/0000-0003-3435-3659
Patrik Svedberg https://orcid.org/0000-0001-6622-5366
Duong Vu https://orcid.org/0000-0001-7960-2765
Martin Ryberg https://orcid.org/0000-0002-6795-4349
Kessy Abarenkov https://orcid.org/0000-0001-5526-4845
Erik Kristiansson https://orcid.org/0000-0002-8609-2414
All of the data that support the findings of this study are available in the main text or Supplementary Information.
Supplementary item
Data type: pdf
Explanation note: a the median impact factor of formal fungal species descriptions 2000-2021. For the red curve, only descriptions published in a journal with a formal impact factor from the year of publication were included in the calculation. For the blue curve, also descriptions published in journals without a formal impact factor from the year of publication were included with their impact factor set to 0.0. The green curve shows the average impact factor of the journals in ISI’s category “Mycology” over time b the median impact factor of formal species descriptions visualized as the difference in IF from the median mycological IF (dashed line) over time c the proportion of formal description of species in journals without a formal IF (red) or with an IF below the mycological median (orange) over time.