Where Does ArXiv Fit Into Open Science? But First, How Do I Pronounce It?

Image courtesy of National Cancer Institute, Unsplash

What is ArXiv?

You may have seen ArXiv pop up in your science-related or open access repository searches. For those of us who are still wondering how to say it, it is pronounced ’archive’. The X represents the Greek letter, c, which is pronounced ‘ch’ and thus spells out archive.  

ArXiv is an open access repository for pre-prints[1] and post-prints[2] that have been moderated but not peer reviewed. It was the first freely available, open access repository, established years before Creative Commons or other mechanisms were available or the Internet became ubiquitous. ArXiv was established in 1991 as a way for physicists and mathematicians to circulate their research for comment before peer review and publication in a journal. It was developed with distribution formats few people use today — File Transfer Protocol (FTP), Gopher, and Mosaic (the world’s first Internet browser). According to Wikipedia:[3]

In many fields of mathematics and physics, almost all scientific papers are self-archived on the arXiv repository before publication in a peer-reviewed journal. Some publishers also grant permission for authors to archive the peer-reviewed postprint.

ArXiv has become tremendously important for scientists worldwide. On this, its 30th birthday, there are almost 2 million articles posted on the site in physics, mathematics, computer science, quantitative biology, quantitative finance, statistics, electrical engineering, systems science, and economics. 

In the 30 years since ArXiv’s  founding, many additional servers for different disciplines and regions have been established, all on the Xiv model.  AfricArXiv, for Africa, is discussed in detail below. Two preprint servers in the biomedical sciences have received considerable attention because of the COVID-19 pandemic, including the publication of several highly quoted articles in the mass media and research publications.[4]  But how much relevance do ArXiv and the other servers have for Africa?

How relevant are preprint servers to Africa?

ArXiv and the disciplinary servers that followed aim to help scientists worldwide share results and get feedback quickly without waiting for publication in journals.  Although bibliometric surveys have not been done for all of the servers, two in the biomedical sciences point to the paucity of African research appearing in them.  A 2020 survey of in eLife, of over 67,000 articles posted on bioRxiv found that international authorship and collaboration of African researchers was scant and most were not principal author.  Table 1 shows which were the 11 most published African countries in bioRxiv.[5]

Table 1: Which African countries publish the most in bioRxiv?

Figure 1: The good news[6]

When the researchers dug deeper into subject matter and where the research was carried out, they found:

Figure 2: Digging deeper[7]

This is important research for those of us who care about the contribution of African scientists to the global knowledge pool.  The role of scientists has evolved over the course of the pandemic – they are becoming more public-facing, which can significantly improve dissemination of accurate information in any given country.  Examples include Dr. Anthony Fauci[8] and Dr. John Nkengasong.[9] It is also important to encourage representation so that we can learn from other approaches and develop solutions that cater for diverse populations. 

 

AfricArXiv

In 2018, African scientists established an open access African preprint server, called AfricArXivto promote better visibility for African research and enhanced collaboration throughout the continent. AfricArXiv’s African focus has a special set of objectives, among them:

  • It is an African-owned open scholarly repository, a knowledge commons of African scholarly works to catalyze the African Renaissance.
  • Submissions must be relevant to Africa, with at least one African author.
  • Language is important in Africa, where AfricArXiv estimates that over 2,000 are spoken, a number which has been confirmed elsewhere.[10] All submissions must be accompanied by a summary in English and French to ease language gaps. Automated translation is allowed and must be acknowledged because these translations are not always accurate.  AfricArXiv also uses volunteer translators. In addition, AfricArXiv encourages postings in African languages and is partnering with Masakhane to undertake human translation of articles into African languages. AfricaArXiv writes about the significance of translation as follows:[11]

We encourage submissions in languages that are commonly used by the scientific community in the respective country, such as English, French, Swahili, Zulu, Afrikaans, Igbo, Akan, or other native African languages. Manuscripts submitted in non-English languages will be held in the moderation queue until we can get them verified. We herewith encourage you to suggest people who could assist in moderating in your language.

For those who want to know more about the significance of preprint servers, we encourage you to read Ten simple rules to consider regarding preprint submission, which was published in May 2017 in PLOS Computational Biology, an open access and highly prestigious journal. The article has been viewed over 44,000 times and cited 61 times.[12]

For those who would like more information on the lack of representation of African science in the journal literature, please see Where there is no local author: a network bibliometric analysis of authorship parasitism among research conducted in sub-Saharan Africa, published on 27 October 2021 in BMJ Global Health.[13]  It demonstrates how few African biomedical researchers receive recognition for research results from their own countries.  See also the journal’s editorial on ‘parachute’ research: Using scientific authorship criteria as a tool for equitable inclusion in global health research.[14]

 

This has been one of OER Africa’s communications on open knowledge, which we will continue to explore in future communications.


Related articles

Access the OER Africa Communications Archive here

 


[1] Wikipedia contributors. (2021b, October 8). Preprint. Wikipedia. Retrieved October 25, 2021, from https://en.wikipedia.org/wiki/Preprint (CC BY-SA)

[2] Wikipedia contributors. (2021b, October 1). Postprint. Wikipedia. Retrieved October 25, 2021, from https://en.wikipedia.org/wiki/Postprint (CC BY-SA)

[3] Wikipedia contributors. (2021, September 7). ArXiv. Wikipedia. Retrieved 13 October 2021 from https://en.wikipedia.org/wiki/ArXiv#Moderation_process_and_endorsement (CC BY)

[4] Ginsparg, P. Lessons from arXiv’s 30 years of information sharing. Nat Rev Phys 3, 602–603 (2021). https://doi.org/10.1038/s42254-021-00360-z (Freely available but copyright protected.  Springer Nature has a content-sharing initiative, which does not permit printing; the link for this article is https://rdcu.be/czHnW.

[5] Abdill RJ, Adamowicz EM, Blekhman R. International authorship and collaboration across bioRxiv preprints. Elife. 2020 Jul 27;9:e58496. doi: 10.7554/eLife.58496. PMID: 32716295; PMCID: PMC7384855. (CC BY)

[6] Guleid, F. H., Oyando, R., Kabia, E., Mumbi, A., Akech, S., & Barasa, E. (2021, March 17). A bibliometric analysis of COVID-19 research in Africa. MedRxiv. Retrieved 19 October 2021 from https://www.medrxiv.org/content/10.1101/2021.03.15.21253589v1(CC BY)

[7] Guleid FH, Oyando R, Kabia E, et al, A bibliometric analysis of COVID-19 research in Africa, BMJ Global Health 2021; https://gh.bmj.com/content/6/5/e005690. (CC BY)

[10] Wikipedia contributors. (2021d, October 23). Languages of Africa. Wikipedia. Retrieved November 3, 2021, from Wikipedia contributors. (2021d, October 23). Languages of Africa. Wikipedia. Retrieved November3, 2021 from https://en.wikipedia.org/wiki/Languages_of_Africa (CC BY)

[11] Languages – AfricArXiv. (n.d.). AfricArXiv. Retrieved October 20, 2021, from https://info.africarxiv.org/languages/ (CC BY)

[12] Bourne PE, Polka JK, Vale RD, Kiley R (2017) Ten simple rules to consider regarding preprint submission. PLoS Comput Biol 13(5): e1005473. Retrieved 20 October 2021 from https://doi.org/10.1371/journal.pcbi.1005473 (CC0)

[13] Rees CA, Ali M, Kisenge R, et al Where there is no local author: a network bibliometric analysis of authorship parasitism among research conducted in sub-Saharan Africa BMJ Global Health 2021. https://gh.bmj.com/content/6/10/e006982. (CC BY-NC)

[14] Sam-Agudu NA, Abimbola S. Using scientific authorship criteria as a tool for equitable inclusion in global health research. BMJ Global Health 2021. https://gh.bmj.com/content/6/10/e007632 (CC BY-NC)