• Corpus ID: 24164601

Popularity of arXiv.org within Computer Science

  title={Popularity of arXiv.org within Computer Science},
  author={Charles Sutton and Linan Gong},
It may seem surprising that, out of all areas of science, computer scientists have been slow to post electronic versions of papers on sites like arXiv.org. Instead, computer scientists have tended to place papers on our individual home pages, but this loses the benefits of aggregation, namely notification and browsing. But this is changing. More and more computer scientists are now using the arXiv. At the same time, there is ongoing discussion and controversy about how prepublication affects… 

Figures and Tables from this paper

Citation Count Analysis for Papers with Preprints
It is observed that papers submitted to arXiv before acceptance have, on average, 65\% more citations in the following year compared to papers submitted after, and it is noted that this finding is not causal, and possible next steps.
How many preprints have actually been printed and why: a case study of computer science preprints on arXiv
A case study of computer science preprints submitted to arXiv from 2008 to 2017 is conducted to quantify how many preprints have eventually been printed in peer-reviewed venues and introduces a semantics-based mapping method with the employment of Bidirectional Encoder Representations from Transformers (BERT).
Analysis of Leading Communities Contributing to arXiv Information Distribution on Twitter
This paper uses the HITS algorithm to analyze the arXiv information diffusion network with users as nodes, and extracts communities from the network of information spreaders with positive authority and hub degrees using the Louvain method, and identifies two types of key persons: information Spreaders who lead the relevant field in the international community and information spreader who bridge the regional and international communities using English and their native language.
Textual analysis of artificial intelligence manuscripts reveals features associated with peer review outcome
The analysis of references included in the manuscripts revealed that the subset of accepted submissions were more likely to cite the same publications, and the peer review outcome of manuscripts with their word content was predicted.
A Dataset of Peer Reviews (PeerRead): Collection, Insights and NLP Applications
The first public dataset of scientific peer reviews available for research purposes (PeerRead v1) is presented and it is shown that simple models can predict whether a paper is accepted with up to 21% error reduction compared to the majority baseline.
Deep Learning in Science
The findings suggest that DL does not (yet?) work as an autopilot to navigate complex knowledge landscapes and overthrow their structure, but the 'DL principle' qualifies for its versatility as the nucleus of a general scientific method that advances science in a measurable way.
It is argued that, due to its low computational expense, sequential alignment is a practical solution to dealing with language evolution by sequentially aligning learned representations.
Back to the Future - Sequential Alignment of Text Representations
Inspired by successes in computer vision, this work argues that, due to its low computational expense, sequential alignment is a practical solution to dealing with language evolution.
Are models getting harder to find?
  • Computer Science, Economics
  • 2020
An R&D-based growth model using data on machine learning performance using a monthly panel dataset on the top performance across 93 machine learning benchmarks, and data on research input derived from data on academic publications indicates modest positive inter-temporal knowledge spillovers, but stark diminishing returns to research effort.
A Computational Literature Analysis of Conversational AI Research with a Focus on the Coaching Domain
We conduct a computational analysis of the literature on Conversational AI. We identify the trend based on all publications until the year 2020. We then concentrate on the publications for the last


Autonomous citation matching
This work presents machine learning techniques that identify variant forms of citations to the same paper, and presents a number of algorithms that perform best and are sufficiently accurate for unassisted use in an autonomous citation indexing system.
Open Scholarship and Peer Review: a Time for Experimentation
A vocabulary for describing the landscape of choices regarding open access, formal peer review, and public commentary is introduced, arguing that the opportunities and pitfalls of open peer review warrant experimentation in these dimensions, and discussing desiderata of a flexible system.
CiteSeer: an automatic citation indexing system
CiteSeer has many advantages over traditional citation indexes, including the ability to create more up-to-date databases which are not limited to a preselected set of journals or restricted by journal publication delays, completely autonomous operation with a corresponding reduction in cost, and powerful interactive browsing of the literature using the context of citations.
CoRR: a computing research repository
This paper describes the decisions by which teh Association for Computing Machinery integrated good features from the Los Alamos e-print (physics) archive and from Cornell University's Networked
Efficient clustering of high-dimensional data sets with application to reference matching
This work presents a new technique for clustering large datasets, using a cheap, approximate distance measure to eciently divide the data into overlapping subsets the authors call canopies, and presents ex- perimental results on grouping bibliographic citations from the reference sections of research papers.
Automating the Construction of Internet Portals with Machine Learning
New research in reinforcement learning, information extraction and text classification that enables efficient spidering, the identification of informative text segments, and the population of topic hierarchies are described.
It was twenty years ago today
To mark the 20th anniversary of the commencement of hep-th@xxx.lanl.gov (now arXiv.org), I trace some historical context and early development of the resource, its later trajectory, and close with some thoughts about the future.
The internet and unrefereed scholarly publishing
  • R. Kling
  • Political Science, Computer Science
    Annu. Rev. Inf. Sci. Technol.
  • 2004
Il rappelle dans un premier temps le concept de communication scientifique ou savante and le role des publications prealables dans le processus, avant of decrire le developpement de differents modeles d'edition and de circulation d'e-scripts non revus dans divers domaines of recherche.
CSRankings. http://csrankings.org/, 2017
  • Accessed September
  • 2017