Corpus ID: 24164601

Popularity of within Computer Science

  title={Popularity of within Computer Science},
  author={Charles A. Sutton and Linan Gong},
It may seem surprising that, out of all areas of science, computer scientists have been slow to post electronic versions of papers on sites like Instead, computer scientists have tended to place papers on our individual home pages, but this loses the benefits of aggregation, namely notification and browsing. But this is changing. More and more computer scientists are now using the arXiv. At the same time, there is ongoing discussion and controversy about how prepublication affects… Expand
Citation Count Analysis for Papers with Preprints
It is observed that papers submitted to arXiv before acceptance have, on average, 65\% more citations in the following year compared to papers submitted after, and it is noted that this finding is not causal, and possible next steps. Expand
How many preprints have actually been printed and why: a case study of computer science preprints on arXiv
A case study of computer science preprints submitted to arXiv from 2008 to 2017 is conducted to quantify how many preprints have eventually been printed in peer-reviewed venues and introduces a semantics-based mapping method with the employment of Bidirectional Encoder Representations from Transformers (BERT). Expand
Textual analysis of artificial intelligence manuscripts reveals features associated with peer review outcome
The analysis of references included in the manuscripts revealed that the subset of accepted submissions were more likely to cite the same publications, and the peer review outcome of manuscripts with their word content was predicted. Expand
A Dataset of Peer Reviews (PeerRead): Collection, Insights and NLP Applications
The first public dataset of scientific peer reviews available for research purposes (PeerRead v1) is presented and it is shown that simple models can predict whether a paper is accepted with up to 21% error reduction compared to the majority baseline. Expand
Deep Learning in Science
The findings suggest that DL does not (yet?) work as an autopilot to navigate complex knowledge landscapes and overthrow their structure, but the 'DL principle' qualifies for its versatility as the nucleus of a general scientific method that advances science in a measurable way. Expand
Back to the future
Language evolves over time in many ways relevant to natural language processing tasks. For example, recent occurrences of tokens ’BERT’ and ’ELMO’ in publications refer to neural networkExpand
Back to the Future - Sequential Alignment of Text Representations
Inspired by successes in computer vision, this work argues that, due to its low computational expense, sequential alignment is a practical solution to dealing with language evolution. Expand
Are models getting harder to find?
  • 2020
We estimate an R&D-based growth model using: (1) data on machine learning performance using a monthly panel dataset on the top performance across 93 machine learning benchmarks, and (2) data onExpand
Machine learning for data-driven discovery in solid Earth geoscience
Solid Earth geoscience is a field that has very large set of observations, which are ideal for analysis with machine-learning methods, and how these methods can be applied to solid Earth datasets is reviewed. Expand
Unbalanced Multistage Heat Conduction and Mass Diffusion Algorithm in an Educational Digital Library
A weighted network-based information filtering framework that models user usage as a bipartite user-resource network; users and resources are treated as nodes in this network, each edge from a user to a resource means usage, and the weight represents the accumulation of multiple usage scenarios. Expand


Autonomous citation matching
This work presents machine learning techniques that identify variant forms of citations to the same paper, and presents a number of algorithms that perform best and are sufficiently accurate for unassisted use in an autonomous citation indexing system. Expand
Open Scholarship and Peer Review: a Time for Experimentation
A vocabulary for describing the landscape of choices regarding open access, formal peer review, and public commentary is introduced, arguing that the opportunities and pitfalls of open peer review warrant experimentation in these dimensions, and discussing desiderata of a flexible system. Expand
CiteSeer: an automatic citation indexing system
CiteSeer has many advantages over traditional citation indexes, including the ability to create more up-to-date databases which are not limited to a preselected set of journals or restricted by journal publication delays, completely autonomous operation with a corresponding reduction in cost, and powerful interactive browsing of the literature using the context of citations. Expand
CoRR: a computing research repository
This paper describes the decisions by which teh Association for Computing Machinery integrated good features from the Los Alamos e-print (physics) archive and from Cornell University's NetworkedExpand
Efficient clustering of high-dimensional data sets with application to reference matching
This work presents a new technique for clustering large datasets, using a cheap, approximate distance measure to eciently divide the data into overlapping subsets the authors call canopies, and presents ex- perimental results on grouping bibliographic citations from the reference sections of research papers. Expand
Automating the Construction of Internet Portals with Machine Learning
New research in reinforcement learning, information extraction and text classification that enables efficient spidering, the identification of informative text segments, and the population of topic hierarchies are described. Expand
It was twenty years ago today
To mark the 20th anniversary of the commencement of (now, I trace some historical context and early development of the resource, its later trajectory, and close with some thoughts about the future. Expand
The internet and unrefereed scholarly publishing
  • R. Kling
  • Political Science, Computer Science
  • Annu. Rev. Inf. Sci. Technol.
  • 2004
Il rappelle dans un premier temps le concept de communication scientifique ou savante and le role des publications prealables dans le processus, avant of decrire le developpement de differents modeles d'edition and de circulation d'e-scripts non revus dans divers domaines of recherche. Expand
Effectiveness of anonymization in double-blind review
Assessing the effectiveness of anonymization in the review process and the need for further studies on this topic are described. Expand
CSRankings., 2017
  • Accessed September
  • 2017