Corpus ID: 202712680

CodeSearchNet Challenge: Evaluating the State of Semantic Code Search

  title={CodeSearchNet Challenge: Evaluating the State of Semantic Code Search},
  author={H. Husain and Ho-Hsiang Wu and Tiferet Gazit and Miltiadis Allamanis and Marc Brockschmidt},
  • H. Husain, Ho-Hsiang Wu, +2 authors Marc Brockschmidt
  • Published 2019
  • Computer Science, Mathematics
  • ArXiv
  • Semantic code search is the task of retrieving relevant code given a natural language query. While related to other information retrieval tasks, it requires bridging the gap between the language used in code (often abbreviated and highly technical) and natural language more suitable to describe vague concepts and ideas. To enable evaluation of progress on code search, we are releasing the CodeSearchNet Corpus and are presenting the CodeSearchNet Challenge, which consists of 99 natural language… CONTINUE READING
    26 Citations

    Figures and Tables from this paper.

    Deep Graph Matching and Searching for Semantic Code Retrieval
    Are the Code Snippets What We Are Searching for? A Benchmark and an Empirical Study on Code Search with Natural-Language Queries
    • 7
    OCoR: An Overlapping-Aware Code Retriever
    OCoR: An Overlapping-Aware Code Retriever.
    Neural Code Search Revisited: Enhancing Code Snippet Retrieval through Natural Language Intent
    Code to Comment "Translation": Data, Metrics, Baselining & Evaluation
    Semantic code search using Code2Vec: A bag-of-paths model
    Hierarchical Embedding for Code Search in Software Q&A Sites
    • R. Li, Gang Hu, Min Peng
    • Computer Science
    • 2020 International Joint Conference on Neural Networks (IJCNN)
    • 2020


    Deep Code Search
    • 141
    • PDF
    CoaCor: Code Annotation for Code Retrieval with Reinforcement Learning
    • 23
    • PDF
    code2seq: Generating Sequences from Structured Representations of Code
    • 125
    • PDF
    When deep learning met code search
    • 28
    • PDF
    Deep API learning
    • 268
    • PDF
    StaQC: A Systematically Mined Question-Code Dataset from Stack Overflow
    • 32
    • PDF
    A Retrieve-and-Edit Framework for Predicting Structured Outputs
    • 51
    • PDF
    Mapping Language to Code in Programmatic Context
    • 41
    • PDF