YouTube AV 50K: An Annotated Corpus for Comments in Autonomous Vehicles

@article{Li2018YouTubeA5,
  title={YouTube AV 50K: An Annotated Corpus for Comments in Autonomous Vehicles},
  author={Tao Li and L. Lin and M. Choi and Kaiming Fu and Siyuan Gong and J. Wang},
  journal={2018 International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)},
  year={2018},
  pages={1-5}
}
  • Tao Li, L. Lin, +3 authors J. Wang
  • Published 2018
  • Computer Science
  • 2018 International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)
  • Ahstract- With one billion monthly viewers, and millions of users discussing and sharing opinions, comments below YouTube videos are rich sources of data for opinion mining and sentiment analysis. We introduce the YouTube AV 50K dataset, a freely-available collections of more than 50,000 YouTube comments and metadata below autonomous vehicle (AV)-related videos. We describe its creation process, its content and data format, and discuss its possible usages. Especially, we do a case study of the… CONTINUE READING
    12 Citations

    Figures, Tables, and Topics from this paper

    Explore Further: Topics Discussed in This Paper

    A regression approach for prediction of Youtube views
    • PDF
    Identifying the Pornographic Video on YouTube Using Vlog Stream
    • 1
    • PDF
    Geo-spatial Clustering of Sentiments on Social Media
    • 1
    Music Sequence Prediction with Mixture Hidden Markov Models
    • 11
    • PDF
    Semi-supervised Text Regression with Conditional Generative Adversarial Networks
    • 7
    • PDF
    Semi-supervised Text Regression with Conditional Generative Adversarial Networks
    • Tao Li, X. Liu, Shihan Su
    • Computer Science, Economics
    • 2018 IEEE International Conference on Big Data (Big Data)
    • 2018
    • 2
    • PDF
    A Statistical Causal Inference Method for Exploring Ultrasonics and Topological Deformations in Biological Systems
    • Weikai He, L. Gao
    • Computer Science
    • 2019 IEEE International Conference on Big Data (Big Data)
    • 2019

    References

    SHOWING 1-10 OF 67 REFERENCES
    Predicting the Future with Social Media
    • 1,247
    Twitter as a Corpus for Sentiment Analysis and Opinion Mining
    • 1,825
    • PDF
    Music Sequence Prediction with Mixture Hidden Markov Models
    • 11
    • PDF
    Opinion Mining and Sentiment Analysis
    • 4,911
    • PDF
    The Million Song Dataset
    • 930
    • PDF
    Sentiment Analysis and Opinion Mining
    • B. Liu
    • Computer Science
    • Synthesis Lectures on Human Language Technologies
    • 2012
    • 1,610
    • PDF