• Corpus ID: 11108826

Global topology of word co-occurrence networks: Beyond the two-regime power-law

@inproceedings{Choudhury2010GlobalTO,
  title={Global topology of word co-occurrence networks: Beyond the two-regime power-law},
  author={Monojit Choudhury and Diptesh Chatterjee and Animesh Mukherjee},
  booktitle={COLING},
  year={2010}
}
Word co-occurrence networks are one of the most common linguistic networks studied in the past and they are known to exhibit several interesting topological characteristics. In this article, we investigate the global topological properties of word co-occurrence networks and, in particular, present a detailed study of their spectrum. Our experiments reveal certain universal trends found across the networks for seven different languages from three different language families, which are neither… 

Figures and Tables from this paper

A Complex Network Approach to Distributional Semantic Models
TLDR
It is demonstrated that the created networks generally exhibit the same properties as word association networks, and it is shown that the distribution of the number of connections in these networks follows the truncated power law, which indicates that distributional semantic models can provide a plausible model of lexical knowledge.
Text Authorship Identified Using the Dynamics of Word Co-Occurrence Networks
TLDR
This study introduces a methodology based on the dynamics of word co-occurrence networks representing written texts to classify a corpus of 80 texts by 8 authors, paving the way for a robust description of large texts in terms of small evolving networks.
Preliminary Report on the Structure of Croatian Linguistic Co-occurrence Networks
TLDR
The results suggest that the corpus influence can be reduced by increasing the co-occurrence window size, and the size of texts is reflected in the network properties.
An Empirical Study of the Occurrence and Co-Occurrence of Named Entities in Natural Language Corpora
TLDR
This paper systematically analyzes the patterns of occurrence and co-occurrence of NEs in standard large English news corpora, providing valuable insight for the understanding of the corpus, and subsequently paving way for the development of technologies that rely critically on handling NEs.
Comparison of the language networks from literature and blogs
TLDR
The selectivity-based results point out that there are differences between the structure of the networks constructed from literature and blogs.
Can Greenbergian universals be induced from language networks?
Language networks have been proposed to be the underlying representation for syntactic knowledge (Roelofs, 1992; Pickering and Branigan, 1998). Such networks are known to explain various word order
Structural Analysis on Social Network Constructed from Characters in Literature Texts
TLDR
The study shows that the metric distance among characters written in linear text is very similar to the intrinsic and semantic relationship described by fiction writers, which implies the proposed social network from fictions could be another representation of literary fiction.
...
...

References

SHOWING 1-10 OF 15 REFERENCES
Discovering Global Patterns in Linguistic Networks through Spectral Analysis: A Case Study of the Consonant Inventories
TLDR
Application of spectral analysis to PhoNet, the co-occurrence network of consonants, not only reveals several natural linguistic principles governing the structure of the consonant inventories, but is also able to quantify their relative importance.
Correlations in the Organization of Large-Scale Syntactic Dependency Networks
TLDR
The results indicate the existence of common patterns in the large scale organization of syntactic dependency networks, and indicate disassortative mixing in the sense that links tend to connect vertices of dissimilar degrees.
Language as an evolving word web
TLDR
It follows from the theory of the evolution of language that the size of the core part of language, the ‘kernel lexicon’, does not vary as language evolves, and the two regimes in the distribution naturally emerge from the evolutionary dynamics of the word web.
Vertex Degree Distribution for the Graph of Word Co-Occurrences in Russian
TLDR
Degree distributions for word forms cooccurrences for large Russian text collections are obtained, supporting Dorogovtsev-Mendes model for Russian and supporting the idea of importance of word forms for a perceptual lexicon of a human.
Using eigenvectors of the bigram graph to infer morpheme identity
TLDR
An eigenvector decomposition of a nearest-neighbor graph is used to produce a two-dimensional rendering of the words of a corpus in which words of the same syntactic category tend to form clusters and neighborhoods.
Spectra of "real-world" graphs: beyond the semicircle law.
TLDR
Methods to determine the eigenvalues of networks comparable in size to real systems are developed, obtaining several surprising results on the spectra of adjacency matrices corresponding to models of real-world graphs.
From Frequency to Meaning: Vector Space Models of Semantics
TLDR
The goal in this survey is to show the breadth of applications of VSMs for semantics, to provide a new perspective on VSMs, and to provide pointers into the literature for those who are less familiar with the field.
Emergence of scaling in random networks
TLDR
A model based on these two ingredients reproduces the observed stationary scale-free distributions, which indicates that the development of large networks is governed by robust self-organizing phenomena that go beyond the particulars of the individual systems.
Linguistics: An Introduction to Language and Communication
TLDR
This popular introductory linguistics text is unique for its integration of themes and provides a sound introduction to linguistic methodology while encouraging students to consider why people are intrinsically interested in language―the ultimate puzzle of the human mind.
Spectral plots and the representation and interpretation of biological data
TLDR
The search for universal properties of biological networks should be complemented by an understanding of more specific features of biological organization principles at different scales, according to Laplacian spectral plots.
...
...