Modeling the structure and evolution of discussion cascades

@article{Gmez2011ModelingTS,
  title={Modeling the structure and evolution of discussion cascades},
  author={V. G{\'o}mez and Hilbert J. Kappen and Andreas Kaltenbrunner},
  journal={ArXiv},
  year={2011},
  volume={abs/1011.0673}
}
We analyze the structure and evolution of discussion cascades in four popular websites: Slashdot, Barrapunto, Meneame and Wikipedia. Despite the big heterogeneities between these sites, a preferential attachment (PA) model with bias to the root can capture the temporal evolution of the observed trees and many of their statistical properties, namely, probability distributions of the branching factors (degrees), subtree sizes and certain correlations. The parameters of the model are learned… 

A likelihood-based framework for the analysis of discussion threads

TLDR
A framework based on generative models of growing trees to analyse the structure and evolution of discussion threads is proposed and provides significant insight into understanding how discussion cascades grow and have potential applications in broader contexts such as community management or design of communication platforms.

Role detection in online forums based on growth models for trees

TLDR
A growth model with an Expectation–Maximization algorithm that finds different parameters for different latent groups of users to find the different roles that coexist in the community and shows that predictions are improved for some of the roles when compared with a simple growth model.

Modelling structure and predicting dynamics of discussion threads in online boards

TLDR
This paper first studies a Reddit dataset to show that the structure of the trees resemble those produced by a Galton-Watson process with a special root offspring distribution, and modifies their formation as a self-exciting Hawkes process to predict future commenting activity and the final size of a discussion tree.

Reply trees in Twitter: data analysis and branching process models

TLDR
It is suggested that the in-degree of the tweet that initiates a reply tree may play an important role in forming the global shape of the reply tree.

Differences in structure and dynamics of networks retrieved from dark and public web forums

When the Wikipedians Talk: Network and Tree Structure of Wikipedia Discussion Pages

TLDR
Evidence of a specific assortativity profile which differentiates article discussions from personal conversations is found and an analysis of the tree structure of the article talk pages allows to capture patterns of interaction, and reveals structural differences among the discussions about articles from different semantic areas.

Modeling Morphology of Social Network Cascades

TLDR
A Multi-order Markov Model for the Morphology of Cascades is proposed that can represent and quantitatively characterize the morphology of cascades with arbitrary structures, shapes, and sizes and is applied to an unexplored but important problem in online social networks -- cascade size prediction.

Nonparametric Estimation of the Preferential Attachment Function in Complex Networks: Evidence of Deviations from Log Linearity

TLDR
PAFit is an advance over previous methods in so far as it does not make any assumptions on the functional form of the preferential attachment function, and is surprised to find that hubs are not always the nodes with the highest node fitnesses.

NesTPP: Modeling Thread Dynamics in Online Discussion Forums

TLDR
The proposed model views the entire event space as a nested structure composed of main thread streams and their linked reply streams, and it explicitly models the correlations between these two types of streams through their intensity functions.

Cascade size prediction in online social networks

  • Zubair ShafiqA. Liu
  • Computer Science
    2017 IFIP Networking Conference (IFIP Networking) and Workshops
  • 2017
TLDR
Evaluations using a Twitter data set show that M3 based cascade size prediction scheme outperforms the baseline scheme based on cascade graph features such as edge growth rate, degree distribution, clustering, and diameter.
...

References

SHOWING 1-10 OF 33 REFERENCES

Statistical analysis of the social network and discussion threads in slashdot

We analyze the social network emerging from the user comment activity on the website Slashdot. The network presents common features of traditional social networks such as a giant component, small

Information diffusion through blogspace

TLDR
A macroscopic characterization of topic propagation through the authors' corpus is presented, formalizing the notion of long-running "chatter" topics consisting recursively of "spike" topics generated by outside world events, or more rarely, by resonances within the community.

The structure of political discussion networks: a model for the analysis of online deliberation

This paper shows that online political discussion networks are, on average, wider and deeper than the networks generated by other types of discussions: they engage a larger number of participants and

When the Wikipedians Talk: Network and Tree Structure of Wikipedia Discussion Pages

TLDR
Evidence of a specific assortativity profile which differentiates article discussions from personal conversations is found and an analysis of the tree structure of the article talk pages allows to capture patterns of interaction, and reveals structural differences among the discussions about articles from different semantic areas.

Cascading Behavior in Large Blog Graphs Patterns and a model

TLDR
The goal is to build a model that generates realistic cascades, so that it can help with link prediction and outlier detection, and present a simple model that mimics the spread of information on the blogosphere, and produces information cascades very similar to those found in real life.

Patterns of Cascading Behavior in Large Blog Graphs

TLDR
Some surprising findings of the blog linking and information propagation structure are reported, after one of the largest available datasets, with 45, 000 blogs and ≈ 2.2 million blog-postings is analyzed.

Using selection bias to explain the observed structure of Internet diffusions

TLDR
It is shown that a basic Galton–Watson epidemic model combined with the selection bias of observing only large diffusions suffices to explain the flow of information on the Internet, and selection biases of which data the authors observe can radically change the estimation of classical diffusion processes.

Impact of human activity patterns on the dynamics of information diffusion.

TLDR
It is found that, contrary to traditional models, information travels at an unexpectedly slow pace, and it is shown that the large heterogeneity found in the response time is responsible for the slow dynamics of information at the collective level.

Modeling Blog Dynamics

TLDR
This work proposes a generative model that uses simple and intuitive principles for each individual blog, and yet is able to produce the temporal characteristics of the blogosphere together with global topological network patterns, like power-laws for degree distributions, for inter-posting times, and several more.

Measuring preferential attachment in evolving networks

TLDR
Measurements on four networks indicate that the rate at which nodes acquire links depends on the node's degree, offering direct quantitative support for the presence of preferential attachment.