Using PageRank to Characterize Web Structure
@article{Pandurangan2006UsingPT, title={Using PageRank to Characterize Web Structure}, author={Gopal Pandurangan and Prabhakar Raghavan and Eli Upfal}, journal={Internet Mathematics}, year={2006}, volume={3}, pages={1 - 20} }
Recent work on modeling the web graph has dwelt on capturing the degree distributions observed on the web. Pointing out that this represents a heavy reliance on "local" properties of the web graph, we study the distribution of PageRank values on the web. Our measurements suggest that PageRank values on the web follow a power law. We then develop generative models for the web graph that explain this observation and moreover remain faithful to previously studied degree distributions. We analyze…
149 Citations
Asymptotic analysis for personalized Web search
- Computer Science, MathematicsAdvances in Applied Probability
- 2010
This paper model the PageRank as a solution of a stochastic equation where the R i s are distributed as R and obtains the asymptotic behavior of R under quite general assumptions on the involved random variables.
Stochastic analysis of web page ranking
- Mathematics, Computer Science
- 2009
This thesis presents a new methodology for analyzing the probabilistic behavior of the PageRank distribution and the dependence between various power law parameters of the Web, based on the techniques from the theory of regular variations and the extreme value theory.
PageRank of Scale-Free Growing Networks
- Mathematics, Computer ScienceInternet Math.
- 2007
An analytical expression is found for the expected PageRank value in a scale-free growing network model as a function of the age of the growing network and theAge of a particular node and it is shown that PageRank follows closely a power law in the middle range of its values.
Using SiteRank for P 2 P Web Retrieval
- Computer Science
- 2004
It is concluded that the decomposition of global Web document ranking computation by making use of SiteRank is a very promising approach for computing global document rankings in a decentralized P2P search system.
Simulating the Webgraph: a comparative analysis of models
- Computer Science
- 2004
This work simulated several of these models and compared them against a 300-million-node sample of the Webgraph provided by the Stanford WebBase project, finding that the more random the model, the better the graph.
Structuring the Web to Cope with Dynamic Changes
- Computer ScienceSDWP@ICWS
- 2005
The view of the web structure is formalized in terms of the integer linear programming that converts the web directed graph to the optimal hierarchical structure to represent a high level structure regardless of various measures such as the cosine similarity and tf-idf measure of the vector space model.
The Web as a graph: How far we are
- Computer Science, MathematicsTOIT
- 2007
A large crawl from 2001 of 200M pages and about 1.4 billion edges, made available by the WebBase project at Stanford is studied, as well as several synthetic ones generated according to various models proposed recently, to investigate several topological properties of webgraphs.
The Static Absorbing Model for the Web
- Computer ScienceJ. Web Eng.
- 2005
The Absorbing Model is presented, a hyperlink analysis model based on absorbing Markov chains, where the Web graph is transformed by adding one absorbing state for each document, and it is shown that the latter is more effective than the authority-oriented model.
PageRank Asymptotics on Directed Preferential Attachment Networks
- Mathematics
- 2021
We characterize the tail behavior of the distribution of the PageRank of a uniformly chosen vertex in a directed preferential attachment graph and show that it decays as a power law with an explicit…
Representing and Quantifying Rank - Change for the Web Graph
- Computer ScienceWAW
- 2006
This paper addresses the issue of representing and quantifying web ranking trends as a measure of web pages, and proposes normalized measures of ranking trends that are comparable among web graph snapshots of different sizes.
References
SHOWING 1-10 OF 37 REFERENCES
The Web as a Graph: Measurements, Models, and Methods
- Computer ScienceCOCOON
- 1999
This paper describes two algorithms that operate on the Web graph, addressing problems from Web search and automatic community discovery, and proposes a new family of random graph models that point to a rich new sub-field of the study of random graphs, and raises questions about the analysis of graph algorithms on the Internet.
A stochastic model for the evolution of the Web
- Computer ScienceComput. Networks
- 2002
The PageRank Citation Ranking : Bringing Order to the Web
- Computer Science, MathematicsWWW 1999
- 1999
This paper describes PageRank, a mathod for rating Web pages objectively and mechanically, effectively measuring the human interest and attention devoted to them, and shows how to efficiently compute PageRank for large numbers of pages.
A random graph model for massive graphs
- Computer Science, MathematicsSTOC '00
- 2000
A random graph model is proposed which is a special case of sparse random graphs with given degree sequences which involves only a small number of parameters, called logsize and log-log growth rate, which capture some universal characteristics of massive graphs.
Self-similarity in the web
- Computer ScienceTOIT
- 2002
An extensive characterization of the graph structure of the Web is presented, with a view to enabling high-performance applications that make use of this structure and to optimized algorithm design for Web-scale data analysis.
Scale-free characteristics of random networks: the topology of the world-wide web
- Computer Science
- 2000
Inferring Web communities from link topology
- Computer ScienceHYPERTEXT '98
- 1998
This investigation shows that although the process by which users of the Web create pages and links is very difficult to understand at a “local” level, it results in a much greater degree of orderly high-level structure than has typically been assumed.
The Diameter of a Scale-Free Random
Graph
- Computer ScienceComb.
- 2004
We consider a random graph process in which vertices are
added to the graph one at a time and joined to a fixed number
m of earlier vertices, where
each earlier vertex is chosen with probability…
Deeper Inside PageRank
- MathematicsInternet Math.
- 2003
A comprehensive survey of all issues associated with PageRank, covering the basic PageRank model, available and recommended solution methods, storage issues, existence, uniqueness, and convergence properties, possible alterations to the basic model, and suggested alternatives to the traditional solution methods.