Corpus ID: 2163310

Exploiting the Block Structure of the Web for Computing

@inproceedings{Kamvar2003ExploitingTB,
  title={Exploiting the Block Structure of the Web for Computing},
  author={Sepandar D. Kamvar and Taher H. Haveliwala and Christopher D. Manning and Gene H. Golub},
  year={2003}
}
  • Sepandar D. Kamvar, Taher H. Haveliwala, +1 author Gene H. Golub
  • Published 2003
  • Computer Science
  • The web link graph has a nested block structure: the vast majority of hyperlinks link pages on a host to other pages on the same host, and many of those that do not link pages within the same domain. We show how to exploit this structure to speed up the computation of PageRank by a 3-stage algorithm whereby (1)~the local PageRanks of pages for each host are computed independently using the link structure of that host, (2)~these local PageRanks are then weighted by the ``importance'' of the… CONTINUE READING

    Create an AI-powered research feed to stay up to date with new papers like this posted to ArXiv

    Citations

    Publications citing this paper.
    SHOWING 1-10 OF 291 CITATIONS

    Validation and Verification Policies for Governance of Service Choreographies

    VIEW 4 EXCERPTS
    CITES BACKGROUND & METHODS
    HIGHLY INFLUENCED

    Exploiting the hierarchical structure for link analysis

    VIEW 12 EXCERPTS
    CITES METHODS & BACKGROUND
    HIGHLY INFLUENCED

    Blogel: A Block-Centric Framework for Distributed Computation on Real-World Graphs

    VIEW 5 EXCERPTS
    CITES METHODS & BACKGROUND
    HIGHLY INFLUENCED

    Subgraph Rank: PageRank for Subgraph-Centric Distributed Graph Processing

    VIEW 5 EXCERPTS
    CITES BACKGROUND & METHODS
    HIGHLY INFLUENCED

    An Improved Approach to the PageRank Problems

    VIEW 7 EXCERPTS
    CITES BACKGROUND & METHODS
    HIGHLY INFLUENCED

    Mining the “Voice of the Customer” for Business Prioritization

    VIEW 5 EXCERPTS
    CITES METHODS
    HIGHLY INFLUENCED

    Exploiting links and text structure on the Web: a quantitative approach to improving search quality

    VIEW 5 EXCERPTS
    CITES BACKGROUND, METHODS & RESULTS
    HIGHLY INFLUENCED

    Performances and Characteristics of DIGRank, Ranking in the Incomplete Networks

    VIEW 5 EXCERPTS
    CITES BACKGROUND
    HIGHLY INFLUENCED

    New Algorithms for Mining Network Datasets: Applications to Phenotype and Pathway Modeling

    VIEW 8 EXCERPTS
    CITES BACKGROUND
    HIGHLY INFLUENCED

    DYNA-RANK: Efficient Calculation and Updation of PageRank

    VIEW 4 EXCERPTS
    CITES METHODS & BACKGROUND
    HIGHLY INFLUENCED

    FILTER CITATIONS BY YEAR

    2003
    2020

    CITATION STATISTICS

    • 44 Highly Influenced Citations

    • Averaged 9 Citations per year from 2017 through 2019

    References

    Publications referenced by this paper.
    SHOWING 1-10 OF 16 REFERENCES

    Representing Web graphs

    VIEW 1 EXCERPT

    Scaling personalized web search

    VIEW 1 EXCERPT

    Parallel crawlers

    VIEW 1 EXCERPT

    Topic-sensitive PageRank

    VIEW 1 EXCERPT

    Who links to whom: mining linkage between Web sites

    VIEW 1 EXCERPT

    Graph structure in the Web

    VIEW 1 EXCERPT