SNAP/webBerkStan
Web graph of Berkeley and Stanford
Name 
webBerkStan 
Group 
SNAP 
Matrix ID 
2300 
Num Rows

685,230 
Num Cols

685,230 
Nonzeros

7,600,595 
Pattern Entries

7,600,595 
Kind

Directed Graph 
Symmetric

No 
Date

2002 
Author

S. Kamvar 
Editor

J. Leskovec 
Structural Rank 

Structural Rank Full 

Num Dmperm Blocks


Strongly Connect Components

109,406 
Num Explicit Zeros

0 
Pattern Symmetry

25% 
Numeric Symmetry

25% 
Cholesky Candidate

no 
Positive Definite

no 
Type

binary 
Download 
MATLAB
Rutherford Boeing
Matrix Market

Notes 
Networks from SNAP (Stanford Network Analysis Platform) Network Data Sets,
Jure Leskovec http://snap.stanford.edu/data/index.html
email jure at cs.stanford.edu
BerkeleyStanford web graph
NOTE: This is an earlier version (2002) of the data obtained from
Sep Kamvar, Stanford (2003) (the Kamvar/Stanford_Berkeley graph
in the UF collection, matrix ID 980).
Dataset information
Nodes represent pages from berkely.edu and stanford.edu domains and directed
edges represent hyperlinks between them. The data was collected in 2002.
Dataset statistics
Nodes 685230
Edges 7600595
Nodes in largest WCC 654782 (0.956)
Edges in largest WCC 7499425 (0.987)
Nodes in largest SCC 334857 (0.489)
Edges in largest SCC 4523232 (0.595)
Average clustering coefficient 0.6149
Number of triangles 64690980
Fraction of closed triangles 0.08769
Diameter (longest shortest path) 669
90percentile effective diameter 10
Source (citation)
J. Leskovec, K. Lang, A. Dasgupta, M. Mahoney. Community Structure in Large
Networks: Natural Cluster Sizes and the Absence of Large WellDefined Clusters.
arXiv.org:0810.1355, 2008.
Files
File Description
webBerkStan.txt.gz BerkelyStanford web graph from 2002
NOTE: a near duplicate of this problem already appears in the UF Collection:
webBerkStan Kamvar/Stanford_Berkeley
in SNAP/: n: 685,230 nz: 7,600,595
in Kamvar/ n: 683,446 nz: 7,583,376
I obtained the Kamvar/Stanford_Berkeley directly
from Sep Kamvar. It is slightly smaller than the
version in SNAP. It is thus likely that Sep created
multiple versions of the graph.
