SNAP/web-BerkStan

Web graph of Berkeley and Stanford
Name web-BerkStan
Group SNAP
Matrix ID 2300
Num Rows 685,230
Num Cols 685,230
Nonzeros 7,600,595
Pattern Entries 7,600,595
Kind Directed Graph
Symmetric No
Date 2002
Author S. Kamvar
Editor J. Leskovec
Structural Rank
Structural Rank Full
Num Dmperm Blocks
Strongly Connect Components 109,406
Num Explicit Zeros 0
Pattern Symmetry 25%
Numeric Symmetry 25%
Cholesky Candidate no
Positive Definite no
Type binary
Download MATLAB Rutherford Boeing Matrix Market
Notes
Networks from SNAP (Stanford Network Analysis Platform) Network Data Sets,     
Jure Leskovec http://snap.stanford.edu/data/index.html                         
email jure at cs.stanford.edu                                                  
                                                                               
Berkeley-Stanford web graph                                                    
                                                                               
NOTE:  This is an earlier version (2002) of the data obtained from             
Sep Kamvar, Stanford (2003) (the Kamvar/Stanford_Berkeley graph                
in the UF collection, matrix ID 980).                                          
                                                                               
Dataset information                                                            
                                                                               
Nodes represent pages from berkely.edu and stanford.edu domains and directed   
edges represent hyperlinks between them. The data was collected in 2002.       
                                                                               
Dataset statistics                                                             
Nodes   685230                                                                 
Edges   7600595                                                                
Nodes in largest WCC    654782 (0.956)                                         
Edges in largest WCC    7499425 (0.987)                                        
Nodes in largest SCC    334857 (0.489)                                         
Edges in largest SCC    4523232 (0.595)                                        
Average clustering coefficient  0.6149                                         
Number of triangles     64690980                                               
Fraction of closed triangles    0.08769                                        
Diameter (longest shortest path)    669                                        
90-percentile effective diameter    10                                         
                                                                               
Source (citation)                                                              
                                                                               
J. Leskovec, K. Lang, A. Dasgupta, M. Mahoney. Community Structure in Large    
Networks: Natural Cluster Sizes and the Absence of Large Well-Defined Clusters.
arXiv.org:0810.1355, 2008.                                                     
                                                                               
Files                                                                          
File    Description                                                            
web-BerkStan.txt.gz     Berkely-Stanford web graph from 2002                   
                                                                               
NOTE: a near duplicate of this problem already appears in the UF Collection:   
                                                                               
web-BerkStan        Kamvar/Stanford_Berkeley                                   
                    in SNAP/:       n: 685,230   nz: 7,600,595                 
                    in Kamvar/      n: 683,446   nz: 7,583,376                 
                                                                               
                    I obtained the Kamvar/Stanford_Berkeley directly           
                    from Sep Kamvar.  It is slightly smaller than the          
                    version in SNAP.  It is thus likely that Sep created       
                    multiple versions of the graph.