SNAP/as-caida

(122 graphs) CAIDA AS Relationships Datasets, from 1/04-11/07

Name	as-caida
Group	SNAP
Matrix ID	2322
Num Rows	31,379
Num Cols	31,379
Nonzeros	106,762
Pattern Entries	106,762
Kind	Directed Weighted Graph Sequence
Symmetric	No
Date	2007
Author	J. Leskovec, J. Kleinberg and C. Faloutsos
Editor	J. Leskovec

Force-Directed Graph Visualization of SNAP/as-caida

Structural Rank
Structural Rank Full
Num Dmperm Blocks
Strongly Connect Components	4,905
Num Explicit Zeros	0
Pattern Symmetry	100%
Numeric Symmetry	7.9%
Cholesky Candidate	no
Positive Definite	no
Type	integer

SVD Statistics
Matrix Norm	1.822487e+02
Minimum Singular Value	0
Condition Number	Inf
Rank	7,360
sprank(A)-rank(A)
Null Space Dimension	24,019
Full Numerical Rank?	no
Download Singular Values	MATLAB

Download	MATLAB Rutherford Boeing Matrix Market
Notes	Networks from SNAP (Stanford Network Analysis Platform) Network Data Sets, Jure Leskovec http://snap.stanford.edu/data/index.html email jure at cs.stanford.edu CAIDA AS Relationships Datasets Dataset information The dataset contains 122 CAIDA AS graphs, from January 2004 to November 2007 - http://www.caida.org/data/active/as-relationships/ . Each file contains a full AS graph derived from a set of RouteViews BGP table snapshots. Dataset statistics are calculated for the graph with the highest number of nodes - dataset from November 5 2007. Dataset statistics for graph with highest number of nodes - 11 5 2007 Nodes 26475 Edges 106762 Nodes in largest WCC 26475 (1.000) Edges in largest WCC 106762 (1.000) Nodes in largest SCC 26475 (1.000) Edges in largest SCC 106762 (1.000) Average clustering coefficient 0.2082 Number of triangles 36365 Fraction of closed triangles 0.007319 Diameter (longest shortest path) 17 90-percentile effective diameter 4.6 Source (citation) J. Leskovec, J. Kleinberg and C. Faloutsos. Graphs over Time: Densification Laws, Shrinking Diameters and Possible Explanations. ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), 2005. Files File Description as-caida20071105.txt.gz CAIDA AS graph from November 5 2007 as-caida.tar.gz 122 CAIDA AS graphs from January 2004 to November 2007 NOTE for UF Sparse Matrix Collection: these graphs are weighted. In the original SNAP data set, the edge weights are in the set {-1, 0, 1, 2}. Note that "0" is an edge weight. This can be handled in the UF collection for the primary sparse matrix in a Problem, but not when the matrices are in a sequence in the Problem.aux MATLAB struct. The entries with zero edge weight would become lost. To correct for this, the weights are modified by adding 2 to each weight. This preserves the structure of the original graphs, so that edges with weight zero are not lost. (A non-edge is not the same as an edge with weight zero in this problem). old new weights: -1 1 0 2 1 3 2 4 So to obtain the original weights, subtract 2 from each entry. The primary sparse matrix for this problem is the as-caida20071105 matrix, or Problem.aux.G{121}, the second-to-the-last graph in the sequence. The nodes are uniform across all graphs in the sequence in the UF collection. That is, nodes do not come and go. A node that is "gone" simply has no edges. This is to allow comparisons across each node in the graphs. Problem.aux.nodenames gives the node numbers of the original problem. So row/column i in the matrix is always node number Problem.aux.nodenames(i) in all the graphs. Problem.aux.G{k} is the kth graph in the sequence. Problem.aux.Gname(k,:) is the name of the kth graph.

Download

MATLAB Rutherford Boeing Matrix Market

Notes

Networks from SNAP (Stanford Network Analysis Platform) Network Data Sets,     
Jure Leskovec http://snap.stanford.edu/data/index.html                         
email jure at cs.stanford.edu                                                  
                                                                               
CAIDA AS Relationships Datasets                                                
Dataset information                                                            
                                                                               
The dataset contains 122 CAIDA AS graphs, from January 2004 to November 2007 - 
http://www.caida.org/data/active/as-relationships/ .  Each file contains a full
AS graph derived from a set of RouteViews BGP table snapshots.                 
                                                                               
Dataset statistics are calculated for the graph with the highest number of     
nodes - dataset from November 5 2007.  Dataset statistics for graph with       
highest number of nodes - 11 5 2007                                            
                                                                               
Nodes   26475                                                                  
Edges   106762                                                                 
Nodes in largest WCC    26475 (1.000)                                          
Edges in largest WCC    106762 (1.000)                                         
Nodes in largest SCC    26475 (1.000)                                          
Edges in largest SCC    106762 (1.000)                                         
Average clustering coefficient  0.2082                                         
Number of triangles     36365                                                  
Fraction of closed triangles    0.007319                                       
Diameter (longest shortest path)    17                                         
90-percentile effective diameter    4.6                                        
                                                                               
Source (citation)                                                              
                                                                               
J. Leskovec, J. Kleinberg and C. Faloutsos. Graphs over Time: Densification    
Laws, Shrinking Diameters and Possible Explanations. ACM SIGKDD International  
Conference on Knowledge Discovery and Data Mining (KDD), 2005.                 
                                                                               
Files                                                                          
File    Description                                                            
as-caida20071105.txt.gz     CAIDA AS graph from November 5 2007                
as-caida.tar.gz     122 CAIDA AS graphs from January 2004 to November 2007     
                                                                               
                                                                               
NOTE for UF Sparse Matrix Collection: these graphs are weighted.  In the       
original SNAP data set, the edge weights are in the set {-1, 0, 1, 2}.  Note   
that "0" is an edge weight.  This can be handled in the UF collection for the  
primary sparse matrix in a Problem, but not when the matrices are in a sequence
in the Problem.aux MATLAB struct.  The entries with zero edge weight would     
become lost.  To correct for this, the weights are modified by adding 2 to each
weight.  This preserves the structure of the original graphs, so that edges    
with weight zero are not lost.  (A non-edge is not the same as an edge with    
weight zero in this problem).                                                  
                                                                               
    old new weights:                                                           
    -1  1                                                                      
    0   2                                                                      
    1   3                                                                      
    2   4                                                                      
                                                                               
So to obtain the original weights, subtract 2 from each entry.                 
                                                                               
The primary sparse matrix for this problem is the as-caida20071105 matrix, or  
Problem.aux.G{121}, the second-to-the-last graph in the sequence.              
                                                                               
The nodes are uniform across all graphs in the sequence in the UF collection.  
That is, nodes do not come and go.  A node that is "gone" simply has no edges. 
This is to allow comparisons across each node in the graphs.                   
Problem.aux.nodenames gives the node numbers of the original problem.  So      
row/column i in the matrix is always node number Problem.aux.nodenames(i) in   
all the graphs.                                                                
                                                                               
Problem.aux.G{k} is the kth graph in the sequence.                             
Problem.aux.Gname(k,:) is the name of the kth graph.