Arxiv High Energy Physics paper citation network
Name cit-HepTh
Group SNAP
Matrix ID 2293
Num Rows 27,770
Num Cols 27,770
Nonzeros 352,807
Pattern Entries 352,807
Kind Directed Graph
Symmetric No
Date 2003
Author J. Gehrke, P. Ginsparg, J. Kleinberg
Editor J. Leskovec
Structural Rank
Structural Rank Full
Num Dmperm Blocks
Strongly Connect Components 20,086
Num Explicit Zeros 0
Pattern Symmetry 0.3%
Numeric Symmetry 0.3%
Cholesky Candidate no
Positive Definite no
Type binary
SVD Statistics
Matrix Norm 8.516069e+01
Minimum Singular Value 6.734437e-66
Condition Number 1.264555e+67
Rank 21,616
Null Space Dimension 6,154
Full Numerical Rank? no
Download Singular Values MATLAB
Download MATLAB Rutherford Boeing Matrix Market
Networks from SNAP (Stanford Network Analysis Platform) Network Data Sets,     
Jure Leskovec                         
email jure at                                                  
High-energy physics theory citation network                                    
Dataset information                                                            
Arxiv HEP-TH (high energy physics theory) citation graph is from the e-print   
arXiv and covers all the citations within a dataset of 27,770 papers with      
352,807 edges. If a paper i cites paper j, the graph contains a directed edge  
from i to j. If a paper cites, or is cited by, a paper outside the dataset, the
graph does not contain any information about this.                             
The data covers papers in the period from January 1993 to April 2003 (124      
months). It begins within a few months of the inception of the arXiv, and thus 
represents essentially the complete history of its HEP-TH section.             
The data was originally released as a part of 2003 KDD Cup.                    
Dataset statistics                                                             
Nodes   27770                                                                  
Edges   352807                                                                 
Nodes in largest WCC    27400 (0.987)                                          
Edges in largest WCC    352542 (0.999)                                         
Nodes in largest SCC    7464 (0.269)                                           
Edges in largest SCC    116268 (0.330)                                         
Average clustering coefficient  0.3295                                         
Number of triangles     1478735                                                
Fraction of closed triangles    0.1196                                         
Diameter (longest shortest path)    14                                         
90-percentile effective diameter    5.4                                        
Source (citation)                                                              
J. Leskovec, J. Kleinberg and C. Faloutsos. Graphs over Time: Densification    
Laws, Shrinking Diameters and Possible Explanations. ACM SIGKDD International  
Conference on Knowledge Discovery and Data Mining (KDD), 2005.                 
J. Gehrke, P. Ginsparg, J. M. Kleinberg. Overview of the 2003 KDD Cup. SIGKDD  
Explorations 5(2): 149-151, 2003.                                              
File    Description                                                            
cit-HepTh.txt.gz    Paper citation network of Arxiv High Energy Physics Theory 
cit-HepTh-dates.txt.gz  Time of nodes (paper submission time to Arxiv)         
cit-HepTh-abstracts.tar.gz  Paper meta information (see below)                 
Example of paper meta information:                                             
    Paper: hep-th/0002031                                                      
    From: Maulik K. Parikh                                                     
    Date: Fri, 4 Feb 2000 17:04:51 GMT (10kb)                                  
    Title: Confinement and the AdS/CFT Correspondence                          
    Authors: D. S. Berman and Maulik K. Parikh                                 
    Comments: 12 pages, 1 figure, RevTeX                                       
    Report-no: SPIN-1999/25, UG-1999/42                                        
    Journal-ref: Phys.Lett. B483 (2000) 271-276                                
      We study the thermodynamics of the confined and unconfined phases of     
    superconformal Yang-Mills in finite volume and at large N using the AdS/CFT
    correspondence. We discuss the necessary conditions for a smooth phase     
    crossover and obtain an N-dependent curve for the phase boundary.          
NOTE: an identical version this problem already appears in the UF Collection:  
cit-HepTh           Pajek/HEP-th-new is identical to the                       
                    SNAP/cit-HeptTh graph.  Since it's small, I have           
                    decided to include both in the collection, to              
                    keep the SNAP/ collection complete.                        
                                    n: 27,770    nz:  352,807