Wikipedia talk (communication) network
Name wiki-Talk
Group SNAP
Matrix ID 2291
Num Rows 2,394,385
Num Cols 2,394,385
Nonzeros 5,021,410
Pattern Entries 5,021,410
Kind Directed Graph
Symmetric No
Date 2008
Author J. Leskovec, D. Huttenlocher, J. Kleinberg
Editor J. Leskovec
Structural Rank
Structural Rank Full
Num Dmperm Blocks
Strongly Connect Components 2,281,879
Num Explicit Zeros 0
Pattern Symmetry 14.4%
Numeric Symmetry 14.4%
Cholesky Candidate no
Positive Definite no
Type binary
Download MATLAB Rutherford Boeing Matrix Market
Networks from SNAP (Stanford Network Analysis Platform) Network Data Sets,     
Jure Leskovec                         
email jure at                                                  
Wikipedia Talk network                                                         
Dataset information                                                            
Wikipedia is a free encyclopedia written collaboratively by volunteers around  
the world. Each registered user has a talk page, that she and other users can  
edit in order to communicate and discuss updates to various articles on        
Wikipedia. Using the latest complete dump of Wikipedia page edit history (from 
January 3 2008) we extracted all user talk page changes and created a network. 
The network contains all the users and discussion from the inception of        
Wikipedia till January 2008. Nodes in the network represent Wikipedia users and
a directed edge from node i to node j represents that user i at least once     
edited a talk page of user j.                                                  
Dataset statistics                                                             
Nodes   2394385                                                                
Edges   5021410                                                                
Nodes in largest WCC    2388953 (0.998)                                        
Edges in largest WCC    5018445 (0.999)                                        
Nodes in largest SCC    111881 (0.047)                                         
Edges in largest SCC    1477893 (0.294)                                        
Average clustering coefficient  0.1958                                         
Number of triangles     9203519                                                
Fraction of closed triangles    -0.09476                                       
Diameter (longest shortest path)    9                                          
90-percentile effective diameter    4                                          
Source (citation)                                                              
J. Leskovec, D. Huttenlocher, J. Kleinberg. Signed Networks in Social Media.   
CHI 2010.                                                                      
J. Leskovec, D. Huttenlocher, J. Kleinberg. Predicting Positive and Negative   
Links in Online Social Networks. WWW 2010.                                     
File    Description                                                            
Wiki-Talk.txt.gz    Wikipedia talk graph till January 2008