SNAP/loc-Gowalla
SNAP network: loc-Gowalla location network
Name |
loc-Gowalla |
Group |
SNAP |
Matrix ID |
2788 |
Num Rows
|
196,591 |
Num Cols
|
196,591 |
Nonzeros
|
1,900,654 |
Pattern Entries
|
1,900,654 |
Kind
|
Undirected Graph |
Symmetric
|
Yes |
Date
|
2011 |
Author
|
E. Cho, S. A. Myers, J. Leskovec |
Editor
|
J. Leskovec |
Structural Rank |
|
Structural Rank Full |
|
Num Dmperm Blocks
|
|
Strongly Connect Components
|
1 |
Num Explicit Zeros
|
0 |
Pattern Symmetry
|
100% |
Numeric Symmetry
|
100% |
Cholesky Candidate
|
no |
Positive Definite
|
no |
Type
|
binary |
Download |
MATLAB
Rutherford Boeing
Matrix Market
|
Notes |
SNAP (Stanford Network Analysis Platform) Large Network Dataset Collection,
Jure Leskovec and Anrej Krevl, http://snap.stanford.edu/data, June 2014.
email: jure at cs.stanford.edu
loc-Gowalla
https://snap.stanford.edu/data/loc-Gowalla.html
Dataset information
Gowalla (http://www.gowalla.com/) is a location-based social networking
website where users share their locations by checking-in. The friendship
network is undirected and was collected using their public API, and
consists of 196,591 nodes and 950,327 edges. We have collected a total of
6,442,890 check-ins of these users over the period of Feb. 2009 - Oct.
2010.
Dataset statistics
Nodes 196,591
Edges 950,327
Nodes in largest WCC 196591 (1.000)
Edges in largest WCC 950327 (1.000)
Nodes in largest SCC 196591 (1.000)
Edges in largest SCC 950327 (1.000)
Average clustering coefficient 0.2367
Number of triangles 2273138
Fraction of closed triangles 0.007952
Diameter (longest shortest path) 14
90-percentile effective diameter 5.7
Check-ins 6,442,890
Source (citation)
E. Cho, S. A. Myers, J. Leskovec. Friendship and Mobility: Friendship and
Mobility: User Movement in Location-Based Social Networks ACM SIGKDD
International Conference on Knowledge Discovery and Data Mining (KDD),
2011. http://cs.stanford.edu/people/jure/pubs/mobile-kdd11.pdf
Files
File Description
loc-gowalla_edges.txt.gz Friendship network of Gowalla users
loc-gowalla_totalCheckins.txt.gz Time and location information
of check-ins made by users
Example of check-in information
[user] [check-in time] [latitude] [longitude] [location id]
196514 2010-07-24T13:45:06Z 53.3648119 -2.2723465833 145064
196514 2010-07-24T13:44:58Z 53.360511233 -2.276369017 1275991
196514 2010-07-24T13:44:46Z 53.3653895945 -2.2754087046 376497
196514 2010-07-24T13:44:38Z 53.3663709833 -2.2700764333 98503
196514 2010-07-24T13:44:26Z 53.3674087524 -2.2783813477 1043431
196514 2010-07-24T13:44:08Z 53.3675663377 -2.278631763 881734
196514 2010-07-24T13:43:18Z 53.3679640626 -2.2792943689 207763
196514 2010-07-24T13:41:10Z 53.364905 -2.270824 1042822
---------------------------------------------------------------------------
Notes on inclusion into the SuiteSparse Matrix Collection, July 2018:
---------------------------------------------------------------------------
The SNAP data set is 0-based, with nodes numbered 0 to 196,590.
In the SuiteSparse Matrix Collection the graph is converted to 1-based.
The Problem.A matrix is the undirected friendship network, where A(i,j)=1
if person 1+i and person 1+j are friends in the SNAP data set.
There are 6,442,892 checkins in the loc-gowalla_totalCheckins.txt
(the SNAP web page states 6,442,890).
In the SuiteSparse Matrix Collection, the checkin data is held in 5 vectors
of length 6,442,892. These are in the Problem.aux component of the MATLAB
struct. The kth entry of each of these vectors holds the data in the kth
line of the loc-gowalla_totalCheckins.txt file.
userid: the SNAP user id is an integer in the range 0 to 196,590.
It has been incremented by one, here, to reflect the
corresponding row and column of the Problem.A matrix.
There are 107,092 unique user id's in the checkins.
checkin_time: a string of length 20
latitude: a double precision number
longitude: a double precision number
location_id: an integer
|