Download CFinder    Manual    Network Data    Publications    WebCFinder


Home

Download

Publications

Support

People

Links

edit SideBar


Download network data

1.  Wikipedia: Network of pages, Page categories, Category hierarchy

Info

Data source: 20071018 dump of the English Wikipedia, see wikimedia downloads
Pre-processing: wikiprep by E. Gabrilovich
Processing: Perl scripts by the authors of the article (see below) available upon request

How to cite

Palla et. al., New J. Phys. 10, 123026 (2008)

Download Format: zipped text files  
  dirLinks.zip [188MB] directed links (hyperlinks) connecting pairs of nodes (pages, with numerical IDs)
undirLinks.zip [175MB] 2,070,486 nodes (pages, with numerical IDs) connected by 42,336,692 undirected links (hyperlinks)
pageNum2Name.zip [25MB] numerical ID of Wikipedia page --> name of page
pageCategList.zip [24MB] numerical ID of Wikipedia page --> numerical IDs of its categories
categNum2Name.zip [3MB] numerical ID of category --> name of category
catHier_allDirLinks.zip [3MB] all directed links (parent category --> child category) in the category hierarchy (265,432 nodes and 543,722 directed links)
catHier_loops.zip [68kB] the subgraph of loops in the category hierarchy (4,980 nodes and 13,164 directed links)
catHier_DAG.zip [3MB] the Directed Acyclic Graph (DAG) of the category hierarchy after eliminating loops with the algorithm in Appendix A.1 of Palla et. al., New J. Phys. 10, 123026 (2008) (265,432 nodes and 539,745 directed links)

Top

2.  MathSciNet: Co-authorship network

Info

391,529 nodes: authors (with numerical IDs) at MathSciNet before January, 2008
873,775 links (weighted, undirected): co-authorship connections (pairs of authors with numerical IDs)
link weight: an article with N authors increases the link weight between each pair of its authors by 1/(N-1)

How to cite

Palla et. al., New J. Phys. 10, 123026 (2008)

Download Format: zipped text files  
paperAuCateg.zip [26MB] detailed information about MathSciNet papers: numerical IDs of papers, authors, and categories
  wCoAuNw.zip [5MB] list of weighted undirected co-authorship links (node: author, link: co-authorsip)

Top

3.  Directed network of Google's own webpages

Info

15,763 nodes: static webpages at www.google.com
171,206 directed links: hyperlinks
file format: list of directed links (<node1> tab <node2> newline)

How to cite

Palla et. al., New J. Phys. 9, 186 (2007)

Download

Network (846kB zipped txt)

Perl script [small zip package]