Dataset Cheatsheet

Note

This dataset statistics table is a work in progress. Please consider helping us filling its content by providing statistics for individual datasets. See here and here for examples on how to do so.

Name

#graphs

#nodes

#edges

#features

#classes/#tasks

KarateClub (Paper)

1

34

156

34

4

TUDataset (Paper)

└─ MUTAG

188

~17.9

~39.6

7

2

└─ ENZYMES

600

~32.6

~124.3

3

6

└─ PROTEINS

1,113

~39.1

~145.6

3

2

└─ COLLAB

5,000

~74.5

~4914.4

0

3

└─ IMDB-BINARY

1,000

~19.8

~193.1

0

2

└─ REDDIT-BINARY

2,000

~429.6

~995.5

0

2

└─ …

GNNBenchmarkDataset (Paper)

└─ PATTERN

10,000

~118.9

~6,098.9

3

2

└─ CLUSTER

10,000

~117.2

~4,303.9

7

6

└─ MNIST

55,000

~70.6

~564.5

3

10

└─ CIFAR10

45,000

~117.6

~941.2

5

10

└─ TSP

10,000

~275.4

~6,885.0

2

2

└─ CSL

150

~41.0

~164.0

0

10

Planetoid (Paper)

└─ Cora

1

2,708

10,556

1,433

7

└─ CiteSeer

1

3,327

9,104

3,703

6

└─ PubMed

1

19,717

88,648

500

3

FakeDataset

FakeHeteroDataset

NELL (Paper)

1

65,755

251,550

61,278

186

CitationFull (Paper)

└─ Cora

1

19,793

126,842

8,710

70

└─ Cora_ML

1

2,995

16,316

2,879

7

└─ CiteSeer

1

4,230

10,674

602

6

└─ DBLP

1

17,716

105,734

1,639

4

└─ PubMed

1

19,717

88,648

500

3

CoraFull

1

19,793

126,842

8,710

70

Coauthor (Paper)

└─ CS

1

18,333

163,788

6,805

15

└─ Physics

1

34,493

495,924

8,415

5

Amazon (Paper)

└─ Computers

1

13,752

491,722

767

10

└─ Photo

1

7,650

238,162

745

8

PPI (Paper)

20

~2,245.3

~61,318.4

50

121

Reddit (Paper)

1

232,965

114,615,892

602

41

Reddit2 (Paper)

1

232,965

23,213,838

602

41

Flickr (Paper)

1

89,250

899,756

500

7

Yelp (Paper)

1

716,847

13,954,819

300

100

AmazonProducts (Paper)

1

1,569,960

264,339,468

200

107

QM7b (Paper)

7,211

~15.4

~245.0

0

14

QM9 (Paper)

130,831

~18.0

~37.3

11

19

MD17 (Paper)

ZINC (Paper)

AQSOL (Paper)

MoleculeNet (Paper)

Entities (Paper)

RelLinkPredDataset (Paper)

GEDDataset (Paper)

AttributedGraphDataset (Paper)

MNISTSuperpixels (Paper)

FAUST (Paper)

DynamicFAUST (Paper)

ShapeNet (Paper)

ModelNet (Paper)

CoMA (Paper)

SHREC2016 (Paper)

TOSCA (Paper)

PCPNetDataset (Paper)

S3DIS (Paper)

GeometricShapes

BitcoinOTC (Paper)

ICEWS18 (Paper)

GDELT (Paper)

DBP15K (Paper)

WILLOWObjectClass (Paper)

PascalVOCKeypoints (Paper)

PascalPF (Paper)

SNAPDataset (Paper)

SuiteSparseMatrixCollection (Paper)

AMiner (Paper)

WordNet18 (Paper)

WordNet18RR (Paper)

WikiCS (Paper)

WebKB (Paper)

WikipediaNetwork (Paper)

Actor (Paper)

OGB_MAG (Paper)

DBLP (Paper)

MovieLens (Paper)

IMDB (Paper)

LastFM (Paper)

HGBDataset (Paper)

JODIEDataset (Paper)

MixHopSyntheticDataset (Paper)

UPFD (Paper)

GitHub (Paper)

FacebookPagePage (Paper)

LastFMAsia (Paper)

DeezerEurope (Paper)

GemsecDeezer (Paper)

Twitch (Paper)

Airports (Paper)

BAShapes (Paper)

MalNetTiny (Paper)

OMDB (Paper)

PolBlogs (Paper)

EmailEUCore (Paper)

StochasticBlockModelDataset

RandomPartitionGraphDataset (Paper)

LINKXDataset (Paper)

EllipticBitcoinDataset (Paper)

1

203,769

234,355

165

2