Design of GNNs¶

What is GNN?¶

In machine learning, Graph Neural Networks (GNNs) are a class of neural networks specifically designed to process graph-structured data. In a GNN, the input is represented as a graph, where nodes (vertices) correspond to entities and edges represent the relationships or interactions between these entities. A typical GNN architecture consists of an initial Transform followed by multiple Convolution layers, as detailed in Understanding Transforms and Understanding Convolutions.

Construct a GCN¶

In this tutorial, we will describe how to train a simple Graph Convolutional Network (GCN). Since their introduction, Graph Neural Networks (GNNs) have significantly impacted various fields such as social network analysis, recommendation systems, and link prediction. The GCN model, proposed in the paper [Semi-supervised Classification with Graph Convolutional Networks], is one of the most classic models in GNN research. Next, we will build a simple GCN and use it to perform node classification on the Cora citation network dataset.

First, we load the Cora dataset and transform it using GCNTransform. GCNTransform refers to the data transformation process described in the original GCN paper, which includes adding self-loops to the adjacency matrix, applying symmetric normalization, and normalizing node features.

import os.path as osp

import torch

from rllm.datasets import PlanetoidDataset
from rllm.transforms.graph_transforms import GCNTransform

device = torch.device("cuda" if torch.cuda.is_available() else "cpu")

# Load dataset
path = osp.join(osp.dirname(osp.realpath(__file__)), "data")
data = PlanetoidDataset(path, "cora")[0]

# Transform data
transform = GCNTransform()
data = transform(data).to(device)

Next, we can construct a two-layer GCN and use ReLU as the activation function:

import torch.nn.functional as F

from rllm.nn.conv.graph_conv import GCNConv

class GCN(torch.nn.Module):
    def __init__(self, in_dim, hidden_dim, out_dim, dropout):
        super().__init__()
        self.dropout = dropout
        self.conv1 = GCNConv(in_dim, hidden_dim)
        self.conv2 = GCNConv(hidden_dim, out_dim)

    def forward(self, x, adj):
        x = F.dropout(x, p=self.dropout, training=self.training)
        x = F.relu(self.conv1(x, adj))
        x = F.dropout(x, p=self.dropout, training=self.training)
        x = self.conv2(x, adj)
        return x

We can initialize the model, optimizer and loss function.

# Set up model, optimizer and loss function
model = GCN(
    in_dim=data.x.shape[1],
    hidden_dim=16,
    out_dim=data.num_classes,
    dropout=0.5,
).to(device)
optimizer = torch.optim.Adam(
    model.parameters(),
    lr=0.01,
    weight_decay=5e-4,
)
loss_fn = torch.nn.CrossEntropyLoss()

Finally, we train our model and get the classification results on the test set.

for epoch in range(200):
    model.train()
    optimizer.zero_grad()
    out = model(data.x, data.adj)
    loss = loss_fn(out[data.train_mask], data.y[data.train_mask])
    loss.backward()
    optimizer.step()

with torch.no_grad():
    model.eval()
    out = model(data.x, data.adj)
    pred = out.argmax(dim=1)

    correct = float(pred[data.test_mask].eq(data.y[data.test_mask]).sum().item())
    acc = correct / int(data.test_mask.sum())

print(f"Accuracy: {acc:.4f}")
>>> 0.8150