Main Hyper-Parameter Reference¶

Name	Description
`activation`	Activation function used in the model layers.
`batch_size`	Number of samples per batch during training.
`bias`	Whether to include bias terms in the model layers.
`conv_dim`	Dimension of convolution layers when the input and output dimensions must be the same.
`concat`	Whether to concatenate the outputs from multiple heads in multi-head attention.
`dataset`	Dataset to be used for training or evaluation.
`dropout`	Dropout rate applied to the model to prevent overfitting.
`emb_dim`	Dimension of the embedding layer.
`epochs`	Number of training epochs.
`head_dim`	Dimension of each attention head in multi-head attention mechanisms.
`hidden_dim`	Dimension of the hidden layers within the model.
`in_dim`	Dimension of the input data.
`lr`	Learning rate for training.
`metadata`	Metadata of graph or tabular data, including node and edge types, and other related information.
`num_classes`	Number of classes in the classification task.
`num_feats`	Number of features in the dataset.
`num_heads`	Number of attention heads in multi-head attention mechanisms.
`num_layers`	Number of layers in the model.
`out_dim`	Dimension of the model’s final output.
`patience`	Early stopping criterion, specifying the number of epochs to wait for improvement before halting training.
`seed`	Random seed for reproducibility of results.
`wd`	Weight decay parameter to regularize the model.