Hello, I saw you reported the performance of GT baselines on large scale datasets, such as ogbn-arxiv and products. However, I didn't see you use any sampler when train GTs on them. I could not report the results since it introduced scalability Issue with some GTs, such as GPS which requires global attention. I am wondering if you could provide more details of your GT experimental settings on large scale datasets, such as sampler you used and hyperparameters, especially with GPS.
Hello, I saw you reported the performance of GT baselines on large scale datasets, such as ogbn-arxiv and products. However, I didn't see you use any sampler when train GTs on them. I could not report the results since it introduced scalability Issue with some GTs, such as GPS which requires global attention. I am wondering if you could provide more details of your GT experimental settings on large scale datasets, such as sampler you used and hyperparameters, especially with GPS.