I am working on a sequence tagging task, where the logits output should be [batchsize * sequence length]. Does cuBERT support that?