evaluation

Rethinking Self-Supervision Objectives for Generalizable Coherence Modeling

We show empirically that increasing the density of negative samples improves the basic model, and using a global negative queue further improves and stabilizes the model while training with hard negative samples.