MultiHashFormer: Hash-based Generative Language Models Paper • 2606.28057 • Published 6 days ago • 19
An Empirical Study on Preference Tuning Generalization and Diversity Under Domain Shift Paper • 2601.05882 • Published Jan 9 • 21
Enhancing Linguistic Competence of Language Models through Pre-training with Language Learning Tasks Paper • 2601.03448 • Published Jan 6 • 13