Emergence of Episodic Memory in Transformers: Characterizing Changes in Temporal Structure of Attention Scores During Training Paper • 2502.06902 • Published Feb 9, 2025 • 1
A Comparative Study of Sentence Embedding Models for Assessing Semantic Variation Paper • 2308.04625 • Published Aug 8, 2023
deven367/Meta-Llama-3.1-8B-Instruct-Q5_K_M-GGUF Text Generation • 8B • Updated Jul 24, 2024 • 249 • 1