From memories to maps: Mechanisms of in context reinforcement learning in transformers11просмотров6 месяцев назад
What Non-Content Perturbations Reveal About Human and Clinical LLM Decision8просмотров6 месяцев назад
Capturing Polysemanticity with PRISM: A Multi-Concept Feature Description Framework7просмотров6 месяцев назад
LiveCodeBench Pro: How Do Olympiad MedalistsJudge LLMs in Competitive Programming?10просмотров6 месяцев назад