demonstrations专题

论文笔记：Are Human-generated Demonstrations Necessary for In-context Learning?

iclr 2024 reviewer 评分 6668 1 intro 大型语言模型（LLMs）已显示出在上下文中学习的能力给定几个带注释的示例作为演示，LLMs 能够为新的测试输入生成输出然而，现行的上下文学习（ICL）范式仍存在以下明显的缺点：最终性能极度敏感于选定的演示示例，到目前为止，还没有公认的完美演示选择标准制作演示可能是劳动密集型的，麻烦的甚至是禁止性的在许多 ICL 场景中

Watch,Try, Learn: Meta-Learning from Demonstrations and Rewards读书笔记

Abstract \quad Imitation learning 允许 agent 从 demonstrations 中学习复杂的行为。然而学习一个复杂的视觉任务需要很大的 demonstrations。Meta-imitation learning 可以通过学习类似任务的经验，使 agent 从一个或几个 demonstrations 中学习新任务。在 t a s k a m b i