Not all thinking happens in conversation. For some introverts, writing is where thought begins—slowly, privately, and with surprising depth.
description [ICML 2026][LLM评测][分层强化学习] HiPER 把 LLM agent 的扁平 RL 改造成"高层规划子目标 + 低层执行原子动作"的两级 Plan-Execute 结构,并配套提出 Hierarchical Advantage Estimation (HAE) 把 GAE ...