发布日期: 2025-09-19

更新日期: 2025-10-07

文章字数: 2.5k

阅读时长: 9 分

阅读次数:

⚠️ 以下所有内容总结都来自于大语言模型的能力，如有错误，仅供参考，谨慎使用
🔴 请注意：千万不要用于严肃的学术场景，只能用于论文阅读前的初筛！
💗 如果您觉得我们的项目对您有帮助 ChatPaperFree ，还请您给我们一些鼓励！⭐️ HuggingFace免费体验

2025-09-19 更新

Dynamic Adaptive Legged Locomotion Policy via Decoupling Reaction Force Control and Gait Control

Authors:Renjie Wang, Shangke Lyu, Donglin Wang

While Reinforcement Learning (RL) has achieved remarkable progress in legged locomotion control, it often suffers from performance degradation in out-of-distribution (OOD) conditions and discrepancies between the simulation and the real environments. Instead of mainly relying on domain randomization (DR) to best cover the real environments and thereby close the sim-to-real gap and enhance robustness, this work proposes an emerging decoupled framework that acquires fast online adaptation ability and mitigates the sim-to-real problems in unfamiliar environments by isolating stance-leg control and swing-leg control. Various simulation and real-world experiments demonstrate its effectiveness against horizontal force disturbances, uneven terrains, heavy and biased payloads, and sim-to-real gap.

虽然强化学习（RL）在腿部运动控制方面取得了显著的进步，但在分布外（OOD）条件下以及模拟环境和真实环境之间的差异中，其性能往往会下降。这项工作并没有主要依赖领域随机化（DR）来最好地覆盖真实环境，从而缩小模拟到真实的差距并增强鲁棒性，而是提出了一种新兴的解耦框架。该框架具有快速的在线适应能力，并通过隔离立腿控制和摆动腿控制，减轻了陌生环境中的模拟到真实问题。各种模拟和真实实验表明，该框架在水平力扰动、不平坦地形、重负载和偏向负载以及模拟到真实的差距方面具有很好的效果。

论文及项目相关链接

PDF

Summary

在强化学习（RL）中，尽管腿式运动控制方面取得了显著进展，但在分布外（OOD）条件下性能下降以及模拟与真实环境之间的差异仍然是一个挑战。本研究提出了一种新兴的解耦框架，通过隔离站立腿控制和摆动腿控制，具备快速在线适应能力，并缓解不熟悉环境下的模拟到现实问题。仿真和真实世界的实验证明，该方法对于水平力扰动、不平坦地形、重负载和偏向负载以及模拟到现实的差距具有很好的效果。

Key Takeaways

强化学习在腿式运动控制方面取得显著进展，但在分布外条件下性能下降以及模拟与真实环境之间的差异是挑战。
提出的解耦框架通过隔离站立腿控制和摆动腿控制，增强了机器人的适应能力。
框架具备快速在线适应能力，能缓解模拟到现实的问题。
该方法对于水平力扰动、不平坦地形、重负载和偏向负载具有很好的效果。
仿真和真实世界的实验证明了该框架的有效性。
该方法不仅关注于覆盖真实环境，而且致力于缩小模拟到现实的差距。

Cool Papers

点此查看论文截图

Reaction rates with temperature-dependent cross sections: A quantum dynamical microscopic model for the neutron capture reaction on the $^{188}$Os target

Authors:N. Lightfoot, A. Diaz-Torres, P. Stevenson

The neutron capture process plays a vital role in creating the heavy elements in the universe. The environments involved in these processes are, in general, high in temperature and are characterized by two distinct reaction mechanisms: the slow and rapid neutron capture processes. In this work, the slow neutron capture process is described with the time-dependent coupled channels wave-packet (TDCCWP) method that uses both a many-body nuclear potential and an initial temperature-dependent state to account for the thermal environment. To evaluate the role of a mixed and entangled initial state in the temperature-dependent neutron capture cross section, TDCCWP calculations are compared with those from the coupled-channels density matrix (CCDM) method based on the Lindblad equation. The importance of the temperature of the environment is then explored in the n+$^{188}$Os reaction with a decrease of cross section with increasing temperature, along with a decrease of $10%$ in reaction rates for the highest incident energies studied, which are important in the rapid neutron capture process.

中子捕获过程在宇宙中重元素的产生中起着至关重要的作用。这些过程所涉及的环境通常温度较高，并表现出两种截然不同的反应机制：慢化中子捕获过程和快速中子捕获过程。在这项工作中，采用时变耦合通道波包（TDCCWP）方法描述了慢化中子捕获过程，该方法使用多体核势和初始温度相关状态来考虑热环境。为了评估混合和纠缠初始状态在温度相关中子捕获截面中的作用，将TDCCWP的计算结果与基于Lindblad方程的耦合通道密度矩阵（CCDM）方法的计算结果进行了比较。然后，在n+$^{188}$Os反应中探索了环境温度的重要性，随着温度的升高，截面减小，对于所研究的最高的入射能量，反应速率降低了1,具有快速中子捕获过程的重要意义。

论文及项目相关链接

PDF 9 pages, 7 figures

Summary
中子俘获过程在宇宙中创造重元素时起到关键作用。本文描述了慢化中子俘获过程的时间相关耦合通道波包（TDCCWP）方法，该方法使用多体核势和初始温度相关状态来模拟热环境。同时，通过与基于Lindblad方程的耦合通道密度矩阵（CCDM）方法比较，评估了初始混合态对温度相关中子俘获截面的影响。同时，还探索了环境温度在n+$^{188}$Os反应中的重要性。

Key Takeaways

中子俘获过程在宇宙创造重元素中起重要作用。
存在慢化和快速中子俘获两种过程，本文关注慢化中子俘获。
TDCCWP方法被用来描述慢化中子俘获过程，考虑了多体核势和初始温度相关状态。
通过与CCDM方法比较，评估了初始混合态对温度相关中子俘获截面的影响。
环境温度在n+$^{188}$Os反应中很重要，随着温度升高，截面和反应速率都会下降。
在研究的最高入射能量下，反应速率下降了10%，这对快速中子俘获过程有影响。

Cool Papers

点此查看论文截图

MOOM: Maintenance, Organization and Optimization of Memory in Ultra-Long Role-Playing Dialogues

Authors:Weishu Chen, Jinyi Tang, Zhouhui Hou, Shihao Han, Mingjie Zhan, Zhiyuan Huang, Delong Liu, Jiawei Guo, Zhicheng Zhao, Fei Su

Memory extraction is crucial for maintaining coherent ultra-long dialogues in human-robot role-playing scenarios. However, existing methods often exhibit uncontrolled memory growth. To address this, we propose MOOM, the first dual-branch memory plugin that leverages literary theory by modeling plot development and character portrayal as core storytelling elements. Specifically, one branch summarizes plot conflicts across multiple time scales, while the other extracts the user’s character profile. MOOM further integrates a forgetting mechanism, inspired by the ``competition-inhibition’’ memory theory, to constrain memory capacity and mitigate uncontrolled growth. Furthermore, we present ZH-4O, a Chinese ultra-long dialogue dataset specifically designed for role-playing, featuring dialogues that average 600 turns and include manually annotated memory information. Experimental results demonstrate that MOOM outperforms all state-of-the-art memory extraction methods, requiring fewer large language model invocations while maintaining a controllable memory capacity.

在人与机器人角色扮演的场景中，记忆提取对于维持连贯的超长对话至关重要。然而，现有的方法常常会出现不可控制的内存增长问题。为了解决这个问题，我们提出了MOOM，这是第一个利用文学理论对情节发展和人物刻画进行建模的双分支记忆插件，将其作为核心叙事元素。具体来说，一个分支总结了跨多个时间尺度的情节冲突，而另一个分支则提取了用户的人物特征。MOOM进一步融合了一种遗忘机制，该机制受到“竞争抑制”记忆理论的启发，以限制内存容量并缓解不受控制的增长。此外，我们还推出了ZH-4O，这是一个专门针对角色扮演设计的中文超长对话数据集，包含平均600轮对话，并包含手动标注的记忆信息。实验结果表明，MOOM优于所有先进的记忆提取方法，在保持可控内存容量的情况下，减少了大型语言模型的调用次数。

论文及项目相关链接

PDF

Summary

本文提出了MOOM，一种双分支记忆插件，用于在人机角色扮演场景中维护连贯的超长对话。它通过模拟情节发展和人物刻画来解决现有方法中的不可控内存增长问题。一分支用于跨多个时间尺度总结情节冲突，另一分支用于提取用户角色特征。此外，MOOM还引入了竞争抑制记忆理论中的遗忘机制来限制内存容量并控制其增长。实验结果表明，MOOM在记忆提取方面优于所有现有方法，使用较少的大型语言模型调用次数即可维持可控的内存容量。

Key Takeaways