%AYang, S.%AFeng, Y.%AZhang, S.%AZhou, M.%D2022%I %K %MOSTI ID: 10340487 %PMedium: X %TRegularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement Learning %X Country unknown/Code not availableOSTI-MSA