Towards Generalist Agents Through Scaling Offline Reinforcement Learning