Posts by Tags

AI

AI Agents

CoRL

[Paper Notes] DayDreamer: World Models for Physical Robot Learning - CoRL 2022

1 minute read

Published:

Key information

  • This paper learns two models: a world model trained on off-policy sequences through supervised learning, and an actor-critic model to learn behaviors from trajectories predicted by the learned model.
  • The data collection and learning updates are decoupled, enabling fast training without waiting for the environment. A learner thread continuously trains the world model and actor-critic behavior, while an actor thread in parallel computes actions for environment interaction.

Cognition

Communication

Dexterous Manipulation

Dynamics Models

Egocentric Data

Embodied AI

Human-to-Robot Transfer

Humanoid

Imitation Learning

Information Retrieval

Intelligence

Manipulation

Model-Based Control

Paper Notes

Personal Thoughts

Productivity

RSS

Reinforcement Learning

[Paper Notes] DayDreamer: World Models for Physical Robot Learning - CoRL 2022

1 minute read

Published:

Key information

  • This paper learns two models: a world model trained on off-policy sequences through supervised learning, and an actor-critic model to learn behaviors from trajectories predicted by the learned model.
  • The data collection and learning updates are decoupled, enabling fast training without waiting for the environment. A learner thread continuously trains the world model and actor-critic behavior, while an actor thread in parallel computes actions for environment interaction.

Robotics

[Paper Notes] DayDreamer: World Models for Physical Robot Learning - CoRL 2022

1 minute read

Published:

Key information

  • This paper learns two models: a world model trained on off-policy sequences through supervised learning, and an actor-critic model to learn behaviors from trajectories predicted by the learned model.
  • The data collection and learning updates are decoupled, enabling fast training without waiting for the environment. A learner thread continuously trains the world model and actor-critic behavior, while an actor thread in parallel computes actions for environment interaction.

Scaling Laws

Sim2Real

Systems

Unsupervised RL

VLA

Vision-Language-Action

World Models

update

Updating website

less than 1 minute read

Published:

I just completed a major update to my personal website.

website

Updating website

less than 1 minute read

Published:

I just completed a major update to my personal website.