Inspired by prior works on self-supervised learning in time (e.g, TimeCycle by
@xiaolonw
, Allan Jabri et al., TCC by Dwibedi et al.), we form a time cycle for self-supervision. The key difference is how to identify the **soft target** in the adjacent frame.