Scene Understanding

I think this is really what we want to achieve.

Where you train a dataset on a video.

The things that they do at sportLoGiQ is actually pretty cool, where they need to classify the actions. However, they do this offline, so it seems kind of ok.

  • CenterNet
  • SlowFast