This is the C3D model used with a fork of Caffe to the Sports1M dataset migrated to Keras. Details about the network architecture can be found in the following arXiv paper:
Tran, Du, et al. "Learning Spatiotemporal Features With 3D Convolutional Networks." Proceedings of the IEEE International Conference on Computer Vision. 2015.
Download: weights
Hi Alberto,
Great job ! Thanks for sharing !
How do you know that
ZeroPadding3D (zeropadding3d) (None, 512, 2, 9, 9) 0
layer is used ? I was not able to find it in the original Caffe model ... Could you please point the place where it is ?