##Sequence to Sequence -- Video to Text
Paper : ICCV 2015 PDF
Download Model: S2VT_VGG_RGB_MODEL (333MB)
| ''' | |
| Here's the snippet from my code. | |
| Also modify the .yaml config file as needed. | |
| Hope this helps! | |
| ''' | |
| from slowfast.models import model_builder, head_helper | |
| from slowfast.config.defaults import get_cfg | |
| import slowfast.utils.checkpoint as cu | |
| # slowfast net |
| # Note – this is not a bash script (some of the steps require reboot) | |
| # I named it .sh just so Github does correct syntax highlighting. | |
| # | |
| # This is also available as an AMI in us-east-1 (virginia): ami-cf5028a5 | |
| # | |
| # The CUDA part is mostly based on this excellent blog post: | |
| # http://tleyden.github.io/blog/2014/10/25/cuda-6-dot-5-on-aws-gpu-instance-running-ubuntu-14-dot-04/ | |
| # Install various packages | |
| sudo apt-get update |
##Sequence to Sequence -- Video to Text
Paper : ICCV 2015 PDF
Download Model: S2VT_VGG_RGB_MODEL (333MB)
| """Short and sweet LSTM implementation in Tensorflow. | |
| Motivation: | |
| When Tensorflow was released, adding RNNs was a bit of a hack - it required | |
| building separate graphs for every number of timesteps and was a bit obscure | |
| to use. Since then TF devs added things like `dynamic_rnn`, `scan` and `map_fn`. | |
| Currently the APIs are decent, but all the tutorials that I am aware of are not | |
| making the best use of the new APIs. | |
| Advantages of this implementation: |
| name: "GoogleNet" | |
| input: "data" | |
| input_dim: 10 | |
| input_dim: 3 | |
| input_dim: 224 | |
| input_dim: 224 | |
| # hierarchy 1 | |
| # conv -> relu -> pool -> lrn |