|
data:image/s3,"s3://crabby-images/04e83/04e83bf87be89ba71035c697f088a059ecc6cc0d" alt="No Maintenance Intended" |
|
data:image/s3,"s3://crabby-images/9d7ec/9d7ecdcc66ad9d93954827a18daf2d57394ef168" alt="TensorFlow Requirement: 1.x" |
|
data:image/s3,"s3://crabby-images/63659/636590da8b81e0a256865448b3517acd86c50e5b" alt="TensorFlow 2 Not Supported" |
|
|
|
<font size=4><b>Visual Dynamics: Probabilistic Future Frame Synthesis via Cross Convolutional Networks.</b></font> |
|
|
|
<b>Introduction</b> |
|
|
|
https://arxiv.org/pdf/1607.02586v1.pdf |
|
|
|
This is an implementation based on my understanding, with small |
|
variations. It doesn't necessarily represents the paper published |
|
by the original authors. |
|
|
|
Authors: Xin Pan, Anelia Angelova |
|
|
|
<b>Results:</b> |
|
|
|
data:image/s3,"s3://crabby-images/d1fc0/d1fc0193d3ced2a04b0161b7e3a7dfd3f94373fd" alt="Sample1" |
|
|
|
data:image/s3,"s3://crabby-images/03997/03997575a8023f019a7bdc7190ed09dea4d27587" alt="Sample2" |
|
|
|
data:image/s3,"s3://crabby-images/14958/14958f835b9a4e59f6b14c89b76a10e6dc9b2d18" alt="Loss" |
|
|
|
<b>Prerequisite:</b> |
|
|
|
1. Install TensorFlow (r0.12), Bazel. |
|
|
|
2. Download the Sprites dataset or generate moving object dataset. |
|
|
|
Sprites data is located here: |
|
|
|
http://www.scottreed.info/files/nips2015-analogy-data.tar.gz |
|
|
|
Convert .mat files into images and use sprites_gen.py to convert them |
|
to tf.SequenceExample. |
|
|
|
<b>How to run:</b> |
|
|
|
```shell |
|
$ ls -R |
|
.: |
|
data next_frame_prediction WORKSPACE |
|
|
|
./data: |
|
tfrecords tfrecords_test |
|
|
|
./next_frame_prediction: |
|
cross_conv g3doc README.md |
|
|
|
./next_frame_prediction/cross_conv: |
|
BUILD eval.py objects_gen.py model.py reader.py sprites_gen.py train.py |
|
|
|
./next_frame_prediction/g3doc: |
|
cross_conv2.png cross_conv3.png cross_conv.png |
|
|
|
|
|
# Build everything. |
|
$ bazel build -c opt next_frame_prediction/... |
|
|
|
# The following example runs the generated 2d objects. |
|
# For Sprites dataset, image_size should be 60, norm_scale should be 255.0. |
|
# Batch size is normally 16~64, depending on your memory size. |
|
|
|
# Run training. |
|
$ bazel-bin/next_frame_prediction/cross_conv/train \ |
|
--batch_size=1 \ |
|
--data_filepattern=data/tfrecords \ |
|
--image_size=64 \ |
|
--log_root=/tmp/predict |
|
|
|
step: 1, loss: 24.428671 |
|
step: 2, loss: 19.211605 |
|
step: 3, loss: 5.543143 |
|
step: 4, loss: 3.035339 |
|
step: 5, loss: 1.771392 |
|
step: 6, loss: 2.099824 |
|
step: 7, loss: 1.747665 |
|
step: 8, loss: 1.572436 |
|
step: 9, loss: 1.586816 |
|
step: 10, loss: 1.434191 |
|
|
|
# Run eval. |
|
$ bazel-bin/next_frame_prediction/cross_conv/eval \ |
|
--batch_size=1 \ |
|
--data_filepattern=data/tfrecords_test \ |
|
--image_size=64 \ |
|
--log_root=/tmp/predict |
|
``` |
|
|