shelhamer/readme.md Secret

Last active August 10, 2017 10:42

Star () You must be signed in to star a gist
Fork () You must be signed in to fork a gist

Learn more about clone URLs
Clone this repository at <script src="https://gist.github.com/shelhamer/91eece041c19ff8968ee.js"></script>
Save shelhamer/91eece041c19ff8968ee to your computer and use it in GitHub Desktop.

Download ZIP

FCN-8s Fully Convolutional Semantic Segmentation on PASCAL-Context

Raw

See the reference implementation at http://fcn.berkeleyvision.org. This pre-release is deprecated.

abidrahmank commented Dec 23, 2015

First convolution layer padding is given as 100. Can anyone explain the reason for high padding values?

ch977 commented Jan 15, 2016

@abidrahmank
later layers do some crop, I think it is the reason why the pad be 100 in the first layer.
For example, pool5 layer is (512,22,22),and fc6 layer is (4096,16,16). So pool5 layer has been cropped.
If you don't use 100 pad , you won't get last layer.

xavierpuigf commented Jan 22, 2016

Hi, I am trying to train fcn-8 on pascal-context dataset to replicate results but so far my loss doesn't seem to be decreasing. I have also tried finetuning from fcn-8-pascalcontext model, but my loss at iteration 0 is 1.7788e+06.

Considering that iteration 0 is just an evaluation of the model, shouldn't I get a much lower loss? Note that here I am not doing the net surgery step.

abidrahmank commented Jan 27, 2016

What is the functionality of crop Layer and Eltwise Fusion in this network?

xjtuljy commented Feb 4, 2016

Hi all, I now try to export the prediction from the FCN using C++ for my project. But I end up with some very strange images.

In my problem I tried to build my own fcn and there are only two classes to segment (0 and 1)

Here is my code to read the output of the forward pass:
////////////////////////////////////////////////////////////////
const vector<Blob>& result = caffe_net.Forward(bottom_vec, &iter_loss); // forward pass
const float result_vec = result[0]->cpu_data();

// generate prediction from the output vector and store it in Mat
cv::Mat srcC = cv::Mat::zeros(cv::Size(512,384), CV_32FC1);
int nl= srcC.rows;
int nc= srcC.cols;

for (int j=0; j<nl; j++) {
float* data= srcC.ptr(j);
for (int i=0; i<nc; i++) {
if (result_vec[i+j*nc+datum.height()*datum.width()] > result_vec[i+j_nc]);
// compare the value from different classes and generate the prediction
data[i] = 255;
}
}
//////////////////////////////////////////////////////////////////
The output of CNN is in format of const float_, but I don't know whether the data is arranged in c_h_w or w_h_c order for fcn, and due to the wired output Mat I got, I think maybe I did it in the wrong way. Can somebody help with this please?

arasharchor commented Mar 11, 2016

@shelhamer I want to fine tune fcn8 on a different dataset for predicting flow magnitude map which is gray scale image by doing regression by changing the loss to euclidean and num_outputs of last layer to 1. is that right?
The loss is too high. basically for how many iterations should I set it in this case?
should I first do some changes in solver around your parameters or in this case I should decrease e.g. base_lr because now we are finetuning.

thanks a lot for your help

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment