How to persist non-trainable variables in tf.Estimator checkpoint?

I'm trying to include a Dense layer that is not trainable and initialized as an identity matrix, as part of my tensorflow Estimator. The intuition is for this Dense layer to pass through its inputs during standard training and a fine tuning step afterward. The catch is that I don't want these weights updated at all during the initial round, only during fine tuning.

I can do several things to make these weights non-trainable, including using the trainable argument in the Dense constructor or by filtering out anything with dense in its name before passing to MomentumOptimizer.compute_gradients().

But in either case (make dense non-trainable or just don't pass it to optimizer), tf will throw an error saying that it cannot find a key related to the dense layer.

I understand that since on the first run, where dense is non-trainable, that it won't be persisted in the checkpoint file. Likewise, if it's filtered out from being passed to compute_gradients, then the same issue occurs.

Is there any method to just persist untrained variables, even with just their initialized values, across runs?

NotFoundError (see above for traceback): Key dense/kernel/Momentum not
found in checkpoint

asked Nov 24 '18 at 16:14

Joey Carson

1,26631845

add a comment |

But in either case (make dense non-trainable or just don't pass it to optimizer), tf will throw an error saying that it cannot find a key related to the dense layer.

Is there any method to just persist untrained variables, even with just their initialized values, across runs?

NotFoundError (see above for traceback): Key dense/kernel/Momentum not
found in checkpoint

asked Nov 24 '18 at 16:14

Joey Carson

1,26631845

add a comment |

But in either case (make dense non-trainable or just don't pass it to optimizer), tf will throw an error saying that it cannot find a key related to the dense layer.

Is there any method to just persist untrained variables, even with just their initialized values, across runs?

NotFoundError (see above for traceback): Key dense/kernel/Momentum not
found in checkpoint

asked Nov 24 '18 at 16:14

Joey Carson

1,26631845

But in either case (make dense non-trainable or just don't pass it to optimizer), tf will throw an error saying that it cannot find a key related to the dense layer.

Is there any method to just persist untrained variables, even with just their initialized values, across runs?

NotFoundError (see above for traceback): Key dense/kernel/Momentum not
found in checkpoint

python tensorflow tensorflow-estimator finetunning

asked Nov 24 '18 at 16:14

Joey Carson

1,26631845

asked Nov 24 '18 at 16:14

Joey Carson

1,26631845

asked Nov 24 '18 at 16:14

Joey Carson

1,26631845

asked Nov 24 '18 at 16:14

Joey Carson

1,26631845

asked Nov 24 '18 at 16:14

Joey Carson

1,26631845

add a comment |

1 Answer
1

active

oldest

votes

I will answer my own question here because it wasn't immediately obvious to me, as tf documentation doesn't seem to make it clear. If you want to introduce a new trainable variable, then it will need to fundamentally be a different model in later ones. Thus in order to handle fine-tuning of existing weights, those existing weights in the new model must be resolved from the warm start settings.

So train a model, conditionally not including the fine-tune layer when your Estimator's model function runs. Train the existing model and then create another model that is separate. Technically this just means you need to use a fresh model directory, but warm start settings should point to the model you trained beforehand.

On fine-tuning runs, your model function should conditionally include the fine-tuning layers, but it should restore the weights from the previous run setting the warm start settings to look at previous model directory.

answered Nov 27 '18 at 14:35

Joey Carson

1,26631845

add a comment |

Your Answer

StackExchange.ifUsing("editor", function () {
StackExchange.using("externalEditor", function () {
StackExchange.using("snippets", function () {
StackExchange.snippets.init();
});
});
}, "code-snippets");

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "1"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});

}
});

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53460026%2fhow-to-persist-non-trainable-variables-in-tf-estimator-checkpoint%23new-answer', 'question_page');
}
);

Post as a guest

Name

Required, but never shown

1 Answer
1

active

oldest

votes

1 Answer
1

active

oldest

votes

answered Nov 27 '18 at 14:35

Joey Carson

1,26631845

add a comment |

answered Nov 27 '18 at 14:35

Joey Carson

1,26631845

add a comment |

answered Nov 27 '18 at 14:35

Joey Carson

1,26631845

answered Nov 27 '18 at 14:35

Joey Carson

1,26631845

answered Nov 27 '18 at 14:35

Joey Carson

1,26631845

answered Nov 27 '18 at 14:35

Joey Carson

1,26631845

answered Nov 27 '18 at 14:35

Joey Carson

1,26631845

add a comment |

draft saved

draft discarded

Thanks for contributing an answer to Stack Overflow!

Please be sure to answer the question. Provide details and share your research!

But avoid …

Asking for help, clarification, or responding to other answers.

Making statements based on opinion; back them up with references or personal experience.

To learn more, see our tips on writing great answers.

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Name

Required, but never shown

Name

Required, but never shown

This page is only for reference, If you need detailed information, please check here

搜尋此網誌

Ytukyg