Consider allowing sublayer/variable reassignment #18601

mattdangerw · 2023-10-12T19:03:43Z

We are having a lot of discussion around lora/quantization and other PEFT strategies that require replacing layers (often dense layers), with parameter efficient replacements.

In torch, a nn.Module will track submodules by name. So you can run module.sub_module = new_sub_module without issue. The old child will be booted out for the new. Same for tf.Module.

This is not the case with Keras layers. Tracking is currently "append only," not by attr name, and locked after build(). To work around this for a LoRA implementation, we currently do the following.

def replace(parent, name, replacement):
    locked = parent._tracker.locked
    parent._tracker.locked = False
    target = getattr(parent, name)
    setattr(parent, name, replacement)
    parent._layers[parent._layers.index(target)] = replacement
    parent._tracker.locked = locked

We should consider whether we want to allow this for Keras layers with public APIs. Potentially by extending the tracker to track by attr name.

The text was updated successfully, but these errors were encountered:

mattdangerw · 2023-10-12T19:15:17Z

@fchollet thought on this?

My thoughts are that since both torch and tf allow variable and submodule reassignment, we should probably do the same. Consistency here will cause less headaches. This could also come up for code like this...

def build():
    self.bias = self.add_weight(...)
    ...
    if some_complex_case:
        self.bias = None  # Nevermind no bias!

I am less sure about locking the tracker after build. We could...

Allow reassignment only.
Not lock tracking at all.
Force an explicit unlock, e.g. layer.unlock().
Have no public API support for this. Only possible via _tracker shenanigans.

This is tricky, because this kind of mutation after fit/predict/evaluate could land you in hot water. Optimizer state will be invalid. Compiled functions too.

fchollet · 2023-10-12T21:11:45Z

I think we can automate in __setattr__ what you're currently doing.

mattdangerw added the type:feature The user is asking for a new feature. label Oct 12, 2023

mattdangerw changed the title ~~Consider allowing sublayer reassignment~~ Oct 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consider allowing sublayer/variable reassignment #18601

Consider allowing sublayer/variable reassignment #18601

mattdangerw commented Oct 12, 2023

mattdangerw commented Oct 12, 2023

fchollet commented Oct 12, 2023

Consider allowing sublayer/variable reassignment #18601

Consider allowing sublayer/variable reassignment #18601

Comments

mattdangerw commented Oct 12, 2023

mattdangerw commented Oct 12, 2023

fchollet commented Oct 12, 2023