Konstantin Kobs
02/13/2024, 11:24 AM_call_node_run
function, the node run itself took only about 5% of the overall function runtime (filtering some datasets), while the after_node_run
hook took 95% of the runtime, even though I have no functions defined for this hook. I suspect that hooks that get the input and output data as parameters are adding a large overhead, since the before_dataset_loaded
hook does not add any overhead. I think this might be a problem with Pluggy, but maybe someone here has an idea of how to reduce this overhead.
Thanks!datajoely
02/13/2024, 11:24 AMKonstantin Kobs
02/13/2024, 11:36 AMrun_node
function takes around 11s.datajoely
02/13/2024, 12:50 PMkedro-mlflow
?Konstantin Kobs
02/13/2024, 1:29 PMkedro-mlflow
, but we have a Great Expectations plugin, which, however, I disabled. When I debug into the code, the hook manager does not have any plugins registered. Are there any "hidden" plugins that could be in there?datajoely
02/13/2024, 1:30 PMpip freeze | grep kedro
to see if there are any other plug-ins installed?Nok Lam Chan
02/13/2024, 1:42 PMpluggy
plugin so they should show up. It may be helpful if you can show all debug
level logging since I think we have the hook trace enabled. For now I don't think this is a pluggy bug.
Is 11 seconds long base on your understanding? The most significant time that a node spend on is :Nok Lam Chan
02/13/2024, 1:42 PM