Олег Литвинов
11/25/2024, 7:13 PMBaseRegressor
and lots of ancestors like LGBMRegressor
LinearRegressor
. And all these wrappers do not only use sklearn.model.predict or lgbm.model.predict but also incorporate quite long list of data preparations.
So my first question is: how this paradigm of "advanced and abstract ML development" is compatible with Kedro which is mostly (to the best of my understanding) for pipelines? In the basic examples I see that there may be any number of preprocessing steps like load, filter, enrich, fillna, etc, etc, and then just train step. This is compatible with the pipeline logic perfectly. But, probably, doesn't work well if you keep some methods in the model class and also use some internal states and so on. Maybe you know some good practices or have any ideas?
The second question is similar to the first one but covers mostly the inference part. Please, correct me if I'm wrong, but I mostly see Kedro as a framework for the preprocessing and training ML routines. What is recommended to do if I want to reuse some of my logic (already defined as data_processing nodes) for model inference?
Thank you very much!Hall
11/25/2024, 7:13 PMRavi Kumar Pilla
11/25/2024, 8:41 PMОлег Литвинов
11/25/2024, 11:00 PMYolan Honoré-Rougé
11/26/2024, 8:10 AMYolan Honoré-Rougé
11/26/2024, 8:10 AMYolan Honoré-Rougé
11/26/2024, 8:11 AMYolan Honoré-Rougé
11/26/2024, 8:12 AMОлег Литвинов
11/26/2024, 9:30 AMОлег Литвинов
11/26/2024, 8:09 PMYolan Honoré-Rougé
11/26/2024, 9:28 PMОлег Литвинов
11/27/2024, 1:07 PMYolan Honoré-Rougé
11/27/2024, 1:15 PMYolan Honoré-Rougé
11/27/2024, 1:28 PMYolan Honoré-Rougé
11/27/2024, 1:28 PMОлег Литвинов
11/27/2024, 5:47 PMRavi Kumar Pilla
11/27/2024, 6:52 PM