another fine piece of writing by Jim Dowling from ...
# resources
j
another fine piece of writing by Jim Dowling from Hopsworks https://www.hopsworks.ai/post/a-taxonomy-for-data-transformations-in-ai-systems
This article introduces a taxonomy for data transformations in AI applications that is fundamental for any AI system that wants to reuse feature data in more than one model. The taxonomy consists of model-independent data transformations that produce reusable features (for example, total customer spend in the last week), model-dependent transformations that produce features that are specific to one model (for example, normalizing that customer spend value using the mean and standard deviation of all customer spends in the model’s training dataset), and on-demand transformations, found in real-time AI systems, that require data only available at request-time (for example, transforming longitude/latitude to a zipcode).