Deepyaman Datta
01/18/2023, 3:17 PMmodel_input = (
spine
.join(number_of_visitors, how="left", on="store_id")
.join(price_per_pound, how="left", on=["store_id", "item_id"])
)
Does this process resonate with your experience, or does it look different in your organization?
Does the terminology also resonate? E.g.
• spine dataframe vs target dataframe vs something else?
• unit of analysis columns vs something else?Antony Milne
01/18/2023, 3:31 PMDeepyaman Datta
01/18/2023, 3:35 PMdate
+ item_id
+ store_id
in this example) be the index? I totally agree with making those into a multi-index, and same with making date + join key for each feature into a multi-index, because then your syntax is simplified to spine.join(number_of_visitors).join(price_per_pound)
. I'd say the label shouldn't be part of the index though (I think?).Antony Milne
01/18/2023, 4:07 PMPedro Abreu
01/19/2023, 10:12 AMmarrrcin
01/19/2023, 10:21 AMDeepyaman Datta
01/19/2023, 12:12 PMmarrrcin
01/19/2023, 12:30 PM