RaifHack: most loved solution
The repository showcases a very interesting solution for the RaifHack competition. The goal of the competition was to implement a machine learning model for accurate commercial real estate price prediction. The given training dataset included both residential and commercial property, but only commercial property prices had to be predicted.
The breakdown of the solution is:
- For a given commercial property object, find similar residential property based on a hand-crafted set of tabular features including geographical location, area, etc. FAISS was used for efficient vector search.
- Run all samples through TabNet to produce vector representation. TabNet is a transformer-based NN designed for tabular data.
- For all pairs of residential-commercial samples, run the embeddings though a
Bilinear
layer - Concatenate the known residential price and run through a
Linear
head to obtain the commercial price prediction - Predictions from all found similar residential objects were averaged to obtain the final price (only for evaluation but not during training)
This method was not used in our final submission — which did win the competition — but received the “most loved solution” award.