Maskininlärning i fastighetsbranschen - prediktion av felanmälningar gällande inomhusklimat baserat på sensordata
This thesis investigates the prerequisites needed for the Swedish real estate company Fabege to create useful machine learning models for classification and prediction of error reportings from tenants. These error reportings are regarding cold indoor climates and bad indoor air quality. By analyzing the available data, that consist of error reporting data, weather data and indoor climate data, the thesis investigates the different correlations between the sensor data and the error reports. By using an algorithm called decision jungles, two machine learning models have been trained in Microsoft Azure Machine Learning Studio. The main model, trained on error reporting data and weather data, shows the possibilities to classify data instances as a part of different error reporting classes. The model proves that it is possible to predict the emergence of future error reports of the different classes with an average accuracy of 78%. The complementary model, trained on a small but more richly annotated dataset consisting of one year of indoor sensor data as well as the above-mentioned data, shows that there is a possibility to improve the main model by using indoor climate data. The thesis has shown that for Fabege to expand and improve these models, the amount of data collected from the indoor sensors needs to be largely increased. Fabege also needs to improve the quality of the error reporting data, which could be achieved by improving the error reporting form used by the tenants.