Monitoring Of Apartment Prices In The Czech Republic Through Parsing A Web Advertising Server
Identifikátory výsledku
Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216275%3A25530%2F20%3A39916689" target="_blank" >RIV/00216275:25530/20:39916689 - isvavai.cz</a>
Výsledek na webu
<a href="http://www.aei.tuke.sk/papers/2020/1/2_Pozdilkova.pdf" target="_blank" >http://www.aei.tuke.sk/papers/2020/1/2_Pozdilkova.pdf</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.15546/aeei-2020-0002" target="_blank" >10.15546/aeei-2020-0002</a>
Alternativní jazyky
Jazyk výsledku
angličtina
Název v původním jazyce
Monitoring Of Apartment Prices In The Czech Republic Through Parsing A Web Advertising Server
Popis výsledku v původním jazyce
Time series of apartment prices in the Czech Republic are available only in the partial statistics of the Statistical Office. Apartment prices are presented mainly in the articles and comments from the real estate agents. Data unavailability leads to a small number of statistically oriented publications on the real estate market. The main aim of our paper is thus to introduce a software solution for parsing real estate websites. Of course, we are only able to retrieve data on demanded prices from advertisements, actual prices are not achieved. By automatic polling, we are able to get data on the floor area of advertised apartments and the asked purchase price. A Python script was written to retrieve data from sreality.cz. The MongoDB database is used to store ads. New ads are saved directly to the database. Then, daily average apartment price of 1 square meter for each municipality are calculated. The filtered data can then be displayed or exported to a file via the web interface. In the statistical analyses, we present graphs showing the development of apartment prices and the number of advertisements in various municipalities of the Czech Republic in the period of 09/2018 – 12/2019. Next, we address the issue of clustering of municipalities with regard to the similarity of relative price changes.
Název v anglickém jazyce
Monitoring Of Apartment Prices In The Czech Republic Through Parsing A Web Advertising Server
Popis výsledku anglicky
Time series of apartment prices in the Czech Republic are available only in the partial statistics of the Statistical Office. Apartment prices are presented mainly in the articles and comments from the real estate agents. Data unavailability leads to a small number of statistically oriented publications on the real estate market. The main aim of our paper is thus to introduce a software solution for parsing real estate websites. Of course, we are only able to retrieve data on demanded prices from advertisements, actual prices are not achieved. By automatic polling, we are able to get data on the floor area of advertised apartments and the asked purchase price. A Python script was written to retrieve data from sreality.cz. The MongoDB database is used to store ads. New ads are saved directly to the database. Then, daily average apartment price of 1 square meter for each municipality are calculated. The filtered data can then be displayed or exported to a file via the web interface. In the statistical analyses, we present graphs showing the development of apartment prices and the number of advertisements in various municipalities of the Czech Republic in the period of 09/2018 – 12/2019. Next, we address the issue of clustering of municipalities with regard to the similarity of relative price changes.
Klasifikace
Druh
J<sub>ost</sub> - Ostatní články v recenzovaných periodicích
CEP obor
—
OECD FORD obor
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Návaznosti výsledku
Projekt
—
Návaznosti
S - Specificky vyzkum na vysokych skolach
Ostatní
Rok uplatnění
2020
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Údaje specifické pro druh výsledku
Název periodika
Acta Electrotechnica et Informatica
ISSN
1335-8243
e-ISSN
—
Svazek periodika
20
Číslo periodika v rámci svazku
1
Stát vydavatele periodika
SK - Slovenská republika
Počet stran výsledku
6
Strana od-do
9-14
Kód UT WoS článku
—
EID výsledku v databázi Scopus
—