BRISE-plandok: a German legal corpus of building regulations
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F25%3AIXRYJIU9" target="_blank" >RIV/00216208:11320/25:IXRYJIU9 - isvavai.cz</a>
Result on the web
<a href="https://www.scopus.com/inward/record.uri?eid=2-s2.0-85197671843&doi=10.1007%2fs10579-024-09747-7&partnerID=40&md5=796d2e2f4215e1ff3ecd9a32628b1ef6" target="_blank" >https://www.scopus.com/inward/record.uri?eid=2-s2.0-85197671843&doi=10.1007%2fs10579-024-09747-7&partnerID=40&md5=796d2e2f4215e1ff3ecd9a32628b1ef6</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1007/s10579-024-09747-7" target="_blank" >10.1007/s10579-024-09747-7</a>
Alternative languages
Result language
angličtina
Original language name
BRISE-plandok: a German legal corpus of building regulations
Original language description
We present the BRISE-Plandok corpus, a collection of 250 text documents with a total of over 7000 sentences from the Zoning Map of the City of Vienna, annotated manually with formal representations of the rules they convey. The generic rule format used by the corpus enables automated compliance checking of building plans, a process developed as part of the BRISE (https://smartcity.wien.gv.at/en/brise/) project. The format also allows for conversion to multiple logic formalisms, including dyadic deontic logic, enabling automated reasoning. Annotation guidelines were developed in collaboration with experts of the city’s building inspection office, describing nearly 100 domain-specific attributes with examples. Each document was annotated independently by two trained annotators and subsequently reviewed by the authors. A rule-based system for the automatic extraction of rules from text was developed and used in the annotation process to provide suggestions. The reviewed dataset was also used to train a set of baseline machine learning models for the task of attribute extraction, the main step in the rule extraction process. Both the rule-based system and the ML baselines are evaluated on the annotated dataset and released as open-source software. We also describe and release the framework used for generating and parsing the interactive xlsx spreadsheets used by annotators. © The Author(s) 2024.
Czech name
—
Czech description
—
Classification
Type
J<sub>SC</sub> - Article in a specialist periodical, which is included in the SCOPUS database
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
—
Continuities
—
Others
Publication year
2024
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
Language Resources and Evaluation
ISSN
1574-020X
e-ISSN
—
Volume of the periodical
2024
Issue of the periodical within the volume
2024
Country of publishing house
US - UNITED STATES
Number of pages
40
Pages from-to
1-40
UT code for WoS article
—
EID of the result in the Scopus database
2-s2.0-85197671843