BRISE-plandok: a German legal corpus of building regulations

Identifikátory výsledku

Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F25%3AIXRYJIU9" target="_blank" >RIV/00216208:11320/25:IXRYJIU9 - isvavai.cz</a>
Nalezeny alternativní kódy
RIV/00216208:11320/26:WJV7HU5W
Výsledek na webu
<a href="https://www.scopus.com/inward/record.uri?eid=2-s2.0-85197671843&doi=10.1007%2fs10579-024-09747-7&partnerID=40&md5=796d2e2f4215e1ff3ecd9a32628b1ef6" target="_blank" >https://www.scopus.com/inward/record.uri?eid=2-s2.0-85197671843&doi=10.1007%2fs10579-024-09747-7&partnerID=40&md5=796d2e2f4215e1ff3ecd9a32628b1ef6</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1007/s10579-024-09747-7" target="_blank" >10.1007/s10579-024-09747-7</a>

Alternativní jazyky

Jazyk výsledku
angličtina
Název v původním jazyce
BRISE-plandok: a German legal corpus of building regulations
Popis výsledku v původním jazyce
We present the BRISE-Plandok corpus, a collection of 250 text documents with a total of over 7000 sentences from the Zoning Map of the City of Vienna, annotated manually with formal representations of the rules they convey. The generic rule format used by the corpus enables automated compliance checking of building plans, a process developed as part of the BRISE (https://smartcity.wien.gv.at/en/brise/) project. The format also allows for conversion to multiple logic formalisms, including dyadic deontic logic, enabling automated reasoning. Annotation guidelines were developed in collaboration with experts of the city’s building inspection office, describing nearly 100 domain-specific attributes with examples. Each document was annotated independently by two trained annotators and subsequently reviewed by the authors. A rule-based system for the automatic extraction of rules from text was developed and used in the annotation process to provide suggestions. The reviewed dataset was also used to train a set of baseline machine learning models for the task of attribute extraction, the main step in the rule extraction process. Both the rule-based system and the ML baselines are evaluated on the annotated dataset and released as open-source software. We also describe and release the framework used for generating and parsing the interactive xlsx spreadsheets used by annotators. © The Author(s) 2024.
Název v anglickém jazyce
BRISE-plandok: a German legal corpus of building regulations
Popis výsledku anglicky
We present the BRISE-Plandok corpus, a collection of 250 text documents with a total of over 7000 sentences from the Zoning Map of the City of Vienna, annotated manually with formal representations of the rules they convey. The generic rule format used by the corpus enables automated compliance checking of building plans, a process developed as part of the BRISE (https://smartcity.wien.gv.at/en/brise/) project. The format also allows for conversion to multiple logic formalisms, including dyadic deontic logic, enabling automated reasoning. Annotation guidelines were developed in collaboration with experts of the city’s building inspection office, describing nearly 100 domain-specific attributes with examples. Each document was annotated independently by two trained annotators and subsequently reviewed by the authors. A rule-based system for the automatic extraction of rules from text was developed and used in the annotation process to provide suggestions. The reviewed dataset was also used to train a set of baseline machine learning models for the task of attribute extraction, the main step in the rule extraction process. Both the rule-based system and the ML baselines are evaluated on the annotated dataset and released as open-source software. We also describe and release the framework used for generating and parsing the interactive xlsx spreadsheets used by annotators. © The Author(s) 2024.

Klasifikace

Druh
J<sub>SC</sub> - Článek v periodiku v databázi SCOPUS
CEP obor
—
OECD FORD obor
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)

Návaznosti výsledku

Projekt
—
Návaznosti
—

Ostatní

Rok uplatnění
2024
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Údaje specifické pro druh výsledku

Název periodika
Language Resources and Evaluation
ISSN
1574-020X
e-ISSN
—
Svazek periodika
2024
Číslo periodika v rámci svazku
2024
Stát vydavatele periodika
US - Spojené státy americké
Počet stran výsledku
40
Strana od-do
1-40
Kód UT WoS článku
—
EID výsledku v databázi Scopus
2-s2.0-85197671843

Podobné výsledky(10)

BRISE-Plandok: a German legal corpus of building regulations Explainable rule extraction via semantic graphs Annotation Protocol for Textbook Enrichment with Prerequisite Knowledge Graph

Co hledáte?

Rychlé hledání

Chytré vyhledávání

BRISE-plandok: a German legal corpus of building regulations

Identifikátory výsledku

Alternativní jazyky

Klasifikace

Návaznosti výsledku

Ostatní

Údaje specifické pro druh výsledku

Podobné výsledky(10)

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Popis výsledku

Identifikátory výsledku

Identifikátory výsledku

Alternativní jazyky

Alternativní jazyky

Klasifikace

Klasifikace

Návaznosti výsledku

Návaznosti výsledku

Ostatní

Ostatní

Údaje specifické pro druh výsledku

Údaje specifické pro druh výsledku

Podobné výsledky(10)