Naturalistic Causal Probing for Morpho-Syntax
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F23%3AJ3XGHLXI" target="_blank" >RIV/00216208:11320/23:J3XGHLXI - isvavai.cz</a>
Result on the web
<a href="https://direct.mit.edu/tacl/article-abstract/doi/10.1162/tacl_a_00554/115895" target="_blank" >https://direct.mit.edu/tacl/article-abstract/doi/10.1162/tacl_a_00554/115895</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1162/tacl_a_00554" target="_blank" >10.1162/tacl_a_00554</a>
Alternative languages
Result language
angličtina
Original language name
Naturalistic Causal Probing for Morpho-Syntax
Original language description
"Probing has become a go-to methodology for interpreting and analyzing deep neural models in natural language processing. However, there is still a lack of understanding of the limitations and weaknesses of various types of probes. In this work, we suggest a strategy for input-level intervention on naturalistic sentences. Using our approach, we intervene on the morpho-syntactic features of a sentence, while keeping the rest of the sentence unchanged. Such an intervention allows us to causally probe pre-trained models. We apply our naturalistic causal probing framework to analyze the effects of grammatical gender and number on contextualized representations extracted from three pre-trained models in Spanish, the multilingual versions of BERT, RoBERTa, and GPT-2. Our experiments suggest that naturalistic interventions lead to stable estimates of the causal effects of various linguistic properties. Moreover, our experiments demonstrate the importance of naturalistic causal probing when analyzing pre-trained models."
Czech name
—
Czech description
—
Classification
Type
J<sub>ost</sub> - Miscellaneous article in a specialist periodical
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
—
Continuities
—
Others
Publication year
2023
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
"Transactions of the Association for Computational Linguistics"
ISSN
2307-387X
e-ISSN
—
Volume of the periodical
11
Issue of the periodical within the volume
2023
Country of publishing house
US - UNITED STATES
Number of pages
20
Pages from-to
384-403
UT code for WoS article
—
EID of the result in the Scopus database
—