Baan, Joris scite author profile

Baan, Joris

5Publications

28Citation Statements Received

37Citation Statements Given

How they've been cited

How they cite others

Affiliations

Publications

Order By: Most citations

On the Realization of Compositionality in Neural Networks

Joris¹,

Leible²,

Nikolaus³

et al. 2019

View full text Add to dashboard Cite

We present a detailed comparison of two types of sequence to sequence models trained to conduct a compositional task. The models are architecturally identical at inference time, but differ in the way that they are trained: our baseline model is trained with a task-success signal only, while the other model receives additional supervision on its attention mechanism (Attentive Guidance), which has shown to be an effective method for encouraging more compositional solutions . We first confirm that the models with attentive guidance indeed infer more compositional solutions than the baseline, by training them on the lookup table task presented by Liška et al. (2019). We then do an in-depth analysis of the structural differences between the two model types, focusing in particular on the organisation of the parameter space and the hidden layer activations and find noticeable differences in both these aspects. Guided networks focus more on the components of the input rather than the sequence as a whole and develop small functional groups of neurons with specific purposes that use their gates more selectively. Results from parameter heat maps, component swapping and graph analysis also indicate that guided networks exhibit a more modular structure with a small number of specialized, strongly connected neurons.

show abstract

Do Transformer Attention Heads Provide Transparency in Abstractive Summarization?

Joris¹,

Hoeve²,

Wees³

et al. 2019

Preprint

View full text Add to dashboard Cite

Learning algorithms become more powerful, often at the cost of increased complexity. In response, the demand for algorithms to be transparent is growing. In NLP tasks, attention distributions learned by attention-based deep learning models are used to gain insights in the models' behavior. To which extent is this perspective valid for all NLP tasks? We investigate whether distributions calculated by different attention heads in a transformer architecture can be used to improve transparency in the task of abstractive summarization. To this end, we present both a qualitative and quantitative analysis to investigate the behavior of the attention heads. We show that some attention heads indeed specialize towards syntactically and semantically distinct input. We propose an approach to evaluate to which extent the Transformer model relies on specifically learned attention distributions. We also discuss what this implies for using attention distributions as a means of transparency.

show abstract

Facts-Ir

Olteanu¹,

Garcia-Gathright²,

Rijke³

et al. 2019

SIGIR Forum

View full text Add to dashboard Cite

The purpose of the SIGIR 2019 workshop on Fairness, Accountability, Confidentiality, Transparency, and Safety (FACTS-IR) was to explore challenges in responsible information retrieval system development and deployment. To this end, the workshop aimed to crowd-source from the larger SIGIR community and draft an actionable research agenda on five key dimensions of responsible information retrieval: fairness, accountability, confidentiality, transparency, and safety. Such an agenda can guide others in the community that are interested in pursuing FACTS-IR research, as well as inform potential funders about relevant research avenues. The workshop brought together a diverse set of researchers and practitioners interested in contributing to the development of a technical research agenda for responsible information retrieval.

show abstract

Understanding Multi-Head Attention in Abstractive Summarization

Joris¹,

Hoeve²,

Wees³

et al. 2019

Preprint

View full text Add to dashboard Cite

On the Realization of Compositionality in Neural Networks

Joris¹,

Leible²,

Nikolaus³

et al. 2019

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Baan, Joris

On the Realization of Compositionality in Neural Networks

Do Transformer Attention Heads Provide Transparency in Abstractive Summarization?

Facts-Ir

Understanding Multi-Head Attention in Abstractive Summarization

On the Realization of Compositionality in Neural Networks

Contact Info

Product

Resources

About