Translating and Evolving: Towards a Model of Language Change in DisCoCat

Tai-Danae Bradley
(Graduate Center, CUNY)
Martha Lewis
(ILLC, University of Amsterdam)
Jade Master
(Dept. Mathematics, UC Riverside)
Brad Theilman
(Gentner Lab, UC San Diego)

The categorical compositional distributional (DisCoCat) model of meaning developed by Coecke et al. (2010) has been successful in modeling various aspects of meaning. However, it fails to model the fact that language can change. We give an approach to DisCoCat that allows us to represent language models and translations between them, enabling us to describe translations from one language to another, or changes within the same language. We unify the product space representation given in (Coecke et al., 2010) and the functorial description in (Kartsaklis et al., 2013), in a way that allows us to view a language as a catalogue of meanings. We formalize the notion of a lexicon in DisCoCat, and define a dictionary of meanings between two lexicons. All this is done within the framework of monoidal categories. We give examples of how to apply our methods, and give a concrete suggestion for compositional translation in corpora.

In Martha Lewis, Bob Coecke, Jules Hedges, Dimitri Kartsaklis and Dan Marsden: Proceedings of the 2018 Workshop on Compositional Approaches in Physics, NLP, and Social Sciences (CAPNS 2018), Nice, France, 2-3rd September 2018, Electronic Proceedings in Theoretical Computer Science 283, pp. 50–61.
Published: 8th November 2018.

ArXived at: https://dx.doi.org/10.4204/EPTCS.283.4 bibtex PDF
References in reconstructed bibtex, XML and HTML format (approximated).
Comments and questions to: eptcs@eptcs.org
For website issues: webmaster@eptcs.org