An international team of researchers is using computer science to explore the knowledge foundation of word meaning in both child language development and the evolution of word meanings across languages.
Through a computational framework they developed, the researchers show how patterns of children’s language innovation can be used to predict patterns of language evolution, and vice versa.
The interdisciplinary work by University of Toronto computer science researcher Yang Xu and computational linguistics and cognitive science researchers from Universitat Pompeu Fabra and ICREA in Spain, was recently published in the journal Science.
In the paper, the team investigates what is known as word meaning extension, which is the creative use of known words to express novel meanings.
The research aimed to look at how this type of human lexical creativity observed in both children and adult language users can be understood in a unified framework, says Xu, a senior author on the paper and an associate professor in the department of computer science in the Faculty of Arts & Science and the cognitive science program at University College.
“A common strategy of human lexical creativity is to use words we know to express something new, so that we can save the effort of creating new words. Our paper offers a unified view of the various processes of word meaning extension observed at different timescales, across populations and within individuals.”
Word meaning extension is often observed in the historical change or evolution of language, Xu adds. For example, the word “mouse” in English originally meant a type of rodent, but now also refers to a computer device.
On the other hand, word meaning extension is also observed in children as early as two years of age. For example, children sometimes use the word “ball” to refer to “balloon,” presumably because they haven’t yet acquired the right word to describe the latter object, so they overextend a known word to express what they want to say.
“In this study, we ask whether processes of word meaning extension at two very different timescales, in language evolution, which takes place over hundreds or thousands of years, and in child’s language development, which typically occurs in the order of months or years, have something in common with each other,” Xu says. “Our hypothesis is that these processes are fundamentally the same.
“We find, indeed, there’s a unified foundation underlying these processes. There is a shared repertoire of knowledge types that underlies word meaning extension in both language evolution and language development.”
To test their hypothesis and figure out what is in common between products of language learning and language evolution, the team built a computational model that takes pairs of meanings or concepts as input, such as “ball” versus “balloon,” “door” versus “key” or “fire” versus “flame,” and makes a prediction about how likely these concepts can be co-expressed under the same word.
In building their model, the researchers constructed a knowledge base that helps identify similarity relations between concepts as they “believe it is the key that makes people relate meanings in word meaning extension,” Xu says.
The knowledge base consists of four primary knowledge types grounded in human experience: visual perception, associative knowledge, taxonomic knowledge and affective knowledge. Pairs of concepts score high if they are measured to be similar in one or some of these knowledge types.
“The pair of concepts like ‘ball’ and ‘balloon’ would score high due to their visual similarity, whereas ‘key’ and ‘door’ would score high because they are thematically related or often co-occur together in daily scenarios,” Xu explains. “On the contrary, for a pair of concepts such as ‘water’ and ‘pencil,’ they would have little similarity measured in any of the four knowledge types, so that pair would receive a low score. As a result, the model would predict they can’t, or they are unlikely to, extend to each other.”
Xu notes the team found all four knowledge types contributed to word meaning extension and a model that incorporates these types tends to better predict data than alternative models that rely on fewer or individual knowledge types.
“This finding tells us that word meaning extension relies on multifaceted and grounded knowledge based on people’s perceptual, affective and common-sense knowledge,” he says.
Built exclusively from children’s word meaning extension data, the model can successfully predict word meaning extension patterns from language evolution and can also make predictions in the reverse direction on children’s overextension when trained on language evolution data.
“This cross-predictive analysis suggests that there are shared knowledge types between children’s word meaning extension and the products of language evolution, despite the fact that they occur at very different timescales. These processes both rely on a common core knowledge foundation – together these findings help us understand word meaning extension in a unified way,” Xu says.
Xu stresses that existing research on child overextension has been typically discussed in the context of developmental psychology whereas word meaning extension in history is typically discussed in historical and computational linguistics, so this project aims to build a tighter connection between the two fields of research.
The researchers hope that additional computational modelling will shed light on other potential lines of inquiry, including the basic mechanisms at play in the historical evolution of word meanings and the emergence of word meanings in child development, as well as the origins of different semantic knowledge types and how they are acquired.