Skip to main content

working memory

Working memory, expertise & retrieval structures

In a 1987 experiment (1), readers were presented with a text that included one or other of these sentences:

or

Both texts went on to say:

After reading the text, readers were asked if the word sweatshirt had appeared in the story. Now here is the fascinating and highly significant result: those who read that John had put on a sweatshirt responded “yes” more quickly than those who had read that he had taken off his sweatshirt.

Why is this so significant? Because it tells us something important about the reading process, at least in the minds of skilled readers. They construct mental models. If it was just a matter of the mechanical lower-order processing of letters and words, why would there be a difference in responses? Neither text was odd — John could as well have put on a sweatshirt before going out for a jog as taken it off — so there shouldn’t be a surprise effect. So what is it? Why is the word sweatshirt not as tightly / strongly linked in the second case as it is in the first? If they were purely textbase links (links generated by the textbase itself), the links should be equivalent. The difference in responses implies that the readers are making links with something outside the textbase, with a mental model.

Mental models, or as they are sometimes called in this context, situation models, are sometimes represented as lists of propositions, but in most cases it seems likely that they are actually analogue in nature. Thus the real world should be better represented by the situation model than by the text. Moreover, a spatial situation model will be similar in many ways to an image, with all the advantages that that entails.

All of this has relevance to two very important concepts: working memory and expertise.

Now, I’m always talking about working memory. This time I want to discuss not so much the limited attentional capacity that is what we chiefly mean by working memory, but another, more theoretical concept: the idea of long-term working memory.

Think about reading. To make sense of the text you need to remember what’s gone before — this is why working memory is so important for the reading process. But we know how limited working memory is; it can only hold a very small amount — is it really possible to hold all the information we need to make sense of what we’re reading? Shouldn’t there be constant delays as we access needed information from long-term memory? But there aren’t.

It’s suggested that the answer lies in the use of long-term working memory, a retrieval structure that keeps a network of linked propositions readily available.

Think about when you are studying / reading a difficult text in a subject you know well. Compare this to studying a difficult text in a subject you don’t know well. In the latter case, you may have to painfully backtrack, checking earlier statements, trying to remember what was said before, trying to relate what you are reading to things you already know. In the former case, you seem to have a vastly expanded amount of readily accessible relevant information, from the text itself and from your long-term memory.

The connection between long-term working memory and expertise is obvious. And expertise has already been conceptualised in terms of retrieval structures (see for example my article on expertise). In other words, you can increase your working memory in a particular domain by developing expertise, and the shortest route to developing expertise is to concentrate on building effective retrieval structures.

One of the areas where this is particularly crucial is that of reading scientific texts. Now we all know that scientific texts are much harder to process than, for example, stories. And there are several reasons for that. One is the issue of language: any science has its own technical vocabulary and you won't get far without knowing it. But another reason, far less obvious to the untutored, concerns the differences in structure — what may be termed differences of genre.

Now it might seem self-evident that stories are far simpler than science, than any non-fiction texts, and indeed a major distinction is usually made between narrative texts and expository texts, but it’s rather like the issue of faces and other objects. Are we specially good at faces because we're 'designed' to be (i.e., we have special 'expert' modules for processing faces)? Or is it simply that we have an awful lot of practice at it, because we are programmed to focus on human faces almost as soon as we are born?

In the same way, we are programmed for stories: right from infancy, we are told stories, we pay attention to stories, we enjoy stories. Stories have a particular structure (and within the broad structure, a set of sub-structures), and we have a lot of practice in that structure. Expository texts, on the other hand, don't get nearly the same level of practice, to the extent that many college students do not know how to handle them — and more importantly, don't even realize that that is what they're missing: a retrieval structure for the type of text they're studying.

References

Glenberg, A.M., Meyer, M. & Lindem, K. 1987. Mental models contribute to foregrounding during text comprehension. Journal of Memory and Language, 26, 69-83.

Effect of working memory capacity on new language learning

  • Vocabulary acquisition in children is significantly affected by the child's ability to repeat back words.
  • This limitation becomes less as the individual gains a large vocabulary, and thus develops a greater ability to make semantic (meaningful) associations.
  • When learning a new language, your ability to repeat back unfamiliar words is only a factor where you are unable to form a meaningful association to a familiar word.
  • In such cases, the keyword mnemonic can be especially useful to those with limited ability to repeat back words.

Research with children has demonstrated that the ability to learn new words is greatly affected by working memory span - specifically, by how much information they can hold in that part of working memory called "phonological short-term memory". The constraining effect of working memory capacity on the ability to learn new words appears to continue into adolescence.

But, as you grow in experience, building a vocabulary, this constraint becomes less important. Because working memory capacity is measured in "chunks" - and the amount of information contained in a chunk is extremely malleable. To a large extent, developing chunking strategies is what memory improvement is all about.

In terms of learning another language, there are essentially four possible classes of word:

  • words that are already familiar because they are the same in your native language (or another known language)
  • words that are already familiar because they involve words that you already know in that language (e.g., learning a related verb form, or learning a word made up of two words you already know, such as sweat-shirt)
  • words that resemble a known word with similar or related meaning (e.g., Russian garlo means throat, and the word garlo resembles the word gargle)
  • words that have no ready association to known words

It appears that in these first three cases, the size of your phonological short-term memory is of no significant relevance. It is only in the last case - where the word cannot utilize any meaningful associations - that your phonological short-term memory capacity becomes important.

Fairly obviously, as your knowledge of language (your own and others) grows, the more meaningful associations you will be able to make, and the fewer new words will fall into this last, difficult, category.

This suggests, of course, the usefulness of a mnemonic strategy (specifically, the keyword strategy) in the last, difficult case.

The importance of phonological short-term memory is also greater for productive learning (learning to produce a language, i.e., speak or write it) than in receptive learning (learning to read or understand a language). For productive learning, the pronounceability of the new words is very important. The more easily pronounced, the more easily learnt.

References
  • Nation, I.S.P. 2001. Learning vocabulary in another language. Cambridge: Cambridge University Press.
  • Ellis, N.C. & Beaton, A. 1993. Factors affecting foreign language vocabulary: imagery keyword mediators and phonological short-term memory. Quarterly Journal of Experimental Psychology, 46A, 533-58.
  • Papagno, C., Valentine, T. & Baddeley, A. 1991. Phonological short-term memory and foreign-language vocabulary learning. Journal of Memory and Language, 30, 331-47.

Short-Term Memory Problems

  • Short-term memory problems are, by and large, attention problems.
  • Attention involves both the ability to keep focused on the information you want to keep active, and the ability to not be distracted by competing and irrelevant stimuli.
  • You need to actively attend to keep information active, particularly as you get older.
  • Many of us over-estimate how much information we can keep active at one time.

Many people, particularly as they get older, have concerns about short-term memory problems: going to another room to do something and then forgetting why you’re there; deciding to do something, becoming distracted by another task, and then forgetting the original intention; uncertainty about whether you have just performed a routine task; forgetting things you’ve said or done seconds after having said or done them; thinking of something you want to say during a conversation, then forgetting what it was by the time it’s your turn to speak, and so on.

This is clearly an issue for many of us. Part of the reason, I believe, is simply that we expect too much from ourselves. For example, research has shown that even a very, very short delay between recalling an intention and being able to carry it out is sufficient to dramatically reduce the likelihood that you will remember to do the intended action — we are talking about a delay of only 10 seconds!

The problem is exacerbated by age (I’m not talking about advanced age — I’m afraid certain aspects of cognitive processing begin to decline as early as the 30s).

Part of the problem is also that we tend to believe that we don’t need to do anything to maintain a thought, particularly when it has “popped” into our minds easily. But current estimates are that unrehearsed information lingers in working memory for less than two seconds!

Some of these problems are dealt with in my article on action slips (these problems are not, strictly speaking, a failure of memory, but a failure in attention), and in my book on Remembering intentions.

But in this article I want to talk about another aspect: the relationship between working memory, and attention (and, as it happens, intelligence!).

In my article on working memory and intelligence I talk about the difference between crystallized and fluid intelligence — that fluid intelligence is probably a better measure of what we think of as “intelligence”, and that working memory capacity is often used synonymously with fluid intelligence. A new theory is that the relationship between working memory and fluid intelligence is due to the ability to control attention.

This theory emphasizes the role of attention in keeping information active (i.e. in working memory), and argues that working memory capacity is not, as usually thought, about the number of items or amount of information that can be held at one time. Instead, it reflects the extent to which a person can control attention, particularly in situations where there is competing information / demands.

I have to say that this makes an awful lot of sense to me. I can’t, in the space I have here, go into all the evidence for and against the theory, but here’s one situation which is interesting. The “cocktail party phenomenon” is a well-known method in psychology, whereby people are given two streams of audio, one for each ear, and instructed to listen only to one. At some point, the person’s name is spoken into the unattended stream, and about a third of people pick that up. In a recent take of that classic study, researchers compared the performance of people as a function of their working memory capacity. Only 20% of those with a high capacity heard their name in the unattended channel compared to 65% of low-capacity people. The point being that a critical aspect of good attentional control is the ability to block our irrelevant information.

This ability is one that we already know is worsened by increasing age.

The message from all this, I guess, is that:

  • short-term memory problems are, by and large, attention problems.
  • attention involves both the ability to keep focused on the information you want to keep active, and the ability to not be distracted by competing and irrelevant stimuli.
  • you need to actively attend to keep information active, particularly as you get older.
  • many of us over-estimate how much information we can keep active at one time.

And if you want strategies to help you keep more information active, I suggest you look at improving your ability to chunk, condense and label information. If you can reduce a chunk of information to a single label quickly, all you need to do is remember the label. (I explain all this at length in my book The Memory Key, but I’m afraid it needs far too much explanation to go into here).

Anyway, I hope this helps those of you (most of us!) with short-term memory problems.

This article originally appeared in the April 2005 newsletter.

Planning to Remember

References

Heitz, R.P., Unsworth, N. & Engle, R.W. 2004. Working memory capacity, attention control, and fluid intelligence. In O. Wilhelm & R,W. Engle (eds.) Handbook of Understanding and Measuring Intelligence. London: Sage Publications.

Working Memory and Intelligence

  • Intelligence tends nowadays to be separated into 2 components: fluid intelligence and crystallized intelligence.
  • Fluid intelligence refers to general reasoning and problem-solving functions, and is often described as executive function, or working memory capacity.
  • Crystallized intelligence refers to cognitive functions associated with knowledge.
  • Different IQ tests measure fluid intelligence and crystallized intelligence to varying extents, but the most common disproportionately measures crystallized intelligence.
  • Increasing evidence suggests that even fluid intelligence is significantly affected by environmental factors and emotions.

You may have heard of “g”. It’s the closest we’ve come to that elusive attribute known as “intelligence”, but it is in fact a psychometric construct, that is, we surmise its presence from the way in which scores on various cognitive tests positively correlate.

In other words, we don’t really know what it is (hence the fact it is called “g”, rather than something more intelligible), and in fact, it is wrong to think of it as a thing. What it is, is a manifestation of some property or properties of the brain — and we don’t know what these are.

Various properties have been suggested, of course. Speed of processing; synaptic plasticity; fluid cognition. These are all plausible, but experimental studies have failed to provide clear evidence for any of them. The closest has been fluid cognition, or fluid intelligence, which is paired with crystallized intelligence. These two terms point to a useful distinction.

Fluid intelligence refers to cognitive functions associated with general reasoning and problem-solving, and is often described as executive function, or working memory capacity.

Crystallized intelligence, on the other hand, refers to cognitive functions associated with previously acquired knowledge in long-term store.

There is of course some interplay between these functions, but for the most part they are experimentally separable.

There are a couple of points worth noting.

For a start, different IQ tests measure fluid intelligence and crystallized intelligence to varying extents – the Raven’s Progressive Matrices Test, for example, predominantly measures fluid intelligence, while the WAIS disproportionately measures crystallized intelligence. An analysis of the most widely used intelligence test batteries for children found that about 1/3 of the subtests measure crystallized intelligence, an additional ¼ measure knowledge and reading/writing skills, while only 7% directly measure fluid intelligence, with perhaps another 10% measuring skills that have a fluid intelligence component – and nearly all the fluid subtests were found in one particular test battery, the W-J-R.

The so-called Flynn effect – the rapid rise in IQ over the past century – is for the most part an increase in fluid intelligence, not crystallized intelligence. While it has been hypothesized that fluid intelligence paves the way for the development of crystallized intelligence, it should be noted that the distinction between fluid and crystallized intelligence is present from a very early age, and the two functions have quite different growth patterns over the life of an individual.

So, what we’re saying is that most IQ tests provide little measure of fluid intelligence, although fluid intelligence appears to reflect “g” more closely than any other attribute, and that although crystallized intelligence is assumed to reflect environment (e.g., education) far more than fluid intelligence, it is fluid intelligence that has been rising, not crystallized intelligence.

In fact, for this and other reasons, it seems that fluid intelligence is far more affected by environment than has been considered.

I’ll leave you to ponder on the implications of this. Let me make just one more point.

The brain areas known to be important for fluid cognition are part of an interconnected system associated with emotion and stress response, and it is hypothesized that functions heretofore considered distinct from emotional arousal, such as reasoning and planning, are in fact very much part of a system in which emotional response is involved.

We’re not saying here that emotions can disrupt your reasoning processes, we all know that. What is being suggested is more radical – that emotions are part and parcel of the reasoning process. Okay, I always knew this, but it’s nice to see science coming along and providing some evidence.

The point about the close interaction between emotional reactivity and fluid intelligence is that stress may have a significant effect on fluid intelligence.

And I’ll leave you to ponder the implications of that.

References

Miyake, A., Friedman, N.P., Rettinger, D.A., Shah, P., & Hegarty, M. 2001. How are Visuospatial Working Memory, Executive Functioning, and Spatial Abilities Related? A Latent-Variable Analysis. Journal of Experimental Psychology – General, 130(4).

Working memory

Working memory is one of the most important concepts in understanding and improving your memory.

Your working memory capacity is a critical factor in determining your ability to :

  • take good notes,
  • read efficiently,
  • understand complex issues,
  • reason.

Indeed it may be that it is your working memory capacity that best ‘measures’ your intelligence.

Short-term vs long-term memory

Working memory is a relatively recent term, a refinement of an older concept - that of short-term memory. Short-term memory was called thus to distinguish it from "long-term memory" - your memory store.

One important difference between the idea of short-term memory and working memory, is that short-term memory was conceived of as a thing. Different from long-term memory (variously analogized as a library, a filing system, a computer) chiefly in the duration of the records it held. But working memory, as its name suggests, is now conceived more as a process than a thing. A state of mind. A pattern of activation.

Working memory contains the information of which you are immediately aware.

To put information into our memory store, it must ... be worked on - i.e., be held in working memory. To get information out of the memory store - to “remember” something - it must again be in an active state - be in working memory. How can we know what we remember if we're not conscious of it?

However, you can only keep something "active" for a very short time without your conscious attention. It is this which so limits working memory capacity.

The magic number seven

Probably the most widely known fact about working memory is that it can only hold around seven chunks of information (between 5 and 9). However, this tells us little about the limits of working memory because the size of a chunk is indeterminate.

1 2 3 4 5 6 7 are seven different chunks - if you remember each digit separately (as you would, for example, if you were not familiar with the digits - as a young child isn't). But for those of us who are only too well-versed in our numbers, 1 through to 7 could be a single chunk.

Recent research suggests however, that it is not so much the number of chunks that is important. What may be important may be how long it takes you to say the words (information is usually held in working memory in the form of an acoustic - sound-based - code). It appears that you can only hold in working memory what you can say in 1.5 — 2 seconds. Slow speakers are therefore penalized.

Your working memory capacity

What we term "working memory" contains several functions, including the "central executive" which coordinates and manages the various tasks needed. The extent to which working memory is domain-specific (different "working memories", if you like, for different sensory and cognitive systems, such as language, spatial memory, number) is still very much debated. However, at a practical level, we may think of working memory as containing several different components, for which you have different "capacities". Thus, your capacity for numbers may well be quite different from your capacity for words, and both from your capacity for visual images.