In recent years there has been an increased focus on
the role of education and training, and on the effectiveness and efficiency of
various instructional design strategies. Some of the most important
breakthroughs in this regard have come from the discipline of Cognitive Science,
which deals with the mental processes of learning, memory and problem
solving.
Cognitive load theory (e.g. Sweller, 1988; 1994) is an
instructional theory generated by this field of research. It describes learning
structures in terms of an information processing system involving long term
memory, which effectively stores all of our knowledge and skills on a
more-or-less permanent basis and working memory, which performs the intellectual
tasks associated with consciousness. Information may only be stored in long term
memory after first being attended to, and processed by, working memory. Working
memory, however, is extremely limited in both capacity and duration. These
limitations will, under some conditions, impede learning.
The fundamental
tenet of cognitive load theory is that the quality of instructional design will
be raised if greater consideration is given to the role and limitations, of
working memory. Since its conception in the early 1980's, cognitive load theory
has been used to develop several instructional strategies which have been
demonstrated empirically to be superior to those used
conventionally.
This paper outlines some of the basic principles of
cognitive load theory. Examples of the instructional design strategies generated
by cognitive load theory are also provided.
2: Memory
2.1 Remembering information
Some people believe that we remember
information by 'capturing' it on something like a video tape in our minds. This
is not the case. What we see and remember depends more on what we already know,
than on what is actually presented.
Look at each of the following, and
note what you see.
In the first example most
people read 'THE CAT', even though the centre symbol in each word is the same.
The context of reading provides information which we use to help interpret the
symbols.
In the second example most people will read each symbol as an
example of the letter "a", even though no two symbols are identical. We can read
an infinite range of symbols as the letter "a", even most peoples' hand writing,
although we have never seen their handwriting before. We are able to do
so because of our knowledge of what constitutes the letter
"a".
Similarly, we are also able to recognise literally millions of
different trees, as trees, even though no two are identical.
These
examples demonstrate that we cannot help but to impose meaning on things that we
sense. Humans are able to behave and think in 'intelligent' ways because of
their ability to quickly identify meaning in presented stimuli.
Our
knowledge and skills in activities as diverse as reading, driving, mathematics
and gardening all derive from the knowledge base which we hold more-or-less
permanently in long-term memory.
2.2 Chunking information
When presented a "large" set of elements
to remember, it is often helpful to combine the elements to form a smaller
number of groups. Each of the groups is referred to as a "chunk" of
information.
For example, it is common practice to combine the digits of
a phone number into two or three chunks of several digits each, rather than
listing all digits in one long sequence. The phone number 3476 - 2980 may be
easier to remember than the sequence 3 4 7 6 2 9 8 0.
Chunking does not
need to be based upon any underlying meaning or logic that can be identified
within the elements of the to-be-learned information. However, if an underlying
meaning or logic can be identified and is used to define the chunks,
then remembering is greatly enhanced.
For example, remembering a shopping
list where elements are chunked into like groups, such as:
is much easier to remember
than a list of identical elements which are chunked into groups without any
underlying structure, such as:
Look at each of the following
statements in turn for just a few seconds, and try to memorise the sequence of
letters and spaces.
The first statement is
difficult to memorise. The series of letters and spaces appears to be random. If
we are unable to identify any form of pattern or meaning then we are reduced to
a strategy of memorising individual letters in turn. If, however, we are able to
identify the "scrambled" meaning, then our strategy for remembering becomes one
of trying to remember the location of the spaces.
The second statement is
easy to memorise because the spaces are located in a way that promotes meaning.
Consequently we need only memorise a few ideas (All fish, enjoy, clean
water).
When what we already know enables us to identify or impose
meaning on a new piece of information because it connects with
information held in long-term memory, then it is relatively easy for us to
remember it because we can "build it into" our existing knowledge base in a way
that makes sense for us. The new information becomes an integral part of our
overall knowledge, held in long-term memory.
2.3 The modal model of memory
It is now widely accepted that we
have, and use, more than one type of memory.
A modal model of memory
distinguishes between three distinct memory types (modes). These are sensory
memory, working memory and long term memory.
Each mode has its own
characteristics and limitations.
These three modes are integrated to
define an information processing model of human cognitive
architecture.
2.4 Sensory memory
Sensory memory deals with incoming stimuli from
our senses. These are sights, sounds, smells, tastes and touches. A separate
partition of sensory memory exists for each of the
senses.
Sensory memories extinguish extremely quickly.
(About half a second for visual information, 3 seconds for auditory
information). In that time, we must identify, classify and assign meaning to the
new information or it will be gone forever.
While looking at the picture
below, quickly shut your eyes, and keep them shut for a few seconds. Repeat this
several times.
As
soon as you shut your eyes you may have noticed an image of the picture
remaining for a split second "somewhere in your mind". This demonstrates the
operation of the partition of your sensory memory that deals with visual
perceptions. This is not restricted to blinking at pictures. Look at anything
around you and it will still work.
2.5 Long term memory
Long term memory refers to the immense body of
knowledge and skills that we hold in a more-or-less permanently accessible
form.
Our name, date of birth, the letters of the alphabet, how to read,
how to write, how to drive, swim, play chess, catch a ball and everything else
that we "know" is all held in our long term memory awaiting
activation.
Activation will occur as a direct result of our working
memory querying long term memory for specific factual information (through our
consciousness). Once a query has been made activation (and the 'answer') is
effectively instantaneous.
Knowledge and skills that are activated with
extremely high regularity, such as walking and talking, may be activated
'automatically' without the need for high levels of conscious attention, even
though the task itself may be a complex one. (Automation is discussed further in
Section 3.3.)
Consider each of the following questions.
Question 1: What is your
name?
You will be able to answer this quickly. It's no surprise since it
is referred to frequently and consists of only a few words. Note how quickly you
can provide the answer.
Question 2: What are the letters of the
alphabet?
Again, you will be able to answer this quickly but this is a
more interesting question than the first. Here there are 26 items in the answer
and virtually everyone presents the 26 items in the same order. Our long term
memory holds the letters of the alphabet in alphabetical sequence. If you try to
say the letters of the alphabet in a random order, then you will find it an
extremely difficult, probably impossible task.
Question 3: Who won the
lottery in 1992 at Wattle St., Sydney, Australia?
Most people will
quickly realise that they do not know the answer to this question. They
recognise almost immediately that this is information that is not currently held
in their long term memory. Generally, people "know that they don't
know".
2.6 Working memory
Working memory is the part of our mind that
provides our consciousness. It is the vehicle which enables us to think (both
logically and creatively), to solve problems and to be
expressive.
Working memory is intimately related to where and how we
direct our attention to "think about something", or to process
information.
The biggest limitation of working memory is its capacity to
deal with no more than about seven elements of information simultaneously
(Miller, 1956).
Working memory capacity may be expanded slightly by
mixing the senses used to present information. That is, it is easier to attend
to a body of information when some of the information is presented visually and
the remainder of the information is presented auditorily than it is when all of
the information is presented through a single sense (either all visually or all
auditorily).
If the capacity of working memory is exceeded while
processing a body of information then some, if not all, of that information will
be lost.
Consider answering both of the following questions without using
pencil and paper.
For most people
Question 1 is quick and easy to solve as an example of mental
arithmetic.
In many ways Question 2 is nothing more than a 'larger'
version of Question 1, yet it is almost impossible to solve mentally.
The
role of long term memory is effectively the same for these two questions (to
recall the rules of addition).
The difference is that in Question 2 our
working memory capacity is exceeded. It cannot cope with the large number of
elements (in this case the numerals) that need to be attended to simultaneously
in order to solve this problem.
The use of pen and paper aids solution to
Question 2 because it effectively relieves the burden placed upon working memory
by giving us a means of recording elements in a 'permanent' form once we have
finished processing them.
3: Learning
3.1 Definition of learning
Learning may be defined as the encoding
(storage) of knowledge and/or skills into long term memory in such a way that
the knowledge and skills may be recalled and applied at a later time on
demand.
Humans have a great capacity for learning and tend to spend their
lives doing so. They learn not only how to walk upright, but also how to talk,
read and write. Many people today learn how to drive a car, operate a microwave
oven, and use a computer. Some even learn how to perform a heart transplant
operation. For all of these tasks (and just about every other task you care to
mention) the role, capacity and qualities of sensory memory and working memory
remain effectively unchanged. The driving force behind all skilled performance
is the knowledge base that has been acquired within long term memory.
The
capacity of our long term memory to acquire knowledge appears to be unlimited.
No-one ever "runs out of space", although with age there may be an overall
deterioration in the performance of our memory system.
It should also be
noted that virtually everyone can learn how to drive a car, operate a
microwave oven, use a computer or even perform a heart transplant operation,
provided that they are given sufficient time and training to enable them to
acquire the necessary knowledge and skills.
The next diagram
presents part of an information network for cars for you to complete, or at
least think about. There are no right or wrong answers.
Spend a few
minutes writing down in point form some information about cars. Some ideas have
been included for you to work from but you are free to add anything you like.
For example, details about their use, cost, construction, road rules, impact on
the environment, history of development, principles of combustion engines, how
to change gears, how to replace spark plugs.....and so on.
Work quickly,
writing down ideas as soon as they come to you. If you spend more than a few
seconds "stuck", then begin another branch.
Everyone living in modern
society holds an enormous amount of knowledge regarding cars, their use, road
rules, and so on. This knowledge base is held in a well structured information
network which is itself connected to other networks. Networks such as those for
'transport' or 'modern society' are higher order concepts, while networks for
'seat belt', 'spark plugs' and 'accelerator' are lower order concepts. Knowledge
about procedures is also held (for example, how to park and how to change
gears).
These hierarchical information networks are referred to as
"schemas". Schemas build in detail and complexity as more extensive knowledge is
acquired in a content area. The network in the diagram above is part of your
schema for cars held in your long-term
memory.
Individual differences exist in the nature and
details of schemas. Someone who is employed as a mechanic and spends their
pastime rebuilding vintage cars will have more detailed and complex schemas for
cars than most people.
Schemas that are well learnt may be recalled and
applied with relative ease. For example, someone learning to drive a manual car
needs to concentrate intently on the knowledge and skills required to coordinate
the movements of the clutch, gear stick and accelerator, in order to change
gears smoothly. After several years of driving, however, most people are able to
change gears "automatically". As automation develops, there is a reduction in
the need for concentration.
3.2 Process of learning
The previous section (Section 3.1) argued
that when we say that "something has been learnt", we mean that is has been
successfully encoded into long term memory and can later be recalled on
demand.
The next question to be considered is 'how does information
become encoded into long term memory?' While the factors which contribute to
encoding may vary from one situation to another, there is one factor that is
always present. To be encoded, information must first be attended to, and
processed by, working memory . If for any reason, working memory is unable
to attend to a body of to-be-learnt information, then learning will be
ineffective.
This has important implications for instructional design
because the limitations of working memory may impede the learning process. This
forms the basis of cognitive load theory.
Try to learn the following
rhyme without paying attention to it.
..........Twinkle twinkle little
star, how I wonder what you are.
In all likelihood you had not proceeded
past the word 'star' before you became aware that you already knew this rhyme.
Indeed, you could probably add a few more lines of the rhyme without
difficulty.
You were instructed to learn this rhyme without paying
attention to it . The fact that you "recognised" the rhyme as one that you
already know, shows, however, that you did attend to the information. Perhaps
you feel that this is due to the fact that a well known rhyme was used. Try the
next one.
Try to learn the following rhyme without paying attention to
it:
..........Emus and elephants into the stew, rub turns your until it
tummy blue.
If you were aware of any grammatical problems with this
statement, then you have again been paying attention to it. Once again, you
could not help yourself.
Working memory, as the embodiment of our
consciousness, cannot be "turned off" or "by passed" while we are
conscious.
3.3 What a novice needs to learn to become an expert
For any given
cognitive domain (algebra, crosswords, astrophysics, chess, electronics) we
think of novices in that area as not knowing much and for their performance to
be slow and error prone. In contrast we view experts as knowing almost
everything and assume their performance to be quick and error
free.
Contrary to popular belief, expertise does not appear to be due to
anything as robust as "intelligence". Nor does it appear that experts are more
"thoughtful" than novices.
The only two distinguishing features of
expertise are:
1 . the expansive schemas (information
networks) that experts hold, and
2 . the high level of
automation (ability to perform tasks without concentrating) that experts
exhibit.
Schemas and automation appear to explain
all other expert/ novice differences.
Experts, because of their
expansive set of schemas, have effectively seen almost every possible situation
in the content domain before. Moreover, they have learnt what response is
required for each situation and can carry out the required responses
automatically, without the need for high levels of concentration. Experts are
effectively just going through a set of routine exercises. It is no surprise
then that experts are so fast and accurate in their
performances.
Novices, on the other hand, have relatively few schemas.
They have trouble recognising anything but the most basic and common situations
as ones that they have encountered previously. Novices are presented with a
"problem" almost every time they venture into the content domain (problem being
defined as not knowing what to do or how to do it). Novices must "solve" almost
every situation presented to them. To make matters worse, even when they realise
what response is required, they may have difficulties in performing the
response. They need to concentrate intently if they are to avoid making
errors.
Consider the task of reading this page of printed material.
Presumably you may do so with little effort. While you need to concentrate on
the arguments being presented, it is likely that you do not need to
concentrate on the actual task of reading, that is, on the interpretation of all
these squiggles which represent letters of the alphabet, which are sequenced to
form words, which are sequenced to form sentences, and so on.
The task of
reading is incredibly complex yet we use written documents as an easy and
efficient way to communicate ideas.
All this changes, of course, if the
reader is young (say five years old).
And what chance would a typical
three year old have of reading even one line of this page? None.
Many
three year olds can recite the letters of the alphabet. They can also identify
many, if not all, of the letters in written form. However, lower case letters
may present some difficulties, and running writing may be virtually impossible
for them to handle. If a three year old can spell his or her name (or simple
words like 'cat' or 'dog'), then the adults around the child express praise and
encouragement.
By the age of five a child is likely to have developed
more refined schemas for letter recognition, and perhaps even for recognition of
some words (their name for example). However, there is a general absence of
automation in reading. Reading is slow, error prone, and needs high levels of
concentration (mental effort). It is likely that in "reading" the child will
sometimes sound out the letters of each word in a sentence, but not actually
comprehend the sentence as a whole. This is because their attention needs to be
fully focussed on each word in isolation.
Contrast this to your reading
skills. You no longer need to attend to individual letters or individual words.
It is likely that you can process the text as quickly, if not faster, than you
can say the material aloud. The only times that you need to slow your reading
speed will be when reading becomes "difficult".
One source of difficulty
lies in physical factors such as tiredness (low levels of attention), loud music
(distractions), tiny text or poor lighting (inability to
discriminate).
The more interesting difficulties, however, arise when
something presented on paper fails to fit into your schemas and/or level of
automation. Uncommon or technical words such as 'einstellung' or 'xanthoma', or
misspelt words such es thiis werrd may cause your attention to be directed to
the individual word, perhaps even the individual letters.
The irony about
tasks such as walking, talking and reading is that they are among the most
difficult that humans ever master, yet we are able to perform each of these with
extremely low levels of mental effort. Our schemas in these areas have become so
complete, and our level of automation so high, that we now find each of these
tasks to be almost trivially easy.
A well known proverb states that
"familiarity breeds contempt". In the context of education and training this
should perhaps be modified to read "familiarity breeds
expertise".
4: Cognitive Load Theory
4.1 Definition of cognitive load
Cognitive load refers to the total
amount of mental activity imposed on working memory at an instance in
time.
The major factor that contributes to cognitive load is the number
of elements that need to be attended to.
Look at each of the following
statements in turn for just a few seconds, and try to memorise the sequence of
digits. Note that you do not need to remember all statements at once. Give all
of your attention to each statement in turn.
For this activity we may use
the number of digits (the elements) to be remembered as a simple measure of
cognitive load. Consequently:
Note that the measure used
for cognitive load does not equate mathematically to task difficulty. That is,
even though statement 2 has twice the number of digits as statement 1, it is
almost as easy to remember.
In contrast, statement 4 has twice the number
of digits as statement 3, yet seems more than twice as difficult to remember.
While statement 3 can be remembered with effort, statement 4 is impossible for
most people to remember without some form of practice or memory aid.
4.2 Reasons why some material is difficult to learn
The previous
activity (Activity 4.1) used a digit span task to demonstrate that human working
memory has a threshold of somewhere between 4 and 10
elements.
For the previous activity this means that:
1.....when
the total number of digits to be remembered is four or less then the task is
trivially easy for most people.
2..... when the total number of digits to
be remembered is between five and nine then the task is achievable for most
people if they exert 'some' mental effort.
3.....when the total number of
digits to be remembered is ten or more then the task is difficult for most
people.
In many ways, however, this task is artificial. People are rarely
required to memorise sequences of random digits. After all, even telephone
numbers and post codes may have an underlying logic.
Most of the
information that we are required to learn in our lifetime is far more complex
than a simple sequence of objects (whether they be digits in a telephone number,
or items on a shopping list). Content areas such as mathematical calculus,
biochemistry and computer programming are considered to be "difficult" to
master. One of the reasons for this is undoubtedly the sheer volume of
information that must be acquired (and built into schemas) before an expert
knowledge base is held in the area. But there is another critically important
quality that is evident in these content areas: that of 'high element
interactivity'.
Element interactivity is defined as the degree to which
the elements of some to-be-learned information can, or cannot, be understood in
isolation. While the nature of element interactivity is difficult (and often
subtle) to comprehend, a simple example may assist in describing this
concept.
Example 4.2 - Element
Interactivity
Consider the task of learning a
foreign language. Most people can quickly learn some simple, everyday words, but
will have difficulty in generating grammatically correct sentences, even when
all of the words used in the sentence are known.
Vocabulary is an example
of low element interactive material. Although there may be literally thousands
of words to be learnt, most words may be learnt in isolation to all of the other
words.
To build sentences that are grammatically correct, however, one
must attend to all of the words within the sentence at once while also
considering syntax, tense, verb endings and so on. Grammar is an example of high
element interactive material because to learn it, many elements must be
considered simultaneously.
Determine if either of the following
statements could be true.
.....1. My fathers' brothers' grandfather is my
grandfathers' brothers' son.
.....2. My fathers' brothers' grandfather is
my grandfathers' brothers' father.
Although each of these statements
requires only a few elements (people) to be considered, the activity is
extremely difficult because there is a need to also attend to the relationships
between the elements. This is an example of "complex" information where
elements interact with each other. As a consequence of the high element
interactivity, the cognitive load induced exceeds the resources of working
memory.
The cognitive load associated with this material can be greatly
reduced if the information is presented pictorially. Elements which interact
with each other often have the potential to be presented in pictorial form,
where the picture itself holds (and conveys) some of the information, reducing
the need for it to be held in working memory.
The partial family tree
presented below shows that statement 2 is logically possible.
4.3 Elements held in working memory are schemas
This paper has
argued that the limited resources of working memory mean that only a few
elements of information may be attended to at any given time.
The
previous section (Section 4.2) demonstrated that to-be-learned information which
has a high level of element interactivity imposes a cognitive load over and
above that imposed by the elements themselves, due to the need to attend also to
the relationships between elements. Consequently, high element interactive
material exacerbates the difficulties which result from working memory
limitations.
All of this begs the question "what is an element?" The
short answer is "that it depends". It depends on the schemas held by the person
who is required to attend to some body of to-be-learned information because
generally, elements are schemas. What is a single element consisting of a single
schema for an expert may be several elements consisting of sub-schemas for a
novice.
Consider again the contrast between statements of the type
represented by 1. and 2. below.
The first statement presents
itself as a random sequence of letters and spaces. It is without meaning and
consequently each letter and each space is a separate element which working
memory needs to attend to.
In contrast, the second statement contains
obvious meaning. Each cluster of letters forms a meaningful word, and the words
combine to form a meaningful sentence. Here the number of elements for an expert
reader, who knows a little about the behaviour of dogs and cats, may be as few
as one. After all, it is a grammatically correct sentence, and it is well known
that dogs do chase cats.
Schemas not only provide the ability to combine
'many elements' into a single element. They also have the capacity to
incorporate the interactions between elements. This means that information which
consists of several elements, all of which interact with one another, may be
embodied into a single schema.
For example, a professional fibre glasser
holds a schema for 'mixing resin' which takes into account not only the ideal
ratio of resin and catalyst that need to be mixed, but also, automatically,
considers interacting factors such as the air temperature, air moisture, and
purpose of the mixture. It is likely that a novice in this area would not even
know that if environmental factors such as temperature and moisture are not
taken into account, then a defective mixture may result.
4.4 Intrinsic and extraneous cognitive load
Intrinsic cognitive
load Intrinsic cognitive load is due solely to the intrinsic nature
(difficulty) of some to-be-learned content. Intrinsic cognitive load cannot be
modified by instructional design. For example, content which is high in element
interactivity remains high in element interactivity regardless of how it is
presented.
Extraneous cognitive load Extraneous
cognitive load is due to the instructional materials used to present information
to students. Teaching materials addressing a concept such as continental drift,
for example, will be more effective if it makes an appropriate use of graphics
rather than a text only presentation.
By changing the instructional
materials presented to students, the level of extraneous cognitive load may be
modified. This may facilitate learning.
Demonstration
4.4
1. When intrinsic cognitive load is
low (simple content) sufficient mental resources may remain to enable a learner
to learn from "any" type of instructional material, even that which imposes a
high level of extraneous cognitive load.
2. If the
intrinsic cognitive load is high (difficult content) and the extraneous
cognitive load is also high, then total cognitive load will exceed mental
resources and learning may fail to occur.
3. Modifying
the instructional materials to engineer a lower level of extraneous cognitive
load will facilitate learning if the resulting total cognitive load falls to a
level that is within the bounds of mental resources.
4.5 Principles of cognitive load theory
Cognitive load theory
focuses on the role of working memory in the learning process.
The
fundamental principles of cognitive load theory rest upon the following
argument.
1. Working memory is extremely
limited.
2. Long term memory is essentially
unlimited.
3. The process of learning requires working
memory to be actively engaged in the comprehension (and processing) of
instructional material to encode to-be-learned information into long term
memory.
4. If the resources of working memory are
exceeded then learning will be ineffective.
4.6 Applying cognitive load theory to instructional design
The
fundamental principles of applying cognitive load theory to instructional design
rest upon the following argument.
1. Excessively high
levels of cognitive load may result directly from the instructional materials
presented to students.
2. Redesigning instructional
materials to reduce the levels of extraneous cognitive load may enhance
learning.
3. Content areas that are most likely to
demonstrate beneficial results from improved instructional design are those that
deal with "complex" information where the elements of to-be-learned information
interact with one another (therefore imposing a high level of intrinsic
cognitive load).
Summary 4.6 - Applying cognitive
load theory to instructional design
Cognitive load
theory states that learning will be maximised by ensuring that as much of a
learners' working memory as possible is free to attend solely to encoding
to-be-learned information.
5: Effects Generated by Cognitive Load Theory
5.1 The different effects
Cognitive load theory has been used
successfully to develop several instructional techniques which facilitate
learning.
These include:
.....the goal free effect
.....the
worked example and problem completion effect
.....the split attention
effect
.....the redundancy effect
.....the modality
effect.
5.2 Benefits for learning
Each of the effects listed above in
Section 5.1 has been shown empirically to provide strong benefits to learners
when used appropriately. In each case the benefits include all of the
following:
.....reduced training time
.....enhanced performance*
on test problems (similar to those seen during training)
.....enhanced
performance* on transfer problems (those which are dissimilar to problems seen
during training but requiring the same rules for
solution).
Note : Enhanced performance* means both
shorter times to complete problems, and fewer errors.
The fact that
students spend less time learning, yet return superior performances when tested,
is a powerful finding that has considerable implications for education and
training.
Of special importance is the increased performance on transfer
problems. This shows that the learning which results from each of these effects
is at a level of true understanding that enables students to solve a wider range
of problems than those students taught using "conventional" instructional
materials.
5.3 Generating a measurable effect
Each effect has been developed
by the argument that engineering a cognitive load which falls within the
limitations of working memory facilitates learning.
The specifics which
determine when and how each of the effects operate for a given set of learners
on a given set of to-be-learned content, may be found in the original research
papers.
Of particular importance to the successful generation of these
effects is the expertise of the learner relative to the to-be-learned
information.
When learners hold high levels of expertise in the content
area then the elements which their working memory may attend to are each, in
and of themselves , large complex knowledge networks (high level schemas).
Consequently, their working memory need only consider a few elements in order to
hold all of the to-be-learned information in mind. Ample cognitive resources
thus remain for the process of learning. Instructional design manipulations for
this group of learners will be ineffective because their working memory capacity
is not being exceeded.
In contrast, when learners hold a low level of
expertise in the content area then only simple elements (low level schemas) have
been acquired (perhaps almost none in the case of true novices). Consequently,
working memory needs to attend to many elements in order to hold all of the
to-be-learned information in mind. Here, cognitive resources are stretched
beyond their capacity and insufficient cognitive resources remain for the
process of learning. Instructional design manipulations for this group of
learners will be effective if the reduction of cognitive load results in a level
that is within the capacity of working memory.
The dynamics of generating
any of the effects thus depends on obtaining a group of students whose relative
level of expertise to content difficulty is ripe for instructional design
manipulations. (See Cooper & Sweller, 1987, for details on how student
ability impacts upon the generation of a measurable effect.)
5.4 Conventional problems
Before presenting information detailing
the effects generated by cognitive load theory a brief overview describing
conventional problems, and the process by which novices solve conventional
problems, will be presented. This is because both the goal free effect and the
worked example effect are based upon the finding that the method employed by
novices to solve conventional problems (means-ends analysis, which is discussed
in the next section) imposes a relatively high level of cognitive load (Sweller,
1988).
Conventional problems are those which present students with a set
of given data (the known information) and a well defined goal (specifies what
needs to be found). Moreover, the answer may be objectively determined to be
correct or incorrect by applying rules (such as formulae) in an algorithm based
sequence.
Conventional problems are typically found in all topic areas of
mathematics and science, and in all subject areas that make use of mathematical
principles (for example engineering, accountancy and computer
programming).
Example 5.4.a
If y = x +
6, x = z + 3, and z = 6, find the value of y
.
Example 5.4.b
A particle starts from
rest and is accelerated at 12 m/s2 for 4.5 seconds. What is its
terminal velocity?
Example 5.4.c
For the right triangle shown, determine the length of the
hypotenuse .
5.5 Using means-ends analysis to solve problems
Means-ends analysis
is a problem solving heuristic (strategy) which is widely used to solve
conventional problems by people who are not highly familiar with the specific
problem type (Larkin, McDermott, Simon & Simon, 1980; Simon & Simon,
1978).
Means-ends analysis is based upon the principle of reducing
differences between the current problem state (which begins at the problem
givens) and the goal state. In practice, this procedure often results in a
problem solver working backwards from the goal to the problem givens, before
then working forwards from the givens to the goal.
While this strategy is
very effective in obtaining answers (assigning a value to a goal state) it has a
necessary consequence of inducing very high levels of cognitive load. This is
because the nature of the strategy requires attention to be directed
simultaneously to the current state, the goal state, differences between them,
procedures to reduce those differences and any possible subgoals that may lead
to solution. Full details of how means-ends analysis operates, and its
consequences for working memory, are presented in Sweller
(1988).
Example 5.5
If y = x + 6, x = z
+ 3, and z = 6, find the value of y .
A novice problem
solver (using means-ends analysis) would first focus on the goal state (find the
value of y).
Rereading the question s/he would note that the value of "y"
is provided by the equation "y = x + 6", so finding the value of "x" becomes a
subgoal.
Similarly, a further rereading of the question would show that
the value of "x" is provided by the equation "x = z + 3", so finding the value
of "z" becomes a subgoal also.
Rereading the question yet again s/he
would identify that the value of z is provided as given information (z = 6).
This value may now be substituted into the equation "x = z + 3" to obtain the
value "x = 9".
A true novice at this point may forget why the value of
"x" was required. After all, their working memory has been heavily taxed
attending to many elements of the problem.
Nevertheless, he or she will
eventually identify that the value of "x" was calculated so that it could be
substituted into the equation "y = x + 6". Doing so yields the value of "y =
15", which is the goal state.
As can be seen by this example (and this is
just a simple problem), means-ends analysis is very cumbersome, and requires
large amounts of cognitive resources for the strategy to be implemented
successfully. Problem solvers using means-ends analysis may successfully solve
"many" problems of an identical type, yet effectively learn nothing from the
activity (Sweller & Levine, 1982)
5.6 The goal free effect
Means-ends analysis operates on the
principle of reducing differences between the goal state and problem givens.
Consequently, means-ends analysis may be rendered inoperable by redefining the
problem goal so that no obvious goal exists (for example, "find what you can").
This is the principle behind the generation of goal free problems.
If
problems are "goal free" then a problem solver has little option but to focus on
the information provided (the given data) and to use it where ever possible.
This automatically induces a forwards working solution path similar to that
generated by expert problem solvers. Such forward working solutions impose very
low levels of cognitive load and facilitate learning (Owen and Sweller, 1985;
Ayres, 1993)
Example 5.6.a
If y = x +
6, x = z + 3, and z = 6, find what you can.
Attention would
focus on "z = 6" as this is the only variable specified as a numerical
value.
Rereading the question it would be identified that the value of "z
= 6" can be substituted into the equation "x = z + 3". Doing so provides "x =
9".
Rereading the question it would now be identified that the value of
"x =9" can be substituted into the equation "y = x + 6". Doing so provides "y =
15".
Nothing else remains to be found.
It can be seen that this
solution path is far simpler than that generated by means-ends analysis in
Example 5.5.
Example 5.6.b
A particle
starts from rest and is accelerated at 12 m/s2 for 4.5 seconds. Find
what you can.
Example 5.6.c
For
the right triangle shown, Find what you can .
5.7 The worked example and problem completion effect
Historically
subjects such as mathematics and science have been taught using the following
general technique:
Step 1 :Introduce a new topic.
Present background knowledge, principles and rules.
Step
2 :Demonstrate, using a few worked examples, how to apply the
principles and rules.
Step 3 :Have the students
"practice" how to apply the principles and rules by solving many ,
conventional, goal specific problems.
Section 5.5 described how the use
of means-ends analysis to solve conventional problems imposes high levels of
cognitive load, and thus impedes learning. It is therefore likely that the
emphasis given to "practice problems" described above will not result in
efficient learning.
While the use of goal free problems
provides an effective alternative to conventional problem solving its
application is limited to situations where the problem space is "small". As the
size of the problem space becomes "large" the increasing number of alternatives
faced at each step in a solution render the technique impractical for teaching
purposes.
An alternative technique may be found in reconsidering the
nature and purpose of worked examples. Worked examples are presented to students
to show them directly, step by step, the procedures required to solve different
problem types. Worked examples contain explicit information that equates to
schemas and automation.
That is, worked examples promote the acquisition
of knowledge and skills required to: .....identify problems as being of a
particular type, .....recall the steps (in sequence) needed to solve each
particular type, and .....perform each step without error.
Studying
worked examples imposes a low level of cognitive load because attention need
only be given to two problem states at a time and the transformation (rule
operator) that links them.
A successful method for placing emphasis on
worked examples is to present them with conventional problems in an alternating
sequence (example type A, problem type A, example type B, problem type B and so
on). Students are informed of the paired nature of the material and instructed
to study each example closely because they will not be allowed to look back at
it once they begin the associated problem.
Students thus focus their
attention on the problem type and the associated steps to solution (the
schemas). In solving the associated conventional problem they are testing
themselves to determine if they have learnt the procedure. This may be a more
genuine form of "practice problem solving".
Example 5.7
- A worked example format for teaching algebra.
Following
the numbered sequence, first study the worked example, then cover it, and
attempt to solve the associated problem.
For each of the following, solve
for 'a'.
The
problem completion procedure has a similar rationale and effect
to the use of worked examples (see Paas, 1992; Van Merrienboer and Krammer,
1987). Instead of providing an entire worked example followed by a problem,
students are just provided with partially completed worked examples. For
instance, in example 1 above, they may be provided with the first two lines and
required to complete the third line themselves.
Discussion
5.7
The specific details regarding the number of
example-problem pairings or completion problems to present, the range of
examples to present, the rate at which the orbit of problem type is increased
and so on, depends on the complexity of the material relative to the expertise
of the learners. The greater the relative expertise, the quicker the pace of
increase in problem types.
Worked example techniques have been
demonstrated to be highly effective at facilitating learning across a wide range
of mathematically based content (see Cooper and Sweller, 1987; Zhu and
Simon,1987; Pass and Van Merrienboer, 1994).
5.8 The split attention effect
Many instructional materials require
both a pictorial component and a textual component of information.
Conventionally a graphic has been presented with the associated text above,
below, or at the side. Such instructional presentations introduce a split
attention effect where the student needs to attend to both the graphic and the
text. Neither the graphic, nor the text, alone, provide sufficient information
to enable understanding. The instructional material can only be understood after
the student has mentally integrated the multiple sources of information. The
portion of working memory that needs to be used in integrating the graphic and
text is unavailable for the learning process. Consequently learning is
ineffective.
Consider this conventional mathematics based example, taken
from Sweller, Chandler, Tierney & Cooper (1990).
Example
5.8.a - Split instructional format for teaching co-ordinate
geometry.
The presentation may be
restructured to improve learning by physically integrating the solution into
the graphic to produce a single source of instructional information. This
eliminates the need to split attention between the graphic and the text. The
association between the text and the graphic is clearly
indicated.
Example 5.8.b - Integrated instructional
format for co-ordinate geometry.
The split attention
effect is not limited to worked examples in mathematics. It is demonstratable in
all contexts where a graphical and a textual presentation are both necessary to
impart meaning.
Consider the instructional material presented below
dealing with electrical testing. (Taken from Chandler & Sweller,
1991)
Example 5.8.c -Split instructional format for
teaching a procedure. INSULATION RESISTANCE
TESTS a) CONDUCTORS IN PERMANENT
WIRING
Test : To test Insulation Resistance from conductors to
earth.
How conducted : i ) Disconnect appliances and busways during these
tests. Make sure mainswitch is "on" and all fuses are "in". Remove main earth
from neutral bar and set meter to read insulation. Connect one lead to earth
wire at MEN bar and take first measure by connecting the other lead to the
active. Take next measure by connecting the lead to the neutral.ii) If
resistance is not high enough in either of the two tests in i) then measure each
circuit separately.
Results required : i) At least One Megaohm ii)
Same result as i) above
Again, by reformatting
the material so that the instructions are integrated into the graphic, learning
is enhanced. In fact, in this study, evidence indicated better performance
resulted on both theoretical and practical
tests.
Example 5.8.d - Integrated instructional
format for teaching a procedure.
INSULATION RESISTANCE
TESTS a) CONDUCTORS IN PERMANENT
WIRING
5.9 Sources of split attention
The examples presented in Section
5.8 (the split attention effect) focussed on the need to eliminate split
attention effects which result from separate textual and graphical components of
instructional materials. Appropriately integrating the text into the graphic
facilitates learning.
Split attention, however, will result
whenever a learner needs to simultaneously attend to two or more
sources of instruction or activities.
Multiple sources of purely text
based instructional materials will induce a split attention effect if two or
more sources must be considered simultaneously. For example, this is likely to
occur when cross referencing documents, or even cross referencing within a
single document.
Chandler and Sweller (1992) provided evidence that a
split attention effect occurs when reading conventional experimental papers
because the results section and the discussion section are reported separately,
yet need to be considered simultaneously to understand the complex of results
and their implications. Here the split attention effect may be eliminated and
intelligibility increased, by restructuring experimental papers to integrate the
results and discussion sections.
A split attention effect may also result
from mixing activities. For example, when learning to use a software package it
is common practice for the learner to simultaneously refer to a hard copy
tutorial (or manual) and the computer. The tutorial provides
step-by-step instructions for performing each task and the learner attempts to
carry out each step on the computer. While this may seem to be an obvious way of
learning a software package, experimental investigations have shown that far
more effective learning strategies are available.
The simplest
modification is to eliminate the use of the computer in the learning phase and
replace it by appropriate pictures and diagrams. Provided the manual contains
all of the relevant information, then students who study the manual alone
outperform students who perform each step in sequence on the computer based upon
the manual instructions. The irony here is that the manual-only-group complete
their "training" without ever having used the software package, yet in testing,
on a computer with the real software, they perform better than the group who has
already spent time using the software package. See Chandler and Sweller (1996)
for details.
Another alternative is to develop a computer based training
package which integrates text based instructions into a computer simulation of
the target computer package. When this is done the manual may be eliminated from
the training process, leaving students to focus their attention wholly on the
computer screen. This eliminates split attention and facilitates learning. See
Cerpa, Chandler & Sweller (1996).
Summary 5.9 -
Sources of split attention
Split attention occurs whenever a
learner needs to attend to more than one source of information, or more than one
activity. A common source of split attention is the need for a learner to
perform a search. Searching a graphic to locate a component, searching a
document to find a reference and searching software pull-down menus to find a
function referred to in a manual are all examples of split
attention.
Redesigning instructional materials to eliminate search and
other sources of split attention facilitates learning.
5.10 The redundancy effect
Sections 5.8 and 5.9 described the
benefits which result from integrating mutually referring textual and graphical
sources of instruction.
Caution needs to be exercised, however, to ensure
that both sources of instruction truly are necessary for the to-be-learned
information to be intelligible.
In situations where a source of textual
instruction, or a source of graphical instruction alone provides full
intelligibility then only one source of instruction should be used (either the
textual or the graphical), and the other source, which is redundant, should be
removed completely from the instructional materials. In these contexts a single
source of instruction returns higher levels of learning than either an
integrated format (text integrated into the graphic), or a dual format (both
text and graphic presented in parallel).
Cognitive load theory explains
this result by focussing on the levels of cognitive load imposed upon the
learner who needs to process the varying instructional
materials.
Attending to both textual and graphical sources of instruction
requires more mental resources than attending to a single source. Attending to
both textual and graphical sources of instruction, therefore, results in a
reduced portion of working memory being available for the process of
learning.
Maps, whether their purpose is to locate countries (an atlas),
indicate the steepness of terrain (a topographic map) or to show the way to get
from A to B (a street directory) are examples of graphically based sources of
instruction that are fully self contained. Provided the user has the skills to
read and interpret a map, then there is no need for any associated body of
textual information.
Similarly, many instances of textual instruction
have no need for graphics. Arguments of litigation, analysis of history and the
use of a dictionary or thesaurus are fully intelligible in a text-only format.
The use of graphics in these situations actually reduces the level of learning
that results from the use of these documents.
Example
5.10.a - Redundant textual information in a dual
format
Example
5.10.b - Redundant textual information in an integrated
format
This example, dealing with the functions of the heart, is
taken from Chandler and Sweller (1991). Note
The graphic contains labels to indicate parts of the heart, and arrows
to indicate the flow of blood.
The textual statements which are
integrated into the graphic do nothing other than restate the parts and the flow
of blood. On this basis the textual statements are redundant and should be
deleted from the instructional material.
Students presented a graphic
only instructional format learn more than students presented either an
integrated format or a dual format.
5.11 The modality effect
All of the effects discussed so far in
this paper have emphasised the need to reduce cognitive load because of the
limitations of working memory.
While information processing models of
learning have historically emphasised the "fixed" limits of working memory,
there is evidence (Pavio, 1990; Baddeley, 1992) that under some conditions, an
expansion of working memory may be achieved.
Consequently, rather than
attempting to reduce cognitive load, an alternative strategy, that of expanding
working memory, may be pursued as a means of facilitating learning.
The
work by Pavio and Baddeley indicates that at least some portions of working
memory appear to be sensory mode specific. That is, some portion of working
memory is dedicated to attending to visual information only (especially
diagrammatic information) and some other portion of working memory is dedicated
to attending to aural information only (especially verbal information). (Note,
however, that the majority of working memory appears to be in the form of a
central resource which may be allocated to any type of sensory
information.)
Partitioning to-be-learned information so that some
information, such as graphics, is presented visually, while other information,
such as text, is presented auditorily enhances learning (see Mousavi, Low and
Sweller, 1995; Jeung, Chandler and Sweller, 1997; Tindall-Ford, Chandler and
Sweller, 1997). The modality effect holds the potential to impact upon the multi
media industry.
Example
5.11 - Mixed mode instructional format
This example is
taken from Jeung, Chandler and Sweller
(1997). Notes: 1. The graphic is presented
visually but the text is only presented auditorily. 2. Screen
highlights (flashing) were used to identify the components of the graphic
referred to by each auditory statement to eliminate screen search.
When
two parallel lines intersect with a third line, four pairs of corresponding
angles are equal. In the diagram, two parallel lines, AB and CD, intersect with
a third line, XY. The following four pairs of angles are corresponding
angles:
Section 6: Summary and Discussion
Cognitive load theory displays
strong consistencies with current knowledge regarding memory, thought, learning
and problem solving.
It is a theory which views the limitations of
working memory to be the primary impediment to learning. Reducing total
cognitive load imposed by a body of to-be-learned information increases the
portion of working memory which is available to attend to the learning process.
This may only be achieved by engineering reduced levels of extraneous cognitive
load through instructional design.
It is interesting (and important) to
note that the effects generated by cognitive load theory often "fly in the face"
of standard practices. This attests to the strength of the theory. The table
below outlines this observation.
The effects generated by
cognitive load theory should be viewed as "rules of thumb" rather than absolute
"laws of instruction". The bottom line, according to cognitive load theory, will
always be the need to reduce total cognitive load, and the need to maximise
cognitive resources available to be utilised in the learning process. If for
some reason cognitive load increases rather than decreases, then learning will
be inhibited.
For example, the worked example effect will not occur if
the examples used actually increase, rather than decrease, extraneous cognitive
load. This is the case for examples which impose a split attention effect.
Redesigning the format of the examples to eliminate split attention returns the
educational benefit of the use of the worked examples.
The success of
cognitive load theory in developing strategies and techniques which result in
both reduced training times and enhanced performance is of paramount importance
to the education and training industries.
Any fears that the application
of instructional design techniques generated by cognitive load theory may result
in a "poorer" quality of student or worker who is less able to think and act
independently in unusual or unforeseen situations are totally unfounded. Over
the last ten years a large body of evidence has been acquired to show that
students taught using cognitive load generated materials are actually more able
to deal with such unusual or unforeseen situations as attested to by their
superior performances on transfer problems (those that differ to problems seen
during training, but requiring similar rules for their solution).
It
should also be noted that current research projects have provided preliminary
evidence for four additional effects generated by the application of cognitive
load theory. These are (1) the procedural learning effect, (2) the imagination
effect, (3) the colour coding effect and (4) the interaction effect. These
effects are not discussed in the current paper as the results are not yet
published (at December 1998). However, the effects appear to be real and promise
to deliver further strategies for instructional
design.
Suggested readings
If you
wish to pursue further readings in cognitive load theory then try: 1. Sweller
(1991), a very short, non technical description of how educational practice is
often based upon myths rather than empirical research, and then 2. Sweller
(1994) which presents a more detailed (though still non-technical) review of
cognitive load theory and the effects generated. 3. Sweller, Van Merrienboer
& Paas (1998) which presents a detailed (and partially technical) summary of
human cognitive architecture and the implications for instructional
design.
After that you may wish to go to the original journal papers.
Happy readings.
References
Ayres, P. (1993).Why goal free problems can facilitate
learning. Contemporary Educational Psychology, 18 ,
376-381.
Baddeley, A.D. (1992). Working memory. Science, 255,
556-559.
Cerpa, N., Chandler, P., & Sweller, J. (1996). Some
conditions under which integrated computer-based training software can
facilitate learning. Journal of Educational Computing Research, 15,
345-367.
Chandler, P., & Sweller, J. (1991). Cognitive load theory
and the format of instruction. Cognition and Instruction,8,
293-332.
Chandler, P., & Sweller, J. (1992). The split attention
effect as a factor in the design of instruction. British Journal of
Education Psychology, 62, 233-246.
Chandler, P., & Sweller, J.
(1996). Cognitive load while learning to use a computer program. Applied
Cognitive Psychology, 10, 151-170.
Cooper, G., & Sweller, J.
(1987). Effects of schema acquisition and rule automation on mathematical
problem solving transfer. Journal of Educational Psychology ,79,
347-362.
Jeung, H. J., Chandler, P., & Sweller, J. (1997). The role
of visual indicators in dual sensory mode instruction. Educational
Psychology, 17, 329-343.
Larkin, H., McDermott, J.,Simon, D., &
Simon, H. (1980). Models of competence in solving physics problems.
Cognitive Science, 11 ,65-99.
Miller, G. A. (1956). The magical
number seven plus or minus two : Some limits on our capacity for processing
information. Psychological Review, 63 , 81-97.
Mousavi, S., Low,
R., & Sweller, J. (1995). Reducing cognitive load by mixing auditory and
visual presentation modes. Journal of Educational Psychology, 87,
319-334.
Owen, E., & Sweller, J. (1985). What do students learn while
solving mathematics problems. Journal of Educational Psychology,77,
272-284.
Paas, F. (1992). Training strategies for attaining transfer of
problem-solving skill in statistics: A cognitive load approach. Journal of
Educational Psychology, 84 , 429-434.
Paas, F., & Van
Merrienboer, J. (1994). Variability of worked examples and transfer of geometric
problem-solving skills: A cognitive load approach. Journal of Educational
Psychology, 86, 122-133.
Pavio, A. (1990). Mental
representations : A dual coding approach. New York : Oxford University
Press.
Simon, D. P., & Simon, H. A. (1978). Individual differences in
solving physics problems. In R.S. Seigler (Ed.), Children's thinking : What
develops ? Hillsdale, NJ : Lawerence Erlbaum Associates.
Sweller, J.
(1988). Cognitive load during problem solving : Effects on learning.
Cognitive Science, 12, 257-285.
Sweller, J. (1991). Some modern
myths of cognition and instruction. In J. B. Biggs (Ed.), Teaching for
Learning: The view from cognitive psychology : ACER, Radford House, Vic.,
Australia.
Sweller, J. (1994). Cognitive load theory, learning difficulty
and instructional design. Learning and Instruction, 4 ,
295-312. Sweller, J., Chandler, P., Tierner,P., &
Cooper, M. (1990). Cognitive load in the structuring of technical material.
Journal of Experimental Psychology; General, 119,
176-192.
Sweller, J., & Levine, M. (1982). Effects of sub-goal
density on means-ends analysis and learning. Journal of Experimental
Psychology : Human Learning and Memory, 8, 463-474.
Sweller, J., Van
Merrienboer, J., & Paas, F. (1998). Cognitive Architecture and Instructional
Design. Educational Psychology Review, 10 (3),
251-296.
Tindall-Ford, S., Chandler, P., & Sweller, J. (1997). When
two sensory modes are better than one. Journal of Experimental Psychology:
Applied, 3, 257-287.
Van Merrienboer, J., & Krammer, H. (1987).
Instructional strategies and tactics for the design of introductory computer
programming courses in high school. Instructional Science, 16 ,
251-285.
Zhu, X., & Simon, H.A. (1987). Learning mathematics by
examples and doing. Cognition and Instruction, 4,
137-166.