|
|
||
|
||
Phil RaceQuality of assessmentPhil Race AbstractWhat our learners really pay for (or what society pays us for) are their Degrees or their Diplomas or their Certificates. Yet we put much more emphasis on trying to teach our students than we normally put into designing assessment - particularly the 'formal assessment' that counts towards final qualifications. Assessment is where learners often get a raw deal. In this chapter, I will propose the following ten 'worries' about assessment, and explore ways of leading to a better quality of assessment. Ten worries about assessment
Is Competence enough?In 1991, the Association of Educational and Training Technology (AETT) International Conference had the theme 'Developing and Measuring Competence'. As one of the editors of the proceedings of that conference (Saunders and Race, 1992), I became well acquainted with the many views on competence which were aired by the various contributors. It was heartening to see that many of the approaches to the development of competence focused on defining the evidence which would be sufficient to demonstrate competence. However, I was left with some concerns about whether competence would often end up as minimum competence rather than high competence. Also, I already hinted at some of my concerns towards the close of the previous chapter. The following two figures pose some questions about 'shades' of competence, and about a range of descriptors which may be needed to provide more information than simply 'can do'.
Figure 1. Shades of Competence
Figure 2. Some competence descriptors Towards mass higher educationHigher education in the United Kingdom is being moved 'in fits and starts' towards greater participation rates, despite apparent U-turns in Government fiscal policies on expansion. In the long term, for an sorts of reasons - and at whatever cost - the pressures for higher participation rates in higher and further education will be irresistible. This means not only rapidly increasing numbers of school leavers, but also many more non-traditional entrants. Already, many lecturers and tutors are seeking help in handling large classes. It is clear that the workload of staff in higher education is increasing, as they become responsible for more and more learners. In some ways, teaching can be 'done to' large numbers almost as easily as to small numbers - it's almost as easy to lecture to 300 as to lecture to 100. However, teaching greater numbers may have little connection with learning by greater numbers. And when it comes to assessment, there are no short cuts. It usually takes three times the amount of time to assess 300 learners as it would to assess 100 learners. The main challenge we face as we move towards a mass higher education system is merely maintaining the quality of assessment we presently have - not to mention improving it. Assessment and learning: how do they mix?In Chapter 1 I introduced the simple model of learning summarised again below:
How does assessment usually relate to these steps in the learning process? Let's take the most common form of assessment - the written examination.
Perhaps traditional forms of assessment have only one real contribution towards learning - people are frightened (shamed) into doing some learning so that they may minimise their chances of being shown to be 'lacking'. Much intensive learning is done just before exams - but most of it is of a superficial nature and soon forgotten again. Ten worries - and some suggestionsIn my Abstract at the start of this Chapter, I listed ten worries I have about assessment. I would like now to repeat them, giving a little more detail about my concerns, and offering some suggestions about how the problems may be minimised. Many of my suggestions point towards involving learners in their own assessment. This can be achieved by letting learners get their hands on the assessment criteria. It can be achieved even better by letting learners apply the assessment criteria, in self-assessment and peer-assessment. It is achieved best by helping learners to formulate the criteria, then apply them. An illustrated discussion of uses of self- and peer-assessment is given later in this chapter. 1 Assessment is often done in a rush, to meet exam board deadlines. It's rarely done under the best of conditions! This is because assessment tends to be done to learners - not by them. Assessment tends to be done at the end of learning something, rather than as a means to help the learning processes. In public exams, examiners often face piles of some hundreds of scripts, which all need to be finished within only a week or two. Suggestions:
2 Assessment is often done by bored people, tired of reading the same answers to the same questions (and seeing the same mistakes). Examiners get thoroughly fed-up as they wade through hundreds of scripts. They get discouraged when they see things they hoped their learners would have mastered, only to find that messages have not got across . Any tedious or repetitive task causes people to change their mood. If assessors' moods plunge, the objectivity of assessment is likely to be affected accordingly. Suggestions:
3 Assessment tends to be governed by 'what is easy to assess' . Therefore, traditional written exams (relatively straightforward to assess) are used. These measure learners' skills at tackling traditional written exams. There is still not enough attention being paid to what should constitute the evidence upon which to base awards. Many important competences are simply not assessable by traditional methods. While it is perfectly possible to use traditional methods to measure recall of facts and information, it is not-at-all easy to use such methods to measure innovation, judgment, or personality. Fig.3 shows an overhead transparency I use to alert students to the various agendas that may be served by traditional exams.
Figure 3. What exams measure? Suggestions:
4 Learners rarely know the intimate detail of the assessment criteria, and how we interpret them. There really is no excuse for this. The reason may be sinister - that those who design the assessment criteria are not sufficiently confident about them to show them to the learners! Assessors often fear that learners may demand to know 'why did I get 65% for this, when my friend got 75%?' Surely, they have every right to ask this sort of question - and to learn from the feedback they should be given by way of a response to the question. Suggestions:
5 How should we develop learners' unassessable qualities? Should we refrain from developing them because we can' t measure them? 'Don't bother to learn anything, when you can't see how they can ask you about it at the end of the day': this is a perfectly rational view taken by learners, deciding what to learn and what not to learn. Suggestions:
6 Almost all assessment processes in common use foster learner competition rather than collaboration. No wonder our educated people are so bad at working in teams. Learners preparing for exams are often quite secretive about the work they do. No-one likes to be thought of as 'a swot'! However, it's more sinister than this: we actually compound the competition by using norm-referenced assessment far too much. In other words, only a certain proportion of learners are allowed to receive 'A' grades, or lst-class Honours degree classifications. Therefore, learners are in competition. Suggestions:
7 What competences are measured by assessment anyway? Are they 'can do' competences? Or are they simply 'did do, once' ghosts? Exams tend to measure 'did once' competences. At their worst, they still measure 'knew once' competences rather than 'did once'! Suggestions:
8 If we were to introduce 'Quality in Assessment BS5750A what should the criteria look like? What evidence of competence should assessors demonstrate? At present, it is automatically assumed that anyone appointed to a post involving teaching or lecturing is blessed with all the skills needed to design assessment schemes and implement fair assessment. People are appointed to teaching (and assessing) posts not on the basis of how well they can do either task, but often on the record of their own academic performance. Suggestions:
9 'If you can' t measure it, it doesn't exist. If you can measure it, it isn't it' . What should we be trying to measure? It has been said that one of the main faults of our education and training systems is that we tend to teach people things that are already understood, instead of equipping them to understand new things. Assessment reflects this. Suggestions:
10 Where stops the buck? Whose fault is it that assessment is so artificial? Heads of Department? Employers? Assessors? Validators? The Government? Yours? Mine? If you imply that there is something suspect about people's abilities to assess, it is badly received! Assessment is something that is usually done privately rather than publicly and people go to great lengths to ensure that they retain privacy. Is such privacy really needed mainly because of the 'put down the number you first thought of' syndrome? Suggestions:
Learning through assessingIn much of my discussion so far, I've been focusing on the dangers when assessment is 'done to' people, and hinting at the benefits which can result when learners themselves are allowed to be intimately involved in their own - and each other's - assessment. Self-assessment and peer-assessment may lack some of the precision of the best of 'formal' assessment - where (some) assessors have a great deal of experience and (sometimes) assess fairly and conscientiously. However, what may be lacked in terms of precision is more than compensated for by the benefits of deeper learning, which go hand in hand with the act of learners themselves assessing. Close encounters with assessment criteriaThis is the crucial difference between formal assessment and self-assessment or peer-assessment. Learners find out a lot about any subject simply by applying assessment criteria to examples of work in that subject (whether the examples are self generated, made by other learners or issued by a teacher). Previously, assessment criteria have seemed to learners to be the property of examiners. There has been a tendency for teachers to regard assessment criteria as quite private. For many years, I marked O-level and A-level scripts for two of England's public examinations boards. The marking schemes were invariably labelled 'Secret' and sometimes I had to sign a declaration to the effect that after the marking was completed I would destroy all information relating to the marking scheme. In Higher Education, it is usual to have some sort of external moderation of assessment, usually involving sending draft examination papers to lecturers in other institutions for feedback or approval. Even where model answers and marking schemes have been required to be sent to external examiners or moderators, the vital information in such schemes has seldom been shared with learners, and until relatively recently hardly ever applied by learners themselves. Yet when learners have the chance to get their hands on assessment criteria, they seem to develop a thirst for the information they can derive from them - leading to much deeper learning. Self-Assessment and peer-assessment are not just self-testingThese forms of assessment when well-developed involve several processes:
In other words, self-assessment and peer-assessment have a tangible learning payoff, not least that associated with the quantity and quality of feedback which learners gain about their own performances and (in peer assessment) about each other's performances. Assessment criteria: black and white or shades of grey?In subjects like maths, science and engineering, things are often either right or wrong - and it is relatively easy to devise assessment criteria for tests and exercises. However, even in subjects such as law or social studies, there are identifiable hallmarks of a good answer or an unsatisfactory answer to a question. Such hallmarks can be turned into checklists of a flexible kind, which enable the characteristics of good and less-good answers to be compared and contrasted. Students can benefit by learning in the act of applying assessment criteria to their own and each others' work. Benefits to learners of close encounters with assessment criteriaLearners can quickly find out about incorrect assumptions they have been making. They are able to find out more and more detail about the answer to the crucial question: 'What am I expected to become able to do?' There are, of course, many more benefits, depending on how we involve learners in using assessment criteria - including helping learners themselves formulate the criteria (when this is possible or appropriate) - leading to the most obvious form of ownership of assessment. Some examples of self- and peer-assessment mechanismsSelf assessment is not confined to the variety that is widely used in open and distance learning (though of course that is one powerful form of it). Self- and peer-assessment processes can include any of the following:
There are further combinations of these. There is also the additional matter of whether the grades or scores contribute in a formal way to the performance records of learners. Facilitating learners' ownership of assessment criteriaWhere it is possible to draw assessment criteria from learners themselves, especially in group situations, the sense of ownership which learners develop is very powerful, and leads to them using the criteria with considerable enthusiasm and commitment when self- or peer-assessing. It is of course important to explain to learners the benefits they can draw from taking part in assessing their own or each other's work before proceeding further. I have found that the following approach gives useful results for groups of 10-20. I have spelled out the process in some detail below, but you will often be able to take short cuts and simplify the task in the light of your own experience. Nonetheless, I have found that the whole of the sequence below can be completed in less than an hour with groups of around 25 learners. Suppose learners are about to do task 'x' (where 'x' could be to write an essay, give a presentation, prepare a project, write a report, design a handout, and so on). After a general introduction to clarify the nature of the task, the time available and the general format of the final product, proceed as follows.
Peer Assessment Grid
Table 1. Peer assessment data Allowing for individual differencesThe sort of peer-assessment described above is suitable for tasks such as presentations, where many people can assess the same piece of evidence, and where scores can then be compared and discussed by the group. For individual tasks such as essays, reports, projects, dissertations and so on, it is likely that each piece of work will reflect slightly different criteria (or even very different criteria) and then it is often best to allow for some 'agreed' criteria, and some 'idiosyncratic' criteria so that each learner can exercise more ownership of the assessment criteria. An example of a grid that can be adapted for such purposes is shown in Fig.6. Self Assessment Grid
Figure 6. Self assessment grid The most important outcome of involving learners in the formulation of self-assessment or peer-assessment criteria is that learners address the task with criteria in their minds and the quality of their work seems to be much higher than it may otherwise have been. In other words, there is no longer the 'hidden agenda' of the criteria in the mind of the assessor. Instead, when learners use self-assessment and peer-assessment, they are already in possession of the 'rules of the game' and will strive particularly hard to live up to these rules, particularly when they have helped to make them in the first place. Furthermore, being involved in devising and applying assessment criteria gives learners a useful insight into how other parts of their work (for example, formal examinations) are likely to be assessed, and helps them to work out more about the 'rules' (or 'wiles'?) of exam marking too. Throw away the numbers or grades?I've often suggested to learners after a peer assessment exercise that the numbers or grades they awarded (if not contributing to their overall assessment) were only a vehicle to help them do learning of a higher productivity. However, I've found they usually want to hang on to the numbers - good or bad. Perhaps this is evidence of the sort of ownership we're aiming for? How well can students assess themselves, and each other?In general, students are quite accurate in their assessing. I have found that when students are asked to 'guess' their own performance scores just after completing an exam, around 90% of students 'guess' within 5% of their actual scores. It is useful to identify the 10% who had an inaccurate perception of how they had done - they usually benefit from a discussion to probe the causes. Those 10% may be over-anxious and underestimate their achievements, or over-confident and over-estimate their achievements. When discrepancies in self-assessment occur, they are usually due to one of the following causes:
Peer-assessment and self-assessment can be usefully combined. Peer-assessment can be conducted 'blind' so that 'arranged' scoring is avoided. If the peer-assessment mark or grade is equal to the corresponding self-assessment mark (e.g. within 5%) then the self-assessment marks go forward into the assessment system - possibly with a staff 'scan' to ensure that fair play is in operation. (It is far quicker to scan a piece of work to check whether the assessment is fair, than it is to mark the work from scratch). When self- and peer-scores differ, negotiation or staff intervention maybe necessary (but this happens surprisingly rarely in practice). ![]() But can't it all go wrong?There are several things that can go wrong with self-assessment and peer-assessment and to use either process successfully, it's worth knowing the potential dangers. 'You're paid to assess me - why should I assess myself?'There will always be some learners who regard self-assessment or peer-assessment as an abdication from duty by tutors. My own reply is along the lines 'Certainly, I'll assess your work if you wish. But the real aim of involving you in assessing is not to save me work, but to allow you to gain much greater understanding not only of your subject material, but also of how assessment works'. 'But we haven't the expert knowledge to use to assess'This, of course, is a real concern of many students faced for the first time with the task of assessing their own - or their colleagues - work. Tutors are regarded as 'expert witnesses' blessed with the experience with which to make authoritative and valid judgments. The key to solving problems of this nature is clarification of assessment criteria. When the criteria are phrased in language that learners can readily understand, their reservations about being able to make judgments based on the criteria are rapidly dispelled. In addition, with peer assessment, it is wise to spell out the advantages of a multiple judgment rather than the single one that might be given by a tutor. The average of several people's opinions is always likely to be more meaningful than a single opinion - even when the single opinion comes from a 'figure of authority' i.e. a tutor. 'What about passengers in team work?'Often, when the products of team work are being self- or peer-assessed, the matter of 'contribution' comes up. One way is to award a score or grade to the overall product of the work of the team, whether it is a presentation, a report, an exhibition, or so on. Then, the team can be asked to 'split' the overall score between its members, in terms of the contribution each member made to the final product. Obviously, it's best if this can be done 'publicly' with all team members reaching agreement about the split of the award. Alternatively, however, it can be done by 'secret ballot', with the possibility of tutor intervention if necessary. Surprisingly perhaps, it is seldom necessary to invoke such extremes, especially if the issue of 'passengers' has been addressed by the whole group in advance and some ground rules on 'contribution' drawn up with the approval (better still, ownership) of the group as a whole. 'Is it the novelty that makes it work?'Usually at present, when self-assessment or peer-assessment are introduced into a course, they have 'novelty value'. Because they are different, students tend to take them seriously and engage fully in the right spirit with their increased responsibilities. If such methods of assessment become commonplace, they would be regarded as part of the normal system, and human nature dictates that students would seek ways of 'playing the system' as in many other aspects of life. So, to some extent, it has to be admitted that the genuine enthusiasm and objectivity that students display for well-arranged self-assessment or peer-assessment has something to do with the novelty-value. 'What if the group just does not gel?'Especially when assessing the product of group-work, there is always the possibility of a particular group having problems, possibly due to a single member. One way round this is to allow new groups to be formed for successive tasks, so that the effect of any disruptive or uncooperative individual is spread rather than concentrated. However, this can lead to what a colleague of mine referred to as 'sieving' of students, ending with the weakest all being left in a single group. 'But won't students be far too kind to themselves when they self-assess?'Tutors are genuinely concerned, sometimes, that left to their own devices, students win an award themselves first-rate scores or grades. However, it is often the case that students' assessments are 'harder' than staff assessments. Admittedly, there are a few students who will self-assess their own efforts unrealistically 'high', but such instances are rare, and stand out clearly. If we are honest about it, students are in a better position to self-assess their own efforts than anyone else is. Even if they rate themselves too high, they almost certainly know they are doing just that. The real payoff is in the reflection that they engage in when making their judgments about their own work, whether rightly or wrongly. In other words, self-assessment is not a means to an end, but an end to a means - i.e. a way of helping students to reflect deeply. In the model of learning I use throughout this book, self-assessment is a very useful process which aids 'digesting'. The discussion above shows that self-assessment and peer-assessment can certainly 'go wrong'. If either process is introduced by tutors who do not believe in the benefits that can be achieved, or to learners who are reluctant to engage fully with the processes, it is not hard to see how either process will be doomed to failure. It is important not to 'impose' innovations such as self- or peer-assessment against the will of tutors. Tutors are highly intelligent professionals and if forced to do something they don't trust will ensure that it demonstrably fails to work! Self-Assessment, Peer-Assessment and how people learnThroughout this chapter, I've advocated the benefits of helping learners become intimately involved in processes of assessing and I've pointed to the hazards of traditional assessment procedures. To conclude, let's look once again at the four main processes of learning I introduced in Chapter 1 and mentioned briefly earlier in this chapter and see how the processes of self- and peer-assessment relate to how people learn.
Reflecting on assessmentIt can be argued that people who need a 'tester' are inadequately prepared to be sent out into the world outside. Self-assessment and peer-assessment can both be important parts of the learning process. The learning experience resulting from such forms of assessment is more important than the result of the assessment. Self-assessment or peer-assessment do not necessarily have to lead to any 'formal' (recorded) assessment. The aim can be purely as a learning experience, with the 'marks 'simply part of the process through which that experience is facilitated. Self-assessment and peer-assessment are skills, and become more reliable with practice. Receiving feedback on the quality of these forms of assessment is vital if learners are to derive the maximum benefit from engaging in them. Self-assessment and peer-assessment should be introduced early -for example during the first term rather than being left till the final year. Late in a course students may see little point in embracing new ways of learning. 'Ownership' is the most crucial aspect of successful learning and both self-assessment and peer-assessment are closely connected to the development of ownership of learning. Not all students warm to the 'exposure' of self- or peer-assessment. They may begin their studies with expectations that they will be assessed by professionals. 'What's in it for me?' they naturally may ask. They need to be convinced that self- and peer-assessment have direct benefits for themselves, and do not represent an abdication from duties on the part of tutors. Some tutors, however, feel it is dangerous to 'lose control' of assessment. If such tutors try to employ self- or peer-assessment, but constantly safeguard their right to step in 'should things go wrong', the whole concept of such forms of assessment is undermined. So what about teaching? Admittedly, our students learn from us. But they probably learn more on their own, and they probably learn even more from each other. Much of their learning occurs in the immediate run-up to assessment of one kind or another - so the role of assessment is an important factor in the circumstances which accelerate learning. Therefore, perhaps our biggest contribution to our students' learning is directly associated with the quality of the assessment our students encounter - and has less than we'd like to think to do with our teaching activities. I believe there is a strong case for using self- and peer-assessment not primarily to assess, but as processes to enhance learning. Finally, if and when we must resort to traditional, formal assessment, I believe that there is a great deal of room for improvement. The 'ten worries' I expressed at the beginning of this chapter may help to set an agenda for improving the quality of assessment. I end this chapter with ten recommendations. What can we do to improve assessment?
ReferenceSaunders D and Race P (1992) (eds) Developing and Measuring Competence: Aspects of Educational and Training Technology XXV Kogan Page, London.
|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||