Extinction, reinforcers and punishers

The strength of associations can also be judged by the resistance to extinction - that is, how long an animal will keep responding in the presence of an SD whithout reiforcement. One of the main features of all schedules of reinforcement which are partial - that is where not every single response is reinforced, is that they produce learning which is much more resistant to extinction than continuous reinforcement.

In operant conditioning appetitive and aversive events produce different patterns of earning. One can distinguish between four different consequences of responding in operant conditioning:

Positive reinforcement: a positive or appetitive event is contingent on responding - e.g., responding might lead to being fed.
Negative reinforcement: cessation of negative or aversive events is contingent on responding - e.g., responding might cause a series of shocks to stop.
Punishment: a negative or aversive event is contingent on responding - e.g., moving off a platform might lead to shock.
Negative punishment: the cessation of positive events is contingent on responding - e.g., moving into the centre of a test box may cause pleasant brain stimulation to be halted.

Although animals can learn all of these contingencies it is very clear that they have quite different consequences in extinction. When a contingency fails to apply to a behavior actively produced by the animal it is clear that the contingency is not longer in operation. On the other hand, if behaving leads to aversive consequences then, in extinction the animal is unlikely to produce the behavior and hence to discover that the contingency no longer applies.

This document has been restructured from a lecture kindly provided by R.W.Kentridge.