BEHL 2012 Lecture Notes - Lecture 2: Reinforcement, Operant Conditioning, Habituation
Week 2a Workshop
Simple Learning: Habituation
Occurs when a stimulus is presented/encountered repeatedly, and the response to it weakens
Associative Learning
-Classical (or Pavlovian) conditioning
-Operant (or instrumental) conditioning
Unconditioned= Automatic
Conditioned= Learned
Pavolv's Dog Experiment
Before conditioning: Food= Salivation Buzzer= no salivation
During conditioning: Food +buzzer= salivation
After conditioning: Buzzer= salivation
Basic Processes in Classical Conditioning
Reinforcement- pairing of neutral stim and conditioned stim
Acquisition- initial stage of learning
Extinction- only neutral stim (no conditioned stim)
Spontaneous recovery- response comes back after a period of time
Generalisation- similar stim triggers same response
Discrimination- exact stim triggers response exclusively
High-order conditioning- other small things affect response (??)
Principles of operant conditioning
three terms of contingency
reinforcement/ punishment
Positive= add (positive punishment--> add punishment)
Negative=take (negative reinforcement--> take nice thing away)
operant conditioning
Schedules of reinforcement
Fixed-ratio (FR30) (eg. Rat presses bar 29 times and gets nothing.
Presses 30 times and receives reward)
Variable-ratio (VR30) (eg. Rat presses bar 29 times and gets
nothing. Presses 30 times and receives reward)
Fixed-interval (FI 1min)
Variable-interval (VI 1min)
VR on average is 30
(see schedules of reinforcement graph)
In the real world
Shaping via successive approximations
Specify goal/desired behaviour
Identify a response to use as a starting point
Reinforce starting response, then require successively closer
approximations, until the desired response eventually occurs
Behaviour modification- central principle is:
Behaviour that is positively reinforced is likely to be
repeated
find more resources at oneclass.com
find more resources at oneclass.com