never seen those specific pictures before but you can still identify the objects
e.g Bruce’s idea of the structural code in face recognition
AI and viewpoint invarience
Marr
origin of the viewpoint invariance view comes from AI
Marr believed that vision operated in an information-processing manner
A grey level description is an image depicting light intensity changes
Each point (pixel) on the image indicates how much light was received at that point
This is computer vision, so, unlike Bruce’s Pictorial Code in face recognition, it is a computer image made up of pixels
looking at greyscale to find the parts that stick out
Raw Primal Sketch
Find important parts of the image
done by looking for large intensity changes
Description of the boundaries between objects
achieved by several types of blurring
blobs, edge segments and bars can be identified
Proximity
These blobs, edge segments and bars need to be put together
Gestalt laws propose ways in which this might happen
report they see a square made of circles for A
but then saw they see 5 rows/columns of 5 circles for the other 2
square background made of squares and a square made of circles
X in front of a square
similar things go together
People will see the circle as closed on first glance
if it has the chance to carry on it will
could turn anyway
so not good continuation
people will see AOB and COD
most people are asymmetrical
those who have average symmetry faces are usually ranked more attractive
Putting the bits together
Similarity
Closure
Good continuation
Symmetry
The Full Primal Sketch
uses Gestalt principles to determine important regions or lines
contain 3 different levels of depth
closest, medium, furthest away
shown in separate images
3 images on top of each other
2.5D sketch
As the name implies, is somewhere between 2D and 3D
A 2D image is flat and contains no information about depth
A 3D image shows full depth
The 2.5D image contains information about the layout of object surfaces but is still viewpoint-dependent
That is, the appearance can still change with a shift in viewpoint
Object recognition
turning objects in a 2.5D sketch into 3D objects
Marr and Mishihara 1978
proposed they could use structures called primitives to active this
everything is made of cylinders
Recognition by components
Recognition by Components (RBC)
Developed by Biederman (1987)
Objects are also comprised of features
An ‘alphabet’ of 36 geons was proposed
All objects can be constructed from this set of geons
The geons are like to parts of a letters in a feature net
An object is recognised by the geons activating parts of the network
A key concept of RBC is viewpoint-independence
That is, objects can be recognised in any viewpoint
Recognition by Components (RBC)
Developed by Biederman (1987)
Objects are also comprised of features
An ‘alphabet’ of 36 geons was proposed
All objects can be constructed from this set of geons
The geons are like to parts of a letters in a feature net
An object is recognised by the geons activating parts of the network
A key concept of RBC is viewpoint-independence
That is, objects can be recognised in any viewpoint
Biederman 1987
used a flashlight and a more complicated penguin geon
recognition getting worse the more geons you have in the shape
viewpoint invariance
In summary, work by Marr, Marr and Nishihara, and Biederman propose that object recognition is viewpoint-invariant
Objects can be successfully recognised from any viewpoint
There is no cost of changing viewpoint between study and test
This is achieved by a group of set shapes known as geons
viewpoint dependency
Object recognition is dependent upon having a stored mental representation of an object that is the same/similar to a currently presented view
Recognition to a previously unseen view requires generalisation
Already encountered one form of viewpoint-dependence in PSYC412
(Bruce’s idea of the pictorial code in face recognition)
Longmore et al 2008
rotating head
shows profile view then other rotations
gets worse the more you rotate away from the original image
Canonical Views
most people choose 3
canonical viewpoint
shows the most info in one image
Problem for RBS
gets worse with the greater angle of difference
demonstrates viewpoint dependency
Liu 1996
made less mistakes with more symmetry
Reaction times and recognition accuracy is a function of how similar the study and test images are.
There is a steady increase in reaction times and a decrease in recognition accuracy with increasing viewpoint changes
Suggests that object recognition is mediated by mental representations based upon view-dependent descriptors
The viewpoint-dependency debate was at its peak in the 1990s and has now largely subsided
It moved away from the idea of a view-dependent vs. view-invariant mental representation to what is the actual process used for generalisation
Use both Dependent and INvariant processes
Burgund et al 2000
Image of objects tested in the same or different orientation as studied
images presented to the left or right visual field
processed by the right or left hemisphere respectively
mental rotation
Long before the viewpoint-dependency debate some element of viewpoint-dependency was known
Studies on mental rotation seemed to suggest this
Shepard and Meztler 1971
abstract objects (Rubiks snake) presented in pairs
could be same or different
angular difference between 2 objects could vary
Ps task to determine whether the ibjects are the same or different
the time participants took to judge whether two rotated abstract block figures were identical increased with the figures' relative angular disparity
Are there dedicated networks that are innate for recognising faces.
It is difficult to define faces as special as in face recognition faces are special because they convey important information(as stated by bruce and young)
special=dedicated neural networks dedicated to processing specific stimuli
Evidence for faces being special
Some evidence for faces being special is the N170. this is an ERP(event related potential). When people are shown different stimuli the brain processes the information on different levels. It also has negative potential meaning down is good.
FFA specificity:
Fusiform face area(fusiform gyrus) responds largely to faces. kanwisher(1997) believes this is because the FFA was designed to process faces while gauthier believes that it is a general area for expertise(anything you are good at)
Developmental evidence:fantz(1961)
Fantz was interested in what babies could actually see(because of this we know babies have blurry vision) fantz narrowed bars on a stimulus until the baby couldn't see the difference and no longer paid attention. He then decided to see how long babies looked at different stimuli for lengths of time
black bar= under 3 months white bar= over 3 months
This shows that babies looked at the face the most at any age, even though the top 3 stimuli all had the same contrast(black and white) the face was still on top, suggesting that they favoured the face.
Johnson et al:1991
24 infants less than an hour old
camera at the top records the babies reactions, the mother holds a paddle with either a face(or part of a face/shapes of a face) or nothing on and the mother would move the paddle over the baby to see if the baby reacted.
0= the baby looking directly up
The face lead to the baby moving its head and eyes more than anything else. This suggests that babies are born with what faces look like and it is an innate process.
This then leads to 2 systems being presented by Johnson and Morton(1991)- Conspec and Conlearn
CONSPEC:
An innate subcortical system
It drives an infants attention to faces to aid with survival(drawn to other people)
It is necessary to drive the shaping of face processing(it understands face shape but are not able to recognise a face yet)
All this suggests that there are innate schematics for face processing
However, there is alot of critics
CONLEARN:
Cortical, face specific mechanism that takes over from CONSPEC at 1-2 months
It is more flexible and therefore more accepted
When there are visual issues from birth, face learning is impaired as it is not able to act
One of the problems with CONSPEC was shown in a study by simion et al(2003)
Infants were shown top heavy or bottom heavy
stimuli.
It was found that infants preferred the top heavy pattern because there is the most contrast there and there is the most contrast at the ‘top part’ of the human face
indicating that it is not the face they are focusing on but the contrast
Another issue with CONSPEC was shown by pascalis(1994),
they reported that 3-4 day old infants will habituate to a strangers face(they would ignore it),
their argument that CONSPEC is supposed to draw the infant to the face and CONLEARN is not developed yet so the infant cannot have learned the face.
But the infant was shown the face alot so it might not be interesting anymore
Perceptual narrowing:
Perceptual narrowing(developed by nelson, 2001) has lots in common with CONLEARN.
This is the gradual learning of human faces, even if you don't recognise human faces, you can tell them apart.
As you start to develop you are better able to define the difference of different faces(pascalis et al 2002).
Kelly et al(2005)
showed that 3 month old infants prefer faces of the area that they grow up race to faces of other races as that is the race that they see more often, but this preference was not present at birth.
Meaning the most commonly seen race is prefered by the infant.
Neuropsychology:
Neuropsychology studies people with brain injuries and tests what they can do vs what they cant do and then compare that to the general public to see what damage does what.
Visual agnosia = when presented with an object he is unable to say what the object is
Sheep recognition!!!!!!!
Sheep farmer who cant recognise his wife but he can recognise his sheep(in rather face like ways)
he knew they were different
WJ the sheep farmer
mcneill and warringon (1993)
WJ had a stroke and became a sheep farmer
recognise the identity of his sheep but not his friends/family.
Results:
he recognised his sheep and other sheep.
evidences the fact that there must be a face-specific impairment as face recognition is down to chance while sheep recognition is high
Mr W prosopagnosia
Developed face recognition problems as a result of heart problems.
He had a number of abilities-
Normal IQ
Could identify faces from other objects
Could copy faces
Could state the sex of the face
Could state the expression of the face
Could match unfamiliar faces
He could recognise people from their voice(he hasn't forgotten about people)
However he could not recognise the faces(he couldn't identify himself or doctors or friends) therefore it is face recognition, not everything facial, just recognising them.