Current aetiological models of auditory verbal hallucinations (AVHs) tend to focus on the mechanisms underlying their occurrence, but often fail to address more fine-grained aspects of the content of the auditory experience. In other words, they tend to ask why there are AVHs at all, instead of asking, given that there are AVHs, why they have the properties that they have. One such property, which has been somewhat overlooked, is why the voices are often experienced as coming from (or being the voices of) agents, and often specific agents.
One reason why this has been overlooked (more by theorists than by clinicians) is perhaps the – explicit or implicit – view that, if we can account for the auditory experience, then the agency will follow. Thus Cho and Wu (2013, p.2) claim that ‘it is simple to explain why the patient misattributes the event to another person: that is what it sounds like’. According to such a view, I hear the voice of, e.g., my stepfather because my auditory experience has the properties that resemble those of my stepfather’s voice.
Of course, one problem with this is that, even if this is an accurate account, we need to explain why the experience has those properties and not others. If, instead, we see that what needs explaining is why it is the stepfather who is represented, and not some other agent, then we can potentially provide illuminating explanations in terms of the stepfather’s role in the subject’s life. Of course, these only form a relatively small subset (e.g. 4 out of 24 in Fowler et al. 2006) of AVHs, but I mean this as an illustration of how, if you change your explanatory focus, your explanations start to look very different. If you move the focus away from auditory properties, to the properties of the agent represented, whether it’s a specific individual from the subject’s past, a richly represented imaginary tormentor or friend, or just a particular type of agent, then the explanatory payoffs could be great.
Why might this shift in focus, away from the auditory, be warranted? There are a number of reasons for this, but here I mention just two.
The first is that some voice-hearers describe an experience of ‘soundless voices’. For example, one participant in a recent study told us: “It’s hard to describe how I could ‘hear’ a voice that wasn’t auditory; but the words used and the emotions they contained were completely clear, distinct and unmistakeable, maybe even more so than if I had heard them aurally”. This is in keeping with Chris Frith’s claim that AVHs can involve ‘an experience of receiving a communication without any sensory component’ (1992, p.73).
The second is that ‘voices’ are also ‘heard’ by congenitally deaf people. Jo Atkinson, a researcher in London, has done very important work correcting the ‘audio-centrism’ of mainstream clinical perspectives (Atkinson 2006). She has shown that deaf voice-hearers experience vague visual imagery like being addressed in sign-language, or of disembodied lips. But they do not, as was previously thought, have auditory experiences at all.
A wealth of literature in evolutionary and developmental psychology, as well as in cognitive anthropology, points towards the representation of agency as something that is “trip-wired” (Gurthrie 1980; Barrett 2000), developmentally early and potentially innate (Bates et al. 1975; Butterworth and Grover 1988; Morton and Johnson 1991), and can plausibly enter into the very content of an experience (Johnson 2003). In other words humans can experience “illusions of agency”, even if they don’t go on to believe that agency is present.
Perhaps we should start viewing AVHs as primarily communicative experiences from agents, which happen to often be auditory or reported as auditory, rather than seeing them as primarily auditory experiences that happen to have communicative and agentive aspects.
Atkinson, J. R. 2006. The perceptual characteristics of voice hallucinations in deaf people: Insights into the nature of subvocal thought and sensory feedback loops. Schizophrenia Bulletin 32(4), 701-708.
Barrett, J. L. 2000. Exploring the natural foundations of religion. Trends in Cognitive Science 4, 29–34.
Bates, E., Camaioni, L. & Volterra, V. 1975. The acquisition of performatives prior to speech. Merrill-Palmer Q. 21, 205–226.
Butterworth, G. E. & Grover, L. 1988. The origins of referential communication in human infancy. In L. Weiskrantz (ed.) Thought without Language, pp. 5–24. Oxford University Press.
Cho, R. and Wu, W. 2013. Mechanisms of auditory verbal hallucination in schizophrenia. Frontiers in Schizophrenia 4.
Fowler, D., Freeman, D., Steel, C., Hardy, A., Smith, B., Hackman, C. & Kuipers, E. 2006. The catastrophic interaction hypothesis: how do stress, trauma, emotion and information processing abnormalities lead to psychosis? In W. Larkin and A.P. Morrison (eds.) Trauma and Psychosis: New Directions for Theory and Therapy. London: Routledge; 101–124.
Frith, C. 1992. The Cognitive Neuropsychology of Schizophrenia. Psychology Press.
Guthrie, S. G. 1980 A cognitive theory of religion. Current Anthropology 21(2), 181–203.
Johnson, S. 2003. Detecting Agents. Phil. Trans. R. Soc. Lond. B. 358, 549-559.
Morton, J. & Johnson, M. M. 1991. CONSPEC and CONLERN: a two-process theory of infant face recognition. Psychol. Rev. 98, 164–181.