M is for Mind

22 04 2012

Words come out of the mouth and go into the ear. But they’re stored in the mind. And retrieved from the mind. And understood in the mind. They’re also learned in the mind.

That, at least, is the conventional wisdom – especially from the point of view of cognitive psychology. ‘Language is instantiated in the minds and therefore the brains of language users, so that linguistics is to be regarded as a branch of psychology’. Thus argues Ray Jackendoff (2002: xiv). Chomsky, of course, took this view to an extreme: the observable messiness of language in use (or performance) ‘surely cannot constitute the actual subject matter of linguistics’ (1965: 4). Rather, ‘linguistic theory is mentalistic, since it is concerned with discovering a mental reality [or competence] underlying such behaviour’ (ibid.).

Theories of second language acquisition follow suit: ‘Second language acquisition is first and foremost a mental process – one that occurs in a behavioural and social context, to be sure, but fundamentally a matter of acquiring a new knowledge system.  Cognition and cognitive factors, therefore, are central to any account of how and why SLA works’ (Long & Richards 2001, p.vii) . Anything else, such as the social contexts in which language is used, or the physical stuff of the brain itself, or even the body in which the mind/brain is housed, are considered marginal, messy, uninteresting – mere noise.

The earliest example I could find of a computer in a coursebook: Headway Intermediate (1986)

Not only is language a mental phenomenon, according to this view, but the ‘mind’ of which it is a product is construed as a kind of computer (or as Pinker [1997: 92] charmingly puts it ‘the on-board computer of a robot made of tissue’). Hence, ‘mental life can be explained in terms of a computational process’ (Johnson-Laird, 1988: 26). Or, put another way, cognition – and, by extension, learning – is basically information-processing.  Furthermore, because of the limitations on the amount of attention that humans can allocate to any particular cognitive task at any one time, this processing is necessarily controlled before it is automatic. In short, humans are ‘limited capacity processors of information’.

This applies equally to language learning, both first and other. As McLaughlin (1987: 133) puts it:

Within this framework, second-language learning is viewed as the acquisition of complex cognitive skill.  To learn a second language is to learn a skill, because various aspects of the task must be practised and integrated into fluent performance.  This requires the automatization of component sub-skills.  Learning is a cognitive process, because it is thought to involve internal representations that regulate and guide performance.

Because learning is a cognitive process, this ‘information processing’ view of learning is known as a cognitivist one, and the metaphor that best captures it is MIND IS COMPUTER.  Associated with this model, therefore, we find a host of information-processing terms like input, intake, output, feedback, automatization, filters, as well as the term processing itself. And, because cognition is implicated, we find a further set of terms like noticing, attention, consciousness-raising, and restructuring.

from Reward (1994)

How does this actually impact on current methodology?  On the one hand, you could argue that all these various models of mind and language operate at a level far removed from actual classroom practice, and that teachers carry on doing what they’ve always done – that is, teaching effectively.  On the other hand, you could also argue that the ‘mind is a computer’ metaphor has percolated down (or up?) and underpins many of our methodological practices and materials, including the idea that language learning is systematic, linear, incremental, enclosed, uniform, dependent on input and practice, independent of its social context, de-humanized, disembodied,  … and so on.

It is a model of language learning that, arguably, turns the learner into an automaton -  ‘a robot made of tissue’.  As David Block (2003: 97) notes, ‘in the ideal world of cognitive scientists, the human mind is still conceived of as dependent on external stimuli to which it responds…The adoption of the computer metaphor of input-output does not disguise the fact that there is still a view of mental behaviour as systematic and mechanistic’.

Is there an alternative model – an alternative metaphor, even?

Block (2003: 93) goes on to argue that there are ‘a growing number of scholars who subscribe to the view that mental processes are as social as they are individual and external as they are internal’. (Some of these approaches I’ve referenced in previous posts, such as E is for Ecology, A is for Affordance and B is for Body). Contrasting cognitive with what they loosely call sociocultural approaches, Foster and Ohta (2005:  403) note that, for the latter

Language development is essentially a social process.  These approaches view mind as distributed and learning as something inter-mental, embedded in social interaction.  This means that individuals and environments mutually constitute one another and persons are not considered to be separable from the environments and interactions through which language development occurs.  In this view, knowledge is not owned solely by the learner, but is also a property of social settings and the interface between person and social context.

Elementary Matters (1997)

The distributed nature of mind is a core tenet of theories of ‘situated cognition’, neatly captured here by Clark (2011: 70):

Extended systems theorists… reject the image of mind as a kind of input-output sandwich with cognition as the filling….  Instead, we confront an image of the local mechanisms of human cognition quite literally bleeding out into body and world.

What, I wonder, would be the characteristics of a methodology that subscribed to this distributed, ‘leaky’, and co-adaptive view of mind? And, specifically, what are the correlates of input and of noticing, in this alternative to a computational, information-processing model of language learning?

References:

Block, D.  (2003) The Social Turn In Second Language Acquisition, Edinburgh: Edinburgh University Press.

Clark, A. (2011) Supersizing the Mind: Embodiment, Action, and Cognitive Extension, Oxford: Oxford University Press.

Chomsky, N. (1965) Aspects of the Theory of Syntax, Cambridge, MA: The M.I.T. Press.

Foster, P. and Ohta, A. (2005) ‘Negotiation for meaning and peer assistance in second language classrooms’, Applied Linguistics, 26, 3,

Jackendoff, R. (2002) Foundations of Language: Brain, Meaning, Grammar, Evolution, Oxford: Oxford University Press.

Johnson-Laird, P.  N.  (1988) The Computer and the Mind: An Introduction to Cognitive Science, Cambridge, MA:  Harvard University Press.

Long, M. and Richards, J. (2001) ‘Series editors’ preface’, in Robinson, P.  (Ed.)  Cognition and Second Language Instruction, Cambridge: Cambridge University Press.

McLaughlin, B. (1987) Theories of Second-and Language Learning, London: Edward Arnold.

Pinker, S. (1997) How The Mind Works, London: Penguin.





I is for Imitation

25 03 2012

Listen!  Repeat! Understand! The sequence below comes from an advert for a self-study language course – an advert that I have used countless times on training sessions to (gently) mock the folk theory that language acquisition (both first and second) is primarily a process of imitation – and imitation in advance of understanding, no less. The text of the advert spells it out: ‘You probably can’t remember, but at that time [i.e. when you were a child]  you first reproduced sounds, then words, and then entire phrases without really understanding anything. Very quickly you were able to speak, understand and make yourself understood’.  And of course they add, ‘This is the best way to learn any language’.

It’s amazing how this notion has resisted the hatchet-job that Chomsky and his followers inflicted upon it so long ago. Mindless reproduction of the type described cannot of course account for the almost limitless creativity that even quite young children allegedly exhibit. Summing up the evidence, Lightbown and Spada (2006: 14) confidently declare that ‘imitation and practice alone cannot explain some of the forms created by children’, citing a figure of less than 10 per cent of children’s output as being directly imitative.

So, if, in Chomsky’s terms, language use is rule-based creativity, and if performance is contingent upon competence, then it follows that we should teach (or have learners figure out) the rules of the language, so that they can generate their own meanings, rather than have them simply imitate a model. The learning sequence might better be summed up as Listen! Understand! Figure it out! Create!

It’s something of a shock, therefore, to come up against this sentence in Vygotsky’s Thought and Language ([1934], 1986: 188, emphasis added): ‘In learning to speak, as in learning school subjects, imitation is indispensable’. Or, as Lantolf and Thorne (2006: 166) gloss it, ‘Imitation is the process through which socioculturally constructed forms of mediation are internalised’. That is to say, the transition from skills (including linguistic ones) that are initially other-regulated to those that are self-regulated is engineered by – hold your breath – imitation.

In fairness, and as Swain et al (2011: 58) point out, Vygotsky’s notion of imitation was a far cry from mindless parroting: ‘Vygotsky differentiated imitation from automatic copying.  In Vygotsky’s view, imitation is a potentially transformative mechanism that is applied consciously and is goal-directed. Intentionality of the imitation, the reflection and examination of the results, and the subsequent revisions differentiates the action from simple mimicry’. This is reminiscent of Bakhtin’s (1981: 428) claim that, to make an utterance is to ‘appropriate the words of others and populate them with one’s own intentions’.

Imitation, then, is like a benign form of plagiarism, in which the child cobbles utterances together, in a kind of cut-and-paste fashion, using whatever linguistic affordances are available in order to achieve their immediate communicative purposes.  These linguistic affordances include, not only words, but multi-word chunks, such as lemme-see, I-wanna-do-it, etc, that, initially at least, are unanalysed into their component parts (Tomasello 2003). In this sense, they constitute what one scholar (Clark 1974: 1) has called performance without competence: ‘The important question is no longer whether imitation can help children to acquire syntax, but precisely how a child gradually extracts grammatical information from the repertoire of imitated sequences at his [or her] disposal’.

So, to tweak our learning sequence yet again, maybe what’s happening is more like Listen! Imitate! Understand! Figure it out! – not a million miles from the Listen! Repeat! Understand! formula that I habitually mock.

The question then is (as ever): how does this apply to the learning of a second language? How does one ‘populate the words of others with one’s own intentions’?  Eva Hoffman (1998: 220), a Polish teenager learning English in the United States, describes the process of appropriation:  ‘Since I lack a voice of my own, the voices of others invade me… By assuming them, I gradually make them mine. I am being remade, fragment by fragment, like a patchwork quilt’.  In a similar, patchwork fashion, a student of academic writing will selectively imitate (or copy) features, both micro- and macro-, of a model text as a first step in discovering her own academic ‘voice’.

If imitation is fundamental to first language acquisition, should we be integrating more imitation-type activities into our second language classrooms? And how can we ensure that, in order to be ‘transformative’, imitation meets the criteria that Swain et al. establish (2011: 59), i.e. that it is ‘deliberate, reflective, and accompanied by some kind of instruction’?

References:

Bakhtin, M.M. (1981) The Dialogic Imagination: Four Essays, Austin: University of Texas Press.

Clark, R. (1974) ‘Performing without competence’, Journal of Child Language, 1, 1.

Hoffman, E. (1998) Lost in Translation: A Life in a  New Language, London: Vintage.

Lantolf, J.P., and Thorne, S. (2006) Sociocultural Theory and the Genesis of Second Language Development, Oxford: Oxford University Press.

Lightbown, P.M. and Spada, N. (2006) How Languages are Learned (3rd edn.), Oxford: Oxford University Press.

Swain, M., Kinnear, P., and Steinman, L. (2011) Sociocultural Theory in Second Language Education: An Introduction through Narratives, Bristol: Multilingual Matters.

Tomasello, M. (2003) Constructing a Language: A Usage-based Theory of Language Acquisition, London: Continuum.

Vygotsky, L. ([1934] 1986), Thought and Language, edited by Kozulin, A., Cambridge, Mass: The MIT Press.





P is for Primate language

24 07 2011

I’ve just seen this somewhat dispiriting documentary about Nim, one of a number of primates who have been sequestered, domesticated, scrutinised, feted, and ultimately abandoned in the name of linguistic research.  Even the shots of the Columbia University forecourt that I walk through every day failed to enliven a story of wanton cruelty, institutional pettiness, dodgy science and bad hair.

The film charts a succession of sudden, traumatic abductions, starting when baby Nim was snatched screaming from his mother’s arms. Over a period of several years, with only humans to interact with, the young chimp was taught to sign, using an adapted form of  American Sign Language, and acquiring a working vocabulary of several hundred words. When he outgrew his cute and cuddly stage, and/or when the funding ran out, he was packed off to a sort of primate Guantánamo Bay. The story is only slightly redeemed by the efforts of one of his former minders to track him down. Even in his hoary old age, Nim still retains a trace of his former competence, pathetically signing ‘play’ from within the bars of his prison.

Columbia University

Frustratingly, the film hardly touches on the linguistic controversies that fuelled this research. In the 1970s, when this unhappy story took place, the debate as to whether only humans are innately equipped with a modular language acquisition device (LAD) was still a fairly hot issue. Not for nothing was Nim named Nim Chimpsky, in (cute) recognition of Noam Chomsky’s role as the leading protagonist of the debate.

What was at stake was this: if highly intelligent apes, exposed to a similar linguistic environment as human children, could acquire an extensive lexicon, but fail to develop even the rudiments of a ‘grammar’, this would go some way towards supporting the view that humans are uniquely hard-wired for language acquisition. On the other hand, if evidence of syntax, however primitive, could be demonstrated, Chomsky’s notion of a ‘Universal Grammar’ (UG) would either need to be extended to nonhuman primates, or it would need to be re-evaluated entirely.

And the findings? Nim’s vocabulary was impressive, but more impressive still was his ability to form two-sign, three-sign, and even longer strings: MORE EAT, HUG NIM, BANANA EAT ME NIM, etc. Moreover, a superficial analysis of the data would suggest that Nim was operating according to some kind of embryonic grammar, producing word order patterns not dissimilar to those of human children’s first utterances. For example, he consistently placed the sign for MORE in front of the sign it modified:  MORE TICKLE, MORE DRINK, etc. But, as Jean Aitchison (1983) notes, “a closer analysis showed that the appearance of order was an illusion. Nim simply had a statistical preference for placing certain words in certain places, while other words showed no such preference” (p.55).

However, as Roger Brown(1973) argued, with regard to similar results for Washoe, an earlier case study of primate signing, “While appropriate order can be used as evidence for the intention to express semantic relations, the lack of such order does not establish the absence of such intentions” (p. 41). This is because the use of appropriate word order, of the verb-object type, for example, as in GIVE BALL, is not strictly necessary, since the context in which the utterances are generated usually resolves any ambiguity. That is to say, the pragmatics of the situation renders syntax redundant. But if that is the case, why do (human) children show evidence of a proto-syntax right from the start?

In the end, we don’t seem to be much the wiser as to whether the higher primates have a rudimentary LAD, despite all the anguish that was inflicted in trying to find one. Nor, for that matter, do we really know whether humans have an LAD either, or whether their faculty for language acquisition isn’t just a spin-off of their vastly more developed cognitive capacities.

What we do know is that the chimpanzees who have been studied do not use their linguistic capacities in the same way as humans, even very young ones, do. Nim, for example, rarely initiated a conversation, and was unable to grasp the basics of turn-taking. As Aitchison (1983, p. 57) concludes, “Nim did not use his signs in the structured, creative, social way that is characteristic of human children”.

In fact, Nim’s ‘language’ was simply a more elaborated version of the way chimpanzees use gestures and vocalizations in the wild: to regulate two-way social interactions such as grooming, feeding, and play. As Tomasello (2003, p. 11) puts it, nonhuman primate communication functions “almost exclusively for imperative motives, to request a behavior of others, not to share attention or information with others in a disinterested manner”.

As someone once said, “Your dog can tell you he is hungry, but not that his father was poor but happy”.

References:

Aitchison, J. (1983). The Articulate Mammal: An Introduction to Psycholinguistics (2nd edn). New York: Universe Books.

Brown, R. (1973). A First Language: The Early Stages. Cambridge, MA.: Harvard University Press.

Tomasello, M. (2003) Constructing a Language: A Usage-based Theory of Language Acquisition. Cambridge, MA.: Harvard University Press.





X is for X-bar Theory

6 01 2010

There’s no entry for X in the A-Z. If there were, one of the few candidates would be X-bar theory, which is in fact a (1981) refinement of Chomsky’s theory of language known as Transformational-generative (TG) grammar. (For what it’s worth, X-bar theory argues that all phrases – whether noun phrases, verb phrases etc – have the same structure, and that this structure is a linguistic universal, i.e. it’s common to English, Japanese, probably even Klingon).

Noam Chomsky

As I say, there’s no entry for X-bar theory. There’s no entry for TG grammar in the A-Z, either. Nor for its predecessor, generative grammar. Nor for Government and Binding theory, nor the Principles and Parameters theory, nor the Minimalist program. In fact, there’s no mention of Chomsky or any of his theories in the entry on Grammar at all.

This might strike some readers as odd, even perverse. At best, negligent. After all, the study of TG grammar (or any of its offshoots) is a key component of  any self-respecting linguistics course on any MA TESOL program in the US. It is often the only theory of grammar that is studied. In fact, in many of the standard texts, such as Fromkin et al. (2007) An Introduction to Language, or the Ohio State University Language Files (ed. Stewart and Vaillette, 2001) it’s not even called TG Grammar, nor ascribed to Chomsky by name. It’s simply the grammar that is. The one and only. And, just in case you don’t know which one I’m talking about, it’s the one that involves the endless “tree-diagramming” of (invariably invented) sentences, like The child found a puppy and Where has Pete put his bone? (both from Fromkin et al.).

Simple tree diagram

So why did I ignore it?

Well, to be perfectly honest, I’m not sure that I really understand it. I can get my head around basic Phrase-structure grammar, even X-bar theory, and just about understand what Theta-theory is on about. But, as much as I want to get to grips with the Minimalist program (not least because it seems to be arguing a central role for lexis in determining syntactic structure), I’m struggling. In the end, all those upside-down trees leave me cross-eyed.

But there are more cogent reasons – both linguistic and pedagogical – for treating TG grammar cautiously, it seems to me. On linguistic grounds the theory seems flawed since it is based entirely on invented sentences in their written form. Try to apply the descriptive framework to spoken data – e.g. authentic utterances like: But the spa, you might want to use it, you know  – and it just doesn’t fit. By the same token, TG grammarians will reject forms as being ungrammatical even when they are commonly attested (one of the texts I consulted disallows the sentence John bought what? for example). Chomsky’s dogged insistence on making the “well-formed” sentence the centrepiece of his theory of language seems to undermine the whole enterprise – this, along with the misguided notion that all sentences are generated from the word up, and are hence all entirely original. (Chomsky’s acolyte, Stephen Pinker, woefully betrays his ignorance of developments in corpus linguistics by claiming – in The Language Instinct - that “virtually every sentence that a person utters or understands is a brand-new combination of words, appearing for the first time in the history of the universe” (1994, p. 22) Compare this to the corpus linguist, John Sinclair’s (1991) claim that “by far the majority of text is made of the occurrence of common words in common patterns”).

The blinkered disavowal of the validity of performance data is, of course, a side-effect of their (Chomsky’s, Pinker’s etc)  mentalist agenda, which is to demonstrate both the universality and innateness of their grammar. This means cherry-picking your examples (and re-configuring your theory fairly regularly so as to accommodate new, potentially disruptive, evidence), while consigning anything that doesn’t fit to the damaged goods bin – the one labelled “performance”.

But, more importantly for me, is the lack of pedagogical applicability. For a start, the idealised nature of Chomskyan grammar seems to bear only an accidental relationship to the way language is actually stored and generated in the brain. Chomsky (to his credit) was quick to acknowledge this.  Way back in 1965, he wrote: “When we say that a sentence has a certain derivation with respect to a particular generative grammar, we say nothing about how the speaker or hearer might proceed, in some practical or efficient way, to construct such a derivation.” (Aspects of the Theory of Syntax. Cambridge, Mass.: MIT Press, p. 9.)  Quite right: the tree diagram describes the structure “after the event”, but it ignores the fact that (as David Brazil put it) “discourse [is] something that is now-happening, bit by bit, in time, with the language being assembled as the speaker goes along” (1995, p. 37). If the TG representation of grammar has little or no psychological foundation, it would seem to be fairly useless for teaching purposes.  I take heart, therefore, from a statement made by Bernard Spolsky, after a think-tank on the applications of Chomskyan grammar, held in the late sixties. He concluded:

Linguistics and its hyphenated fields have a great deal to offer to language teachers, but the fullest benefit can only come when their implications are integrated and formed into a sound theory of language pedagogy. Because linguistics is only indirectly applicable to language teaching, changes in linguistic theory or arguments amongst linguists should not disturb language teachers

(Spolsky, B. (1970) Linguistics and language pedagogy – applications or implications? [emphasis added]) 

So, why does the teaching of TG grammar (including X-bar theory) persist in the US academic context? And was I wrong to ignore it in the A-Z?








Follow

Get every new post delivered to your Inbox.

Join 1,646 other followers