Idiolect
Idiolect is an individual's unique use of language, including speech. This unique usage encompasses
vocabulary, grammar, and pronunciation. This differs from a dialect, a common set of linguistic
characteristics shared among a group of people.
The term is etymologically related to the Greek prefix idio- (meaning "own, personal, private, peculiar,
separate, distinct") and -lect, abstracted from dialect,[1] and ultimately from Ancient Greek λέγω, légō, 'I
speak'.
Language
Language consists of sentence constructs, word choices, and expressions of style, and an idiolect
comprises an individual's uses of these facets. Every person has a unique idiolect influenced by their
language, socioeconomic status, and geographical location. Forensic linguistics psychologically analyzes
idiolects.[2]
The notion of language is used as an abstract description of the language use, and of the abilities of
individual speakers and listeners. According to this view, a language is an "ensemble of idiolects... rather
than an entity per se".[3] Linguists study particular languages by examining the utterances produced by
native speakers.
This contrasts with a view among non-linguists, at least in the United States, that languages as ideal
systems exist outside the actual practice of language users. Based on work done in the US, Nancy
Niedzielski and Dennis Preston describe a language ideology seemingly common among American
English speakers. According to Niedzielski and Preston, many of their subjects believe that there is one
"correct" pattern of grammar and vocabulary that underlies Standard English, and that individual usage
comes from this external system.[4]
Linguists who understand particular languages as a composite of unique, individual idiolects must
nonetheless account for the fact that members of large speech communities, and even speakers of
different dialects of the same language, can understand one another. All human beings seem to produce
language in essentially the same way.[5] This has led to searches for universal grammar, as well as
attempts to further define the nature of particular languages.
Forensic linguistics
Forensic linguistics includes attempts to identify whether a person produced a given text by comparing
the style of the text with the idiolect of the individual in question. The forensic linguist may conclude that
the text is consistent with the individual, rule out the individual as the author, or deem the comparison
inconclusive.[6]
In 1995, Max Appedole relied in part on an analysis of Rafael Sebastián Guillén Vicente's writing style to
identify him as Subcomandante Marcos, a leader of the Zapatista movement. Although the Mexican
government regarded Subcomandante Marcos as a dangerous guerrilla, Appedole convinced the
government that Guillén was a pacifist. Appedole's analysis is considered an early success in the
application of forensic linguistics to criminal profiling in law enforcement.[7][8]
In 1998, Ted Kaczynski was identified as the "Unabomber" by means of forensic linguistics. The FBI and
Attorney General Janet Reno pushed for the publication of an essay of Kaczynski's, which led to a tip-off
from Kaczynski's brother, who recognized the writing style, his idiolect.[9]
In 1978, four men were convicted of murdering Carl Bridgewater. No forensic linguistics was involved in
their case at the time. Today, forensic linguistics reflects that the idiolect used in the interview of one of
the men was very similar to that man's reported statement. Since idiolects are unique to an individual,
forensic linguistics reflects that it is very unlikely that one of these files was not created by using the
other.[10]
Detecting idiolect with corpora
Idiolect analysis is different for an individual depending on whether the data being analyzed is from a
corpus made up entirely from texts or audio files, since written work is more thought out in planning and
precise in wording than in spontaneous speech, which is full of informal language and conversation
fillers, e.g. "umm..." and "you know". Corpora with large amounts of input data allow for the generation
of word frequency and synonym lists, normally through the use of the top ten bigrams created from it. In
such a situation, the context of word usage is considered, particularly when determining the legitimacy of
a given bigram.[11]
Whether a word or phrase is part of an idiolect is determined by the word's location compared with the
window's head word, the edge of the window. This window is kept to 7-10 words, with a sample that is
being considered as a feature of the idiolect being possibly +5/-5 words away from the "head" word of the
window (which is normally in the middle). Data in corpus pertaining to idiolect get sorted into three
categories: irrelevant, personal discourse marker(s), and informal vocabulary. Samples at the end of the
frame and far from this head word are often deemed superfluous. Superfluous and non-superfluous data
are then run through different functions to see if given words or phrases are a part of an individual's
idiolect.[11]
See also
      Linguistics portal
    Dialect
    Gollum
    Yoda
    Idioglossia
    Private language argument
    Referential indeterminacy
    Sociolect
References
 1. Harper, Douglas. "-lect" (https://www.etymonline.com/word/-lect). Etymology Online.
    Retrieved 2019-09-02. "word-forming element abstracted 20c. from dialect and in words
    meaning a regional or social variety of a language."
 2. Gerard, Christophe (2010). "The Individual and His Language: Idiolect, Idiosemy, Style".
    Philologie Im Netz. 51: 1–40.
 3. Zuckermann, Ghil'ad (2006), "A New Vision for 'Israeli Hebrew': Theoretical and Practical
    Implications of Analysing Israel's Main Language as a Semi-Engineered Semito-European
    Hybrid Language." Journal of Modern Jewish Studies 5 (1):57–71
 4. Niedzielski, Nancy & Dennis Preston (2000) Folk Linguistics. Berlin: Mouton de Gruyter.
 5. Gleitman, Lila (1993) "A human universal: the capacity to learn a language." Modern
    Philology 90:S13-S33.
 6. McMenamin, Gerald R. & Dongdoo Choi (2002) Forensic Linguistics: Advances in Forensic
    Stylistics. London: CRC Press.
 7. "Marcos, en la mira de Zedillo - Proceso" (https://web.archive.org/web/20131017152025/htt
    p://www.proceso.com.mx/?p=243839) (in Spanish). 5 August 2002. Archived from the
    original (http://www.proceso.com.mx/?p=243839) on 17 October 2013. Retrieved
    25 January 2018.
 8. "Maestros y condiscípulos de Tampico recuerdan a Rafael Guillén" (https://web.archive.org/
    web/20131017150935/http://hemeroteca.proceso.com.mx/?page_id=278958&a51dc26366d
    99bb5fa29cea4747565fec=169802&rl=wh) (in Spanish). Archived from the original (http://he
    meroteca.proceso.com.mx/?page_id=278958&a51dc26366d99bb5fa29cea4747565fec=169
    802&rl=wh) on 2013-10-17. Retrieved 2018-01-25.
 9. Crain, Caleb (1998). "The Bard's fingerprints" (http://linguafranca.mirror.theinfo.org/9807/crai
    n.html). Lingua Franca: 29–39.
10. Coulthard, Malcolm (2004). "Author Identification, Idiolect, and Linguistic Uniqueness" (http
    s://publications.aston.ac.uk/id/eprint/1928/1/A_AppLing.art.final.doc). Applied Linguistics. 25
    (4): 431–447. doi:10.1093/applin/25.4.431 (https://doi.org/10.1093%2Fapplin%2F25.4.431).
11. Wu, Chung-Hsien; Lee, Chung-Han; Liang, Chung-Hau (2009). "Idiolect Extraction and
    Generation for Personalized Speaking Style Modeling". IEEE Transactions on Audio,
    Speech, and Language Processing. 17: 127–137. doi:10.1109/TASL.2008.2006578 (https://
    doi.org/10.1109%2FTASL.2008.2006578). S2CID 788251 (https://api.semanticscholar.org/C
    orpusID:788251).
External links
    Stanford Encyclopedia of Philosophy entry (http://plato.stanford.edu/entries/idiolects/)
    The Online Dictionary of Language Terminology (http://www.odlt.org)
Retrieved from "https://en.wikipedia.org/w/index.php?title=Idiolect&oldid=1254087973"