Assignment 2 Corpus 80
Assignment 2 Corpus 80
Clarity of
Integration of other Quality of linguistic Syllabus Specification organisation and In-text citation &
Aspect ➔
research analysis and reporting 15% expression bibliography
Grade Scale
20% 40% 15% 10%
Assessment #2: Identify spoken or written text(s) exemplifying language learners' needs,
analyse relevant linguistic features of the text and show how those features could inform
teaching practice
B. Theoretical background
Even though corpus linguistics, the study of a collection of texts for a particular research purpose
(Cheng, 2011), is one of the most rapidly developing areas of applied linguistics, and corpus
analysis has been utilised to generate word lists for a long time, the application of corpus
linguistics to investigate vocabulary in the IELTS reading test is arguably not an area of research
interest (Mitchell, 2022). For this section, I will first examine the structure of the IELTS test,
focusing particularly on the Academic reading component. Second, I will elaborate on the
importance of vocabulary and word lists in English learning before eventually associating the use
of corpus analysis to lexical acquisition.
Unlike Cambridge English exams, IELTS test takers will be awarded a band score from 1 to
9, with 1 being awarded to those who have no ability to use the language except for some
isolated utterances and 9 being awarded to exper users of English. Table 2 gives a detailed
description of how the IELTS bands are interpreted (retrieved from https://www.ielts.org/).
Table 2: IELTS scores scale
The IELTS Academic reading test comprises three passages with 40 questions of various types
that are designed to test a range of reading subskills (e.g. reading for gist, reading for details,
reading for inference). Test takers are given 60 minutes to complete the test including time to
transfer the answers from the question booklet to an answer sheet. Compared with the General
Training reading test, the Academic one features more advanced vocabulary and a greater
complexity of styles; however, the test is constructed for non-specialist candidates, which means
test takers do not need to have specialised knowledge to achieve a good band score. The level of
difficulty increases towards the end of the test.
Table 3 below shows the number of correct answers required to obtain a particular band score
in the test (retrieved from https://www.ielts.org/). Scores are given in whole bands and half
bands.
Table 3: How the IELTS reading scores are caculated
IELTS Reading band scores
Raw score out of 40 Band Score
15 5
23 6
30 7
35 8
B. Data collection
For this corpus-based project, I used two computer programs AntWordProfiler and AntConc. The
former program, which is embedded with the Academic Word List (570 words) developed by
Coxhead (2000), was used to produce a list of the most common academic words used in IELTS
Academic reading papers based on their frequency whilst the latter was employed to identify
collocates of words from the list.
177 academic reading passages were collected from a total of 59 reading tests derived from
books published by Cambridge University Press. These books include the Cambridge IELTS series
(5-15), The Official Cambridge Guide to IELTS, and IELTS Trainer Academic 2. These publications
purport to be the only authentic testing materials from Cambridge ESOL Examinations. The
selection of these up-to-date official test papers guarantees the credibility and reliability of the
research findings.
I managed to obtain a corpus of 158,092 tokens after removing all the instructions and questions
following each reading passage. That is, the corpus only contains words from the reading texts.
My rationale for the removal of instructions and questions primarily depends on my experience
with IELTS test papers – the vast majority of instructional words and words used in questions do
not belong to the academic category to ensure that test takers, no matter where they are on the
1-to-9 scale, can easily understand the instructions and questions.
Table 4: Example of the generated word list (See Appendix 1 for the full list)
No. Headword Frequency Range
1 research 280 92
2 create 158 67
3 environment 147 67
4 process 138 71
5 area 127 70
6 individual 111 58
7 transport 109 24
8 design 105 42
9 involve 102 65
10 evident 100 55
11 major 99 63
12 energy 98 36
13 technology 84 52
14 theory 84 36
15 approach 81 50
16 culture 81 35
17 respond 80 39
18 significant 79 57
19 economy 76 37
20 evolve 73 30
Stage 2: Word list tagging with the Common European Framework of Reference for Languages
(CEFR)
Subsequently, I categorized those 318 words in the list according to the CEFR, which is an
international benchmark for describing language proficiency (Milton, 2013). According to the
Cambridge English Website (http://cambridgeenglish.org/), language ability is described in a
hierarchy from A1 for beginners up to C2 for those who have mastered a language, and this
division enables anyone involved in language teaching or testing including teachers, learners, and
administrators to have a common ground to benchmark language proficiency. The rationale for
using CEFR in vocabulary instruction is so teachers can ensure that they teach the right words to
their students.
As stated earlier, the target students in this study are currently at the level of B2, and they are
aiming for 6.5-7.0 in the reading test component. Therefore, words classified as B2 and C1 were
chosen for the final word list. That is, words at the levels of A1, A2, B1, C2, and off-list words (a
total of 117 words) were removed from the original list.
For this stage of the procedure, https://www.vocabkitchen.com/profile is used to categorize
words according to CEFR levels. Table 5 shows the word list after CEFR levels are tagged to each
item, and Table 6 illustrates a list of words at the levels of B2 (151 words) and C1 (50 words).
These 201 words total a coverage of 3.72% of the 177 passages.
Table 5: Example of the word list with tagged CEFR levels (See Appendix 1 for the full list)
No. Headword Frequency Range CEFR
1 research 280 92 B1
2 create 158 67 B1
3 environment 147 67 B1
4 process 138 71 B2
5 area 127 70 A2
6 individual 111 58 B1
7 transport 109 24 B1
8 design 105 42 B1
9 involve 102 65 B1
10 evident 100 55 B2
11 major 99 63 B2
12 energy 98 36 B1
13 technology 84 52 B2
14 theory 84 36 B1
15 approach 81 50 B1
16 culture 81 35 B1
17 respond 80 39 B2
18 significant 79 57 B2
19 economy 76 37 B2
20 evolve 73 30 C1
Table 6: Example of the list of words at B2 and C1 levels (See Appendix 2 for the full list)
No. Headword Frequency Range CEFR
1 process 138 71 B2
2 evident 100 55 B2
3 major 99 63 B2
4 respond 80 39 B2
5 significant 79 57 B2
6 economy 76 37 B2
7 evolve 73 30 C1
8 factor 71 38 B2
9 impact 68 49 B2
10 psychology 66 25 B2
11 source 66 45 B2
12 focus 65 46 B2
13 identify 65 45 B2
14 survive 64 43 B2
15 construct 60 38 B2
16 intelligence 57 17 B2
17 consume 56 26 B2
18 function 56 34 B2
19 affect 54 34 B2
20 participate 54 23 B2
D. Discussion
It is widely agreed that while a corpus analysis of texts enables materials writers to develop
learning materials with specific lexical knowledge for English for Academic Purposes (EAP)
courses, it informs teachers of what to teach to meet their students’ specific needs (Hyland &
Tse, 2007). In this case, my students’ needs are to achieve 6.5-7.0 in the IELTS Academic reading
test. The generated list of 201 most frequent words and their collocations in 177 IELTS Academic
reading passages are extremely helpful in equipping them with sufficient lexical knowledge to
obtain the score they want.
However, Hyland and Tse (2007) raised a concern over the creation of an academic word list,
suggesting that words in the list often exhibit “a considerable amount of semantic variation” (p.
245). That is, one individual word may behave differently in different contexts, so researchers or
teachers who produce the word list should inform their students of the importance of contextual
environments in which the words are positioned. For instance, the word intelligence has two
distinct meanings: (1) the ability to learn, understand and think quickly and logically and (2)
collected confidential information about a foreign country, especially one that is an enemy (as in
military intelligence and intelligence agent). Moreover, language learners ought to be made
aware of all the different derivations and inflections of a lexical item (Hoey, 2010). For example,
students need to be made aware that economy is a noun, economic is an adjective, economics is
a noun, economical is an adjective, economically is an adverb and economist is a noun. Given the
two mentioned aspects, it stands to argue that it is possible to generate a list of common
academic words by utilising a corpus analysis, but the word list should be used with careful
consideration as there is no context attached to the meanings of the words. This view is
supported by Coxhead & Nation (2001) who claimed that vocabulary is beyond single words
acting separately in a discourse and that vocabulary should not be taught or learnt out of context.
Nevertheless, this issue can be mitigated to a certain degree with the aid of the collocation list
following the decontextualised word list, which indicates parts of speech of each item and its
common collocates. Lewis (1997) stressed the significance of collocations by stating that “instead
of words, we consciously try to think of collocations, and to present these in expressions. Rather
than trying to break things into ever smaller pieces, there is a conscious effort to see things in
larger, more holistic, ways” (p. 204)
E. Conclusion
For many students, the IELTS Academic reading component is a significant challenge due to its
high density of academic lexis. From a pedagogical perspective, this aspect of the IELTS test is
also challenging for teachers who are assigned to teach the exam classes because many teachers,
especially novice ones or those new to the concept of teaching to the test, tend to struggle to
identify which lexical items they should focus on while teaching their students. Failing to find out
those words, the teachers are less likely to help their students succeed in their IELTS test
attempts. However, the introduction of corpus linguistics and advances in computer technology
have enabled materials developers and teachers alike to produce word lists dedicated to the
IELTS test in general and the Academic reading component in particular.
Although the findings of this corpus-based project are fruitful in a sense that a list of 201
academic words at B2 and C1 levels and a table of collocations could inform teachers of what
words they should teach and provide students with a repertoire of vocabulary to draw on, it is
important for teachers to make their students aware of the fact that mastery of the word list and
the collocation table generated from this study will not guarantee them a 6.5-7.0 score as fluent
reading does not depend solely on lexical range, and that such factors as grammatical fluency,
reading speed and reading skills have a crucial role to play (Khalifa & Weir, 2009). Moreover, it is
vital for teachers to avoid the pitfall of teaching vocabulary out of context; otherwise, prospective
IELTS test takers may find the test preparation even more arduous.
Software
• Anthony, L. (2022). AntWordProfiler (Version 2.0.0) [MacOS 10/11]. Tokyo, Japan:
Waseda University. Available from https://www.laurenceanthony.net/software
• Anthony, L. (2022). AntConc (Version 4.0.10) [MacOS 10/11]. Tokyo, Japan: Waseda
University. Available from https://www.laurenceanthony.net/software
References
• Biber, D., Conrad S., & Cortes, V. (2004). If you look at...: Lexical Bundles in University
Teaching and Textbooks. Applied Linguistics, 25(3), 371-405.
• Cheng, W. (2011). Exploring Corpus Linguistics. New York, NY: Routledge.
• Coxhead, A. (2000). A new academic word list. TESOL Quarterly, 34(2), 213-238.
• Coxhead, A., & Nation, I. S. P. (2001). The specialized vocabulary of English
foracademic purposes. In J. Flowerdew & M. Peacock (Eds.), Research perspectives on
English for academic purposes (pp. 252–267). Cambridge: Cambridge UniversityPress
• Davies, M., & Gardner, D. (2010). A Frequency Dictionary of Contemporary American
English. New York, NY: Routledge.
• Day, R., & Bamford, J. (1998). Extensive reading in the second language classroom.
Cambridge: Cambridge University Press.
• Durrant, P. (2014). Discipline and Level Specificity in University Students’ Written
Vocabulary. Applied Linguistics. 35(3), 328-356.
• Durrant, P. (2016). To what extent is the Academic Vocabulary List relevant to university
student writing? English for Specific Purposes. 43, 49-61. doi: 10.1016/j.esp.2016.01.00.
• Gabrielatos, C. (2005). Corpora and Language Teaching: Just a fling or wedding bells?
TESL-EJ, 8(4). Retrieved from http://tesl-ej.org/ej32/a1.html
• Gardner, D., & Davies, M. (2013). A New Academic Vocabulary List. Applied Linguistics,
35(3), 305–327. doi: 10.1093/applin/amt015
• Hoey, M. (1991). Patterns of lexis in text. Oxford: Oxford University Press.
• Hoey, M. (2010). Lexical priming: a new theory of words and language. London:
Routledge.
• Hsu, J.T. (2010). The Effects of Collocation Instruction on the Reading Comprehension
and Vocabulary Learning of College English Majors. The Asian EFL Journal Quarterly, 12
(1), 47-87.
• Hu, M. H. H., & Nation, I. S. P. (2000). Unknown vocabulary density and reading
comprehension. Reading in a Foreign Language, 13(1), 403–430.
• Hyland, K., & Tse, P. (2007). Is there an ‘Academic vocabulary’? TESOL Quarterly, 41(2),
235-253. http://dx.doi.org/10.2307/40264352
• Johns, T. (1988). Whence and whither classroom concordancing? In T. Bongaerts, P. de
Haan, S. Lobbe., & H. Wekker (Eds.) Computer applications in language Learning (pp. 1-
27). Dordrecht: Foris Publications. Journal of Public Health, 16(5), 369-374.
• Khalifa, & Weir, C. J. (2009). Examining reading : research and practice in assessing
second language reading. Cambridge, UK: Cambridge University Press.
• Kiaee, S.S., Moghaddam, N.H., & Hosseini, E.M. (2013). The Effect of Teaching
Collocations on Enhancing Iranian EFL Learners' Reading Comprehension. English
Language Teaching, 1, 1-11.
• Leech, G., Rayson, P., & Wilson, A. (2001). Word Frequencies in Written and Spoken
English: Based on the British National Corpus. London, UK: Longman.
• Lewis, M. (1997). Implementing the lexical approach: Putting theory into practice. Hove,
England: Language Teaching Publications.
• Martínez, I., Beck, S., & Panza, C. (2009). Academic vocabulary in agricultural research
Articles: A corpus-based study. English for Specific Purposes, 28(3), 183-198.
• Michell, C. (2022). Using Corpus Linguistics to Better Prepare Students for the IELTS
Reading Exam. In Coombe, C., Hiasat, L., Daleure, G. (Eds.), English Language and
General Studies Education in the United Arab Emirates. English Language Teaching:
Theory, Research and Pedagogy. Springer, Singapore. https://doi.org/10.1007/978-981-
16-8888-1_23
• Miller, D.G., & Biber, D. (2015). Evaluating reliability in quantitative vocabulary studies:
The influence of corpus design and composition. International Journal of Corpus
Linguistics, 20(1), 30-53.
• Miller, N., & Budgell, B. (2008). The language of public health: A corpus-based analysis.
• Milton, J. (2013). Measuring the contribution of vocabulary knowledge to proficiency in
the four skills. In C. Bardel, C. Lindqvist, & B. Laufer (Eds.), L2 vocabulary acquisition,
knowledge and use: New perspectives on assessment and corpus analysis (pp. 57-78).
Euro SLA.
• Nation, I. S. P. (1990). Teaching and learning vocabulary. Massachusetts: Newbury
House.
• Nation, P., & Waring, R. (1997). Vocabulary size, text coverage and word lists. In N.
Schmitt, & M. McCarthy (Eds.), Vocabulary: Description, acquisition and pedagogy (pp.
6-19). Cambridge: Cambridge University Press.
• Nattinger, J. R., & DeCarrico, J. S. (1992). Lexical phrases and language teaching. Oxford:
Oxford University Press.
• Pawley, A., & Syder, F. H. (1983). Two puzzles for linguistic theory: Nativelike selection
and nativelike fluency. In J. C. Richards & R. W. Schmidt (Eds.), Language and
communication (pp. 191–226). New York: Longman.
• Sinclair, J. (1991). Corpus, Concordance, Collocation. Oxford: OUP.
• Stubbs, M. (2001). Words and Phrases: Corpus Studies of Lexical Semantics. Oxford:
Blackwell.
• Thorndike, D. L., & Lorge, I. (1944). The Teacher’s Word Book of 30,000 Words. New
York, NY: Bureau of Publications, Teachers College, Columbia University.
• Vu, P. A., Nguyen, Q. T., & Nguyen, T. T. (2016). Identifying the criteria for selecting
language outcomes assessment tools: The case of TOEIC and IELTS. Paper presented at
4th British Council New Directions in English Language Assessment: Standardised
Testing and Proficiency scales. Ha Noi, 13–14 October 2016.
• Wang, J., Liang, S., & Ge, G. (2008). Establishment of a medical academic word list.
English for Specific Purposes, 27(4), 442-458
• Ward, J. (2009). A basic engineering English word list for less proficient foundation
engineering undergraduates. English for Specific Purposes, 28(3), 170-182
• West, M. (1953). A General Service List of English words. London, UK: Longman, Green &
Co.
Appendices
Appendix 1: Full list of 318 most frequent academic words in the IELTS Academic reading
papers with CEFR levels
No. Headword Frequency Range CEFR
1 research 280 92 B1
2 create 158 67 B1
3 environment 147 67 B1
4 process 138 71 B2
5 area 127 70 A2
6 individual 111 58 B1
7 transport 109 24 B1
8 design 105 42 B1
9 involve 102 65 B1
10 evident 100 55 B2
11 major 99 63 B2
12 energy 98 36 B1
13 technology 84 52 B1
14 theory 84 36 B1
15 approach 81 50 B1
16 culture 81 35 B1
17 respond 80 39 B2
18 significant 79 57 B2
19 economy 76 37 B2
20 evolve 73 30 C1
21 require 73 53 B1
22 achieve 71 45 B1
23 factor 71 38 B2
24 project 71 38 A2
25 region 71 41 B1
26 similar 70 53 B1
27 benefit 69 39 B1
28 impact 68 49 B2
29 psychology 66 25 B2
30 range 66 54 B1
31 source 66 45 B2
32 focus 65 46 B2
33 identify 65 45 B2
34 role 65 48 B1
35 survive 64 43 B2
36 structure 62 31 B1
37 construct 60 38 B2
38 globe 59 36 Off list
39 predict 59 35 B1
40 site 59 27 A1
41 intelligence 57 17 B2
42 consume 56 26 B2
43 function 56 34 B2
44 affect 54 34 B2
45 participate 54 23 B2
46 task 54 29 B2
47 tradition 54 34 B2
48 analyse 53 28 B2
49 expert 53 33 B1
50 period 52 36 B1
51 vary 52 37 B2
52 decade 51 41 B2
53 institute 51 38 B2
54 potential 51 35 B2
55 team 51 35 A2
56 final 50 39 A2
57 adapt 49 32 B2
58 method 49 32 B1
59 perceive 49 25 C1
60 resource 49 37 B2
61 occur 48 32 B2
62 establish 47 33 B2
63 facilitate 47 18 C1
64 image 47 25 B2
65 innovate 47 14 Off list
66 physical 47 31 B2
67 adult 46 21 A1
68 policy 46 27 B2
69 challenge 45 35 B1
70 community 45 29 B2
71 chemical 44 21 B2
72 contribute 44 35 B2
73 investigate 44 34 B2
74 maintain 44 38 B2
75 dominate 43 20 B2
76 motive 42 20 B2
77 strategy 42 26 B2
78 available 41 29 A2
79 compute 41 15 Off list
80 feature 41 27 B2
81 goal 41 23 A2
82 network 41 20 B2
83 data 40 22 B2
84 job 40 23 A1
85 estimate 39 26 B2
86 intense 39 27 C1
87 invest 39 24 B2
88 locate 39 31 B2
89 principle 39 25 C1
90 revolution 39 19 B2
91 vehicle 39 10 B1
92 aspect 38 29 B2
93 communicate 38 20 B1
94 define 38 23 B2
95 specific 38 29 B2
96 trend 38 16 B1
97 interact 37 27 B2
98 reveal 37 29 B2
99 mental 36 17 B2
100 publish 36 21 B1
101 restrict 36 19 C1
102 aware 35 25 B2
103 eventual 35 29 C2
104 instance 35 28 B1
105 select 35 17 B1
106 emerge 34 27 B2
107 rely 34 26 B2
108 complex 33 25 B2
109 obvious 33 21 B1
110 author 32 18 B1
111 diverse 32 26 B2
112 issue 32 22 B1
113 link 32 24 B1
114 migrate 32 10 Off list
115 consist 31 27 B1
116 demonstrate 31 24 B2
117 detect 31 14 C1
118 previous 31 26 B1
119 sustain 31 18 C2
120 access 30 24 B1
121 decline 30 19 B2
122 distinct 30 21 C1
123 expand 30 24 B2
124 phenomenon 30 19 C1
125 status 30 11 C1
126 survey 30 16 B2
127 accurate 29 21 B1
128 colleague 29 25 A2
129 finance 29 19 B2
130 indicate 29 25 B2
131 positive 29 17 B1
132 apparent 28 24 B2
133 assist 28 18 B2
134 automate 28 9 Off list
135 compound 28 8 Off list
136 concept 28 24 B2
137 shift 28 18 B2
138 technique 28 24 B1
139 unique 28 24 B2
140 ensure 27 25 B2
141 scheme 27 11 B2
142 abandon 27 14 B2
143 abstract 26 7 B2
144 academy 26 17 Off list
145 accompany 26 12 B1
146 acquire 25 10 B2
147 adequate 25 15 B2
148 adjust 25 10 B2
149 aid 25 13 C1
150 alter 25 16 B2
151 alternative 25 19 B2
152 annual 24 13 B1
153 appreciate 24 11 B2
154 appropriate 24 18 B2
155 approximate 24 13 B2
156 assess 24 15 B2
157 assign 24 7 C1
158 assume 24 20 B2
159 attach 24 12 B1
160 attain 24 5 C1
161 attitude 23 18 B1
162 attribute 23 9 C2
163 authority 23 9 B2
164 capable 23 11 B2
165 capacity 23 10 B2
166 category 23 6 B2
167 channel 23 7 A2
168 circumstance 22 9 B2
169 cite 22 9 Off list
170 classic 22 18 B2
171 code 22 8 B2
172 comment 22 10 B1
173 component 22 12 C1
174 concentrate 22 18 B1
175 conclude 22 22 C1
176 conduct 22 21 B2
177 confer 21 9 Off list
178 conflict 21 11 B2
179 consequent 21 21 Off list
180 considerable 21 13 B1
181 constant 21 15 B2
182 consult 21 10 C1
183 contact 21 10 A2
184 context 21 16 B2
185 contrast 20 13 B2
186 convene 20 12 Off list
187 convince 20 18 B1
188 core 20 10 C2
189 corporate 20 7 C1
190 correspond 20 9 B2
191 couple 20 10 B1
192 crucial 20 17 B2
193 cycle 19 11 B1
194 debate 19 18 B2
195 derive 19 8 C1
196 despite 19 20 B1
197 device 19 11 B2
198 devote 19 8 B2
199 display 19 17 B1
200 distribute 19 13 B2
201 drama 19 26 B1
202 duration 19 8 C1
203 dynamic 19 7 B2
204 element 19 20 B2
205 eliminate 18 9 C1
206 emphasis 18 13 B2
207 enable 18 22 B2
208 encounter 18 10 B2
209 enormous 18 12 B1
210 equip 18 17 B2
211 equivalent 18 6 C1
212 erode 18 9 C2
213 error 18 9 B2
214 ethic 18 8 Off list
215 evaluate 18 13 C1
216 exclude 18 10 C1
217 exhibit 17 12 C1
218 explicit 17 6 C2
219 exploit 17 12 B2
220 export 17 10 B2
221 expose 17 15 B2
222 flexible 17 7 B2
223 foundation 17 8 C1
224 founded 17 17 B2
225 fund 17 15 C1
226 fundamental 17 21 C2
227 furthermore 16 12 B2
228 generate 16 17 B2
229 generation 16 18 B1
230 hence 16 9 C1
231 hypothesis 16 15 C2
232 identical 16 5 B2
233 ignorant 16 18 C2
234 illustrate 16 9 B2
235 implement 16 9 B2
236 implicate 16 10 Off list
237 implicit 16 2 C2
238 income 16 12 B2
239 inevitable 16 12 B2
240 initial 15 17 B1
241 initiate 15 12 C2
242 input 15 8 B2
243 insight 15 15 C1
244 instruct 15 11 C1
245 integrate 15 8 C1
246 internal 15 11 B2
247 interpret 15 17 B2
248 intervene 15 8 C2
249 isolate 15 12 Off list
250 labour 15 13 C1
251 layer 15 11 B2
252 logic 15 9 C1
253 manipulate 14 10 Off list
254 maximise 14 9 Off list
255 mechanism 14 11 C1
256 media 14 7 B2
257 medical 14 14 B2
258 medium 14 10 B1
259 minor 14 9 B2
260 mode 14 2 C1
261 monitor 14 12 B2
262 negate 14 14 Off list
263 nevertheless 13 16 B2
264 normal 13 16 A2
265 notion 13 12 C1
266 objective 13 17 B2
267 obtain 13 15 B2
268 occupy 13 12 B2
269 option 13 13 B1
270 outcome 13 12 C1
271 output 13 9 C2
272 overall 13 11 B2
273 partner 13 12 A2
274 percent 13 13 B1
275 perspective 13 10 C1
276 phase 13 9 B2
277 philosophy 13 12 B2
278 precede 13 10 C2
279 precise 13 19 B2
280 primary 13 17 B2
281 priority 12 9 B2
282 proceed 12 15 C1
283 professional 12 8 B1
284 promote 12 17 B1
285 proportion 12 16 C1
286 prospect 12 12 B2
287 pursue 12 9 C1
288 radical 12 12 C1
289 random 12 11 C1
290 react 12 21 B2
291 recover 12 8 B1
292 regulate 12 12 C1
293 reject 12 12 B2
294 relax 12 8 B1
295 release 12 17 B2
296 relevant 12 14 B2
297 remove 12 14 B1
298 reside 12 5 Off list
299 resolve 11 10 C1
300 retain 11 9 C2
301 revenue 11 3 C1
302 reverse 11 10 B2
303 route 11 14 B1
304 section 11 10 B1
305 sector 11 15 C1
306 secure 11 17 B2
307 seek 11 19 B2
308 series 11 19 B1
309 sole 11 10 C1
310 stable 11 13 C1
311 statistic 11 7 C1
312 stress 11 21 B1
313 style 11 15 B1
314 subsequent 11 10 C1
315 sufficient 11 18 B2
316 symbol 11 10 B2
317 target 11 15 B2
318 technical 11 18 B2
Appendix 2: Full list of 201 most frequent academic words in the IELTS Academic reading
papers at B2 and C1 levels
No. Headword Frequency Range CEFR
1 process 138 71 B2
2 evident 100 55 B2
3 major 99 63 B2
4 respond 80 39 B2
5 significant 79 57 B2
6 economy 76 37 B2
7 evolve 73 30 C1
8 factor 71 38 B2
9 impact 68 49 B2
10 psychology 66 25 B2
11 source 66 45 B2
12 focus 65 46 B2
13 identify 65 45 B2
14 survive 64 43 B2
15 construct 60 38 B2
16 intelligence 57 17 B2
17 consume 56 26 B2
18 function 56 34 B2
19 affect 54 34 B2
20 participate 54 23 B2
21 task 54 29 B2
22 tradition 54 34 B2
23 analyse 53 28 B2
24 vary 52 37 B2
25 decade 51 41 B2
26 institute 51 38 B2
27 potential 51 35 B2
28 adapt 49 32 B2
29 perceive 49 25 C1
30 resource 49 37 B2
31 occur 48 32 B2
32 establish 47 33 B2
33 facilitate 47 18 C1
34 image 47 25 B2
35 physical 47 31 B2
36 policy 46 27 B2
37 community 45 29 B2
38 chemical 44 21 B2
39 contribute 44 35 B2
40 investigate 44 34 B2
41 maintain 44 38 B2
42 dominate 43 20 B2
43 motive 42 20 B2
44 strategy 42 26 B2
45 feature 41 27 B2
46 network 41 20 B2
47 data 40 22 B2
48 estimate 39 26 B2
49 intense 39 27 C1
50 invest 39 24 B2
51 locate 39 31 B2
52 principle 39 25 C1
53 revolution 39 19 B2
54 aspect 38 29 B2
55 define 38 23 B2
56 specific 38 29 B2
57 interact 37 27 B2
58 reveal 37 29 B2
59 mental 36 17 B2
60 restrict 36 19 C1
61 aware 35 25 B2
62 emerge 34 27 B2
63 rely 34 26 B2
64 complex 33 25 B2
65 diverse 32 26 B2
66 demonstrate 31 24 B2
67 detect 31 14 C1
68 decline 30 19 B2
69 distinct 30 21 C1
70 expand 30 24 B2
71 phenomenon 30 19 C1
72 status 30 11 C1
73 survey 30 16 B2
74 finance 29 19 B2
75 indicate 29 25 B2
76 apparent 28 24 B2
77 assist 28 18 B2
78 concept 28 24 B2
79 shift 28 18 B2
80 unique 28 24 B2
81 ensure 27 25 B2
82 scheme 27 11 B2
83 abandon 27 14 B2
84 abstract 26 7 B2
85 acquire 25 10 B2
86 adequate 25 15 B2
87 adjust 25 10 B2
88 aid 25 13 C1
89 alter 25 16 B2
90 alternative 25 19 B2
91 appreciate 24 11 B2
92 appropriate 24 18 B2
93 approximate 24 13 B2
94 assess 24 15 B2
95 assign 24 7 C1
96 assume 24 20 B2
97 attain 24 5 C1
98 authority 23 9 B2
99 capable 23 11 B2
100 capacity 23 10 B2
101 category 23 6 B2
102 circumstance 22 9 B2
103 classic 22 18 B2
104 code 22 8 B2
105 component 22 12 C1
106 conclude 22 22 C1
107 conduct 22 21 B2
108 conflict 21 11 B2
109 constant 21 15 B2
110 consult 21 10 C1
111 context 21 16 B2
112 contrast 20 13 B2
113 corporate 20 7 C1
114 correspond 20 9 B2
115 crucial 20 17 B2
116 debate 19 18 B2
117 derive 19 8 C1
118 device 19 11 B2
119 devote 19 8 B2
120 distribute 19 13 B2
121 duration 19 8 C1
122 dynamic 19 7 B2
123 element 19 20 B2
124 eliminate 18 9 C1
125 emphasis 18 13 B2
126 enable 18 22 B2
127 encounter 18 10 B2
128 equip 18 17 B2
129 equivalent 18 6 C1
130 error 18 9 B2
131 evaluate 18 13 C1
132 exclude 18 10 C1
133 exhibit 17 12 C1
134 exploit 17 12 B2
135 export 17 10 B2
136 expose 17 15 B2
137 flexible 17 7 B2
138 foundation 17 8 C1
139 founded 17 17 B2
140 fund 17 15 C1
141 furthermore 16 12 B2
142 generate 16 17 B2
143 hence 16 9 C1
144 identical 16 5 B2
145 illustrate 16 9 B2
146 implement 16 9 B2
147 income 16 12 B2
148 inevitable 16 12 B2
149 input 15 8 B2
150 insight 15 15 C1
151 instruct 15 11 C1
152 integrate 15 8 C1
153 internal 15 11 B2
154 interpret 15 17 B2
155 labour 15 13 C1
156 layer 15 11 B2
157 logic 15 9 C1
158 mechanism 14 11 C1
159 media 14 7 B2
160 medical 14 14 B2
161 minor 14 9 B2
162 mode 14 2 C1
163 monitor 14 12 B2
164 nevertheless 13 16 B2
165 notion 13 12 C1
166 objective 13 17 B2
167 obtain 13 15 B2
168 occupy 13 12 B2
169 outcome 13 12 C1
170 overall 13 11 B2
171 perspective 13 10 C1
172 phase 13 9 B2
173 philosophy 13 12 B2
174 precise 13 19 B2
175 primary 13 17 B2
176 priority 12 9 B2
177 proceed 12 15 C1
178 proportion 12 16 C1
179 prospect 12 12 B2
180 pursue 12 9 C1
181 radical 12 12 C1
182 random 12 11 C1
183 react 12 21 B2
184 regulate 12 12 C1
185 reject 12 12 B2
186 release 12 17 B2
187 relevant 12 14 B2
188 resolve 11 10 C1
189 revenue 11 3 C1
190 reverse 11 10 B2
191 sector 11 15 C1
192 secure 11 17 B2
193 seek 11 19 B2
194 sole 11 10 C1
195 stable 11 13 C1
196 statistic 11 7 C1
197 subsequent 11 10 C1
198 sufficient 11 18 B2
199 symbol 11 10 B2
200 target 11 15 B2
201 technical 11 18 B2
Appendix 3: Full list of collocations in the IELTS Academic reading papers
Word Collocations
• adjective + process (noun): automatic, experimental, complicated,
alternative
• noun + process (noun): participation, evaluation, ageing,
process manufacturing, thinking
• adverb + evident (adjective): particularly, clealry
evident • verb + evident (adjective): become
• major (adjective) + noun: impact, challenge, contributors, impediment,
milestones, asset, strides, drawbacks, breakthroughs, distraction,
worry, outcomes, criticism, argument, components, distinctions,
contributions, difficulties, consequences, risks, threats, policies,
major developments, collection, trend, worry
• respond (verb) + preposition: to
respond • respond (verb) + adverb: appropriately
• significant (adjective) + noun: advances, downsides, role,
contributions, percentage, proportion, insights, barries, steps,
differences, features, aspects, evidence
• adverb + significant (adjective): vitally, statistically, potentially,
significant extremely, sspecially
identify • identify (verb) + noun: factors, features, (a) problem, (an) issue, (a) way
survive No proper collocations found in the corpus
construct No proper collocations found in the corpus
intelligence • adjective + intelligence: artificial, emotional
consume No proper collocations found in the corpus
• adjective + function (noun): basic, essential, main, primary, social,
specific
function verb + function (noun): perform, serve