BlogSupportDownload

Language

All posts

Grammar vs Vocabulary: What Research Says Actually Moves Your English Score

Nation, Laufer, Coxhead, and Krashen all point in the same direction: above B2, vocabulary (not grammar) is the bottleneck. Here's the research, the examiner data, and a 3-level study protocol.


TL;DR: 3 Research Findings Worth Bookmarking

  1. Nation (2001): You need at least 8,000 word families to comprehend authentic English text without constant dictionary interruption. Most test-takers plateau far below this threshold.
  2. Grammar vs vocabulary in communication failure: Research on error analysis consistently shows that grammar mistakes cause readers to note a stylistic imperfection; vocabulary gaps cause readers to stop understanding entirely.
  3. Above B2/C1: Grammar is largely acquired through exposure. At advanced levels, the gap between a Band 6 and Band 7 IELTS score, or a TOEFL score in the mid-80s vs low 100s, is almost always lexical, not syntactic.

The Grammar Trap

Here is a pattern that plays out tens of thousands of times every year.

A student studies English for two, three, sometimes four years. They work through grammar textbooks cover to cover. They can explain the difference between the present perfect and the simple past. They know when to use "which" versus "that" in a relative clause. They have memorized conditional sentence structures. On a grammar quiz, they score well.

Then they sit down to write a TOEFL essay. The rater reads the essay and assigns a score of 4 out of 5.

The feedback is almost always the same: "Your grammar is adequate, but your writing lacks lexical precision. You overuse high-frequency vocabulary. You do not demonstrate awareness of academic register. Your word choices are sometimes imprecise."

In other words: the grammar was fine. The vocabulary held them back.

This article is about why that happens so consistently, and what the research says you should actually be spending your study time on.

A note on intellectual fairness before we begin: grammar is not irrelevant. At beginner and intermediate levels, it is genuinely the bottleneck. And even at advanced levels, sloppy grammar signals carelessness to examiners. But once you reach approximately B2 on the CEFR scale, the research points overwhelmingly in one direction. The remaining variable, the one that separates good from excellent, is vocabulary.


Section 1: What the Research Says

Nation (2001): The Vocabulary Breadth Threshold

Paul Nation's Learning Vocabulary in Another Language (Cambridge University Press, 2001) is the most cited work in vocabulary acquisition research. One of its central findings is what researchers call the vocabulary breadth threshold for independent reading comprehension.

Nation's analysis of text coverage showed that reading authentic English (newspaper articles, academic papers, novels) without constant dictionary interruption requires knowledge of approximately 8,000–9,000 word families. Below this level, unfamiliar words appear frequently enough to disrupt comprehension. At 5,000 word families, readers encounter an unknown word roughly every 20–30 words in authentic text, too frequently for fluent processing.

For comparison: most intermediate English learners who have completed a standard coursebook series possess around 2,000–3,000 word families. Even confident B2-level speakers often plateau at 4,000–5,000.

The gap between where most learners stop and where authentic comprehension begins is almost entirely lexical. Grammar rules, once learned, do not compound this gap. Vocabulary does.

Laufer (1989, 2010): The Lexical Threshold

Batia Laufer's work on vocabulary and reading comprehension identified what she called the lexical threshold: a minimum vocabulary size below which reading comprehension cannot proceed meaningfully, regardless of how strong a reader's grammar knowledge is.

Laufer (1989) first reported that reading comprehension increases more rapidly when learners know at least 95% of the words in a text. Laufer and Ravenhorst-Kalovski (2010) refined this further, identifying two thresholds: 95% text coverage as the minimum for rough comprehension, and 98% coverage as the threshold at which most readers can infer unfamiliar words from context rather than needing a dictionary. Reaching 98% coverage requires knowledge of approximately 8,000 word families.

The implication is direct: if a learner's grammar is at B2 level but their vocabulary is at A2 level, their grammar knowledge provides almost no compensating benefit. They will still fail to understand the text. The inverse is not symmetrical: a learner with strong vocabulary but imperfect grammar will typically succeed at comprehending meaning, even if their production is marked by grammatical errors.

Coxhead (2000): The Academic Word List

Averil Coxhead's A New Academic Word List (TESOL Quarterly, 2000) analyzed a 3.5-million-word corpus of academic English and identified 570 word families that appear consistently across academic disciplines but are not among the 2,000 most common general English words.

The significance of this finding for TOEFL and IELTS test-takers: these 570 word families (words like constitute, significant, contrast, derive, emphasis, indicate, maintain, obtain, occur, perceive) account for approximately 10% of the words in any given academic text.

A learner who knows the 2,000 most common English words plus Coxhead's Academic Word List covers roughly 86% of academic text. Every additional word family from this list increases reading and listening comprehension in measurable, documentable ways.

Grammar improvement at an already-functional level does not have this kind of compounding return.

Krashen (1985): The Input Hypothesis

Stephen Krashen's Input Hypothesis (The Input Hypothesis, Longman, 1985) proposed that language acquisition occurs when learners receive comprehensible input: language that is just slightly above their current level, which he called "i+1."

The key insight for the grammar vs vocabulary debate: comprehensible input is primarily a vocabulary problem, not a grammar problem. When a piece of English becomes incomprehensible to a learner, the cause is almost always an unknown word, not a grammatical structure they have not yet studied.

Krashen's model suggests that grammar is largely acquired implicitly through sufficient exposure to comprehensible input. The learner who builds their vocabulary to the point where they can consume authentic content will acquire the grammar of that content as a byproduct. The reverse (learning grammar explicitly in order to understand authentic content) is far less efficient.

What Error Analysis Studies Show

Research in error analysis, the systematic study of what types of mistakes actually cause communication failure, consistently finds the same pattern.

A grammar error (incorrect tense, missing article, wrong preposition) typically causes a reader to note a surface imperfection. In most cases, the meaning still comes through. "Yesterday I have gone to the library" is grammatically wrong but communicatively transparent.

A vocabulary error (using a word that does not mean what you intend, or not knowing the word that names the concept you are trying to express) frequently causes complete communication breakdown. "The professor made a strong argument" and "The professor made a strong argumentation" are grammatically similar constructions. The vocabulary choice in the second sentence signals a non-native speaker who does not control the register distinction between the two words. More critically, if a learner wants to say "The data corroborates the hypothesis" but does not know the word corroborate, there is no grammar rule that can rescue them.

The asymmetry is fundamental: grammar errors miss style points; vocabulary gaps miss meaning entirely.


Section 2: The Grammar Ceiling Effect

A1–B1: Grammar Is the Real Bottleneck

It is important to be precise about where grammar matters most. At beginner and lower-intermediate levels (roughly A1 through B1 on the CEFR scale), grammar is genuinely the primary constraint on communication.

At this level, learners are still acquiring fundamental sentence structure: subject-verb agreement, basic tense forms, question formation, negation, article use. These are not optional refinements; they are load-bearing elements. Without them, sentences collapse. "She go store yesterday buy bread" is recognizable to a sympathetic listener, but it places the entire communicative burden on the listener's charity.

For A1–B1 learners, grammar study has direct, observable payoff. Each new structure unlocks the ability to produce entire new classes of sentences.

B2–C1: Grammar Is ~80% Acquired

The picture changes significantly at B2 and above.

Research on adult language acquisition suggests that by the time a learner reaches B2, they have acquired the core grammatical structures of English through the cumulative effect of their exposure: reading, listening, writing, classroom instruction. They still make errors (article use remains non-intuitive for speakers of article-free L1s like Chinese, Japanese, and Korean; some preposition patterns take years to fully consolidate), but these errors are at the margins, not the core.

More importantly: at B2+, additional grammar study produces diminishing returns. A student who already knows present perfect, passive voice, conditionals, and relative clauses will not materially improve their reading comprehension or listening ability by adding another grammar chapter. The structures they need to understand authentic text are already in place.

Vocabulary, by contrast, does not show this ceiling effect. The gap between 4,000 and 8,000 word families is enormous. So is the gap between knowing a word's dictionary definition and knowing its collocations, its register, its typical academic usage contexts, its derivations.

The Illusion of Productive Grammar Study

One reason learners continue to study grammar long past the point of diminishing returns is that grammar study feels productive.

Grammar has rules. Rules have right and wrong answers. You study a rule, do exercises, get feedback, see measurable improvement. The feedback loop is satisfying and immediate.

Vocabulary learning does not feel like this. Words do not have simple right/wrong answers; they have shades of meaning, collocational constraints, register, connotation. The learning is gradual and the progress is harder to perceive from session to session.

This difference in cognitive feel creates a systematic bias: learners over-invest in grammar because it gives clear feedback, and under-invest in vocabulary because the payoff is diffuse and slow.

The research does not share this confusion. The payoff from vocabulary is not just equal to grammar at advanced levels; it is substantially larger.

The Compound Interest Metaphor

Every new word family a learner acquires is not just one piece of information. It is a combinatorial multiplier.

Consider what happens when a learner acquires the word mitigate. They do not just learn a single word. They learn:

  • A synonym network (alleviate, reduce, lessen, diminish)
  • A collocation pattern (mitigate risk, mitigate the effects of, mitigate damage)
  • A register signal (formal, academic, professional writing)
  • A derivational family (mitigation, mitigating, unmitigated)
  • A slot in thousands of possible sentences they can now produce and understand

One word family, acquired deeply, opens access to dozens of new sentence patterns. Grammar rules do not compound this way. The ability to form a passive voice sentence does not unlock new vocabulary; each new word family expands the semantic and collocational network in all directions.


Section 3: How Examiners Actually Score

TOEFL Writing: Vocabulary Range Matters

TOEFL Writing tasks are scored on a holistic 0–5 rubric. While vocabulary is not scored as a separate band (unlike IELTS), the rubric at higher score levels explicitly rewards "a range of vocabulary" and penalizes "limited vocabulary." At score level 4, the rubric notes "some variety in syntactic structures and a range of vocabulary," while a score of 5 requires "almost no lexical or grammatical errors" and demonstrates effective use of language. In practice, essays that rely on high-frequency words like good, bad, important, show, get consistently score lower than essays that deploy precise, academic-register vocabulary, even when grammar is comparable.

The jump from a TOEFL Writing score of 4 to 5 is, in the majority of cases, a vocabulary upgrade, not a grammar correction.

IELTS Writing: Bands 6 to 7 Require Vocabulary Range

The IELTS Writing Lexical Resource descriptors are explicit about what separates a Band 6 from a Band 7 response.

Band 6: "Uses an adequate range of vocabulary for the task. Attempts to use less common vocabulary but with some inaccuracy."

Band 7: "Uses a sufficient range of vocabulary to allow some flexibility and precision. Uses less common lexical items with some awareness of style and collocation."

The operative distinction is not "more correct grammar." It is less common vocabulary and collocation awareness. An examiner reading a Band 6 and a Band 7 response is primarily noting that the Band 7 writer chose fluctuated dramatically where the Band 6 writer wrote changed a lot, or attribute the decline to where the Band 6 writer wrote the reason for the decrease is.

GRE Verbal: Vocabulary Is Literally Half the Score

The GRE Verbal section is composed of three question types: Reading Comprehension, Text Completion, and Sentence Equivalence. Text Completion and Sentence Equivalence together constitute approximately 50% of the Verbal section score, and both are pure vocabulary tests.

Text Completion questions present a passage with blanks; the test-taker must select the word or phrase that best fits each blank from a set of options. Sentence Equivalence questions require identifying two answer choices that complete a sentence with equivalent meanings. Neither question type can be answered through grammar knowledge. Both require an extensive, precise understanding of vocabulary at the C1–C2 level.

The target vocabulary for GRE (words like laconic, sanguine, perfidious, recondite, ephemeral, enervate) does not appear in any grammar textbook.

GMAT: Academic Vocabulary Drives Reading Comprehension Performance

GMAT Reading Comprehension passages are drawn from business, economics, science, and social sciences. The passages use dense academic vocabulary. A test-taker who encounters an unfamiliar word in a GMAT passage cannot ask the meaning; they must infer it from context or work around it. Research on GMAT Reading Comprehension consistently shows that vocabulary knowledge, specifically academic register vocabulary, is among the strongest predictors of passage comprehension.

Grammar knowledge, again, does not provide compensating benefit. Understanding the syntactic structure of a sentence containing predicated on, efficacious, or circumscribed by does not help a test-taker who does not know what those words mean.

Score Impact Comparison Table

Exam Grammar Error (minor) Vocabulary Gap (moderate)
TOEFL Writing Minor deduction in holistic score if fluency maintained Limits score ceiling; essays with limited vocabulary rarely score 5
TOEFL Speaking Noted but rarely penalized if fluency maintained Directly limits delivery score; pauses/repairs increase
IELTS Writing Deduction in Grammatical Range (~-0.5 band) Deduction in Lexical Resource (~-1 band); affects Task Achievement
IELTS Speaking Minor deduction if not systematic Immediately reduces Lexical Resource; affects Fluency band
GRE Verbal Not assessed Direct question failure in TC/SE; inference failure in RC
GMAT Verbal Not directly assessed Comprehension failure; wrong answers on RC and CR

Section 4: The Right Balance, A 3-Level Protocol

The research does not say "ignore grammar." It says: calibrate your investment based on your current level. Here is a practical protocol.

Level-by-Level Recommendation

CEFR Level Grammar Focus Vocabulary Focus Rationale
A1–B1 60% 40% Core syntax still being acquired; grammar errors block communication
B1–B2 40% 60% Core grammar mostly in place; vocabulary becomes the differentiator
B2–C2 20% 80% Grammar largely implicit; 8,000+ word family target dominates progress

Daily Minutes by Level

Level Total Daily Study Grammar (min) Vocabulary (min) Notes
A1–B1 60 min 36 min 24 min Grammar: sentence structures, tenses, basic syntax
B1–B2 60 min 24 min 36 min Grammar: article use, prepositions, complex sentences
B2–C2 60 min 12 min 48 min Grammar: review only; vocabulary: academic register, collocations

What Grammar Review Looks Like at B2+

At B2 and above, grammar study should not mean working through another grammar textbook. It should mean:

  • Targeted error correction: identify your personal recurring errors (articles? prepositions? subject-verb agreement with complex subjects?) and address those specifically.
  • Grammar through reading: encountering structures in context, noting patterns, not drilling isolated exercises.
  • Collocation as grammar: many "grammar errors" at the advanced level are actually vocabulary errors, such as using the wrong preposition with a verb (depend on not depend about) or the wrong article with a noun phrase. These are solved by vocabulary work, not grammar rules.

Section 5: Why Vocabulary Learning Requires a System

Grammar Rules vs Vocabulary Acquisition

Grammar can be partially learned through rules. "Use the present perfect for actions connected to the present moment" is a rule you can memorize and apply. It is imperfect (natural language use is always more nuanced), but it gets you substantially correct most of the time.

Vocabulary cannot be learned this way. There is no rule that tells you that alleviate collocates with pain, suffering, and burden but not typically with problem in formal academic writing (where address or mitigate is preferred). There is no rule that tells you that ephemeral carries a connotation of beauty in its transience that temporary does not. These things must be encountered, in context, multiple times.

Research on vocabulary acquisition suggests that 8–12 or more meaningful encounters with a word are required before it moves from recognition into active control, the ability to use it accurately and spontaneously in production. This is not a problem you can solve with a list. It requires a system.

What a Vocabulary Learning System Requires

An effective vocabulary learning system, based on the research, needs three components:

1. Spaced repetition — reviewing words at intervals calibrated to your memory curve, so review time is concentrated on words you are about to forget rather than words you already know.

2. Contextual sentences — encountering words in meaningful sentences, not just definitions. Context provides the collocation data, register data, and semantic range data that a dictionary entry cannot.

3. Active production — moving beyond passive recognition to active use: generating sentences, completing cloze exercises, using words in writing and speaking tasks.

How Rhythm Word Operationalizes This

Rhythm Word is built around exactly this framework. Its core learning features address each dimension of deep word knowledge:

  • Real-time sentence generation: every session delivers fresh, level-appropriate sentences, not static dictionary examples
  • FSRS spaced repetition: review scheduling calibrated to your personal memory curve
  • Custom scenarios (Business, Travel, Campus, Custom): vocabulary contextualized in the register you need
  • Voice playback: phonological form training alongside meaning
  • Card interaction: target words appear bold (remembered); tap to mark orange (fuzzy recall) or red (forgotten), giving you honest self-assessment without artificial button choices

The personalized sentences adapt to your current level. The same word is contextualized differently depending on where you are in your learning journey.

Example: "ephemeral" at Two Levels

A learner at B2 level sees:

"The popularity of some social media trends is ephemeral; they dominate feeds for a week and then vanish."

A learner at C1 level sees:

"The artist was preoccupied with the ephemeral quality of human connection, a theme that recurs throughout the exhibition in the use of dissolving light and incomplete forms."

Same word. Different syntactic environment, different collocational company, different register. Both encounters contribute to deep acquisition in ways that a definition card ("ephemeral: adj. lasting for a very short time") cannot.

This is what Nation means by meaningful encounters. And it is why vocabulary learning requires an adaptive system, not just a list.


5 Frequently Asked Questions

Q: Is vocabulary or grammar more important for TOEFL?

At any level above B1, vocabulary is the more important variable for TOEFL performance. The TOEFL Writing rubric explicitly rewards vocabulary range and precision at higher score levels, and penalizes reliance on basic, repetitive word choices. Grammar is also assessed, but research on TOEFL score improvement consistently finds that lexical development produces larger score gains at the B2+ level. If your goal is to move from a TOEFL score in the mid-80s into the 100+ range, vocabulary development should be the primary focus of your study plan.

Q: Can you speak fluent English with bad grammar?

Yes, and this is not a controversial position among linguists. Fluency refers primarily to the ability to communicate meaning smoothly and spontaneously. Many highly fluent speakers of English as a second language have persistent grammar errors (non-native-like article use is extremely common among Chinese, Japanese, and Korean L1 speakers, for example) but communicate with complete effectiveness in academic, professional, and social contexts. What prevents fluency is almost always vocabulary limitations: not knowing a word, pausing to search for it, using a vague approximate term. Grammar errors rarely stop communication; vocabulary gaps frequently do.

Q: How many words do I need to know to be fluent in English?

Research gives a fairly precise answer: 8,000–9,000 word families for reading authentic English independently; approximately 5,000 word families for 95% text coverage in general texts; roughly 3,000 word families plus the Academic Word List for solid academic reading comprehension. For natural conversation, the threshold is somewhat lower, around 3,000–4,000 word families covers the vast majority of spoken English. "Fluency" is not a single target; it depends on the context. But if you are targeting TOEFL, IELTS, or GRE, aiming for 6,000–8,000 word families is a research-grounded goal.

Q: Why do I know all the grammar rules but still make mistakes?

Because knowing a rule and automatizing a rule are different cognitive processes. You can recite the present perfect rule on a test and still default to the simple past in spontaneous speech because the simple past is more deeply automatized. This is not a grammar knowledge problem; it is a grammar automatization problem, which is solved by massive comprehensible input and output practice, not by studying more grammar rules. Additionally, many persistent "grammar mistakes" are actually vocabulary or collocation errors: using the wrong preposition with a verb, or the wrong article with a noun phrase, are often learned as part of the word's collocational profile, not as grammar rules.

Q: How do I study vocabulary systematically?

The research points to four principles: (1) prioritize high-frequency words first (the Academic Word List's 570 families give you 10% of academic text); (2) use spaced repetition to distribute your review time according to your forgetting curve; (3) encounter words in context, not just as definition lists (sentence-level encounters provide collocational and register information); (4) aim for active production, not just recognition (you need to use words in sentences to move them from passive to active knowledge). Apps like Rhythm Word implement all four principles through personalized contextual sentences and FSRS-based spaced repetition.


Conclusion: The Productive Reframe

The grammar vs vocabulary debate is not about declaring one side the winner. It is about calibrating your attention correctly at each stage of your learning.

Grammar is the scaffold. You need it to be functional before you can build anything on top of it. At A1–B1, it deserves the majority of your study time.

But vocabulary is the material. It is the stuff your sentences are made of. Above B2, the learner who has 7,000 word families and solid but imperfect grammar will consistently outperform the learner who has 3,000 word families and perfect grammar, on TOEFL, IELTS, GRE, in the classroom, and in every professional and academic context where English matters.

Nation, Laufer, Coxhead, and Krashen agree on this. TOEFL and IELTS examiners score it this way. The test data supports it.

The question is whether your study plan reflects it.

If you are at B2 or above and still spending the majority of your study time on grammar exercises, you are working on the wrong constraint. The path forward is vocabulary, contextualized, spaced, and systematically acquired.

Download Rhythm Word (free to download) and start building toward the 8,000-word-family threshold with personalized sentences adapted to your level.


Continue Reading


References

  • Nation, P. (2001). Learning Vocabulary in Another Language. Cambridge University Press.
  • Laufer, B. (1989). What percentage of text-lexis is essential for comprehension? In C. Lauren & M. Nordman (Eds.), Special Language: From Humans Thinking to Thinking Machines (pp. 316–323). Multilingual Matters.
  • Laufer, B. (1998). The development of passive and active vocabulary in a second language: Same or different? Applied Linguistics, 19(2), 255–271.
  • Laufer, B., & Ravenhorst-Kalovski, G. C. (2010). Lexical threshold revisited: Lexical text coverage, learners' vocabulary size and reading comprehension. Reading in a Foreign Language, 22(1), 15–30.
  • Coxhead, A. (2000). A new academic word list. TESOL Quarterly, 34(2), 213–238.
  • Krashen, S. (1985). The Input Hypothesis: Issues and Implications. Longman.
  • Laufer, B., & Nation, P. (1999). A vocabulary-size test of controlled productive ability. Language Testing, 16(1), 33–51.
  • Swain, M. (1985). Communicative competence: Some roles of comprehensible input and comprehensible output in its development. In S. Gass & C. Madden (Eds.), Input in Second Language Acquisition (pp. 235–253). Newbury House.
vocabularygrammarTOEFLIELTSGREresearchlanguage-learning

Rhythm Word is available on iOS. If the way we think about vocabulary learning resonates with you, we would love for you to try it.

Download on the App Store

Related Articles

Grammar vs Vocabulary: What Research Says Actually Moves Your English Score | Rhythm Word