The distributional hypothesis in linguistics is the idea that words that appear in similar contexts tend to have similar meanings.
In essence:
“You shall know a word by the company it keeps.”
— J.R. Firth (a key figure in this idea)
What does it mean practically?
If two words often show up in similar places within sentences — like in the same syntactic structures or next to the same kinds of words — they probably have related meanings. For example:
- "cat" and "dog" often appear in similar contexts:
- "The ___ chased the ball."
- "I took my ___ to the vet."
This tells us they are semantically similar (both are pets, animals, etc.).
Why is it important?
It laid the foundation for computational linguistics and natural language processing (NLP) — especially for m…