Most of the
information at this website deals with data from the COCA
corpus. You might also be interested in the
collocates data from the 14
billion word
iWeb
corpus. |
Collocates are words that occur near a given word (the node word),
and they can provide very useful insight into the
meaning and usage of the words near which they occur.This site contains the
largest and most accurate lists of collocates of
English -- about 13.5 million node/collocate pairs. Feel free to take a look at
the some samples (expanded:
1.34 million entries).
Remember that any list of collocates is
only as good as the corpus (collection of texts) that it is based
on. The 13.5 million node/collocate pairs are based on the only large, genre-balanced,
up-to-date corpus of English -- the one billion word
Corpus of
Contemporary American English (COCA).
Sample (see more)
nodeID |
node |
nodePoS |
collocate |
collPoS |
freq |
MutInfo |
% preNode |
17491 | smolder | v | still |
r | 150 | 3.95 |
0.93 |
17491 | smolder | v | fire |
n | 97 | 5.78 |
0.62 |
17491 | smolder | v | eye |
n | 62 | 3.89 |
0.58 |
17491 | smolder | v | ash |
n | 35 | 7.88 |
0.23 |
17491 | smolder | v | cigarette |
n | 29 | 6.34 |
0.69 |
17491 | smolder | v | ember |
n | 28 | 10.67 |
0.29 |
17491 | smolder | v | burn |
v | 27 | 4.98 |
0.56 |
17491 | smolder | v | ruin |
n | 24 | 7.76 |
0.33 |
|