Kizspy | Question: 37
(Choose 1 answer)
Zipf's law tells us:
A. head words take major portion in English vocabulary;
B. in a given corpus, if the most frequent word's frequency is 1, then the second frequent word's frequency is around 0.5;
C. comparing to tail words, removing head words helps more to reduce the storage of documents represented by a vector space model when using a dense matrix data structure;
D. smoothing is not necessary.