正在加载图片...
Index Compression Collection Statistics Zipf consequences If the most frequent term(the) occurs cf, times then the second most frequent term of occurs c, /2 times the third most frequent term (and)occurs cf /3 times Equivalent: cf k/i where k is a normalizing factor SO log cf, log k-log i Linear relationship between log cf and log Another power law relationshipIndex Compression 14 Zipf consequences ▪ If the most frequent term (the) occurs cf1 times ▪ then the second most frequent term (of) occurs cf1 /2 times ▪ the third most frequent term (and) occurs cf1 /3 times … ▪ Equivalent: cfi = K/i where K is a normalizing factor, so ▪ log cfi = log K - log i ▪ Linear relationship between log cfi and log i ▪ Another power law relationship Collection Statistics
<<向上翻页向下翻页>>
©2008-现在 cucdc.com 高等教育资讯网 版权所有