11-point average, 99, 131
Abiteboul, S., 29
abstracting, automatic, 14
accumulator, 98, 100, 101
accuracy classification. See classification, accuracy
ACM, 218
active verb. See verb, active / stative
ad hoc retrieval. See retrieval, ad hoc
adaptive, 1, 113, 252, 280, 283, 284
adjacency matrix. See matrix, adjacency
adjective
    descriptive, 216
    relational / reference-modifying, 216
agents, 111, 280, 296, 316
agglomerative cluster. See cluster, agglomerative
aging citation. See citation, aging
Agosti, M., 200
alienation, point, 134, 142, 146
Allan, J., 114, 266
Allen, B., 119
Allen, L., 22
alphabet, marked, 16
analysis, level of, 40, 61, 101, 203, 210
ancestor, 278, 284
Anderson, J., 160
annotation, 185, 196, 200, 245, 246, 311
antonymy, 215
argument, 18, 184, 185, 187, 189, 194, 205, 234, 244, 299, 305, 310
    rhetorical, 200, 306
    structure, 193, 237, 244, 305, 306
Armstrong, R., 280
artificial intelligence (AI), 182, 221
    Good old-fashioned (GOFAI), 252
    thesis (AIT) corpus, 76
Ashley, K. D., 242
association
    measure. See measure, association
    associative memory, 160
    associative relation, 215, 225, 233
attentional focus, 24, 303, 313
attribute, 19, 30, 106, 242, 259, 297
    structured, 19, 30, 49, 183, 288
audience, 4, 12, 13, 193, 279, 296, 304, 305, 308, 312
author, 4, 12, 17, 21, 27, 50, 60, 64, 67, 74, 82, 101, 184, 187, 189, 195, 199, 205, 207, 220, 279, 296–8, 300, 303, 304, 306, 308, 312
authority, 32, 196, 240, 312
    citation. See citation, authority
    list. See list, authority

Baddeley, A., 160
bag-of-words, 213, 305
Baker, L. D., 266
Bakhmutova, I.V., 306
Balabanovic, M., 280
Bar-Hillel, Y., 189
Barry, C. L., 107
Bartell, B. T., 135, 157, 163, 165, 275
base pairs (genetic, nucleic acid), 68, 69
batch, 264
Bates, M. J., 110, 111
Bayerman, C., 22
Bayes
    naive, 270
Bein, J., 226
Belkin, N. J., 9, 105, 277
Bernoulli, 74, 270–2
Berry, M., 159
berry picking, 110
bibliometrics. See citation
bigrams, 28
binary, 14, 170, 172, 255
binary independence model (BIM), 170
Bing, J., 244
Blair, D. C., 33, 65, 108, 121, 301
Bonhoeffer, S., 69
Bono, P., 286
bookmark list, 281, 306
Bookstein, A., 74, 140
boosting, 274
Borg, I., 161
Borko, H., 160
bottleneck, knowledge acquisition, 247
Bowman, C., 280
Brachman, R., 219, 239
Brauen, T., 113, 232, 264
Brenner, S. W., 243
broader term. See term, broad
Brower, D., 58
Bruza, P. D., 200
Buckley, C., 92, 98, 113
Bush, V., 18, 200

Callan, J., 177, 275
call numbers, 86
captions, 26, 201
Carlin, B., 267
case, 44
    folded, 44, 239, 294
categories, 238, 254, 258, 286
Cawsey, A., 110
centroid, 88, 111
Chakrabarti, S., 196, 280
Chomsky, N., 213
chorus effect, 276
Church, K. W., 213
citation, 18, 20, 57, 118, 153, 186, 187, 189, 198, 210, 225, 247, 316
    aging, 190
    authority, 187, 196
    co-, 189
    graph, 187, 192, 259, 280
    hub, 196
    lag, 190
    legal, 191, 192, 195, 243
    resolution, 186
classic papers, 187
classification, 158, 267
    accuracy, 133
    algorithms, 263
    assignment, 255
    automatic, 254
    automatic document, 254, 255, 267
    collaborative, 219, 286
    hierarchic, 278
    keyword, 133, 259, 261
    methods, 256, 262
Cleverdon, C., 119, 135
clique, 189
closed vocabulary. See vocabulary, open / closed
cluster, 165
    agglomerative / divisive, 166
    document, 111, 165, 166, 211, 280
    hierarchic, 211
    hypothesis, 165
    keyword, 211
codons (genetic), 68
Cohen, M. L., 244, 288
Cohen, P., 226
Cohen, W. W., 273
Cole, J., 226
collaborative classification. See classification, collaborative
collaborative filter. See filter, collaborative
collateral knowledge, 30
collection, 4, 9, 21, 29, 52, 80, 120, 135, 255
    fusion, 276
colleges, invisible, 189
Collins, A., 225
combination function, 195
community, 182, 254, 187, 196, 219
component, connected, 284
composite measure. See measure, composite
computational learning theory (COLT). See learning
concept drift, 266
conceptual match. See match, conceptual
Conklin, J., 200
connectionist. See network, neural
consensual, 106, 118, 141
containment, 201, 204
content, 12, 26, 50, 154, 297, 301, 306, 312
    analysis, 201
    information, 221, 312
context, 13, 17, 22, 32, 81, 82, 182, 184, 214, 235, 279, 280, 284, 304, 305
context sensitive (grammar), 45
contingency table, 122, 133
conversational threads, 7, 185
convex hull, 240
co-occurrence
    keyword, 77, 211
cookies, 294, 306
Cooper, W., 137, 168, 174
Cooperative principle, 303
coordination level, 94, 124
corpus, 4, 6, 39, 41, 50, 81, 94, 117, 170, 182, 187, 223
    Cornell, 135
    test, 135
corpus-based linguistics. See linguistics, corpus-based
correlation measure. See measure, correlation
cosine, 95, 203
cost, 66, 70, 75, 140, 175, 247, 296, 312
    function, 283
Cost, S., 273
Cranfield collection, 119, 135
Craven, M., 184, 286
crawling. See World Wide Web, crawling
Crestani, F., 262
Croft, W., 9, 12, 44, 74, 85, 177, 273, 277
cross-validation, 269
Cruse, D., 213
curse of dimensionality. See dimensionality, curse of
Cutting, D. R., 101, 167

Dagan, I., 258
Daniels, P., 110
data model, 30
    logical / physical, 30
data retrieval, 32, 245
data structure, 52, 54, 97, 98
decision rule, 101
Deerwester, S., 157, 159
Dennis, S. F., 83
deontic logic. See logic, deontic
dependence
    term, 171, 178
derivation, 44, 64
describes, 12, 26, 75, 245, 298, 308
Dewey, J., 63
Dices coefficient, 95
dimensionality, 86, 94, 160, 238
    curse of, 154
    reduction, 154, 156, 157, 159
      aggressive, 261, 262
direct manipulation (interface), 240
directory, 40, 41, 42, 50
    WWW, 219
discourse, domain of, xxii, 11, 12, 13, 26, 157, 185
discriminant, 75, 82, 262
    linear / nonlinear, 262
discrimination
    power, 82
    threshold, 173
    value, 99
disjunctive normal form, 273
dissimilarity, 192
distribution
    joint probability, 176
    Poisson, 74
    prior, 266, 268
    stationary, 196
    Zipfian, 62, 67, 81, 294
divisive cluster. See cluster, agglomerative / divisive
document, 3, 6, 16, 17, 22, 32, 200, 210, 254
    frequency weighting, 83
    inverse frequency. See inverse document
      frequency weight
    length. See length, document
    modification, 115, 255, 263
    spoken, 21, 24
    typical, 22, 40, 201
domain, 118, 160, 182
Domingos, P., 270
Doszkocs, T. E., 226
Doyle, L. B., 225
drift, 159, 266, 288
Drosnin, M., 62
Duda, R., 262, 267
Dumais, S., 157, 158, 159, 160, 237

E-measure. See measure, E-
editor
    editorial enhancement, 254, 312
    workbench, 254
effectiveness, 131, 132, 296
Egan, D. E., 200
Eichmann, D., 296
eigenfactor analysis, 156
Einstein, A., 292
electronic artifacts, 4
elite, 74
entropy, 69, 260
    conditional, 260
    post-y, 260, 261
error measure. See measure, error
evaluation, 33, 36, 55, 62, 118, 135, 141
event space, 168, 269
exhaustive index. See index, exhaustive
expected mutual information measure. See measure, expected mutual information
exploitation / exploration, 266
exposition, spiral, 270

F-measure. See measure, F-
fact extraction, 183, 286
fallout, 123, 139
features, 2, 9, 12, 27, 40, 50, 51, 94, 105, 168, 174, 207, 255, 259, 260, 289
feedback, 65, 252, 256, 284
    relevant. See relevance, feedback
Fellbaum, C., 214, 215, 216
Fidel, R., 183
field, 12, 21, 140, 187, 191, 312
    textual, 40, 49, 253
    See also discourse, domain of
file
    inverted, 53, 54, 97
    stream, 42
filter, 41, 98, 114, 137, 264, 266
    collaborative, 286
finding out about (FOA), 1, 20, 107
Findler, N. V., 225
finite state machine, 42, 43
Foote, J. T., 306
Fox, C., 43, 45, 47, 58, 183
Fox, E. A., 293
Frakes, W. B., xxiv
Francis, W., 61
frequentist, 271
Friedman, S., 232
Froehlich, T.J., 118
Fuhr, N., 173, 177
Fujii, H., 44
function word. See negative dictionary
function, weighting, 91
Furnas, G., 28
fusion collection. See collection, fusion

Gallant, S. I., 262
Garfield, E., 189
generalization, inductive. See learning
genre, 21, 243
geographical information systems (GIS), 240
Ghias, A., 306
Giuliano, V. E., 225
Glasgow, J., 247
global positioning system (GPS), 240
Goodrich, P., 22, 243
Gordon, M. D., 237, 286
gradient descent (search), 113, 264
Grice, H. P., 303
Grice’s maxims, 303
Griffiths, A., 166
Guntzer, U. G., 211
Guttman, L., 134

Hafner, C., 242, 244
Hanson, R., 183
Harman, D., 96, 120, 121, 135, 238
Harnad, S., 248
Harper, J. D., 101
Harvard, 22, 195
Hawkes, T., 301
header, 30, 41, 50
Hearst, M., 201, 203, 238
Hebb, D., 160
Hersh, W., 136
Hill, B. M., 149
Hinton, G., 227
hitlist, 35, 49, 101, 124, 137, 173, 175, 240, 314, 315
    rank, 124, 126, 133, 135, 174
Hoffman, T., 158
hold out set, 269
homologous, 245
Howe, A., 286
hub citation. See citation, hub
Huberman, B. A., 69, 70, 294
hull, convex, 240
Hull, D. L., 122, 213, 224
Hutchins, W., 4
hyper-footnotes, 201
hypernymy, 11, 215
hypertext, 18, 189, 199, 200, 207, 247, 280, 284, 306, 309
hyphenation, 27, 43

immediacy effect, 190
impact, 188, 196, 218, 223, 243
in-degree, 188, 196
independence
    conditional, 178
    data, 245
    order, 168, 270
    stochastic, 173
index, 12, 20, 21, 26
    exhaustive, 11, 78, 79
    inverted, 53, 97
    latent semantic. See latent semantic indexing
    term weighting, 280
induction. See learning
    manual / automatic, 26
inflectional morphology, 44
information
    measure. See measure, information
    mutual, 221, 260
    need, 5, 7, 13, 23, 29, 78, 84, 107, 111, 176, 256, 296, 308
    publication, 19, 183, 259
information retrieval (IR), 8, 118, 213, 226, 247
    history of, 182
inheritance (attribute / value), 215
inner product, 87, 88, 89, 156, 165
inter-subject reliability. See reliability, inter-subject
intermediary, search, 14, 293
interpolation, 130
inverse document frequency (IDF) weight, 84, 89, 96
inverted index. See index, inverted
invisible colleges. See colleges, invisible
iterative longest match. See match, iterative longest

Jain, A., 166
James, W., 160
Jardine, N., 132
jargon. See term
Joachims, T., 275
joint probability distribution. See distribution, joint probability
Jones, W. P., 160
Joyce, T., 211

Karlgren, J., 117
Katzer, J., 183
Kearns, M. J., 269
Keen, E., 122
Keller, E. F., 310
Kent, A., 124
kernel, 275
Kessler, M. M., 198
keyword, 10
    classification. See classification, keyword
    frequency, 54, 211, 261
    internal / external, 71
Kleinberg, J., 196
Klinkenberg, R., 266
knowledge
    acquisition bottleneck, 247
    collateral, 30
    engineer, 219, 247, 252
    mutual, 235, 304
    network, 1, 234, 309
    public, 236, 303
    representation, 182, 183, 219, 247, 289, 309
    structure, 219
Knuth, D. E., 62
Kochen, M., 183
Koenemann, J., 110
Koll, M., 160
Korfhage, R., xxiv, 238
Krovetz, R., 47, 213
Kruskal, J. B., 109
Kuhn, T., 220
Kwok, K., 262

Lancaster, F., 120
Landauer, T., 158, 160
Langdell, C. C., 244
language
    game. See word, game
    index, 48, 221
    natural, 10, 16, 26, 30, 36, 47, 62, 183, 194, 224, 248, 299, 303, 310
    oral / written, 16, 302, 303
    query. See query, language
Larkey, L., 273, 278, 316
Larkey, L. S., 277
latent semantic indexing (LSI), 154, 159
Latour, B., 224, 309, 310
law. See legal
Lawrence, S., 101, 295, 296
learning, 224, 244, 262, 269
    active, 316
    distance, 315
    error correction, 113, 265
    mixed initiative, 316
    rate, 113, 264
    reinforcement, 256, 266
    supervised, 254, 256, 262
    theory, 269
least mean squared (LMS), 265
Lee, J., 277
legal
    brief, 21, 185, 194, 244
    citation. See citation, legal
    domain, 33, 184, 186, 191, 194, 195, 205, 220, 242, 244, 288, 305
    See also litigation support
length
    document, 17, 40, 53, 74, 89, 91, 97, 99, 101, 271
    normalization pivot, 91
    normalization slope, 91
Letsche, T. A., 159
level of treatment, 18, 40, 204
Levi, J. N., 22, 243
Lewis, D. D., 133, 213, 262, 267, 272, 275
lexical analyzer generator, 42
lexicographic trees, 149
lexicon, 12, 213
Li, W., 149, 151
library, 3, 20, 86, 110, 218, 293, 302
    digital, 292
Lieberman, H., 280
linear media, 200, 306
linguistics, 4, 8, 25, 28, 36, 44, 47, 102, 213, 301
    corpus-based, 183, 213
list
    authority, 240
    hit. See hitlist
    stop (word). See negative dictionary
literacy, 302
litigation support, 121, 244
Littlestone, N., 259, 274
Littman, M.L., 159
logarithmic, 83, 174
logic, 30, 176, 205, 209, 215
    deontic, 244
    relevance, 225
loss function, 265
Lovins, J.B., 46
Lowe, D., 244
Luenberger, D.G., 155
Luhn, H. P., 47, 76, 82

m-estimate, 272
magic bullet query. See query, magic bullet
magnesium, 235
Mandelbrot, B., 64, 66, 67, 152
Mantegna, R.N., 68, 69
Marchianoni, G., 238
Mark of Zero, 272
marked linguistic feature, 44
Markov process, 195
Maron, M., 168, 225
match
    conceptual, 178
    function, 124, 132, 169, 266
    iterative longest, 45
matrix
    adjacency, 195, 197
    similarity, 156
May, R. M., 189
McCallum, A., 267, 268, 270, 272, 278
McCarty, L., 244
McCorduck, P., 223
McCune, B., 183
McFadden, F., 29
McMath, C. F., 211
measure
    association, 94
    composite, 132
    correlation, 188, 247
    E-, 132, 265
    effectiveness, 131
    error, 265
    expected mutual information, 196
    F-, 132, 133
    information, 66, 83
MEDLARS, 119
MEDLINE, 235, 246, 254
memes, 23
Menaud, L., 60
Menczer, F., 281
meronymy, 215
Merryman, J. H., 243
Merton, R., 189
meta-data, 19, 183, 259, 308
metatags, 297
metric, 94, 108, 134, 155
Miller, G., 64, 68, 149, 153
Mitchell, T., 263, 272, 278
mixtures, 75, 268
monotonic, 162, 174
morphological, 27, 46, 60, 297
    transformation, 44, 61, 239
Moukas, A., 280
Moulinier, I., 258
Mozer, M. C., 226
multi-criterial optimization, 132
multi-dimensional scaling (MDS), 161
mutual information. See information, mutual

name-tagging, 239, 259
narrative, 306
nearest neighbor, 158, 167
negative dictionary, 47, 71. See also noise words
Nelson, T. H., 200
neologisms, 67
Nerhot, P., 22, 244
network
    knowledge. See knowledge, network
    neural, 113, 173, 226, 264
      activity, 225
      pre- / post-synaptic, 230
    semantic, 225
Newell, A., 306
Newton, I., 187
noise, 83
    words, 73
nonlinear, 306
nonmetric, 108, 134, 161
norm of scholarship, 189
normalization, 94, 138, 178
    normalized association measure. See measure, association
of text, 89
Norman, D., 238

Occam’s razor, 263
O’Day, V. L., 110
Oddy, R., 105, 110
Ogden, P., 243
Ong, W. J., 16, 303
on point (legal case). See citation, legal
online, 173, 264
open source, 312, 318
orality, 2, 16, 302, 303
ordering
    simple, 125
    total, 125
    weak, 125
orthogonal, 154, 156
orthonormal, 156
over-fit, 263

Paepcke, A., 293
Pao, M. L., 185
Papadimitriou, C. H., 158
paper, classic, 187
Papoulis, A., 260, 261
parameter
    control, 75
    estimation of, 267, 268
partitional, 166
passage. See document
Pearl, J., 176, 178
peer review, 190, 191, 297, 308, 312
Persin, M., 99
phrase, 10, 27, 43, 72, 213, 221, 236, 253, 259
Pirolli, P., 111
pivot, normalization length. See length, normalization pivot
place, physical / political, 239
plural, 27, 44, 46
pointer, 32, 42, 186
Poisson
    distribution. See distribution, Poisson
    process, 64, 73, 75
    two- model. See two-Poisson model
polarity, 194
Pollack, S. M., 134
pooling, 120, 142
portal, 295, 307
Porter, M., 46, 58
post-verbal, 245
posting, 51, 52, 97
pragmatics, 213, 304
precision, 35, 123, 130, 140, 195
precision recall. See recall, precision curve
Preece, S., 226
preference, 27, 109, 262, 316
prerequisite, 18, 205
Price, D. J., 189, 190
Price’s index, 190
Principle of Least Effort, 64
prior distribution. See distribution, prior
priors, 271
privacy, 300
probability
    of error, 168
    marginal, 170
    posterior, 267, 268
    prior, 174, 176, 268, 271
    Ranking Principle (PRP), 125, 168, 177, 255
    See also relevance
proper names, 44, 152, 195, 239, 259
proximity, 55, 161
    (query) operator, 48
proxy, 20, 49, 297
publication information, 19, 183, 259
Pugh, W., 101

query, 6
    drawing, 74, 240
    by example, 13, 97
    language, 6, 10, 30, 304
      operator, 13, 48, 55
    magic bullet, 79, 110, 293
    oriented, 79
    session, 109, 178, 257, 294
    simple, 13, 293
Quinlan, J. R., 263, 274

Rada, R., 211
rank-plus-shift, 157
ranking, partial, 98, 99
Rau, L. F., 239
recall, 34, 122
    high, 35, 124
    normalized, 128, 133
    and precision, 78, 123, 132
    precision (RePre) curve, 125, 130
receiver / operator characteristic (ROC) curve, 139
recognition, object, 107
record, 30
regression, 92, 173
reinforcement, 256
relevance, 3, 7, 106
    assessment, 7, 36, 106, 109, 116, 117, 135, 281, 283
    description, 177
    feedback (RelFbk), 8, 105, 107
    logical, 225, 304
    objective, 225
    principle of, 304
    weight, 174
reliability, inter-subject, 117
representation
    bias, 79
    knowledge. See knowledge, representation
research front, 189, 190
resolution, 40, 55, 203, 219
resolving power, 76
retrieval, 6
    ad hoc, 137
    effectiveness, 99, 131, 168
    information. See information retrieval
    probabilistic, 157
    status value (RSV), 174
review articles, 187
rhetoric, 17, 200, 306
Ribeiro, B., 178
risk, 70
Robertson, S. E., 75, 85, 89, 91, 140, 170
Rocchio’s algorithm, 265
Rocchio, J. J., 113, 128
root, 44, 45, 61, 62
Rosch, E., 107
Rose, D., 167, 191, 229, 233, 238, 241, 244, 289
routing, 137, 263, 266, 286
rubric, 22, 201
Rudwick, M., 310
Russel, D. M., 110, 175
Russell, S., 223, 252

Sahami, M., 259
Salton, G., 56, 83, 86, 92, 96, 98, 101, 113, 114, 115, 120, 131, 186, 201, 226, 233
sampling, stratified, 62
Saracevic, T., 110, 124, 303
Schapire, R., 274
Schneiderman, B., 238
Schutze, H., 153, 264, 277
scope, 12, 13, 89, 219, 315
search
    engine, 6, 30, 96, 101, 293
    history, 306
    intermediary. See intermediary, search
    length, 133, 137
      reduction factor, 138
    spreading activation. See spreading activation search
    strategies, 281
Searle, J., 239
security, 55, 300
semantics, 4, 23, 32, 37, 47, 67, 86, 195, 211, 224, 289, 299
    lexical, 213, 215
semiotics, 25, 301, 313
Sereno, M., 68
Shapiro, F. R., 243
Shardaanand, U., 286
Shepard, R., 109, 240
Shepardizing. See citation, legal
Sheridan, P., 117, 213
signal, 83, 138, 157
significance, statistical, 116
Silverstein, C., 96, 293
similarity
    average, 88
    matrix. See matrix, similarity
Simon, H., 64, 67, 68
simple matching coefficient, 95
Singhal, A., 91
singular term. See term, singular / general
singular value decomposition (SVD), 156
Sitter, S., 205
skimming effect, 276
Sleator, D. D. K., 58
sliding ratio, 134
Small, H., 189
SMART, 92, 253, 263
Smeaton, A. F., 213
Smith, L., 183
Snow, C. P., xix
Soergel, D., 211
Sparck Jones, K., 25, 85, 120, 124, 304
sparse, 86, 113, 154, 159, 173, 176, 259, 260
specific, 11
    index, 78, 79, 218, 298
    specificity weighting, 78
Sperber, D., 304
spiral exposition. See exposition, spiral
spoof, 297
spreading activation search (SAS), 225
Sprowl, J., 288
Srinivasan, P., 211
Stanfill, C., 51, 101, 114
stationary distribution. See distribution, stationary
Stefik, M., 312
Steier, A. M., 221, 280, 284
stemming, 28, 44, 52, 57, 58, 62, 72
    weak, 46
stop (word). See negative dictionary
stream file. See file, stream
stress, 162
Strunk, W. Jr., 303
Strzalkowski, T., 213
suffix, 27, 45, 46
support vector machines, 275, 278
surface markings, 44
surfing. See World Wide Web, surfing
Sutton, R.S., 267
Svenonius, E., 12
Swanson, D., 189, 234
Swets, J. A., 138
Swets model, 140
symbolic / subsymbolic, 289
synonym set, 214
synonymy, 214

Tague-Sutcliffe, J., 122
Tapper, C., 243
temporal dimension, 266
term
    of art, 12, 288
    broad / narrow, 11, 210, 211
    classes. See classification
    frequency, 74, 78, 96, 279, 286
    general, 11, 210, 211, 238
    index, 47, 76, 159, 279
    narrower, 11, 211
    related, 210, 228, 246
    singular / general, 44, 130, 238, 239
    See also keyword
terminal node, 149
test set, 269
TF-IDF weighting, 96, 203
thesaurus, 158, 210, 211, 213, 221, 246
Thompson, P., 276
Thorndike, E. L., 63
time line, 240
token, 27, 28, 43, 101, 200, 238, 261, 289
tokenize, 43
topic, 203
    topical tiling, 203, 204
    tracking, 266, 288
Towell, G., 276
training
    instance, positive / negative, 258
    set, 256, 258, 264, 265, 267, 269, 271
    See also learning
transcript, 26
transitive, 15
tree
    dependence, 261
    lexicographic, 53, 149
triangle inequality, 155
Turtle, H. R., 177
    minimum spanning (MST), 166, 261
    splay, 52, 58
two-Poisson model, 75

Unger, R. M., 244
urn of words, 271

van Rijsbergen, C. J., xxiii, 132, 169, 170, 211, 225
Valiant, L., 269
Vapnik, V. N., 275
vector space, 86
Veerasamy, A., 238
verb, active / stative, 215
verbosity, 89
vocabulary, 10
    balance, 65, 101
    controlled / uncontrolled, 12
    mismatch, 79
    open / closed, 12
    size, 12, 14, 44, 86, 94, 154, 214, 272
Vogt, C. C., 121, 277
Voorhees, E., 101, 120, 217

Walker, S., 46
Waltz, D., 219, 223
Warmuth, M., 274
Watson, J. D., 26
Wells, H. G., 4, 293
Welsh, D., 61
White, H., 265
white-space, 27, 42
Widrow, B., 265
Widrow-Hoff, 265
word. See keyword
Wilbur, W., 119
Willett, P., 166
Williams, J. H., 171
Wilson, P., 109
Wittgenstein, L., 2, 4, 299
Witztum, D., 62
Wold, E., 306
Wong, S., 178, 262
Woodworth, R., 161
word
    common, 69, 103, 294
    content, 73, 75
    frequency, 62, 66, 67
    function, 47, 73
    game (Sprachspiele), 2, 4, 296, 299, 305, 313
    noise, 47
    sense, 46
    urn of. See urn of words
    See also keyword
World Wide Web (WWW), 3
    consortium (W3C), 294
    crawling, 22, 294, 296
    directory, 219
    surfing, 69

Yager, R. R., 276
Yang, Y., 262, 272
Yule, G. U., 63

Zamir, O., 166
Zha, H., 158
Zipf, H., 62
Zipf’s law, xx
Zipfian distribution. See distribution, Zipfian