access indexes 12

accuracy 2, 9, 11, 13, 17, 19, 2425, 3334, 43, 47, 50, 5557, 6063, 67, 71, 73, 75, 76, 7880, 89

adjacency matrix 88

analysis window 37

analytical resolution 10, 25, 67

attribution 11, 26, 33, 35

authorship 11, 26, 33, 35

automatic text categorization 13, 62, 68, 71, 74, 7778, 8085

bag of words 69, 7273

bibliometrics 87

blackouts 7, 16

Boolean 13

categorization 13, 13, 25, 4445, 48, 56, 60, 62, 66, 68, 7185

Central Intelligence Agency 21

centrality 8991, 96

chronemics 30, 32

citation indexes 12

cleaning 7, 10, 2425

closed captioning 19

clustering 2, 11, 43, 5556, 60, 7185, 92, 96

Coleman-Liau Index 27

collections 7, 1011, 1314, 18, 21, 3031, 34, 41, 44, 59, 69, 7172, 7475, 80, 8283, 87, 90, 97

collocates 3638

completeness 9, 25

concordance 1, 6, 39, 42

confusion matrix 76

consistency 2, 45, 50, 56, 61, 77

contingency table 8385

convenience samples 17, 18

co-occurrence 12, 6, 3642, 68, 8889, 96

correlation 4, 29, 3132, 3642, 45, 47, 56, 63, 69, 8788


data collection 7, 67

data preparation 7, 11

decision tree 7475

DICTION 3, 2729, 32, 35, 44, 62, 6668, 9799

Dictionary of Affect in Language 6667, 98, 99

digitize 11, 14, 20, 23, 24

digitized 11, 14, 20, 24

directionality 38

distance matrix 88

document extraction 23

duplication 10, 15, 25

edge 8692, 96

editions 1416, 21, 25, 41

emoticons 28, 29

emotion 11, 19, 20, 28, 44, 6570

entity extraction 38, 4356, 87, 96

evaluate 14, 70, 80, 92, 98

evolution 11, 3032, 65, 91, 96

extraction 12, 6, 2325, 29, 38, 4364, 72, 84, 87, 89, 96

false negative 17, 76

false positive 17, 25, 76, 77

feature reduction 7374, 78, 85

feature selection 7273, 77

filtering 24

Flesch-Kincaid Index 27

Foreign Broadcast Information Service 21

Freedom of Information Act 13

Fulltext Sources Online 13

Gale Directory of Databases 98

gazetteer 43, 5154, 56

gender 26, 30, 3335, 41, 45, 7273, 98

General Architecture for Text Engineering 5, 50

geocoding 2, 43, 4456

Geographic Names Information System 52

GEOnet Names Server 53

Google 8, 12, 2122, 28, 51, 56, 62, 68, 80

Gunning-Fog Index 27

hapax legomena 31

hierarchical clustering 8182

histogram 2629, 32, 34, 72

incompleteness 14, 16, 20

integrated suites 3, 4, 6

Internet Archive 12, 21

isolate 26, 2829, 30, 73, 86

keyword 7, 8, 11, 13, 17, 22, 2425, 29, 31, 39, 4243, 47, 50, 6163, 67, 71, 78, 80

keyword generation 61, 63

Keyword In Context 29, 39

k-nearest-neighbor 7475, 84

learning algorithm 7475, 7879, 8485

lemma 4, 3334

lemmatization 3334

lexeme 33

lexicon 1, 5, 6, 3956, 6568, 70, 85, 99

Lexis Academic 8

LexisNexis 8, 12, 1516, 21, 49

Library of Congress 9

licensing restrictions 7, 14, 16, 25

metadata 20, 30, 3942, 45, 80, 87, 96

Minnesota Contextual Content Analysis 44

Moby Pronunciator 32, 99

morphology 3335

multimedia 7, 19

naïve Bayes 7475

National Center for Supercomputing Applications 6, 50

network 13, 22, 23, 40, 68, 7475, 8698

neural network 7475

New York Times 14, 1617, 20, 24, 53, 98

newspapers 8, 11, 1316, 2021, 24, 65

n-gram 73

node 8692, 96

normalization 61, 74

normative 26, 28, 34

noun phrase 33, 5963, 72

OCR 7, 11, 14, 23, 24, 73

Orange Book 48

overlap 45, 46, 55, 60, 81, 88

part of speech tagging 58, 63

partitional clustering 82

pendant 86

Porter stemmer 34

Practical Extraction and Reporting Language 6

presence/absence 3638, 4244

progressiveness 26

Proquest Historical Newspapers 8, 11

prosody 19, 20

qualitative 3, 29

quantitative 3, 65, 80, 89

random sample 18, 78

randomization 18

readability 2627, 29, 32, 34

real-time 15, 19

regular expression 49, 56

rejection rate 76

reliability 2, 67, 68, 77, 79, 99

reproducibility 2

reshaping 25

result counts 8, 25

SAS 4, 5, 40

scale 12, 5, 8, 17, 24, 44, 49, 50, 57, 6669, 77, 89, 96

search 69, 1114, 1618, 2025, 31, 33, 40, 42, 4651, 53, 55, 58, 71, 80, 85, 97

semantic 11, 36, 37, 38, 40, 44, 47, 57, 5863, 6769, 73, 79, 89, 97, 98

sentiment analysis 1, 3, 56, 29, 43, 6570

signature 3031, 80

similarity 11, 33, 45, 47, 60, 7175

site mirroring 22

slang 29

Soundex 3233

source stability 8

Sourcebook to Public Record Information 13

spatial analysis 1, 6, 51, 55

specialized tools 40608



stemming 7, 3334

structured 58

summarization 6061, 63, 90

Summary of World Broadcasts 15, 21

support vector machine 7475

surface statistics 8

syllable 2729, 32

terms of use 22

thesauri 40, 47, 56, 62, 68

topic extraction 12, 6, 29, 38, 5764

translation 10, 15, 17, 6264

triad census 9091, 96

true negative 76

true positive 7677

uncertainty 15, 54

understanding a source 15

unintended uses 9

University of Illinois 6, 9798

unstructured 58, 97

validate 44

Vanderbilt Television News Archive 12

vector space 7173, 8284

verb phrase 59, 63

viewership 17

visualization 50, 82, 92, 96

vocabulary 16, 11, 22, 24, 2635, 43, 45, 47, 5657, 6265, 67, 6970, 72, 87, 91

window 2930, 37, 70

word birth 31

word class 26, 33, 41

word death 31

WordHoard 45, 34, 99

WordNet 47, 48, 59, 6162, 68

World Bank 13

WorldCat 9, 10, 98

WorldCom 21, 98

XML 58

Yahoo 8

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.