-
-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathlanguage_technology.csv
We can make this file beautiful and searchable if this error is corrected: Illegal quoting in line 465.
570 lines (570 loc) · 19.2 KB
/
language_technology.csv
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
English,Icelandic
accuracy,hittni
accuracy,nákvæmni
acoustic analysis,hljóðgreining
advanced discourse processing,þróuð orðræðuaðferð
aligned,samröðun
alignment,samskipun
alignment,samröðun orða
alignment,samröðun setninga
ambiguity,margræðni
annotated corpus,mörkuð málheild
answer extraction,útdráttur svara
API,forritaskil
API,API
application,verkbúnaður
application,forrit
application area,verkbúnaðarsvið
application environment,verkbúnaðarumhverfi
application program interface,forritaskil
application program interface,api
argument definitions,rökliðagreining
attribute,eigind
attribute,eiginleiki
audio file,hljóskrá
authoring support,ritstoð
authoring support software,ritstoðarforrit
authoring support systems,ritstoðarkerfi
authoring tools,ritstoðartól
automatic continuous speech recognition ,sjálvirk orðaflaumskennsl
automatic hyperlinking,sjálfvirk tenglasetning
automatic language identification,sjálfvirk tungumálskennsl
automatic rule-based parser,sjálfvirkum regluþáttara
automatic text summarization,vélrænn útdráttur
auto-correlation,sjálfvirku fylgni
background noise,bakgrunnshávaða
backoff,íhörfun
bactracking,rakning
backward chaining,afturkeðjun
backward chaining,bakkeðjun
baseline,grunnlína
baseline,beislínu
bidirectional tagger,tvístefnumarkari
bigram,tvístæða
bigram,tvennd
bilingual text corpus,tvímála málheild
bootstrapping approach,gagnkvæm hringrás
bootstrapping approach,sjálfkvæm nálgun
bottom-up,neðansækinn
data-directed,neðansækinn
data analysis,gagnagreining
data format,skráarform
data set,gagnasafn
dataset,gagnasafn
brill type tagger,markari brills
categorial grammar,frumflokkamálfræði
categorization,frumflokkun
chunk,bagga
chunk,baggi
chunking,böggun
clustering,þyrpa
clustering,þyrping
code ,kóti
code ,kóta
code ,kóða
coding and compression,kótun og þjöppun
coding and compression,kóðun og þjöppun
coding tool,kótunartól
coding tool,kótunarverkfæri
coding tool,kóðunarverkfæri
coding tool,kóðunartól
collection of texts,textasafn
collocational features,orðastæðuþættir
collocational features,samstæðuþættir
collocations,orðastæður
collocations,samstæður
common voice,samrómur
compression,þjöppun
computational linguistics,tölvumálvísindi
computational linguistics,tölvumálfræði
computational linguistics,reiknifræðileg málfræði
computational linguistics,reiknifræðileg málvísindi
computational pragmatics,tölvufræðileg málnotafræði
computational pragmatics,reiknifræðileg málnotafræði
computational psycholinguistics,tölvufræðileg sálmálvísindi
computational psycholinguistics,reiknifræðileg sálmálvísindi
computational semantics,tölvufræðileg merkingarfræði
computational semantics,reiknifræðileg merkingarfræði
computational semantics,merkingartækni
computer-assisted language learning,tölvustutt tungumálanám
conditional random fields,skilyrt slembisvæði
connectionist techniques,samtengiaðferðir
content production,semja efni
context sensitive spell-checking,samhengisháð stafrýni
context sensitive spell-checking,samhengisháð ritvilluleit
conversation,samræða
conversational agent,talþjónn
co-occurrence features,grenndarþættir
coreference resolution,tilvísanagreining
coreference resolution,samvísunartúlkun
corpus,málheild
corpus linguistics,gagnamálfræði
corpus linguistics,gagnamálvísindi
cross validation,krossprófun
cross-lingual information retrieval,þvermála upplýsingaheimt
cross-lingual language capabilities,þvermálamöguleikar
data,gögn
data presentation method,gagnaframsetningaraðferð
data-driven methods,gagnaaðferðir aðferðir
data-driven methods,gagnastýrðar aðferðir
data-driven system,gagnakerfi
decision list classifiers,flokkaraskrá
decoder,afkótari
decoder,afkóðari
deep generation,djúp málgerving
deep parser performance evaluation,mat á frammistöðu djúpþáttara
deep parsing,full þáttun
deep parsing,djúp þáttun
full parsing,full þáttun
full parsing,djúp þáttun
denotation,aðalmerking
denotation,merkingarkjarni
denotation,umtak
denote,aðalmerkja
denote,hrámerkja (x denotes y)
dependency bank,venslabanki
dependency grammar,venslamálfræði
development sets,þróunarlistar
dialect,mállýska
dialects,mállýskur
dialects,framburðartilbrigði
dialogue,samræða
dialogue management ,samræðustjórnun
dialogue manager,samræðustjóri
dialogue modeling,gerð samræðulíkana
dialogue system,samræðukerfi
dictionary,orðalista
digital humanities,stafræn hugvísindi
digital library,stafrænt bókasafn
diphone,tvístæða
diphone concatenation system,tvístæðu samtengingarkerfi
disambiguation,einræðing
discourse ,orðræða
discourse and dialogue,orðræða og samræða
discourse capabilities,samræðuhæfileikar
discourse corpora,orðræðumálheild
discourse modeling,gerð orðræðulíkana
discourse capabilities,samræðuhæfileikar
discriminative tagger,aðgreiningarmarkari
document image analysis,greining skjalmyndar
dsp techniques,stafrænar merkjavinnsluaðferðir
dynamic programming,kvikleg forritun
dynamic programming,kvikbestun
dynamic programming,kvik bestun
e-learning,fjarnám
e-learning,fjarskiptanám
e-learning environments,fjarnámsumhverfi
embedding,greyping
emerging computing paradigms,uppkomandi viðmið í tölvun
emotion recognition,tilfinningakennsl
encode,kóta
encode,kóða
encoder,kótari
encoder,kóðari
error corpus,villumálheild
error categories,villuflokkar
evaluation,mat
example resource exercise,dæmaæfing
example-based translation,dæmabyggð þýðing
feature,sérkennaþáttur
feature,kenni
finite state technology,stöðuvélatækni
finite-state,stöðuferjöld
finite-state machine,stöðuvél
finite state machine,endanlegar stöðuvélar
finite state transducer,endandleg stöðuferjöld
formant synthesiser,formandagervill
foreign language,L2
foreign language,annars mál
foreign language speakers,annars málshafar
forward chaining,framkeðjun
front-end,framendi
full parsing,full þáttun
gold standard,gullstaðal
grammar checker,málrýnir/málfræðirýnir
grammar models and formalisms,málfræðilíkön og formalismar
grammars,mállýsingar
grammatical tagger,málfræðilegur markari
grapheme combinations,stafasambönd
grammar correction,málfræðileiðrétting
graphical user interface,myndrænt viðmót
GUI,myndrænt viðmót
harmonicity,samhljóm
head-driven phrase structure grammar,höfuðstýrð liðgerðarmálfræði
heuristics,brjóstvitsreglur
heuristics,þumalputtareglur
highlight,yfirstrikunarlitur
highlight,litun texta
high-level language,æðra forritunarmál
HMM methods,aðferðir hulinna markovslíkana
human aided machine translation,mannstudd vélræn þýðing
hyphenation programs,orðskiptiforrit
hyperparameter,ytri stiki
in-domain,innan óðals
indexing,lyklun
individual word speech recogniser,stakorðagreinir
individual word speech recognizer,stakorðagreinir
inductive logic programming,aðleiðsluleg rökforritun
information access,upplýsingaðgangur
information extraction,útdráttur upplýsinga
information processing,upplýsingavinnsla
information retrieval,upplýsingaheimt
information technology,upplýsingatækni
input,ílag
input,viðfang
input text,ílagstexti
input text,viðfangstexti
intensity,styrkleika
interface,viðmót
intranet,innranet
intuitive language-based interfaces,hugsýn málbundin skil
isolated word recognizer,stakorðagreinir
isolated word speech recognition,stakorðagreining
key enabling technology,lykilstuðningstækni
keyword-based search,lykilorðaleit
knowledge discovery,þekkingaröflun
knowledge exchange,þekkingarskipti
knowledge management,þekkingarstjórnun
knowledge technology,þekkingartækni
knowledge-driven,þekkingarstýrð
label,mark
label,merkja
lambda reduction,aflömbdun
lambda reduction,lambdaeinföldun
language analysis,málgreining
language checking,málrýni
language engineering,máltækni
language engineering,tungutækni
language generation,málmyndun
language generation,málgerving
language model,mállíkan
language modelling,gerð tungumálalíkana
language resources,málföng
language resources,málgögn
language technology,máltækni
language technology,tungutækni
language technology application,máltæknibúnaður
language tools,máltæki
language understanding,málskilningur
latent semantic analysis,hulin merkingargreining
lemma,nefnimynd
lemma,flettimynd
lexeme ,les
lexical analysis,lesgreining
lexical semantics,orðfræðileg merkingarfræði
lexicon acquisition tool,orðtökutól
lexicon coding tool,orðasafnskótunarverkfæri
lexicons,orðasöfn
library,forritasafn
library,forritseining
license,afnotaleyfi
linguistic computing,máltölvun
linguistically annotated corpora,málfræðilega markaðar málheildir
linked data,tengd gögn
localisation expert,staðfæringarsérfræðingur
machine code,vélarmál
machine learning,vélanám
machine learning algorithms,vélnámsreiknirit
machine translation,vélræn þýðing
machine translation,vélþýðing
machine-aided human translation,vélstudd þýðing
mappings,varpanir
mapping functions,vörpunarföll
mathematical methods,stærðfræðilegar aðferðir
maximum entropy methods,hámarksóreiðuaðferðir
meaning representation,merkingartáknun
memory based tagger,minnismarkari
milestones,vörðurnar
models,líkön
module,eining
module,mótull
morphological analysis,orðhlutafræðileg greining
morphological disambiguation,orðhlutafræðileg einræðing
morphological parsing,orðhlutafræðileg þáttun
multilingual corpora,margmála málheildir
multilingual generation,margmála málmyndun
multilingual information retrieval,margmála upplýsingaheimt
multilingual speech processing,margmála talvinnsla
multilingualism,margmæli
multilinguality,margmæli
multimedia documents,margmiðlunarskjöl
multimedia information extraction,útdráttur margmiðlunarupplýsinga
multimedia information retrieval,margmiðlunarupplýsingaheimt
multimedia retrieval,margmiðlunarheimt
multimodal communication,marghátta samskipti
multimodal communication,fjölhátta samskipti
multimodality,fjölskynjun
multimodality,fjölskynþátta
named entity,sérheitaeining
named entity,nefnd eining
named entity recognition,nafnakennsl
named entity recognition,nafnaþekkjari
natural language generation,gerving náttúrulegs máls
natural language generation,myndun náttúrulegs máls
natural language parsing,þáttun náttúrulegs máls
natural language processing,málvinnsla
neural networks,tauganet
noise,hávaði
noisy channel model,truflunarlíkan
non-textual format,á öðru sniði en sem texta
normalized,staðlaður
n-grams,n-stæður
OCR: handwriting,ljóskennsl: skrift
OCR: print,ljóskennsl: prent
ontological language resources ,veranleg málgögn
ontological resources,veranleg gögn
ontologies,verufræði
out-of-domain,utan óðals
output,frálag
output,skil
output,úttak
outsource,útvista
overlap,skörun
overlap,skarast
parallel corpus,hliðstæð málheild
parallel corpus,samhliða málheild
parameter,færibreyta
parameter,stiki
parse tree,þáttunartré
parser,þáttari
parser,þáttara
parsing,þáttun
parsing,þáttun greining
parsing,setningafræðileg greining
parsing techniques,þáttunaraðferðir
partial parsing,hlutþáttun
partial parsing,hlutaþáttun
partial/shallow parsing,hlutþáttun
partial/shallow parsing,grunnþáttun
partial/shallow parsing,hlutaþáttun
part-of-speech,orðflokk
part-of-speech tagging,orðflokksmörkun
part-of-speech tagging,mörkun
part-of-speech tagging,málfræðileg mörkun
perceptron,skynjunarnet
perceptron,skynjunganet
"performance (m.t.t. nákvæmni, heimt o.s.frv.)",hæfni
pitch,hljómfallseinkennin
phonetic transcription,hljóðritun
platform,verkvangur
pos tagging,orðflokksmörkun
pos tagging,mörkun
pos tagging,málfræðileg mörkun
pragmatics,málnotkunarfræði
pragmatics,málnotafræði
precision,nákvæmni
preliminary prototype,frumgerð
pre-processing,forvinnsla
presentation and visualization,kynning og framsetning (sjóngerving?)
principle of compositionality,blendilögmálið
probabilistic context-free grammars,líkindafræðilegar samhengisfrjálsar mállýsingar
probabilistic parsing,líkindaþáttun
probabilistic tagging,líkindamörkun
programming environment,forritunarumhverfi
pronunciation,framburður
pronunciation dictionary,framburðarorðabók
prosody information processing,úrvinnsla hljómfallsupplýsinga
pull requests,togbeiðnir
punctuation,greinarmerkjasetning
qualitative analysis,eigindargreining
quantitative analysis,megindargreining
query analysis,spurnargreining
query language,fyrirspurnarmál
question answering,spurnasvörun
recall,heimt
recognition task,kennslastarf
recognition task,kennslaverkefni
recurrent neural networks (RNN),endurkvæm tauganet
reference,vísun
reference,tilvísun
reference corpus,viðmiðunarmálheild
referent,merkingarmið
referring expression,vísandi segð
regular expression,regluleg segð
relation extraction,útdráttur vensla
relevance ranking,gildisröðun
representations of space and time,birtingarform rúms og tíma
resources,tilföng
resources,forðar
retrieve,sækja
retrieve,heimta
rule-based methods,regluaðferðir
rule-based tagger,reglumarkari
running word,lesmálsorð
search engine,leitarvél
search interface,leitarviðmót
search interface,leitarskil
second language,L2
second language,annars mál
second language speakers,annars málshafar
segmented,sneidd
selection module,valeining
selectional association,valtengsl
selectional restrictions,valhömlur
semantic corpora,merkingarmálheild
semantic drift,merkingarrek
semantic web,merkingarvefur
semantic web community,samfélag merkingarvefja
sentences,málsgreinir
sentence-based correction,setningaleiðrétting
shallow generation,grunn málgerving,
shallow generation,grunn myndun
shallow parsing,hlutaþáttun/grunn þáttun
shallow parsing,grunn þáttun
shallow parsing,hlutþáttun
signal analysis and representation,greining og framsetning merkja
signals,merki
smoothing,jöfnun
software application,hugbúnaður
software application,viðfang
source language,frummál
speaker,mælandi
speaker,málhafa
speaker diarization,samræðugreind
speaker recognition,mælandakennsl
speech analyser,talgreinir
speech analysis,talgreining
speech applications,taltæknihugbúnaði
speech coding,talkótun
speech coding,talkóðun
speech encryption,dulritun tals
speech enhancement,endurbót tals
speech input – assessment and evaluation,mat á talinntaki
speech interaction ,talsamskipti
speech interaction technology ,talsamskiptatækni
speech recognition,talkennsl
speech recognition,talgreining
speech recognition module,talkennsleining
speech recognizer,talþekkjari
speech retrieval,talheimt
speech synthesis,talgerving
speech synthesis evaluation,mat á talgervingu
speech synthesizer,talgervill
speech technology,taltækni
speech-to-text,talgreining
spell checking,stafrýni
spell-checker,stafrýnir
spoken dialogue systems,samræðukerfi fyrir tal
spoken language corpora,málheildir talmáls
spoken language dialogue,samræða í tali
spoken language generation,gerving talmáls
spoken language input,talmálsílag
spoken language understanding,skilningur talmáls
spoken output technologies,talmálsfrálagsaðferðir
statistical language model,tölfræðilegt mállíkan
statistical machine translation,tölfræðistudd vélþýðing
statistical modeling and classification,tölfræðileg líkanagerð og flokkun
stemming,stofngreining
stochastic taggers,líkindamarkarar
stopword,váorð
stopword,stopporð
statistical taggers,líkindamarkarar
stochastic taggers,tölfræðimarkarar
statistical taggers,tölfræðimarkarar
strategic,skipulegur
strategic,skipulagt"
sub task,undirverk
subword,orðaflís
summarization,samantekt
summarizer,samantektir
summarizer,samantektartól
summarizer,samantaki
supervised learning,leiðbeint nám
supervised learning,"viðgjafarnám (helst ekki, viðgjöf á frekar heima innan reinforcement learning)"
support vector machines,stoðvigravélar
syntactic generation,setningagerving
syntactic parsing,setningaleg þáttun
syntax-driven semantic analysis,merkingarfræði í setningarlegu ljósi
system,kerfi
tag,mark
tagged corpus,mörkuð málheild
tagger,markari
tagging,mörkun
tagset,markaskrá
tagset,markamengi
target language,markmál
task-specific module,sérhæfð forritseining
technical product,tæknileg afurð
temporal expression,tímasegð
term base,íðorðagrunnur
terminal,útstöð
terminal,skipanalína
terminal,skjáhermi
terminology,íðorðaforði
terminology restrictions,íðorðahömlur
test corpus,prófunarsafn
test set,prófunarsett
test set,prófunarmengi
test set,prófunarhluti
test sets,prófunarlistar
text collection,textasafn
text compression,textaþjöppun
text data mining,gagnanám úr texta
text encryption,dulritun texta
text indexing,lyklun texta
text mining/analytics,textanám
text processing system,ritvinnslukerfi
text technology,textatækni
text-to-speech ,talgerving
text-to-speech synthesis,talgerving texta
thesauri wordnets,orðtengslanet
token (information),tóki
token (frame),tóki
tokenization,tilreiðsla
tokenization,tókun
tokenization and segmentation,tákngreining og sneiðing,
tokenization and segmentation,tókun og skipting"
tokenizer,tilreiðari
tokenizer,tókari
top-down,ofansækinn
goal-directed,ofansækinn
token-based errors,stakorðavillur
topic detection,efnissviðsgreining
toy system,leikfangakerfi
training corpus,þjálfunarsafn
training corpus,þjálfunarmálheild
training set,þjálfunarsett
transformation-based taggers,námsfúsir markarar
transformation-based taggers,leiðréttingamarkari
translation memory,þýðingarminni
tree adjoining grammar,trjátengslamálfræði
treebank,trjábanki
trigram,þrístæða
trigram,þrennd"
triphones,þrístæða
truth-conditional semantics,sannkjaramerkingarfræði
under-resourced,gagnanaumur
unigram,einstæða
unigram,eind
unknown word guesser,giskari
unsupervised learning,óleiðbeint nám
unsupervised learning,viðgjafarlaust nám (helst ekki)
usability and interface design,nytsemi og viðmótshönnun
user interface,notendaviðmót
user modeling,notendahönnun
user query,fyrirspurn notanda
utterance,yrðingar
video file,myndbandsskrá
voice recognition,raddþekking
voice recognition,raddkennsl
voice user interface,raddstýrt notendaviðmót
voice,rómur (usually used as a suffix)
web application,vefviðmót
web search,vefleit
web search engine,vefleitarvél
web user interface,vefviðmót
wildcard character,algildisstafur
word embedding,orðaívefja
word embedding,orðagreyping
word errror rate,villutíðni orða
word form,orðmynd
word processing tools,ritvinnsluforrit/-tól
word sense,merkingarsvið orða
word sense,merkingarþættir orða
word sense,merkingarkjarni"
word sense disambiguation,einræðing orða
word sense disambiguation,merkingareinræðing"
word speech recogniser,stakorðagreinir
wordnet,orðanet
written language corpora,málheildir ritmáls
written language input,ritmálsílag