0

quantitative and qualitative evaluation of darpa communicator spoken dialogue systems

Báo cáo khoa học:

Báo cáo khoa học: "Quantitative and Qualitative Evaluation of Darpa Communicator Spoken Dialogue Systems" pdf

Báo cáo khoa học

... user’s travel plans bothat the beginning of the dialogue and also after Quantitative and Qualitative Evaluation of Darpa Communicator Spoken Dialogue Systems Marilyn A. WalkerAT&T Labs – ... labels and summed the total effort ex-pended on each type of dialogue act over the dialogue or the percentage of a dialogue givenover to a particular type of dialogue behavior.These sums and ... achieve abetter understanding of the role of qualitative as-pects of each system’s dialogue behavior. Wequantify the extent to which the dialogue actmetrics improve our understanding by applyingthe...
  • 8
  • 319
  • 0
Quality of Telephone-Based Spoken Dialogue Systems docx

Quality of Telephone-Based Spoken Dialogue Systems docx

Kỹ thuật lập trình

... number of system words uttered in a dialogue average number of time-out prompts in a dialogue average number of turns in a dialogue average number of user questions in a dialogue average number of ... be observed, and is thus aprerequisite for setting up better theories and systems. The development of spoken dialogue systems requires not only a change inthe focus of speech and language research ... determine thequality of the developed systems, and the resulting satisfaction of their users.As a wide range of novice users is the target group of current state -of- the-art systems and services, the...
  • 385
  • 207
  • 0
Quality of Telephone-Based Spoken Dialogue Systems phần 1 ppsx

Quality of Telephone-Based Spoken Dialogue Systems phần 1 ppsx

Kỹ thuật lập trình

... Telephone-Based Spoken Dialogue Systems 20internal correction, anticipation, and prediction. Examples of such systems are given in Section 2.1.3.7.Multimodal dialogue systems including speech: Systems of ... be observed, and is thus aprerequisite for setting up better theories and systems. The development of spoken dialogue systems requires not only a change inthe focus of speech and language research ... determine thequality of the developed systems, and the resulting satisfaction of their users.As a wide range of novice users is the target group of current state -of- the-art systems and services, the...
  • 46
  • 293
  • 0
Quality of Telephone-Based Spoken Dialogue Systems phần 2 potx

Quality of Telephone-Based Spoken Dialogue Systems phần 2 potx

Kỹ thuật lập trình

... outcome of an assessment or evaluation experiment5. Spoken language systems are relatively complex systems which offer a num-ber of different (and ill-defined) functions. The functions of the ... provision of help to the user,the correction of errors and misunderstandings,the interpretation of complex discourse phenomena like ellipses and ana-phoric references, and the organization of information ... bythe dialogue manager arethe collection of all information from the user which is needed for the task,the distribution of dialogue initiative,the provision of feedback and verification of information...
  • 49
  • 243
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "You Can’t Beat Frequency (Unless You Use Linguistic Knowledge) – A Qualitative Evaluation of Association Measures for Collocation and Term Extraction" pot

Báo cáo khoa học

... for CE (Wermter and Hahn, 2004) and for ATR (Wermter and Hahn, 2005), which havebeen shown to outperform several of the statistics-only metrics.3 Methods and Experiments3.1 Qualitative CriteriaBecause ... in CE and ATR) because it hasbeen shown to be the best-performing statistics-only measure for CE (cf. Evert and Krenn (2001) and Krenn and Evert (2001)) and also for ATR (seeWermter and Hahn ... Press.Stefan Evert and Brigitte Krenn. 2001. Methods forthe qualitative evaluation of lexical association mea-sures. In ACL’01/EACL’01 – Proceedings of the39th Annual Meeting of the Association...
  • 8
  • 435
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Methods for the Qualitative Evaluation of Lexical Association Measures" doc

Báo cáo khoa học

... curves (Figures 3 and 4), we find: (i) Examination of 50% of the datain the SLs leads to identification of between 75%(AdjN) and 80% (PNV) of the TPs. (ii) For thefirst 40% of the SLs, and lead to ... discussion of the excluded low-frequencycandidates).4 Experimental SetupAfter extraction of the base data and manual iden-tification of TPs, the AMs are applied, resulting inan ordered candidate ... instance, 80% of the full set of PNV data and 58% of the AdjN data are ha-paxes. Thus it is important to know how many (and which) true collocations there are among theexcluded low-frequency candidates.5.1...
  • 8
  • 516
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Correlation between ROUGE and Human Evaluation of Extractive Meeting Summaries" pptx

Báo cáo khoa học

... ICASSP.X. Zhu and G. Penn. 2005. Evaluation of sentence selection forspeech summarization. In ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for MT and/ or Summariza-tion.X. Zhu and G. ... those of the authors and do not necessarily reflect the views of NSF.ReferencesJ. Carbonell and J. Goldstein. 1998. The use of mmr, diversity-based reranking for reordering documents and producingsummaries. ... Infor-mative Coverage (IC): S2 and S9; Informative Relevance(IRV): S3 and S8; and Informative Redundancy (IRD):S4 and S7.4 Results4.1 Correlation between Human Evaluation and Original ROUGE ScoreSimilar...
  • 4
  • 293
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Correlating Human and Automatic Evaluation of a German Surface Realiser" doc

Báo cáo khoa học

... are. Belz and Reiter(2006) and Reiter and Belz (2009) describe com-parison experiments between the automatic eval-uation of system output and human (expert and non-expert) evaluation of the same ... evalua-tion of a string realisation system usually involvesstring comparisons between the output of the sys-tem and some gold standard set of strings. Typi-cally automatic metrics from the fields of ... corpus of 200 million words of newspa-per and other text.Cahill and Forst (2009) describe a number of experiments where they collect judgements fromnative speakers about the three systems...
  • 4
  • 285
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Comparing Automatic and Human Evaluation of NLG Systems" potx

Báo cáo khoa học

... Background2.1 Evaluation of NLG systems NLG systems have traditionally been evaluatedusing human subjects (Mellish and Dale, 1998).NLG evaluations have tended to be of the intrinsictype (Sparck Jones and ... communicative goal; and that corpus texts are often not of high enough qual-ity to form a realistic test.2.2 Automatic evaluation of generated textsin MT and SummarisationThe MT and document summarisation ... algorithms, and data sets. BLEU and re-lated metrics work by comparing the output of anMT system to a set of reference (‘gold standard’)translations, and in principle this kind of evalua-tion...
  • 8
  • 376
  • 0
DELIVERING HEALTH EDUCATION VIA THE WEB: DESIGN AND FORMATIVE EVALUATION OF A DISCOURSE-BASED LEARNING ENVIRONMENT pot

DELIVERING HEALTH EDUCATION VIA THE WEB: DESIGN AND FORMATIVE EVALUATION OF A DISCOURSE-BASED LEARNING ENVIRONMENT pot

Sức khỏe giới tính

... structure and navigation,readability of text, appropriateness of graphics and icons, clarity and quality of information,suitability of external links, and clarity and perceived motivating and discussion ... environment.A variety of data collection protocols and tools were developed to collect quantitative and qualitative data. Pre and post tests related to HIV/AIDS and nutrition will allow for quantitative comparison ... formative evaluation of the Web site have a variedteaching background in terms of level and content and are focusing their postgraduate studieson the design, development and evaluation of technology-based...
  • 12
  • 411
  • 0

Xem thêm