Please use this identifier to cite or link to this item:
https://hdl.handle.net/2440/3244
Citations | ||
Scopus | Web of Science® | Altmetric |
---|---|---|
?
|
?
|
Type: | Journal article |
Title: | Sequential sampling models of human text classification |
Author: | Lee, M. Corlett, E. |
Citation: | Cognitive Science, 2003; 27(2):159-193 |
Publisher: | Elsevier Science Inc |
Issue Date: | 2003 |
ISSN: | 0364-0213 1551-6709 |
Abstract: | <jats:title>Abstract</jats:title><jats:p>Text classification involves deciding whether or not a document is about a given topic. It is an important problem in machine learning, because automated text classifiers have enormous potential for application in information retrieval systems. It is also an interesting problem for cognitive science, because it involves real world human decision making with complicated stimuli. This paper develops two models of human text document classification based on random walk and accumulator sequential sampling processes. The models are evaluated using data from an experiment where participants classify text documents presented one word at a time under task instructions that emphasize either speed or accuracy, and rate their confidence in their decisions. Fitting the random walk and accumulator models to these data shows that the accumulator provides a better account of the decisions made, and a “balance of evidence” measure provides the best account of confidence. Both models are also evaluated in the applied information retrieval context, by comparing their performance to established machine learning techniques on the standard Reuters‐21578 corpus. It is found that they are almost as accurate as the benchmarks, and make decisions much more quickly because they only need to examine a small proportion of the words in the document. In addition, the ability of the accumulator model to produce useful confidence measures is shown to have application in prioritizing the results of classification decisions.</jats:p> |
DOI: | 10.1016/S0364-0213(02)00117-9 |
Published version: | http://dx.doi.org/10.1207/s15516709cog2702_2 |
Appears in Collections: | Aurora harvest Environment Institute publications Psychology publications |
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.