Transcription of Finding scientific topics
{{id}} {{{paragraph}}}
Colloquium Finding scientific topics Thomas L. Griffiths* and Mark Steyvers . *Department of Psychology, Stanford University, Stanford, CA 94305; Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139-4307; and Department of Cognitive Sciences, University of California, Irvine, CA 92697. A first step in identifying the content of a document is determining our algorithm to a corpus consisting of abstracts from PNAS. which topics that document addresses. We describe a generative from 1991 to 2001, determining the number of topics needed to model for documents, introduced by Blei, Ng, and Jordan [Blei, account for the information contained in this corpus and ex- D. M., Ng, A. Y. & Jordan, M. I. (2003) J. Machine Learn. Res. 3, tracting a set of topics . We use these topics to illustrate the 993-1022], in which each document is generated by choosing a relationships between different scientific disciplines, assessing distribution over topics and then choosing each word in the trends and hot topics '' by analyzing topic dynamics and using document from a topic selected according to this distribution.
Apr 06, 2004 · Viewing documents as mixtures of probabilistic topics makes it possible to formulate the problem of discovering the set …
Domain:
Source:
Link to this page:
Please notify us if you found a problem with this document:
{{id}} {{{paragraph}}}