D 2011

Subspace Tracking for Latent Semantic Analysis

ŘEHŮŘEK, Radim

Basic information

Original name

Subspace Tracking for Latent Semantic Analysis

Authors

ŘEHŮŘEK, Radim (203 Czech Republic, guarantor, belonging to the institution)

Edition

Heidelberg, Proceedings of the 33rd European Conference on Information Retrieval (ECIR), p. 289-300, 12 pp. 2011

Publisher

Springer

Other information

Language

English

Type of outcome

Proceedings paper

Field of Study

10000 1. Natural Sciences

Country of publisher

Ireland

Confidentiality degree

is not subject to a state or trade secret

Publication form

printed version "print"

References:

Impact factor

Impact factor: 0.402 in 2005

RIV identification code

RIV/00216224:14330/11:00067252

Organization unit

Faculty of Informatics

ISBN

978-3-642-20160-8

ISSN

UT WoS

000301968000029

Keywords in English

scalability svd subspace tracking

Tags

International impact, Reviewed
Changed: 30/4/2014 04:36, RNDr. Pavel Šmerk, Ph.D.

Abstract

V originále

Modern applications of Latent Semantic Analysis (LSA) must deal with enormous (often practically infinite) data collections, calling for a single-pass matrix decomposition algorithm that operates in constant memory w.r.t. the collection size. This paper introduces a \emph{streamed distributed algorithm for incremental SVD updates}. Apart from the theoretical derivation, we present experiments measuring numerical accuracy and runtime performance of the algorithm over several data collections, one of which is the whole of the English Wikipedia.

Links

LC536, research and development project
Name: Centrum komputační lingvistiky
Investor: Ministry of Education, Youth and Sports of the CR, Centrum komputační lingvistiky