Language Modeling for Information Retrieval edited by W. Bruce Croft, John Lafferty.

A statisticallanguage model, or more simply a language model, is a prob­ abilistic mechanism for generating text. Such adefinition is general enough to include an endless variety of schemes. However, a distinction should be made between generative models, which can in principle be used to synthesize...

Full description

Saved in:
Bibliographic Details
Corporate Author: SpringerLink (Online service)
Other Authors: Croft, W. Bruce (Editor), Lafferty, John (Editor)
Format: eBook
Language:English
Published: Dordrecht : Springer Netherlands : Imprint: Springer, 2003.
Edition:1st ed. 2003.
Series:The Information Retrieval Series, 13
Springer eBook Collection.
Subjects:
Online Access:Click to view e-book
Holy Cross Note:Loaded electronically.
Electronic access restricted to members of the Holy Cross Community.

MARC

LEADER 00000nam a22000005i 4500
001 b3243207
003 MWH
005 20191022141502.0
007 cr nn 008mamaa
008 130321s2003 ne | s |||| 0|eng d
020 |a 9789401701716 
024 7 |a 10.1007/978-94-017-0171-6  |2 doi 
035 |a (DE-He213)978-94-017-0171-6 
050 4 |a E-Book 
072 7 |a UMB  |2 bicssc 
072 7 |a COM031000  |2 bisacsh 
072 7 |a UMB  |2 thema 
072 7 |a GPF  |2 thema 
245 1 0 |a Language Modeling for Information Retrieval  |h [electronic resource] /  |c edited by W. Bruce Croft, John Lafferty. 
250 |a 1st ed. 2003. 
264 1 |a Dordrecht :  |b Springer Netherlands :  |b Imprint: Springer,  |c 2003. 
300 |a XIV, 246 p.  |b online resource. 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
347 |a text file  |b PDF  |2 rda 
490 1 |a The Information Retrieval Series,  |x 1387-5264 ;  |v 13 
490 1 |a Springer eBook Collection 
505 0 |a 1 Probabilistic Relevance Models Based on Document and Query Generation -- 2 Relevance Models in Information Retrieval -- 3 Language Modeling and Relevance -- 4 Contributions of Language Modeling to the Theory and Practice of IR -- 5 Language Models for Topic Tracking -- 6 A Probabilistic Approach to Term Translation for Cross-Lingual Retrieval -- 7 Using Compression-Based Language Models for Text Categorization -- 8 Applications of Score Distributions in Information Retrieval -- 9 An Unbiased Generative Model for Setting Dissemination Thresholds -- 10 Language Modeling Experiments in Non-Extractive Summarization. 
520 |a A statisticallanguage model, or more simply a language model, is a prob­ abilistic mechanism for generating text. Such adefinition is general enough to include an endless variety of schemes. However, a distinction should be made between generative models, which can in principle be used to synthesize artificial text, and discriminative techniques to classify text into predefined cat­ egories. The first statisticallanguage modeler was Claude Shannon. In exploring the application of his newly founded theory of information to human language, Shannon considered language as a statistical source, and measured how weH simple n-gram models predicted or, equivalently, compressed natural text. To do this, he estimated the entropy of English through experiments with human subjects, and also estimated the cross-entropy of the n-gram models on natural 1 text. The ability of language models to be quantitatively evaluated in tbis way is one of their important virtues. Of course, estimating the true entropy of language is an elusive goal, aiming at many moving targets, since language is so varied and evolves so quickly. Yet fifty years after Shannon's study, language models remain, by all measures, far from the Shannon entropy liInit in terms of their predictive power. However, tbis has not kept them from being useful for a variety of text processing tasks, and moreover can be viewed as encouragement that there is still great room for improvement in statisticallanguage modeling. 
590 |a Loaded electronically. 
590 |a Electronic access restricted to members of the Holy Cross Community. 
650 0 |a Data structures (Computer science). 
650 0 |a Information storage and retrieval. 
650 0 |a Computer science. 
650 0 |a Artificial intelligence. 
690 |a Electronic resources (E-books) 
700 1 |a Croft, W. Bruce.  |e editor.  |4 edt  |4 http://id.loc.gov/vocabulary/relators/edt 
700 1 |a Lafferty, John.  |e editor.  |4 edt  |4 http://id.loc.gov/vocabulary/relators/edt 
710 2 |a SpringerLink (Online service) 
773 0 |t Springer eBooks 
830 0 |a The Information Retrieval Series,  |x 1387-5264 ;  |v 13 
830 0 |a Springer eBook Collection. 
856 4 0 |u https://holycross.idm.oclc.org/login?auth=cas&url=https://doi.org/10.1007/978-94-017-0171-6  |3 Click to view e-book  |t 0 
907 |a .b32432070  |b 04-18-22  |c 02-26-20 
998 |a he  |b 02-26-20  |c m  |d @   |e -  |f eng  |g ne   |h 0  |i 1 
912 |a ZDB-2-SCS 
912 |a ZDB-2-BAE 
950 |a Computer Science (Springer-11645) 
902 |a springer purchased ebooks 
903 |a SEB-COLL 
945 |f  - -   |g 1  |h 0  |j  - -   |k  - -   |l he   |o -  |p $0.00  |q -  |r -  |s b   |t 38  |u 0  |v 0  |w 0  |x 0  |y .i2156372x  |z 02-26-20 
999 f f |i eb6ef3fb-0fb0-52ba-b0be-a7413f3bc274  |s e15da4c1-c1ba-5a65-afe2-52ae981c2d52  |t 0 
952 f f |p Online  |a College of the Holy Cross  |b Main Campus  |c E-Resources  |d Online  |t 0  |e E-Book  |h Library of Congress classification  |i Elec File