2 options

Russian through switched telephone network (RuSTeN) / [Authors, Anrey Raev ... and others].

Loading location information...

Available from offsite location This item is stored in our repository but can be checked out.

Loading location information...

Available from offsite location This item is stored in our repository but can be checked out.

Format:: Datafile
Contributor:: Raev, Anrey.; Linguistic Data Consortium.
Language:: English; Russian
Subjects (All):: Russian language--Data processing--Databases.; Russian language.; Speech perception--Data processing--Databases.; Speech perception.; Automatic speech recognition--Databases.; Automatic speech recognition.; Natural language processing (Computer science).; Russian language--Data processing.; Speech perception--Data processing.
Genre:: Databases.; Academic theses.
Physical Description:: 1 DVD-ROM : sound ; 4 3/4 in.; 4 3/4 in.
Other Title:: RuSTen
Place of Publication:: [Philadelphia, Pa.] : Linguistic Data Consortium, [2006]
Language Note:: Sound files in Russian, instructional documents in English.
System Details:: digital; optical; data file
Summary:: "This file contains documentation on the Russian through Switched Telephone Network (RuSTeN), Linguistic Data Consortium (LDC) catalog number LDC2006S34 and isbn 1-58563-388-7. This corpus was developed as part of`"Trawl" (Automatic Voice Identification System in Telephone Channel). The purpose of the project was to develop software for automatic identification of speakers based on voice samples acquired through telephone channels. The training of the system was performed with the telephone speech corpus RuSTeN. The RuSTeN (Russian through Switched Telephone Network) database was recorded between March 2001 and February 2003 by Speech Technology Center using the "Forget-me-not" professional telephone recording and archiving software package developed by STC. Please see file.tbl for the directory structure of this publication, as well as a complete list of files. Please go to data for a listing of data files. The files were recorded with sample frequency 11025 Hz, 1-channel, 16-bit linear. Each of the speakers made at least 5 calls from different locations and/or telephone sets. Most of the calls were made from home or office environment with uncontrolled noise level. Besides, one call per speaker was made from a public telephone (with either street or metro station noise in the background). The recordings are spontaneous (sometimes guided by the near-end speaker) conversations between the caller and the speech database collector on various subjects (the weather, the caller's biography, hobbies etc.) and include approximately 150 seconds of the far-end and at least 5 seconds of the near-end speaker. Besides, each time the caller was asked to utter the usual digits set (0-9) and the words "yes" and "no". The time interval between 2 successive sessions is at least 2 days. The database contains 125 speakers (far-end), 58 male and 67 female. Each far-end speaker is represented by at least 5 speech files. The sound files are in the wav-format. The speech filenames contain the following information: FFF (far-end speaker number), SS (session number)."--index.html.
Notes:: Title from index.html file on DVD-ROM.; "LDC2006S34"
ISBN:: 1585633887; 9781585633883
OCLC:: 71558340
Online:: LDC catalog entry; Using LDC Data general information

The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.

2 options

Russian through switched telephone network (RuSTeN) / [Authors, Anrey Raev ... and others].

Find

My Account

Guides