Products of the same type
Products of the same team

Sound Archive

Acoustic database management system

First version:   Last version: 1999  

Application:  
  The system is used by the team and its partners, in order to transform valuable archival recordings (performed on a wax cylinder or disc, made on a magnetic tape, etc.) into electronic files, to store them in acoustic databases, to study them and to publish on CD-ROMs and via Internet. With the system, several collections of old recordings from the Sound Archive of the Institute of the Russian Literature (Pushkinsky Dom) have been processed and stored in acoustic databases. At present, the system is used in the INTAS project N° 1705 "Sound Archives on the World Wide Web with Sound Recordings from Saint-Petersburg Collections".  
Description:  
  The system is designed to store and manage sound realisations of big text corpora. It provides storage of and access to digitised sound material, and also the text of the recording (in standard orthography), its transcription, various experts’ comments, archival attribution (the archival number of the original recording, time and genre of the recording, name and age of the performer, his nationality, etc.).
Working with the displayed text, its transcription and comments the user can listen to any fragment of the material he selected: from the whole text and several phrases to one word. By marking part of the text for listening the user indicates which fragment (between boundary labels) of the sound file he wishes to listen to. The output of the marked fragment can be saved into a separate sound file for further analysis with the help of any Sound Editor. Working with the text the user can enter changes into the text or its transcription, he can also add his own comments to the material.
All archival attribution elements (e.g., the place and time of the recording, etc.) may be used as search criteria.
 
Type of processing:  
  Preliminary segmentation of the sound file into sentences, phrases, and phonetic words (accent groups); special tagging; storage of the results in a special file.
The type of a particular label corresponds to the type of the segmentation of the sound string.
 
Hardware/software requirements:
  IBM PC or compatible, Windows. The system is implemented in ANSI C.
Text files are presented in the Windows ANSI (Text Only) format; sound files are saved in the 16 bit Raw PCM format at 16 KHz sample rate.
 
Distribution: Internal use  

Developer team: Contact person:  
Department for phonetics and methods of teaching foreign languages, SPbGU Pavel A. Skrelin
Phone: (+7 812) 328 95 65
E-mail: paul@PS1089.spb.edu, paul@phonet.lang.pu.ru