Damien Douxchamps, Nick Campbell (auth.), Andrei Popescu-Belis, Steve Renals, Hervé Bourlard (eds.)3540781544, 9783540781547
The 25 revised full papers presented together with 1 invited paper were carefully selected during two rounds of reviewing and revision from 60 workshop presentations. The papers are organized in topical sections on multimodal processing, HCI, user studies and applications, image and video processing, discourse and dialogue processing, speech and audio processing, as well as the PASCAL speech separation challenge.
Table of contents :
Front Matter….Pages –
Robust Real Time Face Tracking for the Analysis of Human Behaviour….Pages 1-10
Conditional Sequence Model for Context-Based Recognition of Gaze Aversion….Pages 11-23
Meeting State Recognition from Visual and Aural Labels….Pages 24-35
Object Category Recognition Using Probabilistic Fusion of Speech and Image Classifiers….Pages 36-47
Automatic Annotation of Dialogue Structure from Simple User Interaction….Pages 48-59
Interactive Pattern Recognition….Pages 60-71
User Specific Training of a Music Search Engine….Pages 72-83
An Ego-Centric and Tangible Approach to Meeting Indexing and Browsing….Pages 84-95
Integrating Semantics into Multimodal Interaction Patterns….Pages 96-107
Towards an Objective Test for Meeting Browsers: The BET4TQB Pilot Experiment….Pages 108-119
Face Recognition in Smart Rooms….Pages 120-131
Gaussian Process Latent Variable Models for Human Pose Estimation….Pages 132-143
Automatic Labeling Inconsistencies Detection and Correction for Sentence Unit Segmentation in Conversational Speech….Pages 144-155
Term-Weighting for Summarization of Multi-party Spoken Dialogues….Pages 156-167
Automatic Decision Detection in Meeting Speech….Pages 168-179
Czech Text-to-Sign Speech Synthesizer….Pages 180-191
Using Prosodic Features in Language Models for Meetings….Pages 192-203
Posterior-Based Features and Distances in Template Matching for Speech Recognition….Pages 204-214
A Study of Phoneme and Grapheme Based Context-Dependent ASR Systems….Pages 215-226
Transfer Learning for Tandem ASR Feature Extraction….Pages 227-236
Spoken Term Detection System Based on Combination of LVCSR and Phonetic Search….Pages 237-247
Frequency Domain Linear Prediction for QMF Sub-bands and Applications to Audio Coding….Pages 248-258
Modeling Vocal Interaction for Segmentation in Meeting Recognition….Pages 259-270
Binaural Speech Separation Using Recurrent Timing Neural Networks for Joint F0-Localisation Estimation….Pages 271-282
To Separate Speech….Pages 283-294
Microphone Array Beamforming Approach to Blind Speech Separation….Pages 295-305
Back Matter….Pages –
Reviews
There are no reviews yet.