Machine Learning for Multimodal Interaction: 4th International Workshop, MLMI 2007, Brno, Czech Republic, June 28-30, 2007, Revised Selected Papers

Free Download

Authors:

Edition: 1

Series: Lecture Notes in Computer Science 4892 : Information Systems and Applications, incl. Internet/Web, and HCI

ISBN: 3540781544, 9783540781547

Size: 9 MB (9320286 bytes)

Pages: 308/318

File format:

Language:

Publishing Year:

Category: Tags: , , , ,

Damien Douxchamps, Nick Campbell (auth.), Andrei Popescu-Belis, Steve Renals, Hervé Bourlard (eds.)3540781544, 9783540781547

This book constitutes the thoroughly refereed post-proceedings of the 4th International Workshop on Machine Learning for Multimodal Interaction, MLMI 2007, held in Brno, Czech Republic, in June 2007.

The 25 revised full papers presented together with 1 invited paper were carefully selected during two rounds of reviewing and revision from 60 workshop presentations. The papers are organized in topical sections on multimodal processing, HCI, user studies and applications, image and video processing, discourse and dialogue processing, speech and audio processing, as well as the PASCAL speech separation challenge.


Table of contents :
Front Matter….Pages –
Robust Real Time Face Tracking for the Analysis of Human Behaviour….Pages 1-10
Conditional Sequence Model for Context-Based Recognition of Gaze Aversion….Pages 11-23
Meeting State Recognition from Visual and Aural Labels….Pages 24-35
Object Category Recognition Using Probabilistic Fusion of Speech and Image Classifiers….Pages 36-47
Automatic Annotation of Dialogue Structure from Simple User Interaction….Pages 48-59
Interactive Pattern Recognition….Pages 60-71
User Specific Training of a Music Search Engine….Pages 72-83
An Ego-Centric and Tangible Approach to Meeting Indexing and Browsing….Pages 84-95
Integrating Semantics into Multimodal Interaction Patterns….Pages 96-107
Towards an Objective Test for Meeting Browsers: The BET4TQB Pilot Experiment….Pages 108-119
Face Recognition in Smart Rooms….Pages 120-131
Gaussian Process Latent Variable Models for Human Pose Estimation….Pages 132-143
Automatic Labeling Inconsistencies Detection and Correction for Sentence Unit Segmentation in Conversational Speech….Pages 144-155
Term-Weighting for Summarization of Multi-party Spoken Dialogues….Pages 156-167
Automatic Decision Detection in Meeting Speech….Pages 168-179
Czech Text-to-Sign Speech Synthesizer….Pages 180-191
Using Prosodic Features in Language Models for Meetings….Pages 192-203
Posterior-Based Features and Distances in Template Matching for Speech Recognition….Pages 204-214
A Study of Phoneme and Grapheme Based Context-Dependent ASR Systems….Pages 215-226
Transfer Learning for Tandem ASR Feature Extraction….Pages 227-236
Spoken Term Detection System Based on Combination of LVCSR and Phonetic Search….Pages 237-247
Frequency Domain Linear Prediction for QMF Sub-bands and Applications to Audio Coding….Pages 248-258
Modeling Vocal Interaction for Segmentation in Meeting Recognition….Pages 259-270
Binaural Speech Separation Using Recurrent Timing Neural Networks for Joint F0-Localisation Estimation….Pages 271-282
To Separate Speech….Pages 283-294
Microphone Array Beamforming Approach to Blind Speech Separation….Pages 295-305
Back Matter….Pages –

Reviews

There are no reviews yet.

Be the first to review “Machine Learning for Multimodal Interaction: 4th International Workshop, MLMI 2007, Brno, Czech Republic, June 28-30, 2007, Revised Selected Papers”
Shopping Cart
Scroll to Top