Publication:

Multi-stream ASR: Oracle Test and Embedded Training

cris.legacyId

83195

cris.virtual.department

PH-STI

cris.virtual.parent-organization

IEM

cris.virtual.parent-organization

STI

cris.virtual.parent-organization

EPFL

cris.virtual.sciperId

117014

cris.virtual.unitId

10381

cris.virtual.unitManager

Cavallaro, Andrea

cris.virtual.unitManager

Sayed, Ali H.

cris.virtualsource.author-scopus

91518bbf-4d04-4822-aa00-6bfcbb690918

cris.virtualsource.department

91518bbf-4d04-4822-aa00-6bfcbb690918

cris.virtualsource.orcid

91518bbf-4d04-4822-aa00-6bfcbb690918

cris.virtualsource.parent-organization

698d568c-939f-4ee2-b445-0b4cb7d94a7d

cris.virtualsource.parent-organization

698d568c-939f-4ee2-b445-0b4cb7d94a7d

cris.virtualsource.parent-organization

698d568c-939f-4ee2-b445-0b4cb7d94a7d

cris.virtualsource.parent-organization

698d568c-939f-4ee2-b445-0b4cb7d94a7d

cris.virtualsource.rid

91518bbf-4d04-4822-aa00-6bfcbb690918

cris.virtualsource.sciperId

91518bbf-4d04-4822-aa00-6bfcbb690918

cris.virtualsource.unitId

698d568c-939f-4ee2-b445-0b4cb7d94a7d

cris.virtualsource.unitManager

698d568c-939f-4ee2-b445-0b4cb7d94a7d

cris.virtualsource.unitManager

49c3c840-fa98-4487-8d6d-b6cef08ee781

datacite.rights

openaccess

dc.contributor.author

Misra, Hemant

dc.contributor.author

Vepa, Jithendra

dc.contributor.author

Bourlard, Hervé

dc.date.accessioned

2006-03-10T16:36:53

dc.date.available

2006-03-10T16:36:53

dc.date.created

2006-03-10

dc.date.issued

2005

dc.date.modified

2024-10-18T07:25:48.324132Z

dc.description.abstract

Multi-stream based automatic speech recognition (ASR) systems outperform their single stream counterparts, especially in the case of noisy speech. However, the main issues in multi-stream systems are to know a) Which streams to be combined, and b) How to combine them. In order to address these issues, we have investigated an `Oracle' test, which can tell us whether two streams are complimentary. Moreover, the Oracle test justifies our previously proposed inverse entropy method for weighting various streams. We have carried out experiments on two multi-stream systems and results indicate that in clean speech around 80% of the time Oracle selected the stream which had the minimum entropy. In this paper, we have also presented an embedded iterative training for multi-stream systems. The results of the recognition experiments on Numbers95 database showed that we can improve the performance significantly by multi-stream iterative training, not only for clean speech but also for various noise conditions.

dc.description.notes

{in Proceedings of ISCA International Conference on Spoken Language Processing (ICSLP), 2006}

dc.description.sponsorship

LIDIAP

dc.identifier.uri

https://infoscience.epfl.ch/handle/20.500.14299/228646

dc.publisher

IDIAP

dc.publisher.place

Martigny, Switzerland

dc.relation

https://infoscience.epfl.ch/record/83195/files/rr05-62.pdf

dc.subject

speech

dc.subject

misra

dc.subject

vepa

dc.subject

bourlard

dc.title

Multi-stream ASR: Oracle Test and Embedded Training

dc.type

text::report

dspace.entity.type

Publication

dspace.legacy.oai-identifier

oai:infoscience.tind.io:83195

epfl.legacy.fileversion

n/a

epfl.legacy.itemtype

Reports

epfl.legacy.submissionform

REP_WORK

epfl.oai.currentset

report

epfl.oai.currentset

OpenAIREv4

epfl.oai.currentset

STI

epfl.publication.version

http://purl.org/coar/version/c_970fb48d4fbd8a85

epfl.url

http://publications.idiap.ch/downloads/reports/2005/rr05-62.pdf

epfl.url.description

URL

epfl.writtenAt

EPFL

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
rr05-62.pdf
Size:
214.76 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed to upon submission
Description: