移动平台上的DSR：Java ME与Symbian的比较

需积分: 0 176 浏览量更新于2024-11-03 收藏 1.68MB PDF 举报

"Java与Symbian：手机上软件实现的分布式语音识别对比" 这篇论文主要探讨了在当今具有强大处理能力的移动电话上，如何合理利用分布式语音识别（DSR）技术。作者Dmitry Zaykovskiy和Alexander Schmitt来自德国乌尔姆大学的信息技术研究所，他们对比了两种主流的移动操作系统——Java ME和Symbian上DSR前端标准的软件实现。首先，文章介绍了ETSI（欧洲电信标准化协会）的DSR前端标准，并在Java ME和Symbian平台上进行了软件实现的性能比较。这一部分可能涉及了各种语音处理算法、数据结构以及系统优化策略，以确保在不同平台上的高效运行。接着，论文讨论了执行时间这一关键指标，这是衡量DSR在实际设备上是否能实时运行的重要因素。通过对当前移动设备的测试，作者证明了在这些设备上进行实时特征提取是可行的。特征提取是DSR过程中的重要步骤，它涉及到对语音信号的预处理，如滤波、分帧、加窗、傅立叶变换等，以提取出有助于识别的关键信息。然而，论文也揭示了在DSR开发和部署过程中可能遇到的挑战和难题。这些可能包括但不限于内存限制、计算资源的优化、平台兼容性问题、语音数据库的管理和更新，以及由于手机硬件限制导致的识别率下降。特别是在词汇量增加时，由于内置的低复杂度模型和有限的处理能力，识别率可能会显著降低。此外，关键词还提到了“特征提取”和“分布式语音识别”，这表明论文深入研究了这两个领域。特征提取不仅关乎算法选择，还可能涉及到特征维度的压缩、特征选择以及特征学习等技术，以减少计算负担并提高识别准确率。而分布式语音识别则关注如何在多个设备或服务器之间分布计算任务，以实现大规模、高效的语音处理。这篇论文对于理解Java ME和Symbian平台在DSR技术上的差异和实际应用提供了深入的见解，对于移动设备上的语音识别开发者和研究者具有很高的参考价值。它提醒我们在设计和实施DSR解决方案时，必须充分考虑设备的硬件限制和实时性需求，同时解决开发和部署过程中可能出现的各种问题。

Java vs. Symbian: A Comparison

Software-based DSR

Implementations on Mobile Phones

Dmitry Zaykovskiy and Alexander Schmitt

Institute

Information Technology, University

ofUlm,

Ulm, Germany

{dmitry.zaykovskiy,

alexander.schmitt}@uni-ulm.de

Abstract

With increasing processing power

today's mobile

phones, a reasonable employment

Distributed Speech

Recognition (DSR) Technology comes into reach. This pa-

per presents the ETSI DSR front-end standards as software-

based implementations on the two most popular mobile

phone platforms: Java ME and Symbian.

present per-

formance comparisons between the different front-end stan-

dards as well as the execution times on current mobile

phones. After showing that real-time feature-extraction on

current devices is possible, we furthermore point out hin-

drances and pitfalls during development and deployment.

Keywords: Java ME, Symbian, mobile phones, feature

extraction, distributed speech recognition (DSR).

also restricted to a very limited quantity

words. With a

growing number

entries in the phone book, the recog-

nition rate suffers severely due to built-in low-cost proces-

sors, limited storage and RAM. With this embedded strat-

egy applications such as SMS dictation or the use

nat-

ural language are far out

reach for the near future. The

most vividly discussed proposal to overcome this challenge

is the principle

Distributed Speech Recognition. In this

approach, the speech recognition process is separated into

two parts: a

front-end on the client-side and a back-end on

the server-side.

The front-end extracts characteristic features out

the

speech signal, whereas the back-end, making use

the lan-

guage and acoustic models performs the computationally

costly recognition.

1 Introduction

Figure

Architecture

a client-server DSR system.

2 Related Work

Figure 1 shows a system architecture for DSR. The client

captures the speech signal using a microphone and extracts

features out

the signal. The features are compressed in

order to obtain low data rates and transmitted to the server.

At the server back-end, the features are decompressed and

subjected to the actual recognition process.

Bit-Stream Decoding

and

Error Mitigation

Bit-stream

Server

Client

Recognit~on

Result:

While there have been a number

studies on the the-

ory

DSR, little research has been done on deploying

this technology to real mobile devices. The authors

[1] proposed a modified version

the widespread ETSI

FE-standard for DSR, implemented as a hardware-based

front-end solution for a Motorola Digital Signal Processor

(DSP). A software-based implementation

the ETSI AFE-

standard for PDAs using Sphinx-4 [10] as speech recog-

• the number

recognizable words is very limited,

• usually the words have to be recorded by the user be-

forehand,

• the recognition system

speaker dependent.

Some recent devices spare the user the necessity to pre-

record the commands and feature speaker-independent, em-

bedded speech recognition. This functionality, however, is

The days are numbered where we used our mobile phones

exclusively for telephone conversation. Today we have ac-

cess to thousands

different applications and services for

our mobile companions and their number is rapidly grow-

ing. Although the devices have stopped getting smaller and

smaller, we see ourselves confronted with a limited user

interface consisting

tiny keys and a miniature display,

which suffices for making phone calls, but which is un-

suited to control applications. The most promising solution

to this challenge

the use

speech recognition.

we take a glance at modem mobile phones, we indeed

discover basic speech recognition functionality: most mo-

bile devices on the market support voice control features

such as voice dialling or hands-free commands. Instead

searching names in the telephone book, the user can dictate

the name

a person he wants

call or, on some devices,

use a voice command to launch a particularphone function.

Although this technology points out new ways to improve

user interfaces on mobile phones, it still has severallimita-

tions:

Authorized licensed use limited to: IEEE Xplore. Downloaded on March 17, 2009 at 15:53 from IEEE Xplore. Restrictions apply.

下载后可阅读完整内容，剩余5页未读，立即下载

chen715660403

粉丝: 0
资源: 4

移动平台上的DSR：Java ME与Symbian的比较

"基于单片机的多功能电能表设计及智能算法优化研究

JavaScript图片比较工具：photos-comparison-js详细解析

Exam-Papers-Comparison：智能应用提高学生查找重复试题效率

Computer support in a rural setting: A comparison of computer-assisted and handwritten evaluation reports

Tutoring interventions within special education settings: A comparison of cross-age and peer tutoring

Error querying database. Cause: java.lang.IllegalArgumentException: invalid comparison: java.util.Date and java.lang.String

database. Cause: java.lang.IllegalArgumentException: invalid comparison: java.util.Date and java.lang.String 这个报错 怎么解决

java.lang.IllegalArgumentException: invalid comparison: java.util.Date and java.lang.String

最新资源

database. Cause: java.lang.IllegalArgumentException: invalid comparison: java.util.Date and java.lang.String 这个报错怎么解决