A54 Maheswari

An Interactive RTOS Based Machine Controller through Speech using ESTREL
R.Maheswari
Asst Professor(Senior) Vellore Institute of Technology +919042187469
S.Sheeba Rani
Asst Professor Vels University +919841835050
maheswari_20@hotmail.com ABSTRACT
sheeba_rani@yahoo.com
Embedded system architecture is developed in a mobile-like handset device for controlling machineries in Industries through wireless mode of communication. Speech recognition is the ability of machines to respond to spoken commands. This mechanism is activated in two ways: 1. Speech 2. Manually by hand The speech implementation is performed when speech input is given to the device. A software program is developed for recognizing the commands. It performs according to the conditions specified in the code and corresponding machinery actions are controlled. The manual implementation is executed when the user presses on the button, which is designed to control all necessary actions in the Machine. This method of automating the machines in the Industry serves as a communication & controlling aid to the differently-abled people working in the Industries.
Andreas Gerslauer, Haobo Yu, Daniel D.Gajski (2003) proposed implementation of the high level models in RTOS. Certain RTOS Features has been incorporated to perform some Real Time Task such as Scheduling Deterministic algorithms Usually some sort of priority Predictable worst-case task fly back time. Resource Management Low-overhead Aware of priority inversions Interrupt Handling Guaranteed worst-case interrupt latency Cyprian F.Ngolah, Yingxu Wang, and Xinming Tan (2004) proposed a formal description about task scheduling algorithms which avoids multiple blocking and deadlock problems.
2. PROCESS:
Speech Recognition circuit, mode control button, sound acquisition
Keywords
A typical complete speech recognition process consists of the following parts: (1) sound converter (2) Fragmentation, (3) Recognition. Sound Acquisition: The user voice is captured with the help of mic in a handset. Sound Conversion: The digital sound captured by the sound card through a mic is converted in to a more manageable format. The converter translates the stream of amplitudes that form the digital sound wave in to its frequency components. It is still a digital representation, but more akin to what a human ear really perceives.
1. INTRODUCTION
The main aim of designing this handset is to provide a tool of accessibility to those individuals who are working in the Industries with some physical or cognitive difficulties, impairments, and disabilities. A software program is developed for recognizing the commands. It derives the input from the user in form of speech then recognizes it and performs according to the conditions specified in the code and corresponding machinery actions are controlled. The handset is proposed to design Speaker Independent Controller for Industry applications using ESTREL language to control certain Real Time Task
3. FRAGMENTATION:
The next stage is the identification of phonemes the elementary sound that are building blocks of words. Each frequency component of sound is mapped to a specific phoneme. This process actually finishes the conversion from sounds to words Recognition procedure is divided into two consecutive stages depending on data set and test set. It consists of two stages: (1) Training (for data set) (2) Comparison and Classification (for test set)
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. A2CWiC 2010, September 16-17, 2010, India Copyright 2010 978-14503-0194-7/10/0009 $10.0
3.1 Training: The words, which have to be recognized, need
to be added in the database provided in the software. The words can be dynamically added to the database. Sushi1 Kumar Podder,
Khaled Shaban, Jiping Sun, Fakhri Karray, Otman Basir, And Mohamed Kame1(2003), proposed a prototype for a large vocabulary and continuous speech recognition system and increased performance by Sphinx3. Some level of training is required to be done for accurate recognition.
Ameet Patil and Neil Audsley (2004) proposed that a RTOS kernel which is being worked on a module-based reflective mechanism that allows applications to dynamically change the behaviour of RTOS. If the remote is going to receive two or more input at a same time then Aperiodic Scheduling algorithm in RTOS will resolve the conflict and activate/deactivate the machine that has highest priority. Melissa Vetromille, Luciano Ost, Csar A. M. Marcon, Carlos Reif, Fabiano Hessel (2006) proposed three different RTOS scheduler implementation approaches: (i) implemented in software running in the same processor of the application tasks, (ii) implemented in software running in a co-processor, and (iii) implemented in hardware, while application tasks are running on a processor.
3.2Comparison And Classification: At this stage, comparison is done with the help of the generated word and the words on the program. Based on the result the appropriate function is performed. 4. SYSTEM ARCHITECTURE:
The proposed architecture for building speech recognition system, has two modes of commands such as speech command and manual command given by the user and processes it to perform the activation of the desired appliance.
5.1.1 Alarm / Timer
Overview:
It is architecture for controlling the home appliance through the speech command. The user will give the command to a handset that contains the speech recognising software system. On receiving the sound signal the software does the recognising process described above.
In case of emergency such as Machine out of control, mishandling of machines, machine jam etc, the ESTREL program for alarm is activated to produce alarm sound. Timer program is written to set some time duration for the machine to work and bring down the machine when the time elapses.
5.2 User Program:

Depending on user input mode, the user program gets invoked and it receives the input signal. If the input is speech, then the speech recognition component in the program sends the phrase specified by the user to the speech recognition engine for recognition.
5. WORKING PRINCIPLE: The proposed model consists of three main units:

Remote Receiving Unit Appliances
5.3 Speech Recognition Engine:

The proposed recognition software is speaker independent. Yong-Qian Ying , Peng-Yung Woo (1999) proposed the speech recognition in digital speech processing. The models such as Markov model, dynamic Time Warping are compared with the newly developed technology. The engine splits the received string in to phonemes and tries to group the homophones together from vast collection of words in the database. These grouped words are then compared with a set of words, which are already saved in the database. If there occurs a match, then that word is returned to the user program. Then the checking is performed between the returned string from engine & a set of options in user program. If the condition satisfies, the signal containing the data is transferred to micro controller. If there is any mismatch, it is brought to the knowledge of the user in both the modes. The circuit is a stand-alone trainable speech recognition circuit that may be interfaced to control just about anything electrical, such as; appliances, robots, test instruments, VCR's TV's, etc. The circuit is trained (programmed) to recognize the words you want it to recognize. The unit can be trained in any language and even non-languages such as grunts, birdcalls and whistles. The entire speech recognition circuit is available as a kit (SR-07) or may be hardwired together in accordance with the schematic.
REMOTE: It consists of Dynamic scheduling Alarm / Timer User program Speech recognition engine Transmitter unit
Figure 1.Architecture of Machine Controller
5.1 Dynamic Real Time Scheduling:
The speech recognition circuit (SR-07) uses a simple keypad and digital display to communicate with and program the HM2007 chip.
This chip provides the options of recognizing either forty .96second words or twenty 1.92-second words. A jumper on the main circuit board selects either the .96-second word length (40 word vocabulary) or the 1.92-second word length (20 word vocabulary). For accuracy 1.92-second option is used.
Use four word spaces for each target word Arrange the words so that the words can be recognized by just decoding the least significant digit (number) To accomplish this word, spaces 01, 11, 21 and 31 are allocated to the first target word. By only decoding the least significant digit number, in this case 1 of "X" "1" (where X is any number 0 - 3) we can recognize the target word. The effective vocabulary drops from forty words to ten words. The decoding circuit that recognizes the word number and performs a function must be designed to recognize error codes 55, 66 and 77 and not confuse them with word spaces 5, 6 and 7.
5.4.1 Testing Recognition:
The circuit is continually listening. Repeat a training word into the microphone. The number of the word should be displayed on the digital display. For instance if the word "directory" was trained as word number 5. Saying the word "directory" into the microphone will cause the number 5 to be displayed on the digital display. Error Codes:
The chip provides the following error codes.
Figure 2 Speech Recognition Circuit

This chip provides the options of recognizing either forty .96-second words or twenty 1.92-second words. A jumper on the main circuit board selects either the .96-second word length (40 word vocabulary) or the 1.92-second word length (20 word vocabulary). For accuracy 1.92-second option is used. The keypad is made up of 12 switches. 123 456 789 *0# When the circuit is turned on, the HM2007 checks memory (static RAM) status. If successful, the board displays "00" on the digital display and lights the red LED (READY). In the "Ready" state, the circuit is listening for a verbal word to recognize or may be programmed (trained).
55 = word 66 = word 77 = word no match

5.4.2 Clearing The Memory:
too too
long short
To erase all the words in the RAM memory (Training) the key "99" is pressed on the keypad then the "*" key is pressed. The display will scroll through the numbers 1-40 (or 1-20) quickly, clearing out the memory. To erase a single word space press the number of the word you want to clear, then press the "*" key.
6. TRANSMITTER UNIT: 6.1 Signal Processing:

According to the data the micro controller receives, it generates a signal containing port number to specific appliance and a value. Colt R.correa, Dr.Selim Awad (2001) proposed a prototype of hardened version of the 8051 micro controller, able to assure reliable operation in the presence of bit flips caused by radiation. The AT89C51 is a low-power, high-performance CMOS 8-bit microcomputer with 4K bytes of Flash programmable and erasable read only memory (PEROM). The AT89C51 provides the following standard features: 4K bytes of Flash, 128 bytes of RAM, 32 I/O lines, two 16-bit timer/counters, a five vector two-level interrupt architecture, a full duplex serial port, on-chip oscillator and clock circuitry. Chao-Lin Wu, Wei-Chen Wang, and Li-Chen Fu (2004) proposed the method for reducing the network tramc and computation load by delegating the management function to each control node.Then this signal is transferred to the transmitter.
5.4 Training
To train the circuit, press the word number that need to be trained on the keypad. The circuit can be trained to recognize either 40 (one-second) words or 20 (two-second) words. This option is selected by setting a jumper on the main circuit board. Any number between 1 and 20 or 40 is used. For example press the number "1" to train word number 1. When the number(s) is pressed on the keypad the red LED will turn off. The number is displayed on the digital display. Next the "#" key is pressed for training the word. When the "#" key is pressed it signals the chip to listen for a training word and the red LED turns back on. Now speak the word you want the circuit to recognize into the microphone. The LED should blink off momentarily; this is a signal that the word has been accepted. Continue training new words using the procedure outlined above. Press the "2" key then "#" key to train the second word and so on. To train the system for speaker independent recognition (Multiuser):
6.2 Signal Transmission:

Transmitter converts the digital signal to RF signal (whose frequency is less than 1GHz) with a specific frequency & control transfers to receiving unit.
Hiroyuki Tomiyama Shin-Ichiro Chikada Shinya Honda Hiroaki Takada(2005) proposed an RTOS oriented methodology for design and validation of embedded systems. The programming style in ESTEREL, an imperative synchronous language and the target processor is RTX2000 a highperformance Real Time Micro-Controller The micro controller used is RTX2000. It is a highperformance chip, based on RISC architecture. Several of its characteristics are valuable for real time applications.
8. CONCLUSION:
The architecture for automating the machines in industry with the help of speech recognition and manual mode is propose which will be helpful for the visually impaired and for physically challenged people.
9. REFERENCES
6.4 Transmitter & Receiver
The TWS-434 and RWS-434 are extremely small, and are excellent for applications requiring short-range RF remote controls.
[1]. Ameet Patil and Neil Audsley. Implementing Application Specific RTOS Policies Using Reflection (2004), Proceedings of 11th IEEE REAL time and Embedded Technology and Applications Symposium.Sushi1 Kumar Podder, Khaled [2]. Andreas Gerslauer, Haobo Yu, Daniel D.Gajski (2003). RTOS Modeling for System Level Design , IEEE Transaction on Computer Vol:21,No:1. [3]. Arno Wacker, Timo Heiber, and Holger cermann (2004).A Key Distribution Scheme for Wireless Home Automation Networks, IEEE Transaction on Consumer Electronics, pp 47-52. [4]. Chao-Lin Wu, Wei-Chen Wang, and Li-Chen Fu (2004). Mobile Agent based Integrated Control Architecturefor Home Automation System, IEEE transaction of Intelligent Robots.
Figure 3.TWS-434 & RWS-434 Pin Diagram

The same sequence of operations is carried out for the manual mode input expect the input signal control never transfer into speech recognition engine.
[5]. Colt R.correa, Dr.Selim Awad (2001).Embedded controller software and Algorithm Development Tool IEEE Instrumentation and Measurement. [6]. Cyprian F.Ngolah, Yingxu Wang, and Xinming Tan (2004). Implementing Task Scheduling and Event Handling in RTOS, IEEE Transaction on Computers. [7].Hiroyuki Tomiyama Shin-Ichiro Chikada Shinya Honda Hiroaki Takada (2005). An RTOS-Based Approach To Design And Validation Of Embedded Systems, IEEE Transactions on Consumer Eectronics. [8]. Melissa Vetromille, Luciano Ost, Csar A. M. Marcon, Carlos Reif, Fabiano Hessel (2006).RTOS Scheduler Implementation In Hardware And Software For Real Time Applications, Proceedings of the 7th IEEE on Rapid System Prototyping. [9]. Sushi1 Kumar Podder, Khaled Shaban, Jiping Sun, Fakhri Karray, Otman Basir, And Mohamed Kame1(2003), Performance Improvement Of Automatic Speech Recognition Systems Via Multiple Language Models Produced By Sentence-Based Clustering, IEEE Transactions On Consumer,pp 362-367. [10]. Yong-Qian Ying , Peng-Yung Woo (1999), Speech Recognition Using Fuzzy Logic, IEEE Transactions on Computers.
6.5 Unique Features:

Some features are proposed uniquely for visually challenged people: Power failure notification Signal Retransmission
7. SIGNAL RETRANSMISSION:
The signal is retransmitted to the transmitter unit for acknowledgement in both the mode. This enhances the efficiency of this method of automation. If the input is in button mode the user presses the button, the signal is send to the program and the same set of procedure as on other mode is performed. Proposed Handset Model would look like figure4:
Figure 4. Remote

A54 Maheswari

Caricato da

Informazioni sul documento

Descrizione originale:

Titolo originale

Copyright

Formati disponibili

Condividi questo documento

Condividi o incorpora il documento

Opzioni di condivisione

Hai trovato utile questo documento?

Questo contenuto è inappropriato?

Copyright:

Formati disponibili

A54 Maheswari

Caricato da

Copyright:

Formati disponibili

An Interactive RTOS Based Machine Controller through Speech using ESTREL

3.1 Training: The words, which have to be recognized, need

5.1.1 Alarm / Timer

5.2 User Program:

5. WORKING PRINCIPLE: The proposed model consists of three main units:

5.3 Speech Recognition Engine:

Figure 1.Architecture of Machine Controller

5.1 Dynamic Real Time Scheduling:

5.4.1 Testing Recognition:

Figure 2 Speech Recognition Circuit

55 = word 66 = word 77 = word no match

6. TRANSMITTER UNIT: 6.1 Signal Processing:

6.2 Signal Transmission:

6.4 Transmitter & Receiver

Figure 3.TWS-434 & RWS-434 Pin Diagram

6.5 Unique Features:

Potrebbero piacerti anche