Application of MIDI Technique For Medical Audio Signal Coding
Application of MIDI Technique For Medical Audio Signal Coding
USA
sounds.[3]
Through our implementation of this proposed method, we
are also applying a MIDI encoding to other types of sound
materials such as animal voices, human speeches and songs.
But in this paper, we are focusing on heart sounds and lung
sounds, present several examples selected from our coding
experiments, and before that we are going to briefly describe
our proposed encoding and decoding algorithms.
11. BENEFITS OF MIDI FOR MEDICINE
For encoding a given source signal into MIDI, we must analyze the signal and extract its features, therefore we can use a
encoding process as a signal analyzing tool which is capable of
inspecting, diagnosing and classifying audio data. We can also
use a MIDI encoding as a signal visualization tool because it
can present a signal as a musical score. Moreover, We can use
a MIDI encoding as a database authoring tool, and we can
construct high-efficient audio databases and provide MIDI
keyword retrieval functions to it. The following sections are
listed up features of MIDI encoding and those merits especially
for medical applications.
A. Low Bit Rate
Suppose a l-second sound stream which is composed of 8
musical notes played by an instrument, its PCM coded size in
CD quality becomes about 80 kbytes, while its MIDI coded
size is only about 80 bytes, where the coded size of each note
is about 10 bytes. This feature especially gives benefits to
applications of tele-medicine and archiving of electronic medical records.
B. Score Presentation
Using a conventional 5-line musical score or a MIDI specialized editor, we can view content of signals and also modify it.
Moreover a MIDI encoding process includes feature extractions, and each note on a score may represent some diagnosis
point. This feature can be used as a new type of medical presenting medium for auscultators to explain to patients their
conditions, while the conventional presentation methods are
limited to a wave form PCG (phono-cardiogram) and a sound
spectrogram.
C.
1417
(0-7803-4262-3/97/$10.00 (C) 1997 IEEE)
Multi-Track Coding
Proceedings - 19th International Conference - IEEE/EMBS Oct. 30 - Nov. 2, 1997 Chicago, IL. USA
Detection
Converted M D I data
A. Detection of Peaks
This process is extracting local peaks in a PCM sampled signal
after removing its DC level, on condition that two values of
each pair of neighbor peaks become an opposite polar. M e r
that a basic frequency value of f(p) is given to each extracted
peak p by the following formula:
(1)
f(p)=Fs*n / { x(p+2n) - x(p) }
where x(p) is the sampled location of peak p, Fs is the sampling frequency and n is an appropriate natural number determined by an encoding operator.
B. Detection of Signal Sections
A signal section is at first extracted by an amplitude slicing as a
group of consecutive peaks whose absolute levels ]v(p)] are
more than the specified value S1. Next each section is subdivided by a basic frequency of each peak included in this section
in order that every peak of the section have a similar basic
1418
Proceedings - 19th International Conference - IEEE/EMBS Oct. 30 - Nov. 2, 1997 Chicago, IL. USA
1419
Proceedings - 19th International Conference - IEEE/EMBS Oct. 30 - Nov. 2, 1997 Chicago, IL. USA
D. Evaluation Methods
Our goal of encoding technique is reproducing not the same
signal but similar signal to the original. Using our developed
encoding software, we can compare decoded sounds to the
original by playback both sounds, showing two wave patterns
and two spectrograms.
VI. RESULTS OF EXPERIMENTS
In this paper we present 3 coded examples of both heart
sounds and lung sounds whose source materials are stored in
reference[5][6]. All of the source PCM data, whose length is
about 20 seconds, is sampled at frequency 44.1 [kHz] and each
sample is quantized to 16-bit resolution, therefore the source
bit rate is about 690 [kbps]. By trial and error, the encoding
parameters described in section V-C are determined as n=3,
S1=30[%], Sn=6, Lgap=lO and Lmin=20 for all of heart
sounds, whereas n=3, Sl=lOy/.], Sn=3, Lgap=3 and Lmin=6
for all of lung sounds. As a result, the converted bit rate becomes 206[bps] to 1.7 [kbps], and the compression ratio of all
cases is less than 1/400, and the ratio of the heart sounds is
better than the lung sounds.
In Fig.3, two drawings at each case shows two types of
musical scores, and we must explain the upper one designed by
us, which can present all of parameters in MIDI data. Each
rectangle indicates one note, its horizontal position is time and
vertical position is pitch, and its width is length and its height is
strength. The similar score is used for lung sounds in Fig.4.
In Fig.3, comparing three sets of scores, we can easily find
additional notes called third sound in the second DCM case,
additional harmony of notes called murmur in the third AS case,
and these two must be familiar to medical experts.
In Fig.4-(2), we can extract three significant notes whose
pitch are C3, which are called rhonchus known as a feature of
bronchial athma.
2-Beat rhyibm
Bit-rate: 206 bps
I
. . . .
Summation
Gallop Rhythm
Bit-rate: 460 bps
(3) AS Case
(Aortic Slenosis)
Mid-systolic Murmur
Bit-rate 440 bps
.......
......
.. . , ..
.....
......
....
....
c3
c3
c3
i.........
......
........
........
.... ........
......
REFERENCES
[l] K. Kashino, K. Nakadai, T. Kinoshita, and H.
Tanaka, Note recognition mechanisms in the OPTIMA
processing architecture for music scene analysis, IEICE
Transactions, Vol.J79-D-II, No.11, pp.1751-1761, 1996.
[Z] H. Kanai, N. Chubachi, and Y. Koiwa, Acoustical diagnosis in medical engineering, Journal of IPU, Vo1.36,
No.3, 1995.
[3] T. Modegi and S. Iisaku, A MIDI encoding algorithm for
physiologic rhythmical sound, IEICE Proceedings of
General Conference, No.SD-5-1, 1997.
[4] J. Arai, D71MHandbooks - SMF Reference Book, Tokyo,
Japan,: Rittor Music, 1996.
[5] T. Sawayama, Cardiac Auscultations - Exercise with
Compact Disc, Tokyo, Japan,: Nankodo, 1994.
[6] T. Ishihara, Pulmonary Auscultation, Lung Sounu5 on
Compact Disc, Tokyo, Japan, :Nankodo, 1993,
1420