Аппаратные и программные средства реального времени для одно-и двумерных микрофонных решеток
Диссертация
Предложен способ компенсации разницы фаз для сигналов различных каналов микрофонной решетки, предполагающий использование аналоговых и цифровых узлов реального времени с программируемой задержкой. Предложенный способ компенсации разницы фаз предполагает эффективную аппаратную реализацию на основе современных ПЛИС, что позволяет снизить потребляемую мощность встраиваемых многоканальных систем… Читать ещё >
Список литературы
- Akagi М., Mizumachi M. Noise reduction by paired microphones I I 5th European conf. on speech communication and technology. Rhodes, Greece, 22−25 Sept, 1997. Vol.1, pp. 335−338.
- Akaike H. A new look at the statistical model identification // IEEE Transactions on Automation Control, December 1974. Vol. AC-19, pp. 716 723.
- Allen J. В., Berkley D. A. Image method for efficiently simulating small-room acoustics // J.Acoust. Soc. Am., 1979. Vol. 65, pp. 943−950.
- Aoshima, N. Computer-generated pulse signal applied for sound measurement // J. Acoust. Soc. Am., May 1981. Vol. 69, pp. 1484−1488.
- Applebaum S. P., Chapman D. J. Adaptive arrays with main beam constraints // IEEE Transactions on Antennas and Propagation, September 1976. Vol. AP-24, pp. 650−662.
- Bell A.J., Sejnowski T. J. An information-maximization approach to blind separation and blind deconvolution // Neural Computation. 1995. Vol. 7, pp. 1129−1159.
- Bogdanovich M. В., Zhur A. V., Malevich I. Yu. A method of increasing the effectiveness of structural adaptive nonlinear interference protection for receiver amplifier channels // Radio Engineering, Mar. 1991. Vol. 46. No. 3, pp. 45−49.
- Boll S. F. Suppression of acoustic noise in speech using spectral subtraction // IEEE Trans, on Acoustics, Speech and Signal Processing, April 1979, Vol.27, pp. 113−120.
- Brandstein M. S, and Silverman H. F. A practical methodology for speech source localization with microphone arrays // Computer Speech and Language. April 1997. Vol. 11, pp. 91−126.
- Burnham D., Ciocca V., Stokes S. Auditory perception of lexical tone // Proceedings of 7th European conference on speech communication and technology. Eurospeech 2001 Scandinavia. Vol. 1, pp.395−398.
- Capon J. High resolution frequency-wavenumber spectrum analysis // Proceedings of the IEEE. August 1969. Vol. 57, pp. 1408−1418.
- Castelli E., Istrate D. Everyday life sounds analysis for a medicalthtelemonitoring system // Proceedings of 7 European conference on speech communication and technology. Eurospeech 2001 Scandinavia. Vol.4, pp. 2417−2420.
- Chase L. Word and acoustic confidence annotation for large vocabularyLspeech recognition // Proceedings of 5 European conference on speech communication and technology. Eurospeech '97, 1997, Rhodes, Greece, pp. 815−818.
- Cohen I., Berdugo B. Microphone array post-filtering for non-stationary•Lnoise suppression // In Proceedings 27 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2002, pp. 901−904.
- Compernolle V. D. Switching adaptive filters for enhancing noisy and reverberant speech from microphone array recordings // Proc. ICASSP '90, 1990, pp. 833−836.
- Compton R. T. The relationship between tapped delay-line and FFT processing in adaptive arrays // IEEE Transactions on Antennas and Propagation, 1988. Vol. 36. No. 1, pp. 15−26.
- Cosi P., Tesser F., Gretter R., Avesani C., Marcon Mike. Festival speaksth1. alian // Proceedings of 7 European conference on speech communication and technology. Eurospeech 2001 Scandinavia. Vol. 1, pp.509−512.
- Dorbecker M. Small microphone arrays with optimized directivity for speech enhancement // Proceedings of 5th European conference on speech communication and technology. Eurospeech 1997 (ESCA). Vol. 5, pp. 327 330.
- Er M., Cantoni A. Derivative constraints for broad-band element space antenna array processors // IEEE Trans. Acoustics, Speech, and Signal Processing, 1983. Vol. 31. No. 6, pp. 1378−1393.
- Epps J., Dowd A., Smith J., Wolfe J. Real time measurements of the vocal track resonances during speech // Proceedings of 5lh European conference on speech communication and technology. Eurospeech 1997 (ESCA). Vol.2, pp. 721−724.
- Falconer D. D. Adaptive reference echo cancellation // IEEE Trans. Commun., Sept. 1982. Vol. COM-30. No. 9, pp. 2083−2094.
- Fernandez J., Lleida E., Masgrau E. Microphone array design for robust speech acquisition and recognition // Proceedings of 6th European conference on speech communication and technology. Eurospeech 1999 -(ESCA). Vol.5, pp.2363−2366.
- Fernandez D. L., Cgarcia M. C. Application of several channel and noise compensation techniques for robust speaker recognition // Proceedings of 5th European conference on speech communication and technology. Eurospeech 1997(ESCA). Vol.3, pp.1115−1118.
- Fissore L., Micca G., Vair C. Methods for microphone equalization in speech recognition // Proceedings of 5th European conference on speech communication and technology. Eurospeech 1997, Vol, pp. 2415−2418.
- Flanagan J. L. Bandwidth design of speech-seeking microphone arrays // In Proceedings of 1985 ICASSP, March 1985. Tampa, Florida, pp. 732−735.
- Flanagan J. L. Use of acoustic filtering to control the beamwidth of steered microphone arrays // Journal of the Acoustical Society of America. August 1985. Vol. 78, pp. 423−428.
- Flanagan J. L., Berkley D. A., Elko D. W., Sondhi M. M. Autodirective microphone systems//Acoustica, 1991. Vol. 73, pp. 58−71.
- Flanagan J. L., Johnston J. D, Zahn R, Elko G. W. Computer-steered microphone arrays for sound transduction in large rooms // JASA, Nov 1985. Vol. 78, pp. 1508−1518.
- Flanagan J. L, Mammone R, Elko G. W. Autodirective microphone systems for natural communication with speech recognizers // Proceedings of the Workshop on Speech and Natural Language. Pacific Grove, California, February 19 22, 1991, pp. 170−175.
- Flanagan J. L., Surendran A. C., Jan E. E. Spatially selective sound capture for speech and audio processing // Speech Communication, 1993. Vol. 13, pp. 207−222.
- Forssen U. Adaptive bilinear digital filters // IEEE Trans. Circuits Syst. II Analog and Digital Signal Proc., Nov. 1993. Vol. 40. No. 11, pp. 729−735.
- Friedman D. H. Pseudo-maximum-likelihood speech pitch extraction // IEEE Transactions on Acoustics, Speech, and Signal Processing. June 1977. Vol. ASSP-25, pp. 213−221.
- Fris H., Feldman C. A multiple unit steerable antenna for shortwave reception//Bell System Technical Journal, 1937. Vol. 16, pp. 337−419.
- Frost O. L. An algorithm for linear constrained adaptive beamforming // Proc. of IEEE, 1972. Vol. 60, pp. 926−935.
- Furui S. Cepstral Analysis techique for automatic speaker verification // IEEE Transactions on Acoustics, Speech, and Signal Processing, April 1981. Vol. ASSP-29, pp. 254−272.
- Godara L. C. Applications of antenna arrays to mobile communications, part i: Performance improvement, feasibility, and system considerations // Proceedings ofthe IEEE, 1997. Vol. 85. No. 7, pp. 1031−1060.
- Godara L. C. Applications of antenna arrays to mobile communications, part ii: Beamforming and direction-of-arrival considerations // Proceedings of the IEEE, 1997. Vol. 85, No. 8. pp. 1195−1245.
- Griffiths L. J. Jim C. W. An alternative approach to linearly constrained adaptive beamforming // IEEE Trans, on Antennas and Propagation, Jan. 1982. Vol. 30. No. 1, pp. 27−34.
- Hirsch H. G., Hellwig K., Dobler S. Speech recognition at multiple samplingthrates // Proceedings of 7 European conference on speech communication and technology. Eurospeech 2001- Scandinavia Vol. 3, pp. 1837−1840.
- House D., Beskow J., Granstrom B. Timing and interaction of visual cues for prominence in audiovisual speech perception // Proceedings of 7th European conference on speech recognition and technology. Eurospeech 2001 Scandinavia. Vol. 1, pp.387−390.
- Howells P.W. Intermediate frequency sidelobe canceler. U.S. Patent 3 202 990, August 24,1965.
- Hughes Т. В., Kim H. S., DiBiase J. H., Silverman H. F. Performance of an HMM speech recognizer using a real-time tracking microphone array as input // IEEE Trans, on Speech and Audio Proc. May 1999. Vol. 7, pp.346 349.
- Inore M., et al. Microphone array design measures for hands- free speech recognition // 5th European conf. on speech communication and technology. Rhodes, Greece, 22−25 Sept. 1997. Vol. l, pp. 331−334.
- Jiri S., Vratislav D. Multi-channel noise reduction using wavelet filter bank // Proceedings of 7th European conference on speech communication and technology. Eurospeech 1997. Vol, pp. 2591−2594.
- John F. D., Richard J. M. A fast method for regularized adaptive filtering // Digital Signal Processing, Jan. 1992. Vol. 2. No. 1, pp. 14−26.
- Juang В. H., Rabiner L.R., Wilpon J.G. On the Use of bandpass filtering in speech recognition // IEEE Transactions on Acoustics, Speech, and Signal Processing. July 1987. Vol. ASSP-35, pp. 947−954.
- Karjalainen M., Paatero T. Generalized source-filter structures for speech synthesis // Proceedings of 7th European conference on speech communication and technology. Eurospeech 2001- Scandinavia Vol. 4, pp. 2271−2274.
- Kellermann W. A self-steered digital microphone array // In Proceedings of 1992 ICASSP, March 1991. Toronto, Canada, pp. 3581−3584.
- Kleban J, Gong Y. HMM adaptation and microphone array processing for distant speech recognition // Proc. ICASSP '00, 2000, Istanbul, Turkey, pp. 1411−1414.
- Knight W. C., Pridham R. G., Kay S. M. Digital signal processing for sonar //Proceedings of the IEEE, 1981. Vol. 69. No. 11, pp. 1451−1507.
- Koutras A., Dermatas E., Kokkinakis G. Blind speech separation of moving speakers using hybrid neural networks // Proceedings of 7th European conference on speech communication and technology. Eurospeech 2001-Scandinavia Vol. 2, pp. 997−1000.
- Kurita S., Sauwatari H., Kajita S., Takeda K., Itakura F. Evaluation of blind signal separation method using directivity pattern under reverberant conditions // Proc. ICASSP '00. 2000, Istanbul, Turkey, pp. 3140−3143.
- Laakso Т., Vlimki V., Karjalainen M., Laine U. Splitting the unit delay -tools for fractional delay filter design // IEEE Signal Processing Magazine, Jan 1996. Vol. 13, pp. 30−60.
- Lee J., Kim J. Y. An Efficient lipreading method using the symmetry of lip // Proceedings of 7th European conference on speech communication and technology. Eurospeech 2001- Scandinavia Vol. 2, pp. 1019−1022.
- Liu Q. G, Champagne B, Kabal P. A microphone array processing technique for speech enhancement in a reverberant space // Speech Communication, 1996. Vol. 18, pp. 317−334.
- Lockwood P., Boudy J. Experiments with a nonlinear spectral subtractor (NSS), Hidden Markov Models and the projection, for robust speech recognition in cars // Speech Communication, 1992. Vol. 11, pp. 215−228.
- Lorenzelli F., Wang A., Korompis D., Hudson R., Yao K. Optimization and performance of broadband microphone arrays // Proceedings, SPIE. February 1995. Vol, pp. 158−168.
- Ludwing Т., Heute U. Detection of digital transmission system for voice quality measurements // Proceedings of 7th European conference on speech communication and technology. Eurospeech 2001- Scandinavia Vol. 3, pp. 1699−1702.
- Mahmoudi D. A microphone array for speech enhancement using multi-resolution wavelet transform // 5lh European conf. on speech communication and technology. Rhodes, Greece, 22−25 Sept. 1997. Vol.1, pp.339−342.
- Martyn C. J., Singh S. D. Automated lip synchronization for human-computer interaction and special effect animation // Proceedings of 5th European conference on speech communication and technology. Eurospeech 1997 (ESCA). Vol. 2, pp.891−894.
- Masgrau E., Aguilar L, Lleida E. Performance comparison of several adaptive schemes for microphone array beamforming // Proceedings of 6th European conference on speech communication and technology. Eurospeech 1999 (ESCA). Vol.6, pp. 2615−2618.
- Matousek J., Psutka J., Kruta J. Design of speech corpus for text-to-speech synthesis // Proceedings of 7th European conference on speech communication and technology. Eurospeech 2001- Scandinavia Vol. 3, pp. 2047−2050.
- Michael L. S., Raj B. Calibration of microphone arrays for improved speech recognition // Proceedings of 7th European conference on speech communication and technology. Eurospeech 2001- Scandinavia Vol. 2, pp. 1005−1008.
- Miyoshi M., Kaneda Y. Inverse filtering of room acoustics // IEEE Trans, on Acoustics Speech and Signal Processing, Feb. 1988. Vol. 36, pp. 145−152.
- Montacie C., Jose M. C. Sound channel video indexing // Proceedings of 7th European conference on speech communication and technology. Eurospeech 1997. Vol, pp. 2359−2362.
- Moses R.L., Beex A.A. Instrumental variable adaptive array processing // IEEE Transactions on Aerospace and Electronic Systems. March 1988.Vol. 24, pp. 192−201.
- Nagana Y., Tsuboi H. A two-channel adaptive microphone array with target tracking // 5th European conf. on speech communication and technology. Rhodes, Greece, 22−25 Sept. 1997. Vol.1, pp.343−346.
- Nakadai K., Okuno H. G., Kitano H. Real-time sound source localization and separation for robot audition // Proceedings IEEE International Conference on Spoken Language Processing, 2002, pp. 193−196.
- Neely S. Т., Allen J. B. Invertibility of a room impulse response // J. Acoust. July 1979. Soc. Am. Vol. 66. pp. 165−169.
- Nordholm S., Claesson I., Dahl M. Adaptive microphone array employing calibration signals: an analytical evaluation // IEEE Trans, on Speech and Audio Proc., May 1999. Vol. 7, pp. 241−252.
- Omologo M, and Svaizer P. Acoustic event localization using crosspower-spectrum phase based technique // Proc. ICASSP '94, pp. 273−276.
- Parra L. C., Alvino С. V. Geometric source separation: Merging convolutive source separation with geometric beamforming // IEEE Transactions on Speech and Audio Processing, 2002. Vol. 10. No. 6, pp. 352−362.
- Peter J. M. Spectral tilt as a perturbation-free measurement of noise levels in voice signals // Proceedings of 7th European conference on speech communication and technology. Eurospeech 2001- Scandinavia Vol. 2, pp. 1495−1498.
- Putnam W., Rocchesso D., Smith J. A numerical investigation of the invertibility of room transfer functions // Proc. IEEE ASSP Workshop on App. of Sig. Proc. to Audio and Acoust. '95, Mohonk, NY, pp. 249−252.
- Qureshi S. U. H. Adaptive equalization // Proc. IEEE, Sept. 1985. Vol. 73. No. 9, pp. 1349−1387.
- Rabiner L. R. A tutorial on Hidden Markov Models and selected applications in speech recognition // Proceedings of the IEEE, Feb. 1989. Vol. 77, pp. 257−286.
- Renevey P., Drygajlo A. Entropy based voice activity detection in very noisy conditions // Proceedings of 7th European conference on speech communication and technology. Eurospeech 2001- Scandinavia Vol. 3, pp. 1887−1890.
- Rissanen J. Modeling by shortest data description // Automatica. 1978. Vol. 14, pp. 465−471.
- Shukla P. K., Turner L. F. Channel-estimation-based adaptive DFE for fading multipath radio channels // IEE Proc., Dec. 1991. Vol. 138. No. 6, pp. 525−543.
- Sharma, S., et al. Feature extraction using non-linear transformation for robust speech recognition on the Aurora database // Int. Conf. on Acoustics, Speech and Signal Processing, 2000, Istanbul, Turkey, pp.1117−1120.
- Shdaifat I., Grigat R., Lutgert S. Viseme recognition using multiple feature matching // Proceedings of 7th European conference on speech communication and technology. Eurospeech 2001 Scandinavia. Vol. 4, pp. 2431−2434.
- Stadermann J., Stahl V., Rose G. Voice activity detection in noisy environments // Proceedings of 7th European conference on speechcommunication and technology. Eurospeech 2001- Scandinavia Vol. 3, pp. 1851−1854.
- Steele A. Comparison of directional and derivative constraints for beamformers subject to multiple linear constraints // IEEE Proceedings, 1983. Vol. 130. No. 1, pp. 41−45.
- Silverman H. F., Kirtman S. E. A two-stage algorithm for determining talker location form linear microphone array data // In Computer Speech and Language, 1992. Vol. 6, pp. 129−152.
- Sullivan Т. M. Multi-microphone correlation-based processing for robust automatic speech recognition. Ph.D. Dissertation. Carnegie Mellon University, August, 1996.
- Takao K., Fujita M., Nishi T. An adaptive antenna array under directional constraint // IEEE Transactions on Antennas and Propagation. 1976. Vol. 24, no. 5, pp. 662−669.
- Thiran J. Recursive digital filters with maximally flat group delay // IEEE Trans. Circuit Theory, 1971. Vol. 18, no. 6, pp. 659−664.
- Valin J.M., Rouat J., Michaud F. Microphone array post-filter for separation of simultaneous non-stationary sources // Proceedings IEEE International Conference on Acoustics, Speech, and Signal Processing, 2004, pp. 452 462.
- Wester M. Automatic classification of voice quality: Comparing regression models and hidden Markov models // In Proc. of VOICEDATA98, Symposium on Databases in Voice Quality Research and Education, Utrecht, 1998, pp. 92−97.
- Wax M., Kailath T. Detection of signals by information theoretic criteria // IEEE Transactions on Acoustics, Speech, and Signal Processing, April 1985. Vol. ASSP-33, pp. 387−392.
- Yoh’ichi Tohkura. A weighted cepstral distance measure for speech recognition // IEEE Transactions on Acoustics, Speech, and Signal Processing, October 1987. Vol. ASSP-35, pp. 1414−1422.
- Антонью А. Цифровые фильтры: анализ и проектирование. М.: Радио и связь, 1983.-320 с.
- Балякин И.А., Егоров Ю. М., Родзивалов В. А. Приборы с переносом заряда в радиотехнических устройствах обработки информации. М.: Радио и связь, 1987. -173 с.
- Вемян Г. В. Передача по сетям электросвязи. М.: Радио и связь, 1985. -272с.
- Блейхут Р. Быстрые алгоритмы цифровой обработки сигналов. М.: Мир, 1989.-448 с.
- Васильев Д.В. Радиотехнические цепи и сигналы: Учебное пособие для вузов. М.: Радио и связь, 1982. 528 с.
- Гутников B.C. Фильтрация измерительных сигналов. Л.: Энергоатомиздательство, 1990. 192 с.
- Даджион Д., Мерсеро Р. Цифровая обработка многомерных сигналов. М.: Мир, 1988.- 488 с.
- Данилов Р.В., Ельцова С. А., Иванов Ю. П. и др. Применение интегральных микросхем в электронной вычислительной технике: Справочник. М.: Радио и Связь, 1987. -384 с.
- Долуханов М.П. Распространение радиоволн. М.: Радио и Связь, 1972. -С.33−49.
- Жодзишский М.И., Мазепа Р. Б., Овсянников Е.П и др. Цифровые радиоприемные системы:. М.: Радио и связь, 1969. -320 с.
- Калабеков Б.А. Микропроцессоры и их применение в системах передачи и обработки сигналов. М.: Радио и связь, 1988. -104 с.
- Кловский Д.Д. М. Теория передачи сигналов: учебник для институтов связи. Радио и Связь, 1973. — 376 с.
- Кузнецов Ю.А., Шилин В. А. Микросхемотехника БИС на приборах с зарядовой связью. М.: Радио и связь, 1988. -160 с.
- Кузьмин С.З. Цифровая обработка радиолокационной информации. Современное радио, 1967. 400 с.
- Меерзон Б. Многодорожечные рекордеры. Обзор «Звукорежисер», № 7 (сентябрь) 2000 г, -С.5−37.
- Мьо Ти Ха, Илиницкий А. А., Алюшин А. В., Павленко А. Н. Модуль сигнальной обработки на основе процессора ADSP21061L. для опытного образца отечественного гамма-томографа // Электроника, микро- и наноэлектроника. Сб. научн.трудов.-МИФИ, 2004.-С.250−251.
- Мьо Ти Ха. Разработка системы разделения нескольких источников звуковых сигналов на основе микрофонной решетки для мобильногоробота // Электроника, микро и нано электроника. Сборник научных трудов / Под ред. В. Я. Стенина. -М.:МИФИ, 2005.-С.183−185.
- Мьо Ти Ха, Алюшин М. В. Многоканальный усилитель для 2D и 3D микрофонных решеток // Известия вусов. Электроника.№ 4. 2007.-С.91−93.
- Мьо Ти Ха, Алюшин М. В. Аналого-цифровой способ компенсации разницы фаз в многоканальных акустических решетках // Электроника, микро- и наноэлектроника. Сборник научных трудов / Под ред. В. Я. Стенина. -М.: МИФИ, 2007.-С.93−95.
- Мьо Ти Ха, Алюшин М. В. Аппаратные и программные средства реального времени для одно- и двумерных микрофонных решеток // Электроника, микро- и наноэлектроника. Сборник научных трудов / Под ред. В. Я. Стенина. -М.: МИФИ, 2007.-С.96−98.
- Оппенгейм А.В., Шафер Р. В. Цифровая обработка сигналов. Перевод с английского. М.: Радио и Связь, 1979. 416 с.
- Покровский Н.Б. Расчет и измерение разборчивости речи. М.: Радио и Связь, 1962.-391с.
- Рабинер JL, Гоулд Б. Теория и применение цифровой обработки сигналов. -М.: Мир, 1978. 848 с.
- Ушкар М.Н. Микропроцессорные устройства в радиоэлектронной аппаратуре. М.: Радио и Связь, 1988. -128 с.
- Фланаган Д.Л. Анализ, синтез и восприятие речи: Перевод с английского. М.: Радио и Связь, 1968. -392с.
- Хемминг Р.В. Цифровые фильтры. М.: Недра, 1987. -221 с.