Publications
Books:
[1] Digital Speech Signal Processing, Electronic Industry Press,
Aug 1995
[2] Signal Detection in the Noise, A.D. Whalen, Translated jointly,
Science Press, Nov 1977
[3] Digital Transmission System, P. Bylanski, D.G. W. Ingram,
Translated jointly, Posts & Telecom Press, Jun 1979
[4] Spread Spectrum System, R.C. Dixon, Translated jointly, National
Defense Industry Press, Feb 1982
[5] Electronic Computing Manual, M. Kaufman, Translated jointly,
National Defense Industry Press, Oct 1985
Main Selected Papers:
[1] Wu X.H.., et al., The Effect of Perceived Spatial Separation
on Informational Masking of Chinese Speech, (Accepted by Hearing
Research.)
[2] Tianshu Qu, Zheng Xiao, Mei Gong, Ying Huang, Xiaodong Li,
Xihong Wu, “Distance-dependent Head-related Transfer Functions
Measured with High Spatial Resolution Using a Spark Gap”, IEEE
Trans. on Audio, Speech and Language Processing, vol.17(6), 2009
(PKU&IOA HRTF Database)
[3] Li L., Wu X.H., et al., Perceived spatial separation releases
Chinese speech from informational masking, The 18th International
Congress on Acoustics, ICA 2004.
[4] He S.C., Huang J., Wu X.H., Li L., The effects of activation
of the auditory thalamus and cortex on the startle reflex are
regulated by glutamate and GABAB receptors in the lateral amygdala,
International congress of psychology (ICP 2004).
[5] Li L., Wu X.H., Wang C., et al, Release from Informational
Masking due to Perceived Spatial Separation in Chinese Speech
Recognition, International congress of psychology (ICP 2004).
[6] Luo D.S., Wu X.H. and Chi H.S. (2004): On outlier problem
of statistical ensemble learning. Proceedings of the IASTED International
Conference on Artificial Intelligence and Applications (AIA-2004),
pp.281-286, Innsbruck, Austria.
[7] Wu X.H., Ma C., Chen J., Li L. and Chi H.S.(2003): A new
method of measuring auditory temporal resolution. Chinese Scientific
Journal of Hearing and Speech Rehabilitation 1:13-15.
[8] Wu X.H., Luo D.S., Chi H.S. and Szu H.(2003): Biomimetics
speaker identification systems for network security gatekeepers,
International Joint Conference on Neural Networks (IJCNN'03),
pp.3189-3194, Oregon, U.S.A.
[9] Luo D.S. and Chen K. (2003): Refine decision boundaries of
a statistical ensemble by active learning, International Joint
Conference on Neural Networks (IJCNN'03), pp.1523-1528, Oregon,
U.S.A.
[10] Luo D.S. and Chen K. (2003): On the generalization of statistical
ensemble learning on mismatch conditions: an empirical study,
International Conference on Neural Information Processing (ICONIP'2003),
Istanbulm, Turkey.
[11] Wu T. Y., Lu L., Chen K, Zhang H. J. (2003),UBM-Base Real-Time
Speaker Segmentation For Broadcasting News, Proceedings of International
Conference on Acoustic, Speech and Signal Processing (ICASSP'03),
2003
[12] Wu T. Y., Lu L., Chen K, Zhang H. J. (2003): Incremental
Speaker Adaptation Based on Universal Background Model and Its
Applications on Speaker Segmentation, International Conference
on Multimedia & Expo (ICME'03), Vol. II, pp. 721-724, Baltimore,
MD, July 6-9, 2003
[13] Wu T. Y., Lu L., Chen K, Zhang H.J.(2003), Universal Background
Models for Real-Time Speaker Change Detection, Proceedings of
International Conference on Multimedia Modeling, 2003
[14] Luo D.S., Wu X.H. and Chi H.S. (2003): Unit-weighting strategy
to restrain outliers in ensemble learning. Proceedings of the
10th Annual Conference of Chinese Association of Artificial Intelligence
(CAAI-10), pp.401-406, Guangzhou, China.
[15] Wang L., Chen K., and Chi H.S. (2002): Capture inter-speaker
information with a neural network for speaker identification.
IEEE Transactions on Neural Networks 13(2): 436-445. IEEE Press.
[16] Chen K., Wu T.Y., and Zhang H.J. (2002): On the use of nearest
feature line for speaker identification. Pattern Recognition Letters
-- An Official Publication of the International Association for
Pattern Recognition 23(14): 1735-1746.
[17] Zhen B., Wu X. H., Liu Z. M., and Chi H.S. (2002): An enhanced
relative spectral processing of speech, Chinese Journal of Acoustics
21(1): 86-96.
[18] Luo D.S., Wu X.H. and Chi H.S. (2002): A relative entropy
based confidence method on using boosting to handwritten digit
recognition problem. Proceedings of the 12th Chinese National
Conference on Neural Networks, pp.440-443, Beijing, China. (Best
Paper Award)
[19] Luo D.S. and Chen K. (2002): A comparative study of statistical
ensemble methods on mismatch conditions. Proceedings of World
Congress on Computational Intelligence - International Joint Conference
on Neural Networks (WCCI'02-IJCNN'02), pp. 59-64, Honolulu, U.S.A.
[20] Luo D.S. and Chen K. (2002): On the use of statistical ensemble
methods for telephone-line speaker identification. Proceedings
of IEEE International Conference on Communications, Circuits and
Systems (ICCCAS'02), pp. II904-II908, Chengdu, China.
[21] Zhen B., Wu X.H., Liu Z. M., and Chi H. S.(2001): An enhanced
RASTA processing for speech signal. Chinese Journal of Acoustics
26(3): 252-258.
[22] Liu Z. M., Wu X.H., Zhen B., and Chi H. S.(2001): Forward
masking auditory model and its application in speaker identification
and speech recognition. Chinese Journal of Electronics 10(2):
196-199.
[23] Meng H., Chan S.F., Wong Y.F., Chan C.C., Wong Y.W., Fung
T.Y., Tsui W.C., Chen K., Wang L., Wu T.Y., Li X.L., Lee T., Choi
W.N., Ching P.C., and Chi H.S. (2001):ISIS: A learning system
with combined interaction and delegation dialogs. Proceedings
of the 7th European Conference on Speech Communication and Technology
(Eurospeech), Vol.3, pp.1551-1554, Aalborg, Denmark.
[24] Wang L., Chen K., and Chi H.S. (2001): Towards better capturing
inter-speaker information by active learning for speaker identification.
Proceedings of International Joint Conference on Neural Networks
(IJCNN'2001), pp. 2985-2990, Washington D.C., U.S.A.
[25] Wu T.Y. and Chen K. (2001): On the use of nearest feature
line for speaker identification. Proceedings of International
Conference on Neural Information Processing (ICONIP'2001), Shanghai,
China. (CD-ROM, No. 41)
[26] Wang L., Chen K., and Chi H.S. (2000): Capture inter-speaker
information by a neural network for speaker identification. Proceedings
of International Joint Conference on Neural Networks (IJCNN'2000),
pp. V247-V252, Como, Italy.
[27] Chi H. S., and Wu X.H. (2000): Roles of computational auditory
model in automatic speech recognition. Progress of Nature Science
10(12): 887-896.
[28] Wu X.H., Chi H.S., Auditory model - based speech feature
extraction and its application to speaker identification, Chinese
Journal of Electronics (English version), vol.8, No.4, 413-418,1999.
[29] Xiang B., Wu X.H., Liu Z. M., and Chi H. S.(1999): Auditory
model based speech feature extraction and its application to speaker
identification, Proceedings of International Joint Conference
on Neural Networks (IJCNN'99), pp. 284-287. Washington D. C.,
U.S.A.