Jump to content

Fumitada Itakura

From Wikipedia, the free encyclopedia

Fumitada Itakura (板倉 文忠, Itakura Fumitada, born 6 August 1940) is a Japanese scientist. He did pioneering work in statistical signal processing, and its application to speech analysis, synthesis and coding, including the development of the linear predictive coding (LPC) and line spectral pairs (LSP) methods.

Biography

[edit]

Itakura was born in Toyokawa, Aichi Prefecture, Japan. He received undergraduate and graduate degrees from Nagoya University in 1963 and 1965, respectively.[1] In 1966, while studying his PhD at Nagoya, he developed the earliest concepts for what would later become known as linear predictive coding (LPC), along with Shuzo Saito from Nippon Telegraph and Telephone (NTT). They described an approach to automatic phoneme discrimination that involved the first maximum likelihood approach to speech coding.[2] In 1968, he joined the NTT Musashino Electrical Communication Laboratory in Tokyo.[1] The same year, Itakura and Saito presented the Itakura–Saito distance algorithm.[3] The following year, Itakura and Saito introduced partial correlation (PARCOR) to LPC.[2]

Itakura completed his D.Eng. degree in speech processing in 1972, writing his dissertation on "Speech Analysis and Synthesis based on a Statistical Method."[1] From 1973 to 1975, he worked at the Acoustics Research Department of Bell Labs, having been invited to work there on fundamental problems by James Flanagan, who had been impressed by one of Itakura's papers on low bit-rate encoding.[4]

In 1975, Itakura developed the line spectral pairs (LSP) method for high-compression speech coding, while at NTT.[5][6][1] From 1975 to 1981, he studied problems in speech analysis and synthesis based on the LSP method.[1] In 1980, his team developed an LSP-based speech synthesizer chip. LSP is an important technology for speech synthesis and coding, and in the 1990s was adopted by almost all international speech coding standards as an essential component, contributing to the enhancement of digital speech communication over mobile channels and the internet worldwide.[6]

In 1981, he was appointed as Chief of the Speech and Acoustics Research Section at NTT. He left this position in 1984 to take a professorship in communications theory and signal processing at Nagoya University. He currently teaches at Meijo University.[7]

Itakura's work on spectral and formant estimation laid the foundation for much of the early progress in speech signal processing.[8] His work on autoregressive modeling of speech is used in nearly every modern low-to-medium, bit-rate speech transmission system, and the line spectral pair representation he developed is now found in nearly all cellular telephone systems.[8]

Awards

[edit]

His awards include the IEEE ASSP 1975 Senior Award, an award from Japan's Ministry of Science and Technology in 1977, the IEEE 1986 Morris N. Liebmann Award[9] (with B. S. Atal), the IEEE Signal Processing 1996 Society Award, the IEEE Third Millennium Medal, the IEICE 2002 Distinguished Achievement and Contributions Award, and the 2003 Purple Ribbon Medal from Japanese Government. In 2005, he received the Asahi Prize and the IEEE Jack S. Kilby Signal Processing Medal.[10][11] In 2009, he received the NEC C&C Prize for his pioneering research and the development of highly efficient voice-coding technology with analysis-synthesis methods for speech. He is a Fellow of the IEEE for pioneering contributions to speech processing,[12] and an honorary member the Institute of Electronics, Information and Communication Engineers of Japan.

References

[edit]
  1. ^ a b c d e "Fumitada Itakura Oral History". IEEE Global History Network. 20 May 2009. Retrieved 2009-07-21.
  2. ^ a b Gray, Robert M. (2010). "A History of Realtime Digital Speech on Packet Networks: Part II of Linear Predictive Coding and the Internet Protocol" (PDF). Found. Trends Signal Process. 3 (4): 203–303. doi:10.1561/2000000036. ISSN 1932-8346.
  3. ^ Itakura, F., & Saito, S. (1968). Analysis synthesis telephony based on the maximum likelihood method. In Proc. 6th of the International Congress on Acoustics (pp. C–17–C–20). Los Alamitos, CA: IEEE.
  4. ^ "James L. Flanagan Oral History". IEEE Global History Network. 20 May 2009. Archived from the original on 31 December 2009. Retrieved 2009-07-21.
  5. ^ Zheng, F.; Song, Z.; Li, L.; Yu, W. (1998). "The Distance Measure for Line Spectrum Pairs Applied to Speech Recognition" (PDF). Proceedings of the 5th International Conference on Spoken Language Processing (ICSLP'98) (3): 1123–6.
  6. ^ a b "List of IEEE Milestones". IEEE. Retrieved 15 July 2019.
  7. ^ "視聴覚情報研究室". Meijo University.
  8. ^ a b "Fumitada Itakura". IEEE. Archived from the original on December 5, 2008. Retrieved 2009-07-21.
  9. ^ "IEEE Morris N. Liebmann Memorial Award Recipients". Institute of Electrical and Electronics Engineers (IEEE). Archived from the original on June 6, 2008. Retrieved 2008-02-15.
  10. ^ "IEEE Jack S. Kilby Signal Processing Medal Recipients" (PDF). IEEE. Archived from the original (PDF) on December 16, 2021. Retrieved February 27, 2011.
  11. ^ "IEEE Jack S. Kilby Signal Processing Medal Recipients – 2005 – Fumitada Itakura". IEEE. Archived from the original on September 5, 2012. Retrieved February 27, 2011.
  12. ^ "IEEE Fellows 2003 |". IEEE Communications Society. Retrieved September 7, 2024.