Learning of audio codec technology for instant messaging audio and video development

People always ask me what books I should read to study audio codecs. In fact, this is a difficult question to answer, for many reasons:

    The first problem of doing engineering is that it is different from textbook learning, and it cannot be solved by reading books.
    There are very few people who study audio codec technology in China, including the overall audio technology research abroad. (It can be seen from the fact that China's submarine noise technology has not been solved well).
    Audio codec technology is an application, and general books are about theoretical basis.


Just reading theoretical books will be too far away from the application, and you will forget it without practice. When I first read books, I also started from engineering. I read a lot of standards in practical work and personal interests, and then looked for papers on places I didn’t understand, and then looked for books to supplement knowledge. This is a typical reverse learning. Usually graduate students use a learning method and process such as "textbook -> reading paper -> doing engineering".

 

What kind of thinking can we follow to find books or papers?

Audio codec technology is a relatively complex application, and ordinary books are theoretical books, especially in China. Audio codec technology is different from general audio technology, such as AEC, HRTF, the latter is the application technology of voice and audio respectively, it should be said that it is a specific application technology, relatively speaking, it is easy to search for information.

Specifically, audio codec technology is also a specific application technology, but the system may be relatively complex and the purpose is relatively basic. It is a branch of source codec technology, the purpose is to compress data.

So what are the aspects of audio codec technology? Since it is a source coding technology (Source Coding Technology), you can read books on source coding technology as a theoretical basis.

In addition, I actually divide the audio codec technology into 5 major technologies, referred to as: EQTPM. E-entropy coding, Q-quantization coding, T-transform coding, P-predictive coding, M-audio modeling (perceptual modeling, BCC modeling, sinusoidal modeling, etc.). In contrast, when studying, we can consult and study related books according to these technologies. Instant messaging chat software app development can add Wei Keyun's v: weikeyun24 consultation

 

Because there are very few books on wideband audio codecs in China, you can read some books on voice codecs, which also talk about EQTP technology. For example: "Speech Processing Technology", "Speech Coding", "Low Bit Rate Audio Coding", "Principles of Digital Speech Coding", "Variable Rate Speech Coding", "Low Rate Speech Coding", "Digital Speech Coding", "Data Compression".

"Basics of JPEG2000 Image Compression": I think this book is good, and the translation is good, and many basic principles are explained thoroughly.

"Information Theory and Coding", "Signals and Systems", and advanced mathematics, I will not only list them, but they will also be repeated frequently.

For example, I think it is very classic "Audio Signal Processing and Coding" by ANDREAS SPANIAS. and his 63-page dissertation, "Perceptual Coding of Digital Audio".

Other books to read include:

    The father of MP3 - K. Brandenburg:
    "Applications of Digital Signal Processing to Audio and Acoustics"
    "A Digital Signal Processing Primer, with Applications to Digital Audio and Computer Music"
    "Auditory Perception and the MPEG Audio Standard"
    "Foundation and Evolution" of Standardized Coders (Wiley,2003)(ISBN 0471373125)(578s)"
    Professor Udo Zolzer of the Hamburg Federal Defense Force University:
    "Digital Audio Signal Processing"
    "High-Fidelity Multichannel Audio Coding"
    "Speech Coding Algorithms"

I highly recommend reading the SPANIAS book. At least take a closer look at the paper by SPANIAS. This will help you a lot in understanding audio coding. But there will be a lot of physiological words about the ear in it, so you have to hold a dictionary and look through it slowly. I am bored with this paper, and it has benefited me a lot.

I have not systematically read the following books, but they all have PDF versions, and I also occasionally look through them. Because you don't spend a lot of time understanding these classic books, it will create the illusion that they all talk similarly at first glance, but in actual understanding, you find that they are elaborations of different details.

Special technical books:

    Such as "Adaptive Signal Processing", because of audio coding or other audio technologies, adaptive technology is often used.
    For example: Lossless coding Wavpack, MPEG4 ALS, both use adaptive technology.
    Books on filter design.
    "Multiple Sampling Rate Digital Signal Processing Theory and Its Application": Explain Transform technology.
    HE-AAC and ATRAC3 using QMF,
    MP3 using PQF
    AAC, MP3 using MDCT,
    AC3 using TDAC (MDCT)
    WMA and G722.1 (MLT)


That's why, choose these transformation tools. What's the difference. It would be more helpful if you could have a look at Vaidyananthan PP's book.

Guess you like

Origin blog.csdn.net/weikeyuncn/article/details/128384688