Multimedia and Communications

classroom: R108 of technology building III, Tuesday 9:10 ~12:00 a.m.

Associate Professor Herng-Yow Chen
Email: hychen@csie.ncnu.edu.tw,
phone: 049-2910960#4843
office: R401, technology building III

 Lectures ||  Resource ||  Homework


Course Objective

  1. This course investigates multimedia (speech, audio, and video) compression; multimedia processing and retrieval; and relevant important issues on communication.
  2. By the end of semester, you should be able to master the technical underpinnings of multimedia system and related communication applications.
  3. Students should learn to present paper or their own work.

Evaluation

Homework regulations

  • In addition to print-out version, you should email me your report and the paper (or article) you choose before deadline. The email subject:  multimedia, homework#, your_student-id
  • All reports should be accessible via your web entry.


http://www.ncnu.edu.tw/~your_account/multimedia

References

  1. IEEE Multimedia Magazine (paper reading)
  2. IEEE Internet Computing Magazine (paper reading)
  3. IEEE Communication (paper reading)
  4. other relevant papers selected from IEEE, ACM multimedia conference, journal, and magazine
  5. Multimedia: Computing, Communications, and Applications, Ralf Steinmetz and Klara Nahrstedt, Prentice Hall, 1995, ISBN 0-13-324435-0
  6. Understanding Networked Multimedia applications and technology, Francois Fluckiger, Prentice Hall, 1995, ISBN 0-13-190992-4
  7. Handbook of Multimedia Computing, Borko Furht (editor-in-chief), CRC press, 1999, ISBN 0-8493-1825-4
  8. Handbook of Internet and Multimedia: Systems and Applications, Borko Furht (editor-in-chief), CRC press, 1999, ISBN 0-8493-1858-0

Course Schedule

  1. Introduction to Multimedia and Communication
  2. Multimedia System Overview (paper study)
  3. Multimedia Communication (paper study)
  4. Speech and Audio Compression
  5. Sampling and Interpolation
  6. Quantization
  7. Audio Coding
  8. MPEG Audio Coding (paper study)
  9. *** Midterm *** (paper oral report)
  10. Graphics and Image Compression
  11. Digital Image Processing, Basics of Color Images
  12. Image Filtering, Image Resizing
  13. Transform Coding, Discrete Cosine Transform (DCT)
  14. Image Coding Standards: JPEG, JPEG2000
  15. Video Compression
  16. Motion Estimation and Compensation: MPEG-1, MPEG-2, MPEG-4
  17. Multimedia Systems & Information Retrieval
  18. Final examination (project report)

 Lecture slides

        Use these as study guides--not as an excuse to skip class. I may be changing/refining these files right up until class time!

Reading: (print this) (reading assignment)

Borko Furht, "Multimedia Systems: An Overviews," IEEE Multimedia Magazine, pp. 47-59, Spring 1994.

Reading:

Lars C. Wolf, Carsten Griwodz, and Ralf Steinmetz, "Multimedia Communication," Proceedings of the IEEE, Vol. 85, No. 12, pp. 1915-1993, December 1997.

Reading:

Jayant, N.; Johnston, J.; Safranek, R., "Signal compression based on models of human perception," Proceedings of the IEEE, Volume: 81,   Issue: 10,   Oct 1993, Page(s): 1385-1422.

Reading:

Fundamental of Psychoacoustics, http://sound.eti.pg.gda.pl/SRS/psychoacoust.html

Reading:

Zwicker E., Zwicker T., "Audio Engineering and Psychoacoustics: Matching Signals to the Final Receiver, the Human Auditory System", Journal of Audio Engineering Society, vol. 39, No. 3, March 1991, pp. 115-126.

Reading:

Painter and Spanias, " Perceptual Coding of Digital Audio", Proc. IEEE, April 2000.

Spanias, A. S. (1994). Speech Coding: A Tutorial Review.

Portions published in Proceedings of the IEEE, Oct. 1994.

Allen Gersho, "Advances in Speech and Audio Compression," Proceedings of the IEEE , Volume: 82 Issue: 6 , Jun 1994, Page(s): 900 -918.

Reading:

Davis Pan, "A tutorial on MPEG/audio compression," Multimedia, IEEE , Volume: 2 Issue: 2 , Summer 1995 , Page(s): 60 -74.

Reading: (print this)

Peter Noll, "MPEG Digital Audio Coding Standards," 2000 CRC Press LLC, http://www.ff.vu.lt/studentams/tekstai/vizualizavimas/mpeg%20audio%20coding.pdf

Reading:

Eric D. Scheirer, "Structured audio and effects processing in the MPEG-4 multimedia standard," Multimedia System, Vol 7, Page(s): 11-22, 1999.

Reading:

Chris Kyriakakis, "Fundamental and technological limitations of immersive audio systems," Proceedings of the IEEE , Volume: 86 Issue: 5 , May 1998, Page(s): 941 -951.

Reading: (reading assignment)

Gregory K. Wallace, "The JPEG still picture compression standard," Communications of the ACM, Volume: 34, Issue 4 (April 1991), Page(s): 30 - 44.  You can access the revised version of the same paper (clearer for print) published in IEEE Transaction on Consumer Electronic. 1991.

Reading: (print this)

A. Skodras, C. Christopoulos, T. Ebrahimi, "The JPEG2000 Still Image Compression Standard", IEEE Signal Processing Magazine, pp. 36-58, Sept. 2001.

Reading:

 Didier Le Gall, "MPEG: a video compression standard for multimedia applications," Communications of the ACM, Volume: 34, Issue 4 (April 1991), Page(s): 46 - 58.

Reading:

Multimedia compression techniques and standards: JPEG and MPEG

Reading:

B.G. Haskell, P.G. Howard, Y.A. LeCun, A. Puri, J. Ostermann, M.R. Civanlar, L. Rabiner, L. Bottou, and P. Haffner,  "Image and Video Coding -- Emerging Standards and Beyond," IEEE Transactions on Circuits and Systems for Video Technology, vol. 8, no. 7, pp. 814-837. 1998.

Reading:

Thomas Sikora,  "MPEG Digital Video-Coding Standards," IEEE Signal Processing Magazine, Volume: 14 Issue: 5 , Sept. 1997, Page(s): 82 -100.

  • Lecture 5: Audio Retrieval and Navigation Interfaces

Reading:

Asif Ghias , Jonathan Logan , David Chamberlin , Brian C. Smith, "Query by humming: musical information retrieval in an audio database," Proceedings of the third ACM international conference on Multimedia, January 1995, page(s) 231-236.

Reading:

Wold, E.; Blum, T.; Keislar, D.; Wheaten, J., "Content-based classification, search, and retrieval of audio," Multimedia, IEEE , Volume: 3 Issue: 3 , Fall 1996, Page(s): 27 -36.

Reading:

Lonce Wyse, Stephen W. Smoliar, "Toward content-based audio indexing and retrieval and a new speaker discrimination technique," Computational Auditory Scene Analysis: Proceedings of the IJCAI-95 Workshop. Erlbaum (1998), Page(s): 351-360.

Reading:

M. G. Brown, J. T. Foote, G. J. F. Jones, K. Sparck Jones, S. J. Young, "Open-vocabulary speech indexing for voice and video mail retrieval," Proceedings of the fourth ACM international conference on Multimedia, 1996, page(s) 307-316. (best paper award)

Reading:

Barry Arons, "SpeechSkimmer: a system for interactively skimming recorded speech,"  ACM Transactions on Computer-Human Interaction (TOCHI) March 1997.
.

  • Lecture 6: Video Analysis, Structuring, Indexing, and Browsing

Reading:

Tonomura, Y.; Akutsu, A.; Taniguchi, Y.; Suzuki, G.,  "Structured Video Computing," Multimedia, IEEE , Volume: 1 Issue: 3 , Autumn/Fall 1994, Page(s): 34-

Reading:

Minerva Yeung; Boon-Lock Yeo; Bede Liu, "Extracting story units from long programs for video browsing and navigation," Proceedings of the Third IEEE International Conference on Multimedia Computing and Systems, 1996, Page(s): 296 -305.

Reading:

Smith, M.A.; Kanade, T.,"Video skimming and characterization through the combination of image and language understanding techniques

," Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, 1997, Page(s): 775 -781.


Reading:  Chen at al, "RAID: High-Performance, Reliable Secondary Storage," ACM Computing Surveys, Vol. 26, No.2, June 1994, Page(s): Pages: 145 - 185. (homework due: May 13th)


Reading:   Steinmetz R (1996) Human Perception of Jitter and Media Synchronization. IEEE journal on Selected Areas in Communications, 14(1), pp. 61-72.
Reading:   Blakowski, G.; Steinmetz, R, (1996) A media synchronization survey: reference model, specification, and case studies. IEEE journal on Selected Areas in Communications, 14(1), pp. 5-35.

Topic Presentation (by 2004 students)


"Teaching and learning as multimedia authoring the classroom 2000 project," Proceedings of the fourth ACM international conference on Multimedia,
1996, pp. 187 - 198.

Software

Resource

References:

  1. TCP Congestion control
  2. SNR and PSNR
  3. Image and Video Compression  Learning Tool, http://www-it.et.tudelft.nl/~inald/vcdemo/
  4. Special Issues on MPEG-7, IEEE Circuits and Systems for Video Technology, Volume: 11,   Issue: 6,   Jun 2001
  5. http://www.williamson-labs.com