Technical Report 2004-06-01
Internetworking and Media Communications Research Laboratories
Department of Math & Computer Science,
http://medianet.kent.edu/technicalreports.html
Predictive Perceptual Compression for Real Time Video Communication
Oleg Komogortsev and Javed I. Khan
Internetworking and Media Communications Research Laboratories
Department of Computer Science
Kent State University, 233 MSB, Kent, OH 44242
Last Revised
Predictive Perceptual Compression for Real Time Video Communication
Approximately 2 degrees in our 140 degree vision span has sharp vision. Many researchers have been fascinated by the idea of eye-tracking integrated perceptual compression of an image or video, yet any practical system has yet to emerge. The unique challenge presented by real time perceptual video streaming is how to handle the fast nature of the human eye and provide its integration with computationally intensive video transcoding scheme. Difficulty arises due to the delay introduced by video transmission in the network. This delay creates a problem when we try to use information about eye movements for perceptual encoding. In this paper we discuss a new approach to the eye-tracker based video compression. Rather than relying on the point of gaze, this novel scheme tracks a vicinity of interest and offers a prediction mechanism for eye movements. The described system compensates the interim eye movements between the sampling and actual coding. The proposed scheme can be applied to a large variety of today’s video compression standards. We have developed an eye gaze-aware MPEG-2 transcoder that can perceptually re-encode a live video stream in real time. The experiments we have conducted illustrate the substantial impact this integrated prediction method has on perceptual video compression and bit-rate reduction.
This report contains experiment clips used to test the performance of this system. The videos are MPEG-2 ISO 13818-2 streams. We recommend using MPlayer found at www.mplayerhq.hu or Winamp to view these video samples.
Video Samples:
Sample Name |
Original Sample |
Saccade Window with eye-gaze displayed |
Perceptually Encoded |
Car |
|||
Shamu |
|||
Airplanes |