MIME-Version: 1.0
Server: CERN/3.0
Date: Monday, 25-Nov-96 00:30:32 GMT
Content-Type: text/html
Content-Length: 11296
Last-Modified: Tuesday, 29-Oct-96 01:32:17 GMT
Zeno Papers
 |
Papers from Zeno Research |
Compressed Domain Transcoding of MPEG
Brian C. Smith, Soam Acharya
Abstract
Current compression formats optimize for either compression or editing.
For example, motion JPEG (MJPEG) provides excellent random and moderate
overall compression, while MPEG optimizes for compression at the expense
of random access. Converting from one format to another, a
process called transcoding, is often desirable over the life of a
video segment. In this paper, we show how to transcode MPEG video to
motion-JPEG without fully decompressing the MPEG source. Our
compressed domain transcoding technique differs from previous work
because it uses a new technique that is optimized for software
implementation and because we compare the performance of a working
implementation of our compressed domain transcoder, instead of just
counting the number of multiplies needed to transcode. Our
experiments show that our compressed domain transcoder is 1.5
to 3 times faster than an optimized spatial domain transcoder,
and offers another benefit: a single parameter can improve the
speed of transcoding at the expense of the quality of the
resulting images. This speed/quality trade-off is important to
many real-time applications.
Acrobat (280K)
Gzipped postscript (281K)
CU-SeeMe VR: Immersive Desktop Teleconferencing
To appear in ACM Multimedia '96
Jefferson Han, Brian C. Smith
Abstract
Current video-conferencing systems provide a
video-in-a-window user interface. This paper
presents a video-conferencing application called CU-SeeMe VR that
provides a richer interface. CU-SeeMe VR is a distributed
video-conferencing system that allows users to connect to 3D worlds
and interact with other using live video and audio embedded in a
virtual space. This paper describes a prototype implementation of
CU-SeeMe VR, including the user interface, system architecture, and
a detailed look at the enabling technologies. Future directions
and metaphors for this space are discussed.
HTML version
Acrobat version (211K)
Compressed Domain Processing of JPEG-encoded Images
To appear in Real-Time Imaging Journal
Brian C. Smith, Lawrence A. Rowe, July, 1996
Abstract
This paper addresses the problem of processing motion-JPEG video
data in the compressed domain. The operations covered are those
where a pixel in the output image is an arbitrary linear
combination of pixels in the input image, which includes
convolution, scaling, rotation, translation, morphing,
de-interlacing, image composition, and transcoding. This paper
further develops an approximation technique called condensation to
improve performance and evaluates condensations in terms of
processing speed and image quality. Using condensation, motion-JPEG
video can be processed at near real-time rates on current
generation workstations.
Acrobat version (931K)
Massively Distributed Video File Server Simulation: Investigating
Intelligent Caching Schemes
Alexander Castro, C. Edward Lazzerini, Vivekananda Kolla
December, 1995
Abstract
This paper, the final report in
CS631,
a graduate multimedia systems course, presents the results of a
simulation study that compares the effectivesness of different caching
schemes within the DVFS architecture.
HTML version
Acrobat version (34K)
A Survey of Compressed Domain Processing Techniques
Reconnecting Science and Humanities in Digital Libraries,
University of Kentuky
Brian C. Smith, Oct 1995
Abstract
This short paper surveys current techniques for compressing
compressed multimedia data, including compressed audio, video, and
images.
HTML version
Acrobat version (160K)
A Resolution Independent Video Language
Presented at ACM Multimedia 95.
Jonathan Swartz, Brian C. Smith,
November, 1995
Abstract
As common as video processing is, programmers still implement video
programs as manipulations of arrays of pixels. This paper presents a
language extension called Rivl (pronounced "rival") where video is a
first class data type. Programs in Rivl use high level operators that
are independent of video resolution and format, increasing a program's
portability, simplifying code reuse, and reducing development time.
This paper also describes a Rivl interpreter and the strategies the
interpreter uses to optimize Rivl programs. These optimizations include
classical programming language optimizations, such as common
subexpression elimination and out of order execution, image and video
specific optimizations, such as computing only those images that will
affect the output, and an optimized memory manager.
HTML version
Acrobat version (822K)
Query By Humming -- Musical Information Retrieval in an Audio Database
Presented at ACM Multimedia 95.
Asif Ghias, Jonathan Logan, David Chamberlin, Brian C. Smith,
November, 1995
Abstract
The emergence of audio and video data types in databases will
require new information retrieval methods adapted to the specific
characteristics and needs of these data types. An effective and
natural way of querying a musical audio database is by humming the
tune of a song. In this paper, a system for querying an audio
database by humming is described along with a scheme for
representing the melodic information in a song as relative pitch
changes. Relevant difficulties involved with tracking pitch are
enumerated, along with the approach we followed, and the
performance results of system indicating its effectiveness are
presented.
HTML version
Acrobat version (82K)
Tcl-DP Name Server
Presented at the
1995 Tcl/Tk Workshop.
Peter T. Liu, Brian Smith, Lawrence Rowe
July, 1995
Abstract
This paper describes a general purpose name server for Tcl-DP. This
name server maintains host addresses and port numbers of services
running in a distributed en- vironment and allows clients to query
about them. It starts services on demand so services are guaranteed
to be available, and it provides a simple authentication protocol
for better security. The Tcl-DP name server is also designed to be
fault- tolerant. Multiple backup servers can be started on
different hosts, and a failover occurs when the main server goes
down. In addition, the name server provides mechanisms to interface
with external modules for extending its functionality.
Acrobat version (90K)
Tcl Distributed Programming
Presented at the 1993 Tcl/Tk Workshop.
Brian Smith, Lawrence A. Rowe, Stephen C. Yen,
May, 1993
Abstract
This paper describes Tcl-DP, an extension to Tcl/Tk
that introduces a suite of commands for creating client/server systems.
This extended abstract provides examples of using Tcl-DP, the Tcl-DP
RPC abstraction, and the Tcl-DP distributed object system.
Acrobat version (9K)
Tcl-DP Tutorial
Unpublished manuscript
Brian Smith, Lawrence A. Rowe, September, 1995
Abstract
This document provides a tutorial introduction to Tcl-DP.
HTML version
Acrobat version (105K)
Rivl Tutorial
Unpublished manuscript
Brian Smith, Jon Swartz, December 1995
Abstract
This document provides a tutorial introduction to Rivl.
HTML version
Acrobat version (632K)
Cyclic-UDP: A Priority-Driven Best-Effort Protocol
Unpublished manuscript
Brian C. Smith, May, 1994
Abstract
This paper describes Cyclic-UDP, a best-effort protocol for
audio and video file servers. Cyclic-UDP is the transmission protocol
used in CMT, the Continuous Media Toolkit.
Acrobat version (34K)
Experiments with a Lossless JPEG Codec
Master's Report
Kongji Huang, May, 1994
Abstract
This master's report describes a set of experiments conducted to
test the effectiveness of the lossless JPEG encoding method.
Acrobat version (712K)
