Publication detail

Komprimace obrazových signálů pomocí transformace 3D DCT

FRÝZA, T.

Original Title

English Title

Video Signal Compression Using 3D DCT Transform

Type

book

Language

Czech

Original Abstract

Předložená disertační práce pojednává o možnosti komprimovat barevné video signály pomocí trojrozměrné diskrétní kosinové transformace (3D DCT). Základní snahou všech komprimačních metod je potlačení nadbytečnosti v jednotlivých snímcích a také v čase, mezi sousedními snímky. Metoda 3D DCT využívá současného kódování skupiny snímků a slučuje tak oba požadavky do jediného transformačního kódování. Vytvořený kódovací řetězec 3D DCT je odvozen od standardu JPEG, určený pro komprimaci statických snímků. Obměny, ke kterým muselo dojít ve struktuře kodéru a dekodéru 3D DCT jsou v textu podrobně popsány. Jedná se především o použití vícerozměrné diskrétní kosinové transformace, dále o odlišný způsob kvantování frekvenčních koeficientů, a také o vývoj nových Huffmanových tabulek pro entropické kódování 3D DCT koeficientů. Praktické možnosti komprimační metody byly ověřeny na množině testovacích video sekvencí, které svým obsahem pokrývají širokou škálu aplikací. Bylo zjištěno, že metoda 3D DCT dosahuje nejlepších komprimačních vlastností při kódování scén s pomalým pohybem a velkými plochami shodné barvy. Pro tuto kategorii scén není vyjímkou dosažený kompresní poměr o velikosti převyšující hodnotu 100. Prioritní oblastí použití metody 3D DCT jsou tedy video konference a video telefonie.

English abstract

Thesis presents the possibilities of the Three Dimensional Discrete Cosine Transform (3D DCT) in a video compression domain. All video compression methods are focused on removing of any kind of redundance, both in space and temporal dimensions. The 3D DCT combines these principles in a single transform coding. Proposed structure of the 3D DCT coder is based on the JPEG standard, dedicated for compression of the static pictures. Unavoidable modifications were realized mainly in usage of the three dimensional transform, in quantisation of the frequency coefficients and in the code words dictionary, used in entropy coding. Practical capabilities of the compression method were tested with the aid of several color video sequences. Each of them represents different type of a visual scene: from the static scenes to the sequences with dynamic changes in temporal dimension. It was discovered the best compression properties of the 3D DCT is obtained when input video sequence contains slow motion accompanied by large areas of the same color. In that case the compression ratio values higher than 100 can be repeatedly reached. Therefore the main domain of using the 3D DCT is in the video conference and video telephony applications.

Keywords

Statické snímky, pohyblivé snímky, komprimace obrazových signálů, JPEG, MPEG-1, MPEG-2, DCT, 2D DCT, 3D DCT, kvantování, entropické kódování, MATLAB, C/C++, střih video sekvencí, DSP, TMS320C6000, Code Composer Studio

Key words in English

Images, video signals, compression techniques, JPEG, MPEG-1, MPEG-2, DCT, 2D DCT, 3D DCT, quantisation, entropy coding, MATLAB, C/C++, video sequences' cut, DSP, TMS320C6000, Code Composer Studio

Authors

FRÝZA, T.

RIV year

2007

Released

3. 9. 2007

Publisher

Nakladatelství VUTIUM

Location

Brno

ISBN

978-80-214-3467-7

Edition

PhD Thesis

Edition number

Pages from

Pages to

Pages count

URL

http://www.vutium.vutbr.cz/

BibTex

@book{BUT61911,
  author="Tomáš {Frýza}",
  title="Komprimace obrazových signálů pomocí transformace 3D DCT",
  year="2007",
  publisher="Nakladatelství VUTIUM",
  address="Brno",
  series="PhD Thesis",
  edition="1",
  pages="1--32",
  isbn="978-80-214-3467-7",
  url="http://www.vutium.vutbr.cz/"
}

VUT

Faculties

University Institutes

Parts

Komprimace obrazových signálů pomocí transformace 3D DCT